2023.0.0
Initial release of oneAPI for NVIDIA® GPUs!
This release was created from the intel/llvm repository at commit 0f579ba.
New Features
Support for CUDA® backend
SYCL™ Compiler
Support for sycl::half type
Support for
bf16builtins operating on storage typesSupport for the SYCL builtins from relational, geometric, common and math categories
Support for sub_group extension
Support for group algorithms
Support for
group_ballotintrinsicSupport for atomics with scopes and memory orders
Support for multiple streams in each queue to improve concurrent execution
Support for
sycl::queue::mem_adviseSupport for
--ffast-mathin CUDA libclcSupport for device side
assertSupport for float and double exchange and compare exchange atomic operations in CUDA libclc
Enabled CXX standard library functions
Native event for default-ctored sycl::event has to be in COMPLETE state
SYCL Library
Added
bf16builtins forfma,fmin,fmaxandfmaxSupport for
sycl::aspect::fp16Added
tanh(for floats/halfs) andexp2(for halfs) native definitionsSupport for
sycl::get_native(sycl::buffer)Implemented
mem_advisereset and managed concurrent memory checksSupport for element-wise operations on
joint_matrixincludingbfloat16supportSupport for Unified Shared Memory (USM)