v1.2-RC1
Pre-release
Pre-release
pvelesko
released this
04 Sep 07:41
·
42 commits
to main
since this release
What's Changed
- Refactor WaitForThreadExit by @pvelesko in #752
- Fix #757: skip texture tests with iGPU+OpenCL when USM=ON by @franz in #758
- adjust modules due to NFS going down by @pvelesko in #766
- Fix tests were unintentinally skipped by @linehill in #764
- remove Unit_hipMemsetFunctional_ZeroSize_hipMemsetD32 from exclusion list by @pvelesko in #767
- update ROCm-Device-Libs by @pvelesko in #765
- page lock runner test by @pvelesko in #773
- Various improvements by @pvelesko in #774
- Dynamic event pools by @pvelesko in #771
- Update cpp-linter-action version by @pvelesko in #777
- Map device built-ins to compiler built-ins by @linehill in #763
- Level-zero-premature-exit by @pvelesko in #778
- Add sanity check for catching unexpected atomic built-ins by @linehill in #706
- Use a fence for syncing RCL by @pvelesko in #688
- default to Debug by @pvelesko in #776
- Adjustments for future LLVM-18 release by @linehill in #714
- SYCL-HIP Interop - Drop RCL/ICL Quer by @pvelesko in #781
- OpenCL Event Cleanup by @pvelesko in #788
- OpenCL: Fix indirect USM pointer related issues by @linehill in #790
- Add CHIP_LAZY_JIT environment option to control JIT timing by @linehill in #786
- Remove SPIR-V version check in the parser by @linehill in #787
- Backend handles refactor by @pvelesko in #789
- Fixup exluded tests by @pvelesko in #795
- Add CHIP_DEVICE_TYPE to documentation by @karlwessel in #800
- Fix HIP float intrinsics were mapped double built-ins by @linehill in #793
- fix name of the cuda compiler script by @karlwessel in #802
- OpenCL BE: set CHIP_USE_INTEL_USM on by default by @linehill in #791
- Sample and Test Profiling by @pvelesko in #804
- Linter Fix include complaint by @pvelesko in #805
- Changes to reduce kernel launch overheads by @linehill in #794
- Fix Event Collection by @pvelesko in #803
- OpenCL: Skip SVM pointer annotation if possible by @linehill in #785
- Remove a confusing already registered and mapped warning by @linehill in #809
- Level Zero Refactor + Bugfixes by @pvelesko in #817
- Refactor Known Failing Tests by @pvelesko in #822
- Fix called incorrect compiler built-ins by @linehill in #820
- Internalize
__device__
functions by @linehill in #819 - Implement FencedCmdLists by @pvelesko in #823
- Rebase HIP 6.x + Update hip-tests by @pvelesko in #796
- HIPCC Fixes by @pvelesko in #827
- Add CHIP_BUILD_HIPBLAS option by @pvelesko in #831
- OpenCL: Use non-profiling queue, switch to profiling when needed by @linehill in #814
- Fixes scripts/configure_llvm.sh by @linehill in #835
- Use loginfo for printing device info by @pvelesko in #839
- OpenCL: Fix memory leak / OoM and stack overflow by @linehill in #837
- Fix bunch texture cases by @linehill in #842
- Ubuntu Fixes by @pvelesko in #825
- Small Fixes by @pvelesko in #844
- Various small optimizations by @linehill in #816
- Level Zero - Fix OOM & Improve Thread Safety by @pvelesko in #845
- Add a workaround for name mangling issue with PowerVR OpenCL by @franz in #828
- Add libCEED to testing + Update hipBLAS w/sync by @pvelesko in #847
- update spirv_hip_complex.h header by @pvelesko in #856
- rtdevlib: fix function signature mismatches by @linehill in #851
- OpenCL: Support devices with cl_ext_buffer_device_address by @linehill in #830
- Add SKIP_TESTS_WITH_DOUBLES Option by @pvelesko in #826
- [HipBLAS] Fix hiblas.h and hipsolver header conflicts by @pvelesko in #852
- Small Fixes by @pvelesko in #862
- New CUDA compiler by @pvelesko in #858
- LLVM Configure script changes by @pvelesko in #864
- known_failures.yaml add hostname key by @pvelesko in #867
- spirv-extractor link fix by @pvelesko in #871
- update configure_llvm for IPO by @pvelesko in #870
- Submodules track branches by @pvelesko in #872
- Fix math function j1 typo in dp_math.hh by @jjennychen in #876
- fixed cudaMallocManaged function parameter type issue by @jjennychen in #878
- CUDA Compiler Refactor by @pvelesko in #875
- Docker Images + update linter github action by @pvelesko in #879
- Update DockerfileFull by @pvelesko in #881
- Implement missing host-side math functions by @pvelesko in #884
- Adding runtime error conversion for Level0 backend by @jjennychen in #886
- Fix 885 by @pvelesko in #889
- Fix 887 by @pvelesko in #888
- handle relocatable code flags cucc by @pvelesko in #892
- Add more implicit casts to dim3 by @pvelesko in #895
- update HIPCC to preserve ordering by @pvelesko in #899
- spirv_hip_fp16.h header file updates by @jjennychen in #896
- ARM CI by @pvelesko in #903
- skip kernel annotation on CPU by @pvelesko in #905
- use github.sha for docker by @pvelesko in #907
- docker ref fix by @pvelesko in #908
- docker build only on merge to main by @pvelesko in #909
- Expand the use of error maps by @pvelesko in #891
- Ajust known_failures for abort,assert by @pvelesko in #906
- Properly annotate Intel USM kernels by @pvelesko in #911
- Make adjustments for LLVM-19 by @linehill in #901
- Fix device-side functions by @pvelesko in #913
- Enable building of hipFFT by @pvelesko in #912
- OpenCL Backend Fixes by @pvelesko in #914
- hipStreamSemantics Fixes by @pvelesko in #917
- Cleanup by @pvelesko in #918
New Contributors
- @karlwessel made their first contribution in #800
- @jjennychen made their first contribution in #876
Full Changelog: v1.1...v1.2-RC1