Skip to content

Releases: JuliaGPU/AMDGPU.jl

v0.2.11

20 Jul 14:02
62f88d4
Compare
Choose a tag to compare

AMDGPU v0.2.11

Diff since v0.2.10

Merged pull requests:

v0.2.10

16 Jul 18:00
6d29402
Compare
Choose a tag to compare

AMDGPU v0.2.10

Diff since v0.2.9

Merged pull requests:

v0.2.9

09 Jul 18:01
4590e6d
Compare
Choose a tag to compare

AMDGPU v0.2.9

Diff since v0.2.8

Merged pull requests:

v0.2.8

06 Jul 16:00
3073525
Compare
Choose a tag to compare

AMDGPU v0.2.8

Diff since v0.2.7

Merged pull requests:

v0.2.7

20 May 19:17
360c030
Compare
Choose a tag to compare

AMDGPU v0.2.7

Diff since v0.2.6

Closed issues:

  • "Spills" from adjacent views of ROCVector (#130)

Merged pull requests:

v0.2.6

06 Apr 19:08
f836632
Compare
Choose a tag to compare

AMDGPU v0.2.6

Diff since v0.2.5

Closed issues:

  • ROCArrays matrix multiplication not working (#103)
  • Data race in kernel packet writing? (#121)

Merged pull requests:

  • Add mark/wait synchronization system (#116) (@jpsamaroo)
  • CompatHelper: bump compat for "GPUCompiler" to "0.11" (#122) (@github-actions[bot])
  • Replace arrays with Refs in ccall. (#123) (@chriselrod)
  • Fix packet launch (#125) (@jpsamaroo)

v0.2.6 for Zenodo

09 Apr 23:06
f836632
Compare
Choose a tag to compare
v0.2.6 for Zenodo Pre-release
Pre-release
Merge pull request #116 from JuliaGPU/jps/mark-wait

Add mark/wait synchronization system

v0.2.5

29 Mar 19:03
308941e
Compare
Choose a tag to compare

AMDGPU v0.2.5

Diff since v0.2.4

Merged pull requests:

v0.2.4

26 Mar 00:03
9f387fa
Compare
Choose a tag to compare

AMDGPU v0.2.4

Diff since v0.2.3

Closed issues:

  • Implement execution contexts (#16)
  • Add/test broadcasting support to ROCArray (#12)
  • Add queue/device/system sync functionality (#24)
  • Support OpenCL.jl as device runtime (#23)
  • Distribute ROCR/ROCT via artifacts (#6)
  • Allow setting Private and Group segment sizes manually (#56)
  • FATAL ERROR: Symbol "ccalllib_libhsa-runtime64445"not found on AMDGPU (#73)
  • test failures and crashes on 580 (#92)
  • Tests allocate memory indefinitely (#106)
  • Check for invalid workgroup sizes (#110)
  • Add example for gridsize usage and workgroup sizing (#113)

Merged pull requests:

v0.2.3

05 Feb 19:00
3ffacab
Compare
Choose a tag to compare

AMDGPU v0.2.3

Diff since v0.2.2

Closed issues:

  • Add support for trap handlers (#8)
  • Unreachable reached in SIISelLowering.cpp due to unhandled AS (#76)
  • Ensure that CI tests all available external libraries (#85)
  • Only load libhsa-runtime64 major version 1 (#93)

Merged pull requests: