Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

remaining CUDA kernels not yet in main #3173

Open
12 of 29 tasks
jpivarski opened this issue Jul 2, 2024 · 1 comment
Open
12 of 29 tasks

remaining CUDA kernels not yet in main #3173

jpivarski opened this issue Jul 2, 2024 · 1 comment
Assignees
Labels
gpu Concerns the GPU implementation (backend = "cuda')

Comments

@jpivarski
Copy link
Member

jpivarski commented Jul 2, 2024

The following are completed in #3150 for n = 2:

  • awkward_ListArray_combinations
  • awkward_RegularArray_combinations_64

The following are in completed in #3149:

  • awkward_reduce_sum_bool_complex
  • awkward_reduce_sum_complex
  • awkward_reduce_countnonzero_complex
  • awkward_reduce_max_complex
  • awkward_reduce_min_complex
  • awkward_reduce_prod_bool_complex
  • awkward_IndexedArray_local_preparenext_64
  • awkward_ListOffsetArray_reduce_nonlocal_outstartsstops_64
  • awkward_RecordArray_reduce_nonlocal_outoffsets_64

The following is in development in #3149:

  • awkward_reduce_prod_complex

The following are for axis=-1 reduction:

  • awkward_reduce_argmax
  • awkward_reduce_argmin
  • awkward_reduce_argmax_complex
  • awkward_reduce_argmin_complex

The following are for general reduction:

  • awkward_ListOffsetArray_reduce_nonlocal_nextshifts_64
  • awkward_ListOffsetArray_reduce_nonlocal_preparenext_64

The following are for sorting:

  • awkward_ListOffsetArray_argsort_strings
  • awkward_NumpyArray_sort_asstrings_uint8
  • awkward_argsort
  • awkward_sort

The following are for uniqueness checking/computing:

  • awkward_IndexedArray_unique_next_index_and_offsets_64
  • awkward_NumpyArray_unique_strings
  • awkward_unique_offsets
  • awkward_unique_ranges
  • awkward_unique_ranges_bool

The following are for converting strings to_numpy:

  • awkward_NumpyArray_prepare_utf8_to_utf32_padded
  • awkward_NumpyArray_utf8_to_utf32_padded
@jpivarski jpivarski added the gpu Concerns the GPU implementation (backend = "cuda') label Jul 2, 2024
@jpivarski
Copy link
Member Author

@lgray requests the sorting kernels to be next.

@ianna ianna self-assigned this Jul 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
gpu Concerns the GPU implementation (backend = "cuda')
Projects
None yet
Development

No branches or pull requests

3 participants