feat: add some of the kernels using cuda.compute by maxymnaumchyk · Pull Request #3981 · scikit-hep/awkward

maxymnaumchyk · 2026-04-17T11:01:22Z

Closes #3978, closes #3997, closes #3998, closes #3999, closes #4000, closes #4001, closes #4002, closes #4003, closes #4004, closes #4005, closes #4006, closes #4007

These kernels are only ~2 times faster-->
IndexedArray_reduce_next_nonlocal_nextshifts_64 kernel before:

IndexedArray_reduce_next_nonlocal_nextshifts_64 kernel after:

IndexedArray_reduce_next_64 kernel before:

IndexedArray_reduce_next_64 kernel after:

IndexedArray_overlay_mask kernel before:

IndexedArray_overlay_mask kernel after:

…pute

…ing cuda.compute

github-actions · 2026-04-17T11:13:37Z

The documentation preview is ready to be viewed at http://preview.awkward-array.org.s3-website.us-east-1.amazonaws.com/PR3981

maxymnaumchyk · 2026-04-17T11:15:05Z

IndexedArray_reduce_next_64 kernel before:

IndexedArray_reduce_next_64 kernel after:

codecov · 2026-04-17T12:08:30Z

Codecov Report

❌ Patch coverage is 43.53448% with 131 lines in your changes missing coverage. Please review.
✅ Project coverage is 85.04%. Comparing base (6f6d816) to head (a7ac8a7).

Files with missing lines	Patch %	Lines
src/awkward/_connect/cuda/_compute.py	43.53%	131 Missing ⚠️

❌ Your patch check has failed because the patch coverage (43.53%) is below the target coverage (98.00%). You can increase the patch coverage or adjust the target coverage.

Additional details and impacted files

Files with missing lines	Coverage Δ
src/awkward/_backends/cupy.py	`100.00% <ø> (ø)`
src/awkward/_connect/cuda/_compute.py	`57.47% <43.53%> (-11.15%)`	⬇️

maxymnaumchyk · 2026-04-17T15:55:45Z

ByteMaskedArray_getitem_nextcarry kernel before:

ByteMaskedArray_getitem_nextcarry kernel after:

maxymnaumchyk · 2026-04-17T16:05:31Z

awkward_ByteMaskedArray_numnull kernel before:

awkward_ByteMaskedArray_numnull kernel after:

…uda.compute

maxymnaumchyk · 2026-04-21T09:30:52Z

awkward_RegularArray_getitem_jagged_expand kernel before:

awkward_RegularArray_getitem_jagged_expand kernel after:

…lay_mask

…uda.compute

maxymnaumchyk · 2026-04-21T17:43:45Z

awkward_UnionArray_simplify_one kernel before:

awkward_UnionArray_simplify_one kernel after:

…ompute

maxymnaumchyk · 2026-04-27T12:21:17Z

awkward_ListArray_broadcast_tooffsets kernel before:

awkward_ListArray_broadcast_tooffsets kernel after:

maxymnaumchyk · 2026-04-27T13:35:01Z

awkward_ListArray_localindex kernel before:

awkward_ListArray_localindex kernel after:

maxymnaumchyk · 2026-04-28T10:55:14Z

awkward_ListArray_compact_offsets kernel before:

awkward_ListArray_compact_offsets kernel after:

…ompute

maxymnaumchyk · 2026-04-28T15:34:06Z

awkward_ListArray_combinations_length kernel before:

awkward_ListArray_combinations_length kernel after:

maxymnaumchyk · 2026-05-04T15:07:40Z

awkward_ListArray_combinations kernel before:

awkward_ListArray_combinations kernel after:

Just in case, I also manually tested the new kernel with n = 3 and replacement = True.

Also, there is a possible optimization that can be done -- to use the calculated offsets directly from awkward_ListArray_combinations_length (now, they are being discarded).

…a.compute

maxymnaumchyk · 2026-05-05T09:47:10Z

The awkward_UnionArray_nestedfill_tags_index kernel turned out to be a little slower: 0.006263 vs 0.006439 seconds. Add just for archive.

…lay_mask

…overlay_mask

ianna

@maxymnaumchyk - Thanks! 12 more kernels migrated to cuda.compute! I'll enable auto-merge. The benchmarks will be updated after it is merged later today. Thanks.

maxymnaumchyk added 3 commits April 15, 2026 12:18

feat: add awkward_IndexedArray_overlay_mask kernel using cuda.compute

883eac9

feat: add awkward_IndexedArray_reduce_next_64 kernel using cuda.com…

ce870c7

…pute

feat: add IndexedArray_reduce_next_nonlocal_nextshifts_64 kernel us…

ed40791

…ing cuda.compute

maxymnaumchyk marked this pull request as ready for review April 17, 2026 13:56

maxymnaumchyk marked this pull request as draft April 17, 2026 13:57

feat: add ByteMaskedArray_getitem_nextcarry kernel using cuda.compute

ed9f4ee

maxymnaumchyk changed the title ~~feat: add some of the awkward_IndexedArray kernels using cuda.compute~~ feat: add some of the awkward_IndexedArray and ByteMaskedArray kernels using cuda.compute Apr 17, 2026

maxymnaumchyk added 2 commits April 20, 2026 13:52

feat: add awkward_ByteMaskedArray_numnull kernel using cuda.compute

394d017

feat: add awkward_RegularArray_getitem_jagged_expand kernel using c…

0ed2175

…uda.compute

maxymnaumchyk and others added 5 commits April 21, 2026 11:59

add an upper bound

2aa4c39

Merge branch 'main' into maxymnaumchyk/3978-awkward_indexedarray_over…

8ee1e6b

…lay_mask

style: pre-commit fixes

f117e70

feat: add awkward_RegularArray_getitem_jagged_expand kernel using c…

d491d37

…uda.compute

feat: add awkward_UnionArray_simplify_one kernel using cuda.compute

167bc94

feat: add awkward_ListArray_broadcast_tooffsets kernel using cuda.c…

93104c6

…ompute

maxymnaumchyk added 2 commits April 27, 2026 15:23

feat: add awkward_ListArray_localindex kernel using cuda.compute

219419c

fix the impl

731e572

maxymnaumchyk added 2 commits April 28, 2026 12:55

feat: add awkward_ListArray_compact_offsets kernel using cuda.compute

0cb3721

feat: add awkward_ListArray_combinations_length kernel using cuda.c…

5982197

…ompute

feat: add awkward_ListArray_combinations kernel using cuda.compute

4479ce2

feat: add awkward_UnionArray_nestedfill_tags_index kernel using cud…

1d1cc3d

…a.compute

maxymnaumchyk and others added 5 commits May 5, 2026 14:29

Merge branch 'main' into maxymnaumchyk/3978-awkward_indexedarray_over…

1b6ee9a

…lay_mask

fix the tests for kernels that are deliberately raising errors

8935bab

compare starts and stops separately

2428e5c

Merge branch 'main' into maxymnaumchyk/3978-awkward_indexedarray_over…

7b9a1dc

…lay_mask

ignore memptr argument for pylint

28d6206

maxymnaumchyk changed the title ~~feat: add some of the awkward_IndexedArray and ByteMaskedArray kernels using cuda.compute~~ feat: add some of the kernels using cuda.compute May 7, 2026

maxymnaumchyk mentioned this pull request May 7, 2026

awkward_UnionArray_nestedfill_tags_index.cu #4008

Open

maxymnaumchyk marked this pull request as ready for review May 7, 2026 09:46

ianna changed the base branch from main to awkward3 May 16, 2026 19:45

ianna and others added 2 commits May 18, 2026 15:23

Merge branch 'awkward3' into maxymnaumchyk/3978-awkward_indexedarray_…

1c891ad

…overlay_mask

style: pre-commit fixes

5b75fc4

ianna approved these changes May 18, 2026

View reviewed changes

ianna and others added 7 commits May 18, 2026 16:02

Add functions for indexing and repeating arrays

2d7b6b8

style: pre-commit fixes

972a456

return unary_transform call for segment_ids

ac3bb21

style: pre-commit fixes

9bd7682

update the awkward_IndexedArray_reduce_next_64 to work with offsets

8d85968

style: pre-commit fixes

1193083

Merge branch 'awkward3' into maxymnaumchyk/3978-awkward_indexedarray_…

a7ac8a7

…overlay_mask

ianna approved these changes May 19, 2026

View reviewed changes

maxymnaumchyk mentioned this pull request May 19, 2026

feat: migrate some more kernels to cuda.compute #4019

Merged

ianna merged commit 85e945d into scikit-hep:awkward3 May 19, 2026
35 of 38 checks passed

Conversation

maxymnaumchyk commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Apr 17, 2026

Uh oh!

maxymnaumchyk commented Apr 17, 2026

Uh oh!

codecov Bot commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

maxymnaumchyk commented Apr 17, 2026

Uh oh!

maxymnaumchyk commented Apr 17, 2026

Uh oh!

maxymnaumchyk commented Apr 21, 2026

Uh oh!

maxymnaumchyk commented Apr 21, 2026

Uh oh!

maxymnaumchyk commented Apr 27, 2026

Uh oh!

maxymnaumchyk commented Apr 27, 2026

Uh oh!

maxymnaumchyk commented Apr 28, 2026

Uh oh!

maxymnaumchyk commented Apr 28, 2026

Uh oh!

maxymnaumchyk commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maxymnaumchyk commented May 5, 2026

Uh oh!

ianna left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

maxymnaumchyk commented Apr 17, 2026 •

edited

Loading

codecov Bot commented Apr 17, 2026 •

edited

Loading

maxymnaumchyk commented May 4, 2026 •

edited

Loading