Adding resize(PadOp) vectorization analysis #3321

jjsjann123 · 2024-10-31T15:26:12Z

Adding conditional support of reszie in vectorization analysis. This PR allows vectorized load on PadOp directly without using cache load. This PR improves performance of generated kernel.

What's in this PR:

Add propagation rule for resize in vectorization analysis. The propagation rule works as:
i. For supported resize: a). project the resize op to the frontier and clear (frontier.begin(), resize_position); b). add projected extent of the new resize op as gcd(id_from, resize_op->leftExpand(), resize_op->rightExpand)
ii. For unsupported resize: clear [frontier.begin(), resize_position]; no behavior change.
updating TensorView::cacheAfter to opt-in a set of uses to cache while leaving other uses unchanged. Necessary for cases where inputs are used by PadOp as well as other operation that relies on cached load for vectorization.

Follow up to #3261.
Work for supporting rope performance. design doc:

This reverts commit d0addc4.

Added support for lowering TernaryOp:where with vectorization factor. i.e. ``` predicate ? loadGlobalToLocal<...>(&dst[0], &src[i_src]) : dst.set(0.0f) ``` Currently this can only be done via manual scheduling. The follow up PR on vectorization analysis will make this automatically applied in PR #3321

jjsjann123 · 2024-11-06T02:00:59Z

!test

csrc/tensor_view.cpp

csrc/preseg_passes/move_pad.cpp

tests/cpp/test_resize.cpp

csrc/scheduler/vectorize_helper.cpp

Co-authored-by: Naoya Maruyama <naoyam@users.noreply.github.com>

naoyam

It overall looks good. Just would like a few things I commented about to get addressed.

naoyam · 2024-11-08T00:28:08Z

!test --pybench

naoyam · 2024-11-08T00:28:39Z

Initiated testing with python benchmarks just in case.

jjsjann123 · 2024-11-08T00:37:55Z

Thanks, I'll address the issues you brought up as well as running through some real size problem so we get a taste of the perf impact. 🙇

jjsjann123 · 2024-11-08T23:36:16Z

!test --pybench

jjsjann123 · 2024-11-09T00:00:35Z

!test --pybench

jjsjann123 and others added 30 commits September 2, 2024 04:27

relaxing check

8f9708f

allow cache on inputs for pad

54826aa

Merge remote-tracking branch 'origin/main' into jjsjann123/resize_vec

e54938c

cpp example

2bc3c7a

Merge branch 'jjsjann123/pad_vec' into jjsjann123/resize_vec

d04e8c3

reverting earlier changes

d0addc4

Revert "reverting earlier changes"

490fdbe

This reverts commit d0addc4.

cherry-pick my revert

51c3022

Merge remote-tracking branch 'origin/main' into jjsjann123/resize_vec

1158ef0

debug print

fdc6a9a

Merge remote-tracking branch 'origin/main' into jjsjann123/resize_vec

9a6c03a

removing comments

a9d16ce

removing assert

3401119

Merge remote-tracking branch 'origin/main' into jjsjann123/resize_vec

5d05284

patching test

b6587ee

Merge remote-tracking branch 'origin/main' into jjsjann123/resize_vec

28decac

Merge remote-tracking branch 'origin/main' into HEAD

3e53feb

fixing test

ad61ecb

fixing

a8edc56

fixing test

9cdeb64

does this work to replace Ternary(where) with IfThenElse

09a2aee

fixing build

895d0bf

removing print

7a15e22

restore lower to ternary:where; restore vectorization on tests

a6e8fb1

testing water

fe0f263

fixing syntax

baa7b09

now it's functional

ca5ced1

better formatting on printed code

e0492d3

adding a tab

b528429

supporting local memory

a23e010

Merge branch 'jjsjann123/resize_vec' into HEAD

fc3b28f

Base automatically changed from jjsjann123/resize_vec to main November 5, 2024 16:51

jjsjann123 added 8 commits November 5, 2024 10:39

Merge remote-tracking branch 'origin/main' into HEAD

85517b8

fixing cacheAfter

69469c5

fixing assert in cacheAfter

cc69971

fix

c46e50e

fixing tests

a63bb67

fixing tests

974c85b

err should have checked the input for uop

4c7366b

clangformat

2a1ff76

jjsjann123 added 2 commits November 6, 2024 10:14

Merge remote-tracking branch 'origin/main' into HEAD

7b71934

comment

d019f9a

jjsjann123 changed the title ~~resize(PadOp) vectorization factor analysis~~ Adding resize(PadOp) vectorization analysis Nov 6, 2024

jjsjann123 requested review from naoyam, zasdfgbnm and jacobhinkle November 6, 2024 18:47

naoyam reviewed Nov 7, 2024

View reviewed changes

Update tests/cpp/test_resize.cpp

e8033c1

Co-authored-by: Naoya Maruyama <naoyam@users.noreply.github.com>

naoyam reviewed Nov 8, 2024

View reviewed changes

jjsjann123 and others added 2 commits November 8, 2024 08:41

Merge remote-tracking branch 'origin/main' into HEAD

6848dad

addressing review comment; adding more tests

5299327

jjsjann123 added 2 commits November 8, 2024 15:48

errr

48cafba

fix

61d7c3f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding resize(PadOp) vectorization analysis #3321

Adding resize(PadOp) vectorization analysis #3321

jjsjann123 commented Oct 31, 2024 •

edited

Loading

jjsjann123 commented Nov 6, 2024

naoyam left a comment

naoyam commented Nov 8, 2024

naoyam commented Nov 8, 2024

jjsjann123 commented Nov 8, 2024

jjsjann123 commented Nov 8, 2024

jjsjann123 commented Nov 9, 2024

Adding resize(PadOp) vectorization analysis #3321

Are you sure you want to change the base?

Adding resize(PadOp) vectorization analysis #3321

Conversation

jjsjann123 commented Oct 31, 2024 • edited Loading

jjsjann123 commented Nov 6, 2024

naoyam left a comment

Choose a reason for hiding this comment

naoyam commented Nov 8, 2024

naoyam commented Nov 8, 2024

jjsjann123 commented Nov 8, 2024

jjsjann123 commented Nov 8, 2024

jjsjann123 commented Nov 9, 2024

jjsjann123 commented Oct 31, 2024 •

edited

Loading