Merge OpenAI Triton commit `4dac289` #265

whitneywhtsang · 2024-01-16T02:18:56Z

This PR change the Triton base from bbfdc0d to 4dac289 (Jan 11).

Please do not squash and merge this PR.

AMD is enabled by default, but not ripe for usage (not tested). Lots of work will be necessary to make everything robust and maintainable.

Solves triton-lang/triton#2898 . With the [MLIR VS Code](https://marketplace.visualstudio.com/items?itemName=llvm-vs-code-extensions.vscode-mlir) plugin, here is how the result looks like: <img width="1195" alt="image" src="https://github.com/openai/triton/assets/23236638/529c02a0-6448-4221-90fc-78d5d416356e"> Further efforts require managing the file extension to be `.mlir` rather than `.ttlr`.

…o avoid being dependent on numpy by default (#2904) Fixes triton-lang/triton#2899 .

…tup.py (#2906) Init submodule before trying to check if something is in it

…ods of defining target link libraries (#2907) Cmake requires that you either specify PUBLIC/PRIVATE keyword in target_link_libraries, or you don't. Mixing two methods is not supported.

…s (#2908) * Adding new `tl.clamp(x, min, max, propagate_nan)` function to triton language. Lowering it to a sequence of minimum(x, maximum(x, min), max) in the general case, and to `min.xorsign.abs` inline assembly when `clamp(x, -limit, limit)` is detected. * Refactoring the `tl.PropagateNan` enum, so it is defined directly in MLIR and exported to python FE. * New tests for clamp and symmetric clamp

…… (#2910) …rong type

Those tests are deprecated, since we have comprehensive test_conversions now

…ing swap file (#2912)

…t now (#2911) This PR triton-lang/triton#2887 removes `third_party/triton_shared`, and the corresponding test should be removed. Otherwise it will fail (and now it indeed fail) all the CI tests.

On Hopper when storing mma tensor to shared memory we can use stmatrix to reduce the number of store intrusctions. This give a very small improvements to the epilogue for fp16 output. It will later be combined with cp.async.bulked to improve performance further.

`DistributedEncodingTrait::getCTAOrder()` returns a SmallVector by value, which is deleted as soon as it is assigned to `ref`. `ref` then becomes a dangling reference. To prevent that, we now use a vector instead of an array reference.

ptillet and others added 12 commits January 9, 2024 22:21

[BUILD][FRONTEND] working 3P backend (#2896)

9a38395

AMD is enabled by default, but not ripe for usage (not tested). Lots of work will be necessary to make everything robust and maintainable.

[FRONTEND] Update jit.py to delay the import of InterpretedFunction t…

8594268

…o avoid being dependent on numpy by default (#2904) Fixes triton-lang/triton#2899 .

[BUILD][FRONTEND] Fixing the order of actions in _copy_backends in se…

c8a38ba

…tup.py (#2906) Init submodule before trying to check if something is in it

[FRONTEND][BUILD] Fix for the build issue coming from mixing two meth…

3a95ce6

…ods of defining target link libraries (#2907) Cmake requires that you either specify PUBLIC/PRIVATE keyword in target_link_libraries, or you don't. Mixing two methods is not supported.

[FRONTEND] Fix typo in tl.max/tl.min causing the tensor to have the w…

3497c16

…… (#2910) …rong type

[TEST] Deprecating old conversion tests from test_core (#2875)

ec34fe9

Those tests are deprecated, since we have comprehensive test_conversions now

[NFC] Update .gitignore to exclude Vim swap files and remove an exist…

a0000a0

…ing swap file (#2912)

[CI] remove shared middle-layer lit tests as the plugin does not exis…

8d6a37d

…t now (#2911) This PR triton-lang/triton#2887 removes `third_party/triton_shared`, and the corresponding test should be removed. Otherwise it will fail (and now it indeed fail) all the CI tests.

whitneywhtsang self-assigned this Jan 16, 2024

whitneywhtsang force-pushed the whitneywhtsang/merge branch 2 times, most recently from e505031 to 7e28281 Compare January 16, 2024 03:41

whitneywhtsang changed the title ~~Merge OpenAI Triton commit 9a38395~~ Merge OpenAI Triton commit 4dac289 Jan 16, 2024

whitneywhtsang requested a review from etiotto January 16, 2024 03:54

etiotto approved these changes Jan 16, 2024

View reviewed changes

Merge commit '4dac289e65a38774a9b72ae09669890151122bff'

7f911ad

whitneywhtsang force-pushed the whitneywhtsang/merge branch from 7e28281 to 7f911ad Compare January 16, 2024 15:32

whitneywhtsang merged commit 7f911ad into llvm-target Jan 16, 2024
2 of 3 checks passed

whitneywhtsang deleted the whitneywhtsang/merge branch January 16, 2024 15:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge OpenAI Triton commit `4dac289` #265

Merge OpenAI Triton commit `4dac289` #265

whitneywhtsang commented Jan 16, 2024 •

edited

Loading

Merge OpenAI Triton commit 4dac289 #265

Merge OpenAI Triton commit 4dac289 #265

Conversation

whitneywhtsang commented Jan 16, 2024 • edited Loading

Merge OpenAI Triton commit `4dac289` #265

Merge OpenAI Triton commit `4dac289` #265

whitneywhtsang commented Jan 16, 2024 •

edited

Loading