-
Notifications
You must be signed in to change notification settings - Fork 327
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[PyTorch] Userbuffers support in operation-based API #1142
Commits on Aug 20, 2024
-
Add Userbuffers support for column TP linear layer
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for f2da5eb - Browse repository at this point
Copy the full SHA f2da5ebView commit details -
Add Userbuffers support for row TP linear layer
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 90e0a41 - Browse repository at this point
Copy the full SHA 90e0a41View commit details
Commits on Aug 21, 2024
-
Interpret linear+RS as row TP linear
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for a520974 - Browse repository at this point
Copy the full SHA a520974View commit details
Commits on Aug 22, 2024
-
Add Userbuffers support for FP8 row TP linear layer
Assumes FP8 RS, which is not a good assumption. Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for bb2e714 - Browse repository at this point
Copy the full SHA bb2e714View commit details -
Debug bug with incorrect bias pointers in UB GEMM
Bias pointers are not properly offset for different data chunks. Also removed logic for FP8 RS. Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 1e54b88 - Browse repository at this point
Copy the full SHA 1e54b88View commit details
Commits on Aug 23, 2024
-
Add Userbuffers support for linear dgrad
Test passes with row TP, fails with col TP. Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 80b9d42 - Browse repository at this point
Copy the full SHA 80b9d42View commit details
Commits on Aug 24, 2024
-
Add Userbuffers support for linear wgrad
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for e6ad571 - Browse repository at this point
Copy the full SHA e6ad571View commit details -
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for bd5c61e - Browse repository at this point
Copy the full SHA bd5c61eView commit details
Commits on Aug 26, 2024
-
Configuration menu - View commit details
-
Copy full SHA for db5a7e2 - Browse repository at this point
Copy the full SHA db5a7e2View commit details
Commits on Aug 27, 2024
-
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for d5f8a8b - Browse repository at this point
Copy the full SHA d5f8a8bView commit details -
Support case where wgrad is optional
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for cd0db1c - Browse repository at this point
Copy the full SHA cd0db1cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6209910 - Browse repository at this point
Copy the full SHA 6209910View commit details -
Configuration menu - View commit details
-
Copy full SHA for 38263fe - Browse repository at this point
Copy the full SHA 38263feView commit details -
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Configuration menu - View commit details
-
Copy full SHA for a98e2f2 - Browse repository at this point
Copy the full SHA a98e2f2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7aaef65 - Browse repository at this point
Copy the full SHA 7aaef65View commit details
Commits on Sep 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 5d7f48a - Browse repository at this point
Copy the full SHA 5d7f48aView commit details
Commits on Sep 26, 2024
-
Merge branch 'main' into ub-ops
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 880486f - Browse repository at this point
Copy the full SHA 880486fView commit details -
Use recently added convenience functions in Float8Tensor
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 7d8e08b - Browse repository at this point
Copy the full SHA 7d8e08bView commit details -
Configuration menu - View commit details
-
Copy full SHA for fd4e541 - Browse repository at this point
Copy the full SHA fd4e541View commit details -
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Configuration menu - View commit details
-
Copy full SHA for a0646d2 - Browse repository at this point
Copy the full SHA a0646d2View commit details -
Configuration menu - View commit details
-
Copy full SHA for d77a9fb - Browse repository at this point
Copy the full SHA d77a9fbView commit details
Commits on Oct 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 706d490 - Browse repository at this point
Copy the full SHA 706d490View commit details -
Respect PyT autocast dtype in bprop
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 1ed735d - Browse repository at this point
Copy the full SHA 1ed735dView commit details
Commits on Oct 18, 2024
-
Merge branch 'main' into ub-ops
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for c2709d2 - Browse repository at this point
Copy the full SHA c2709d2View commit details
Commits on Oct 19, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 98a6cf4 - Browse repository at this point
Copy the full SHA 98a6cf4View commit details
Commits on Oct 31, 2024
-
Merge branch 'main' into ub-ops
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 0242d83 - Browse repository at this point
Copy the full SHA 0242d83View commit details -
Configuration menu - View commit details
-
Copy full SHA for a297abb - Browse repository at this point
Copy the full SHA a297abbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 12ca945 - Browse repository at this point
Copy the full SHA 12ca945View commit details -
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Configuration menu - View commit details
-
Copy full SHA for cbb25d6 - Browse repository at this point
Copy the full SHA cbb25d6View commit details
Commits on Nov 1, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2fab5c7 - Browse repository at this point
Copy the full SHA 2fab5c7View commit details
Commits on Nov 5, 2024
-
Merge branch 'main' into ub-ops
Signed-off-by: Tim Moon <4406448+timmoon10@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 0875f24 - Browse repository at this point
Copy the full SHA 0875f24View commit details