-
Notifications
You must be signed in to change notification settings - Fork 327
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[PyTorch] Branching operations #1027
Commits on Jul 17, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 9f61bca - Browse repository at this point
Copy the full SHA 9f61bcaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2cda094 - Browse repository at this point
Copy the full SHA 2cda094View commit details -
Add op that adds extra output to fuser
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 6bf2869 - Browse repository at this point
Copy the full SHA 6bf2869View commit details
Commits on Jul 18, 2024
-
Add fused op for GEMM+bias+add
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 6d78177 - Browse repository at this point
Copy the full SHA 6d78177View commit details -
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for da7a981 - Browse repository at this point
Copy the full SHA da7a981View commit details
Commits on Jul 19, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 872f863 - Browse repository at this point
Copy the full SHA 872f863View commit details -
Configuration menu - View commit details
-
Copy full SHA for 93c4040 - Browse repository at this point
Copy the full SHA 93c4040View commit details -
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Configuration menu - View commit details
-
Copy full SHA for 151df0b - Browse repository at this point
Copy the full SHA 151df0bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4e618cd - Browse repository at this point
Copy the full SHA 4e618cdView commit details
Commits on Jul 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for fadbb8a - Browse repository at this point
Copy the full SHA fadbb8aView commit details
Commits on Aug 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 8b78f65 - Browse repository at this point
Copy the full SHA 8b78f65View commit details
Commits on Aug 3, 2024
-
Review suggestions from @ptrendx
Output tensor dtype and device take precedence over weight tensor in linear functional API. Move some index calculation to fuser constructor. Avoid some unnecessary dereferences. Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 2679fbf - Browse repository at this point
Copy the full SHA 2679fbfView commit details
Commits on Aug 5, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 33347f4 - Browse repository at this point
Copy the full SHA 33347f4View commit details
Commits on Aug 9, 2024
-
Update transformer_engine/pytorch/ops/fuser.py
Signed-off-by: Tim Moon <4406448+timmoon10@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for a5418ab - Browse repository at this point
Copy the full SHA a5418abView commit details -
Configuration menu - View commit details
-
Copy full SHA for 912742b - Browse repository at this point
Copy the full SHA 912742bView commit details