Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[PyTorch] Userbuffers support in operation-based API #1142

Merged
merged 31 commits into from
Nov 6, 2024

Commits on Aug 20, 2024

  1. Add Userbuffers support for column TP linear layer

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Aug 20, 2024
    Configuration menu
    Copy the full SHA
    f2da5eb View commit details
    Browse the repository at this point in the history
  2. Add Userbuffers support for row TP linear layer

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Aug 20, 2024
    Configuration menu
    Copy the full SHA
    90e0a41 View commit details
    Browse the repository at this point in the history

Commits on Aug 21, 2024

  1. Interpret linear+RS as row TP linear

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Aug 21, 2024
    Configuration menu
    Copy the full SHA
    a520974 View commit details
    Browse the repository at this point in the history

Commits on Aug 22, 2024

  1. Add Userbuffers support for FP8 row TP linear layer

    Assumes FP8 RS, which is not a good assumption.
    
    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Aug 22, 2024
    Configuration menu
    Copy the full SHA
    bb2e714 View commit details
    Browse the repository at this point in the history
  2. Debug bug with incorrect bias pointers in UB GEMM

    Bias pointers are not properly offset for different data chunks. Also removed logic for FP8 RS.
    
    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Aug 22, 2024
    Configuration menu
    Copy the full SHA
    1e54b88 View commit details
    Browse the repository at this point in the history

Commits on Aug 23, 2024

  1. Add Userbuffers support for linear dgrad

    Test passes with row TP, fails with col TP.
    
    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Aug 23, 2024
    Configuration menu
    Copy the full SHA
    80b9d42 View commit details
    Browse the repository at this point in the history

Commits on Aug 24, 2024

  1. Add Userbuffers support for linear wgrad

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Aug 24, 2024
    Configuration menu
    Copy the full SHA
    e6ad571 View commit details
    Browse the repository at this point in the history
  2. Add support for grad bias

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Aug 24, 2024
    Configuration menu
    Copy the full SHA
    bd5c61e View commit details
    Browse the repository at this point in the history

Commits on Aug 26, 2024

  1. Configuration menu
    Copy the full SHA
    db5a7e2 View commit details
    Browse the repository at this point in the history

Commits on Aug 27, 2024

  1. Fused cast-transpose-dbias

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Aug 27, 2024
    Configuration menu
    Copy the full SHA
    d5f8a8b View commit details
    Browse the repository at this point in the history
  2. Support case where wgrad is optional

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Aug 27, 2024
    Configuration menu
    Copy the full SHA
    cd0db1c View commit details
    Browse the repository at this point in the history
  3. Expand documentation

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Aug 27, 2024
    Configuration menu
    Copy the full SHA
    6209910 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    38263fe View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    a98e2f2 View commit details
    Browse the repository at this point in the history
  6. Fix linter warnings

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Aug 27, 2024
    Configuration menu
    Copy the full SHA
    7aaef65 View commit details
    Browse the repository at this point in the history

Commits on Sep 9, 2024

  1. Configuration menu
    Copy the full SHA
    5d7f48a View commit details
    Browse the repository at this point in the history

Commits on Sep 26, 2024

  1. Merge branch 'main' into ub-ops

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Sep 26, 2024
    Configuration menu
    Copy the full SHA
    880486f View commit details
    Browse the repository at this point in the history
  2. Use recently added convenience functions in Float8Tensor

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Sep 26, 2024
    Configuration menu
    Copy the full SHA
    7d8e08b View commit details
    Browse the repository at this point in the history
  3. Respect autograd dtype

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Sep 26, 2024
    Configuration menu
    Copy the full SHA
    fd4e541 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    a0646d2 View commit details
    Browse the repository at this point in the history
  5. Fix missing imports

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Sep 26, 2024
    Configuration menu
    Copy the full SHA
    d77a9fb View commit details
    Browse the repository at this point in the history

Commits on Oct 9, 2024

  1. Configuration menu
    Copy the full SHA
    706d490 View commit details
    Browse the repository at this point in the history
  2. Respect PyT autocast dtype in bprop

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Oct 9, 2024
    Configuration menu
    Copy the full SHA
    1ed735d View commit details
    Browse the repository at this point in the history

Commits on Oct 18, 2024

  1. Merge branch 'main' into ub-ops

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Oct 18, 2024
    Configuration menu
    Copy the full SHA
    c2709d2 View commit details
    Browse the repository at this point in the history

Commits on Oct 19, 2024

  1. Fix linter warnings

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Oct 19, 2024
    Configuration menu
    Copy the full SHA
    98a6cf4 View commit details
    Browse the repository at this point in the history

Commits on Oct 31, 2024

  1. Merge branch 'main' into ub-ops

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Oct 31, 2024
    Configuration menu
    Copy the full SHA
    0242d83 View commit details
    Browse the repository at this point in the history
  2. Debug merge conflicts

    Signed-off-by: Tim Moon <tmoon@nvidia.com>
    timmoon10 committed Oct 31, 2024
    Configuration menu
    Copy the full SHA
    a297abb View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    12ca945 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    cbb25d6 View commit details
    Browse the repository at this point in the history

Commits on Nov 1, 2024

  1. Configuration menu
    Copy the full SHA
    2fab5c7 View commit details
    Browse the repository at this point in the history

Commits on Nov 5, 2024

  1. Merge branch 'main' into ub-ops

    Signed-off-by: Tim Moon <4406448+timmoon10@users.noreply.github.com>
    timmoon10 authored Nov 5, 2024
    Configuration menu
    Copy the full SHA
    0875f24 View commit details
    Browse the repository at this point in the history