Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Offloading example #1299

Open
wants to merge 8 commits into
base: main
Choose a base branch
from

Commits on Aug 23, 2024

  1. Added offloading support FP8 attention

    Signed-off-by: Selvaraj Anandaraj <selvaraja@login-eos02.eos.clusters.nvidia.com>
    Selvaraj Anandaraj committed Aug 23, 2024
    Configuration menu
    Copy the full SHA
    0ef5803 View commit details
    Browse the repository at this point in the history

Commits on Sep 4, 2024

  1. Configuration menu
    Copy the full SHA
    84857bb View commit details
    Browse the repository at this point in the history
  2. Update transformer_engine/pytorch/attention.py

    Co-authored-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
    Signed-off-by: Selvaraj Anandaraj <anandaraj@wisc.edu>
    sanandaraj5597 and ksivaman authored Sep 4, 2024
    Configuration menu
    Copy the full SHA
    e59e4ba View commit details
    Browse the repository at this point in the history

Commits on Sep 5, 2024

  1. Fix

    Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
    ksivaman committed Sep 5, 2024
    Configuration menu
    Copy the full SHA
    e54c03b View commit details
    Browse the repository at this point in the history
  2. Merge branch 'main' into main

    Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
    ksivaman authored Sep 5, 2024
    Configuration menu
    Copy the full SHA
    4e249fb View commit details
    Browse the repository at this point in the history

Commits on Oct 29, 2024

  1. Configuration menu
    Copy the full SHA
    7348d21 View commit details
    Browse the repository at this point in the history
  2. Added example for CPU offloading

    Signed-off-by: Selvaraj Anandaraj <selvaraja@login-eos02.eos.clusters.nvidia.com>
    Selvaraj Anandaraj committed Oct 29, 2024
    Configuration menu
    Copy the full SHA
    69866e2 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    dc89e7d View commit details
    Browse the repository at this point in the history