🎯
Focusing
-
University of Wisconsin–Madison
- https://cfc87.github.io/
- @FanchaoChen
Highlights
- Pro
Pinned Loading
-
Megatron-LM
Megatron-LM PublicForked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Python
-
nanoChatGPT
nanoChatGPT PublicForked from sanjeevanahilan/nanoChatGPT
A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.