Pinned Loading
-
alpha_zero
alpha_zero PublicA PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
-
deep_rl_zoo
deep_rl_zoo Public archiveA collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.
-
Llama3-FunctionCalling
Llama3-FunctionCalling PublicFine-tune Llama3 model to support function calling
-
InstructLLaMA
InstructLLaMA PublicImplements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to Instru…
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.