Pinned Loading
-
time_llm
time_llm PublicA transformer coded from scratch. Designed to make LLMs special-token aware
Python
-
dpo_sample
dpo_sample PublicImplementation of DPO for RLHF pipeline. Just upload a CSV and let the model train the rest.
Python 1
-
Gymnasium-Robotics
Gymnasium-Robotics PublicForked from bstadie/Gymnasium-Robotics
Personal environments added to experiment with robustness of vision encoders for visuomotor learning. A collection of robotics simulation environments for reinforcement learning
Python 1
-
curiosity_redteam
curiosity_redteam PublicForked from Improbable-AI/curiosity_redteam
Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizXgXU)
Jupyter Notebook
-
Metaworld
Metaworld PublicForked from Farama-Foundation/Metaworld
Modifications to the environments to enable visual data collection more easily. Collections of robotics environments geared towards benchmarking multi goal reinforcement learning
Python
-
jax_demo
jax_demo Public(WIP) This repo is for those who have experience in deep learning with torch, and want to learn a lesser known but still powerful library JAX
Jupyter Notebook
If the problem persists, check the GitHub status page or contact support.