Here are
6 public repositories
matching this topic...
[ICLR 2024] SemiReward: A General Reward Model for Semi-supervised Learning
Updated
Jun 10, 2024
Python
Updated
May 12, 2023
Python
This repository contains the lab work for Coursera course on "Generative AI with Large Language Models".
Updated
Dec 1, 2023
Jupyter Notebook
Developing a LLM response ranking reward model using HFRL except it's GPT-3.5 instead of human.
Updated
Dec 28, 2023
Jupyter Notebook
Library built on TextRL for easy training and usage of fine-tuned models using RLHF, a rewards model, and PPO
Updated
Feb 28, 2024
Python
Improve this page
Add a description, image, and links to the
reward-model
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
reward-model
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.