#

hfrl

Here is 1 public repository matching this topic...

techandy42 / LLM_Reward_Model

Developing a LLM response ranking reward model using HFRL except it's GPT-3.5 instead of human.

language-model reward-model hfrl

Updated Dec 28, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the hfrl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hfrl topic, visit your repo's landing page and select "manage topics."