This project implements a self-improving language model evaluator using synthetic data, inspired by the paper "Self-Taught Evaluators". The model iteratively generates, evaluates, and fine-tunes itself using its own synthetic data, eliminating the need for costly human annotations.
sanowl/Self-Taught-Evaluator
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.