What is Self-Instruct? #3

AaronWard · 2023-10-25T21:02:05Z

AaronWard
Oct 25, 2023
Maintainer

Self-Instruct, a framework for improving the instruction-following capabilities of pretrained language models by bootstrapping off its own generations. Our pipeline generates instruction, input, and output samples from a language model, then prunes them before using them to finetune the original model. Applying our method to vanilla GPT3, we demonstrate a 33% absolute improvement over the original model on Super-NaturalInstructions, on par with the performance of InstructGPT_001, which is trained with private user data and human annotations. For further evaluation, we curate a set of expert-written instructions for novel tasks, and show through human evaluation that tuning GPT3 with Self-Instruct outperforms using existing public instruction datasets by a large margin, leaving only a 5% absolute gap behind InstructGPT_001. Self-Instruct provides an almost annotation-free method for aligning pre-trained language models with instructions, and we release our large synthetic dataset to facilitate future studies on instruction tuning.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is Self-Instruct? #3

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

What is Self-Instruct? #3

AaronWard Oct 25, 2023 Maintainer

Replies: 0 comments

AaronWard
Oct 25, 2023
Maintainer