Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md

Repository files navigation

Emergent Abilities of Large Language Models

In-Context Learning

[GPT3] Language Models are Few-Shot Learners, NeurIPS 2020
Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity, ACL2022
Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers
Transformers learn in-context by gradient descent
Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Learning

Instruction tuning

[InstructGPT] Training language models to follow instructions with human feedback, NeurIPS2022
[T0] Multitask Prompted Training Enables Zero-Shot Task Generalization, ICLR2022
[Flan-T5/PaLM] Scaling Instruction-Finetuned Language Models
[Flan2020] The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
InstructDial: Improving Zero and Few-shot Generalization in Dialogue, EMNLP2022
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes,ACL2023finding

Chain of Thought

Data Augmentation with LLMs

About

No description, website, or topics provided.

Custom properties

Report repository

Releases

No releases published

Packages

No packages published