Skip to content

Latest commit

 

History

History
7 lines (4 loc) · 618 Bytes

230615 Inverse Scaling.md

File metadata and controls

7 lines (4 loc) · 618 Bytes

https://arxiv.org/abs/2306.09479

Inverse Scaling: When Bigger Isn't Better (Ian R. McKenzie, Alexander Lyzhov, Michael Pieler, Alicia Parrish, Aaron Mueller, Ameya Prabhu, Euan McLean, Aaron Kirtland, Alexis Ross, Alisa Liu, Andrew Gritsevskiy, Daniel Wurgaft, Derik Kauffman, Gabriel Recchia, Jiacheng Liu, Joe Cavanagh, Max Weiss, Sicong Huang, The Floating Droid, Tom Tseng, Tomasz Korbak, Xudong Shen, Yuhui Zhang, Zhengping Zhou, Najoung Kim, Samuel R. Bowman, Ethan Perez)

inverse scaling, 모델 학습 규모가 커지면 오히려 성능이 떨어지는 과제들이 다시 정리되어 나왔군요.

#llm