Skip to content

Latest commit

 

History

History
3 lines (2 loc) · 207 Bytes

210714 Deduplicating Training Data Makes Language Models Better.md

File metadata and controls

3 lines (2 loc) · 207 Bytes

https://arxiv.org/abs/2107.06499

Deduplicating Training Data Makes Language Models Better (Katherine Lee, Daphne Ippolito, Andrew Nystrom, Chiyuan Zhang, Douglas Eck, Chris Callison-Burch, Nicholas Carlini)