Skip to content

Latest commit

 

History

History
3 lines (2 loc) · 238 Bytes

230710 BeaverTails.md

File metadata and controls

3 lines (2 loc) · 238 Bytes

https://arxiv.org/abs/2307.04657

BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset (Jiaming Ji, Mickel Liu, Juntao Dai, Xuehai Pan, Chi Zhang, Ce Bian, Chi Zhang, Ruiyang Sun, Yizhou Wang, Yaodong Yang)