Skip to content

Pull requests: allenai/open-instruct

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Decontamination scripts
#392 opened Oct 18, 2024 by pdasigi Draft
Remove experiment name
#391 opened Oct 16, 2024 by vwxyzjn Loading…
Prototype ppo + ray
#390 opened Oct 16, 2024 by vwxyzjn Draft
Add ds configs, better analysis
#388 opened Oct 14, 2024 by natolambert Loading…
GPT model synthetic data generation
#386 opened Oct 12, 2024 by VictoriaGraf Loading…
Faeze configs
#381 opened Oct 8, 2024 by fabrahman Loading…
Ground-Truth RL
#377 opened Oct 7, 2024 by hamishivi Draft
take top bottom of generating n
#369 opened Sep 26, 2024 by mnoukhov Loading…
Create synthetic MMLU via GPT-4
#367 opened Sep 24, 2024 by nouhadziri Loading…
files for multinode dpo
#366 opened Sep 24, 2024 by jacob-morrison Draft
Process reward modeling support
#362 opened Sep 21, 2024 by fabrahman Loading…
Fix and improvements rejection sampling generation
#335 opened Sep 6, 2024 by vwxyzjn Loading…
Adding support for latest OLMo architectures
#331 opened Sep 5, 2024 by natolambert Loading…
Add new DPO config for data mixing
#319 opened Aug 30, 2024 by ValentinaPy Loading…
Win rate plot experiment stuff
#317 opened Aug 29, 2024 by vwxyzjn Loading…
Add new eval Benchmarks for evaluating long context
#282 opened Aug 23, 2024 by nouhadziri Loading…
Added Claude and Gemini models
#273 opened Aug 19, 2024 by nouhadziri Loading…
Add rejection sampling analysis
#253 opened Aug 13, 2024 by vwxyzjn Draft
Does prompt make sense?
#243 opened Aug 12, 2024 by vwxyzjn Loading…
ProTip! Updated in the last three days: updated:>2024-10-16.