Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

allenai / open-instruct Public

Notifications You must be signed in to change notification settings
Fork 168
Star 1.2k

Code
Issues 10
Pull requests 24
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: allenai/open-instruct

Labels 9 Milestones 0

Labels 9 Milestones 0

New pull request New

24 Open 255 Closed

24 Open 255 Closed

Author

Filter by author

Loading

Label

Filter by label

Loading

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Loading

Milestones

Filter by milestone

Loading

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Loading

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Decontamination scripts

#392 opened Oct 18, 2024 by pdasigi • Draft

Remove experiment name

#391 opened Oct 16, 2024 by vwxyzjn

Loading…

1

Prototype ppo + ray

#390 opened Oct 16, 2024 by vwxyzjn • Draft

Add ds configs, better analysis

#388 opened Oct 14, 2024 by natolambert

Loading…

More systematic and reproducible conversion of SFT datasets

#387 opened Oct 14, 2024 by yizhongw

Loading…

2

GPT model synthetic data generation

#386 opened Oct 12, 2024 by VictoriaGraf

Loading…

1

#381 opened Oct 8, 2024 by fabrahman

Loading…

2

Ground-Truth RL

#377 opened Oct 7, 2024 by hamishivi • Draft

take top bottom of generating n

#369 opened Sep 26, 2024 by mnoukhov

Loading…

Onlinedpo Support rm with different vocab size

#368 opened Sep 25, 2024 by vwxyzjn • Draft

Create synthetic MMLU via GPT-4

#367 opened Sep 24, 2024 by nouhadziri

Loading…

1

files for multinode dpo

#366 opened Sep 24, 2024 by jacob-morrison • Draft

Process reward modeling support

#362 opened Sep 21, 2024 by fabrahman

Loading…

Add corresponding lora finetuning with config scripts

#343 opened Sep 10, 2024 by notoookay

Loading…

2

Fix and improvements rejection sampling generation

#335 opened Sep 6, 2024 by vwxyzjn

Loading…

1

Adding support for latest OLMo architectures

#331 opened Sep 5, 2024 by natolambert

Loading…

9

Add new DPO config for data mixing

#319 opened Aug 30, 2024 by ValentinaPy

Loading…

1

Win rate plot experiment stuff

#317 opened Aug 29, 2024 by vwxyzjn

Loading…

Add new eval Benchmarks for evaluating long context

#282 opened Aug 23, 2024 by nouhadziri

Loading…

Try uv for package management (245ms project installation)

#281 opened Aug 22, 2024 by vwxyzjn • Draft

Added Claude and Gemini models

#273 opened Aug 19, 2024 by nouhadziri

Loading…

Add rejection sampling analysis

#253 opened Aug 13, 2024 by vwxyzjn • Draft

2

Does prompt make sense?

#243 opened Aug 12, 2024 by vwxyzjn

Loading…

3

Revamping the human feedback interface frontend with Next.js

#195 opened Jul 18, 2024 by darrensapalo • Draft

ProTip! Updated in the last three days: updated:>2024-10-16.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.