Skip to content

Actions: ReaLLMASIC/nanoGPT

Install Then Test Run Experiments script

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
541 workflow runs
541 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Make n_kv_group 6 by default to enable flash attn
Install Then Test Run Experiments script #68: Pull request #164 synchronize by gkielian
May 13, 2024 04:43 4m 25s gkielian:n_kv_group_6_default
May 13, 2024 04:43 4m 25s
Make n_kv_group 6 by default to enable flash attn
Install Then Test Run Experiments script #67: Pull request #164 synchronize by gkielian
May 13, 2024 04:39 4m 13s gkielian:n_kv_group_6_default
May 13, 2024 04:39 4m 13s
Make n_kv_group 6 by default to enable flash attn
Install Then Test Run Experiments script #66: Pull request #164 opened by gkielian
May 13, 2024 04:18 4m 18s gkielian:n_kv_group_6_default
May 13, 2024 04:18 4m 18s
Merge pull request #163 from Hrancheng/my-pr-135
Install Then Test Run Experiments script #65: Commit 10a0f21 pushed by gkielian
May 11, 2024 14:58 4m 13s master
May 11, 2024 14:58 4m 13s
Modified train.py to enable plotting of input/output statistics for constantmax
Install Then Test Run Experiments script #64: Pull request #163 synchronize by Hrancheng
May 11, 2024 13:41 4m 13s Hrancheng:my-pr-135
May 11, 2024 13:41 4m 13s
Modified train.py to enable plotting of input/output statistics for constantmax
Install Then Test Run Experiments script #63: Pull request #163 synchronize by gkielian
May 11, 2024 07:56 1m 23s Hrancheng:my-pr-135
May 11, 2024 07:56 1m 23s
A Github action workflow to test GQA combination with gating
Install Then Test Run Experiments script #62: Pull request #133 synchronize by Hrancheng
May 11, 2024 06:21 3m 49s Hrancheng:my-pr-119
May 11, 2024 06:21 3m 49s
Add scripts compatible with lichess dataset
Install Then Test Run Experiments script #61: Pull request #157 synchronize by klei22
May 4, 2024 03:51 4m 7s klei22:add_chess_dataset
May 4, 2024 03:51 4m 7s
Merge pull request #159 from gkielian/add_quantized_polymax
Install Then Test Run Experiments script #60: Commit 472d41e pushed by klei22
April 29, 2024 06:22 4m 11s master
April 29, 2024 06:22 4m 11s
Add polymax with relu2 forward pass (PolymaxQuan)
Install Then Test Run Experiments script #59: Pull request #159 synchronize by gkielian
April 29, 2024 05:38 4m 8s gkielian:add_quantized_polymax
April 29, 2024 05:38 4m 8s
Merge pull request #162 from gkielian/update_inspect_script
Install Then Test Run Experiments script #58: Commit 305c75f pushed by klei22
April 29, 2024 05:29 4m 0s master
April 29, 2024 05:29 4m 0s
Add polymax with relu2 forward pass (PolymaxQuan)
Install Then Test Run Experiments script #57: Pull request #159 synchronize by gkielian
April 29, 2024 05:27 4m 3s gkielian:add_quantized_polymax
April 29, 2024 05:27 4m 3s
Merge pull request #161 from klei22/add_korean_parallel_corpora
Install Then Test Run Experiments script #56: Commit c42743a pushed by gkielian
April 29, 2024 05:22 3m 57s master
April 29, 2024 05:22 3m 57s
Add Softplus activation and update inspect script
Install Then Test Run Experiments script #55: Pull request #162 opened by gkielian
April 29, 2024 05:22 4m 3s gkielian:update_inspect_script
April 29, 2024 05:22 4m 3s
Add scripts compatible wtih the korean parallel corpora
Install Then Test Run Experiments script #54: Pull request #161 opened by klei22
April 29, 2024 02:18 4m 13s klei22:add_korean_parallel_corpora
April 29, 2024 02:18 4m 13s
Merge pull request #160 from gkielian/exppolymax_ln_of_a_fix
Install Then Test Run Experiments script #53: Commit a8a138b pushed by klei22
April 28, 2024 18:52 4m 7s master
April 28, 2024 18:52 4m 7s
Fix non euler base calculation for exppolymax
Install Then Test Run Experiments script #52: Pull request #160 opened by gkielian
April 23, 2024 19:49 4m 11s gkielian:exppolymax_ln_of_a_fix
April 23, 2024 19:49 4m 11s
Add polymax with relu2 forward pass (PolymaxQuan)
Install Then Test Run Experiments script #51: Pull request #159 opened by gkielian
April 22, 2024 19:28 4m 2s gkielian:add_quantized_polymax
April 22, 2024 19:28 4m 2s
Merge pull request #158 from klei22/add_ood_addition
Install Then Test Run Experiments script #50: Commit 04ad5f9 pushed by gkielian
April 22, 2024 06:20 4m 5s master
April 22, 2024 06:20 4m 5s
Add scripts for testing out-of-distribution addition
Install Then Test Run Experiments script #49: Pull request #158 synchronize by klei22
April 22, 2024 04:29 3m 55s klei22:add_ood_addition
April 22, 2024 04:29 3m 55s
Add scripts for testing out-of-distribution addition
Install Then Test Run Experiments script #48: Pull request #158 opened by klei22
April 22, 2024 04:13 4m 7s klei22:add_ood_addition
April 22, 2024 04:13 4m 7s
Add scripts compatible with lichess dataset
Install Then Test Run Experiments script #47: Pull request #157 synchronize by klei22
April 21, 2024 07:14 4m 3s klei22:add_chess_dataset
April 21, 2024 07:14 4m 3s
Add scripts compatible with lichess dataset
Install Then Test Run Experiments script #46: Pull request #157 opened by klei22
April 20, 2024 17:57 4m 1s klei22:add_chess_dataset
April 20, 2024 17:57 4m 1s
Merge pull request #156 from gkielian/tidy_implementation
Install Then Test Run Experiments script #44: Commit 80996c6 pushed by klei22
April 20, 2024 15:03 4m 6s master
April 20, 2024 15:03 4m 6s
Tidy implementation
Install Then Test Run Experiments script #43: Pull request #156 opened by gkielian
April 20, 2024 01:31 3m 52s gkielian:tidy_implementation
April 20, 2024 01:31 3m 52s
ProTip! You can narrow down the results and go further in time using created:<2024-04-20 or the other filters available.