Skip to content

Actions: TablewareBox/evals

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
91 workflow runs
91 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Update rag_reaction_extract.py
Run unit tests #35: Commit c452aac pushed by TablewareBox
March 9, 2024 05:24 2m 5s main
March 9, 2024 05:24 2m 5s
fix reaction_qa and oled_attribute data & postprocessing
Run unit tests #34: Commit dc20981 pushed by TablewareBox
March 9, 2024 05:23 2m 21s main
March 9, 2024 05:23 2m 21s
fix bad cases in oled_attribute
Run unit tests #33: Commit a184c42 pushed by TablewareBox
March 9, 2024 03:54 2m 13s main
March 9, 2024 03:54 2m 13s
add more oled_attribute data, fix drug scipaper_hasmol data
Run unit tests #32: Commit 1baab00 pushed by TablewareBox
March 9, 2024 03:19 2m 31s main
March 9, 2024 03:19 2m 31s
update drug dataset
Run unit tests #31: Commit 8d44bbb pushed by TablewareBox
March 8, 2024 08:02 2m 7s main
March 8, 2024 08:02 2m 7s
Update problems from macro-F1 score
Run unit tests #30: Commit c29d093 pushed by Linmj-Judy
March 8, 2024 05:19 2m 5s main
March 8, 2024 05:19 2m 5s
update reaction_ord data
Run unit tests #29: Commit 0a7dc20 pushed by TablewareBox
March 8, 2024 00:40 2m 12s main
March 8, 2024 00:40 2m 12s
simplify reaction JSON diff using deepdiff
Run unit tests #28: Commit b6fb516 pushed by TablewareBox
March 8, 2024 00:04 2m 40s main
March 8, 2024 00:04 2m 40s
Diff utils for ORD reaction json comparison
Run unit tests #27: Commit 79260b1 pushed by TablewareBox
March 7, 2024 23:41 2m 1s main
March 7, 2024 23:41 2m 1s
example on ORD-style reaction extraction
Run unit tests #26: Commit 9650bf9 pushed by TablewareBox
March 7, 2024 23:40 2m 24s main
March 7, 2024 23:40 2m 24s
add multimodal prompt truncation for gemini
Run unit tests #25: Commit c0ccfb6 pushed by TablewareBox
March 7, 2024 16:21 2m 12s main
March 7, 2024 16:21 2m 12s
uni-finder compatibility and update for 03.07 version
Run unit tests #24: Commit a52c254 pushed by TablewareBox
March 7, 2024 10:45 2m 14s main
March 7, 2024 10:45 2m 14s
sol_QA_update
Run new evals #8: Pull request #14 synchronize by ChitandaErumanga
March 7, 2024 10:08 2m 15s solubility_reaction
March 7, 2024 10:08 2m 15s
sol_QA_update
Run unit tests #23: Pull request #14 synchronize by ChitandaErumanga
March 7, 2024 10:08 2m 2s solubility_reaction
March 7, 2024 10:08 2m 2s
Asserting no-image in gemini-pro-vision model
Run unit tests #22: Commit 36e6f88 pushed by TablewareBox
March 7, 2024 05:22 2m 9s main
March 7, 2024 05:22 2m 9s
fix gemini
Run unit tests #21: Commit 11fe08c pushed by TablewareBox
March 6, 2024 08:53 2m 15s main
March 6, 2024 08:53 2m 15s
update pubmedqa
Run unit tests #20: Commit 2f14826 pushed by Caixc97
March 5, 2024 08:20 2m 14s main
March 5, 2024 08:20 2m 14s
Merge remote-tracking branch 'origin/main'
Run unit tests #19: Commit 6002593 pushed by Caixc97
March 3, 2024 20:21 2m 3s main
March 3, 2024 20:21 2m 3s
Merge pull request #15 from Caixc97/main
Run unit tests #18: Commit 471b8ed pushed by Caixc97
March 3, 2024 20:20 2m 15s main
March 3, 2024 20:20 2m 15s
sol_QA_update
Run new evals #6: Pull request #14 synchronize by ChitandaErumanga
March 1, 2024 09:59 2m 10s solubility_reaction
March 1, 2024 09:59 2m 10s
sol_QA_update
Run unit tests #16: Pull request #14 synchronize by ChitandaErumanga
March 1, 2024 09:59 2m 8s solubility_reaction
March 1, 2024 09:59 2m 8s
sol_QA_update
Run new evals #5: Pull request #14 opened by ChitandaErumanga
March 1, 2024 06:27 2m 6s solubility_reaction
March 1, 2024 06:27 2m 6s
sol_QA_update
Run unit tests #15: Pull request #14 opened by ChitandaErumanga
March 1, 2024 06:27 2m 19s solubility_reaction
March 1, 2024 06:27 2m 19s
add retry and multiple api_keys support for genimi models
Run unit tests #14: Commit ceffa0b pushed by TablewareBox
February 29, 2024 03:15 2m 7s main
February 29, 2024 03:15 2m 7s
fix balance_chemical_equations: replace CI with Cl, modify incorrect …
Run unit tests #13: Commit 76de743 pushed by TablewareBox
February 29, 2024 02:47 2m 16s main
February 29, 2024 02:47 2m 16s