Evaluate performance on few shot for an unseen tool after training toolformer #14

dmahan93 · 2023-02-26T03:02:15Z

Should compare with a few items:

Generation with few-shot on base model, same as data generation
Generating with few-shot on base model, with similar setup to inference
Generation with few-shot on trained model

Not sure if there should be an additional comparison with data generation hyperparams (pull out API calls > 10%, evaluate k calls, M shot) for the trained toolformer, as well.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluate performance on few shot for an unseen tool after training toolformer #14

Evaluate performance on few shot for an unseen tool after training toolformer #14

dmahan93 commented Feb 26, 2023

Evaluate performance on few shot for an unseen tool after training toolformer #14

Evaluate performance on few shot for an unseen tool after training toolformer #14

Comments

dmahan93 commented Feb 26, 2023