Towards A Unified View of Answer Calibration for Multi-Step Reasoning 🚀

Towards A Unified View of Answer Calibration for Multi-Step Reasoning Hyperspheres

This repository is an offical project for the paper ''Towards A Unified View of Answer Calibration for Multi-Step Reasoning'', accepted by ACL 2024, Natural Language Reasoning and Structured Explanations Workshop.

Usage 🛠️

First put your OpenAI API key in a file named api_key.txt.

Setup

sh setup.sh

Run LLM generation

sh generate_answer.sh

./data/generated/* contains the cached generation results. ./data/restored/* contains the cached reformulated generation results and accuracy results.

Evaluation

sh eval_scores.sh

./scores/* contains the cached evaluation results.

How to Cite 📝

📋 Thank you very much for your interest in our work. If you use or extend our work, please cite the following paper:

@inproceedings{ACL2024_NLRSE_EvalReasoning,
    author    = {Shumin Deng and
                 Ningyu Zhang and
                 Nay Oo and
                 Bryan Hooi},
  title       = {Towards A Unified View of Answer Calibration for Multi-Step Reasoning},
  booktitle   = {Proceedings of the 2nd Workshop on Natural Language Reasoning and Structured Explanations (@ACL 2024)},
  publisher   = {Association for Computational Linguistics},
  pages       = {25--38}
  year        = {2024},
  url         = {https://aclanthology.org/2024.nlrse-1.3}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
eval_classical		eval_classical
eval_roscoe		eval_roscoe
index		index
prompts		prompts
scores		scores
LICENSE		LICENSE
README.md		README.md
api_key.txt		api_key.txt
constants.py		constants.py
eval_scores.sh		eval_scores.sh
eval_utils.py		eval_utils.py
generate_answer.py		generate_answer.py
generate_answer.sh		generate_answer.sh
requirements.txt		requirements.txt
restore_data.py		restore_data.py
setup.sh		setup.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Towards A Unified View of Answer Calibration for Multi-Step Reasoning 🚀

Usage 🛠️

Setup

Run LLM generation

Evaluation

How to Cite 📝

About

Releases

Packages

Languages

License

231sm/Eval_Multi-Step_Reasoning

Folders and files

Latest commit

History

Repository files navigation

Towards A Unified View of Answer Calibration for Multi-Step Reasoning 🚀

Usage 🛠️

Setup

Run LLM generation

Evaluation

How to Cite 📝

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages