llm-evaluation-metrics

Star

Here are 3 public repositories matching this topic...

confident-ai / deepeval

Star

The LLM Evaluation Framework

evaluation-metrics evaluation-framework llm-evaluation llm-evaluation-framework llm-evaluation-metrics

Updated Dec 20, 2024
Python

zhuohaoyu / KIEval

Star

[ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models

machine-learning explainable-ai llm llm-evaluation llm-evaluation-toolkit llm-evaluation-framework llm-evaluation-metrics acl2024

Updated Jul 19, 2024
Python

ritwickbhargav80 / quick-llm-model-evaluations

Star

This repo is for an streamlit application that provides a user-friendly interface for evaluating large language models (LLMs) using the beyondllm package.

streamlit llms retrieval-augmented-generation llm-evaluation-metrics beyondllm

Updated Aug 29, 2024
Python

Improve this page

Add a description, image, and links to the llm-evaluation-metrics topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-evaluation-metrics topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-evaluation-metrics

Here are 3 public repositories matching this topic...

confident-ai / deepeval

zhuohaoyu / KIEval

ritwickbhargav80 / quick-llm-model-evaluations

Improve this page

Add this topic to your repo