Evalica, your favourite evaluation toolkit

Evalica [ɛˈʋalit͡sa] (eh-vah-lee-tsah) is a Python library that transforms pairwise comparisons into ranked lists of items. It offers convenient high-performant Rust implementations of the corresponding methods via PyO3, and additionally provides naïve Python code for most of them. Evalica is fully compatible with NumPy arrays and pandas data frames.

The logo was created using Recraft.

Installation

pip: pip install evalica
Anaconda: conda install conda-forge::evalica

Usage

Imagine that we would like to rank the different meals and have the following dataset of three comparisons produced by food experts.

Item X	Item Y	Winner
`pizza`	`burger`	`x`
`burger`	`sushi`	`y`
`pizza`	`sushi`	`tie`

Given this hypothetical example, Evalica takes these three columns and computes the outcome of the given pairwise comparison according to the chosen model. Note that the first argument is the column Item X, the second argument is the column Item Y, and the third argument corresponds to the column Winner.

>>> from evalica import elo, Winner
>>> result = elo(
...     ['pizza', 'burger', 'pizza'],
...     ['burger', 'sushi', 'sushi'],
...     [Winner.X, Winner.Y, Winner.Draw],
... )
>>> result.scores
pizza     1014.972058
burger     970.647200
sushi     1014.380742
Name: elo, dtype: float64

As a result, we obtain Elo scores of our items. In this example, pizza was the most favoured item, sushi was the runner-up, and burger was the least preferred item.

Item	Score
`pizza`	1014.97
`burger`	970.65
`sushi`	1014.38

Command-Line Interface

Evalica also provides a simple command-line interface, allowing the use of these methods in shell scripts and for prototyping.

$ evalica -i food.csv bradley-terry                
item,score,rank
Tacos,2.509025136024378,1
Sushi,1.1011561298265815,2
Burger,0.8549063627182466,3
Pasta,0.7403814336665869,4
Pizza,0.5718366915548537,5

Refer to the food.csv file as an input example.

Web Application

Evalica has a built-in Gradio application that can be launched as python3 -m evalica.gradio. Please ensure that the library was installed as pip install evalica[gradio].

Implemented Methods

Method	In Python	In Rust
Counting	✅	✅
Average Win Rate	✅	✅
Bradley–Terry	✅	✅
Elo	✅	✅
Eigenvalue	✅	✅
PageRank	✅	✅
Newman	✅	✅

Name		Name	Last commit message	Last commit date
Latest commit History 315 Commits
.github		.github
docs		docs
python/evalica		python/evalica
src		src
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
Cargo.toml		Cargo.toml
Chatbot-Arena.ipynb		Chatbot-Arena.ipynb
Evalica.svg		Evalica.svg
LICENSE		LICENSE
README.md		README.md
Tutorial.ipynb		Tutorial.ipynb
codecov.yml		codecov.yml
conftest.py		conftest.py
food-golden.csv		food-golden.csv
food.csv		food.csv
llmfao-golden.csv		llmfao-golden.csv
llmfao.csv		llmfao.csv
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
simple-golden.csv		simple-golden.csv
simple.csv		simple.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evalica, your favourite evaluation toolkit

Installation

Usage

Command-Line Interface

Web Application

Implemented Methods

Copyright

About

Releases 14

Sponsor this project

Contributors 2

Languages

License

dustalov/evalica

Folders and files

Latest commit

History

Repository files navigation

Evalica, your favourite evaluation toolkit

Installation

Usage

Command-Line Interface

Web Application

Implemented Methods

Copyright

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 14

Sponsor this project

Contributors 2

Languages