DB Benchmark

project structure

db-benchmark
├── README.md
├── compose.yaml            <- compose file for docker containers
├── data                    <- data folder containing .dvc files
│   └── Books.json.gz.dvc
├── dbs                     <- dbs that implement helpers.db_connector class
│   ├── chromadb.py
│   ├── milvus.py
│   ├── qdrant.py
│   ├── vespadb.py
│   └── pgvector.py
├── helpers                 <- helper classes for data loading and db interaction   
│   ├── data_processor.py
│   └── dp_connector.py
├── dvc.yaml                <- dvc pipeline
├── main.py                 <- main file
├── params.yaml             <- dvc experiment parameters
└── requirements.txt

Prerequisites

docker
docker-compose

Setup

Step I: create environment

The first step is to create your environment.

conda create --name <env-name> python=3.11

Step II: install dependencies

The necessary modules for this python project can be installed with the given requirements.txt file.

conda activate <env-name>
pip install -r requirements.txt

Step III: downloading data

If you followed the steps before the dvc command is now available. Run the following command in the git root directory to download the data:

dvc update -R data/

Running the Benchmark

The benchmarking process involves several steps managed by DVC:

Running the DVC Pipeline:
- Execute the main DVC pipeline which orchestrates the data processing and benchmarking tasks with dvc exp run.
Reviewing Results:
- Use the dvc exp show command to visualize and analyze the results of the experiments conducted as part of the benchmark.

Results

We found the following results (times in miliseconds):

	insert	query	remove
chromadb	7.3517	5.8963	6.2306
pgvector	7.0311	2.7662	0.30817
vespa	8.7366	11.556	5.6302
milvus	3.8154	2.0659	2.9517
qdrant	6.9746	2.1554	6.8914

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DB Benchmark

project structure

Prerequisites

Setup

Step I: create environment

Step II: install dependencies

Step III: downloading data

Running the Benchmark

Results

About

Releases

Packages

Contributors 5

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.dvc		.dvc
data		data
dbs		dbs
helpers		helpers
.dvcignore		.dvcignore
.gitignore		.gitignore
README.md		README.md
compose.yaml		compose.yaml
dvc.yaml		dvc.yaml
main.py		main.py
params.yaml		params.yaml
requirements.txt		requirements.txt

oswdm/db-benchmark

Folders and files

Latest commit

History

Repository files navigation

DB Benchmark

project structure

Prerequisites

Setup

Step I: create environment

Step II: install dependencies

Step III: downloading data

Running the Benchmark

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages