MTEB: Massive Text Embedding Benchmark
-
Updated
Jan 15, 2025 - Jupyter Notebook
MTEB: Massive Text Embedding Benchmark
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
SGPT: GPT Sentence Embeddings for Semantic Search
Generative Representational Instruction Tuning
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
Codebase for RetroMAE and beyond.
Code & data accompanying the KDD 2017 paper "KATE: K-Competitive Autoencoder for Text"
Efficient LLM inference on Slurm clusters using vLLM.
Go module for fetching embeddings from embeddings providers
a vector embedding database with multiple storage engines and AI embedding integrations
Simple script to compute CLIP-based scores given a DALL-e trained model.
A text embedding viewer for the Jupyter environment
Perform topic classification on news articles in several limited-labeled data regimes.
Code for embedding and retrieval research.
Simple customizable evaluation for text retrieval performance of Sentence Transformers embedders on PDFs
Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.
Flask API for generating text embeddings using OpenAI or sentence_transformers
Topic Embedding, Text Generation and Modeling using diffusion
SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings
Contextual embedding for text blobs.
Add a description, image, and links to the text-embedding topic page so that developers can more easily learn about it.
To associate your repository with the text-embedding topic, visit your repo's landing page and select "manage topics."