Skip to content
Change the repository type filter

All

    Repositories list

    • diginext

      Public
      A developer-focused platform for app deployment & centralized cloud resource management.
      TypeScript
      GNU General Public License v3.0
      6000Updated Aug 7, 2024Aug 7, 2024
    • Generate ideal question-answers for testing RAG
      Python
      312201Updated Jul 9, 2024Jul 9, 2024
    • lilac

      Public
      Curate better data for LLMs
      Python
      Apache License 2.0
      89001Updated Jun 9, 2024Jun 9, 2024
    • topicGPT

      Public
      Code & Prompts for TopicGPT paper
      Python
      34000Updated Mar 28, 2024Mar 28, 2024
    • ragas

      Public
      Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
      Jupyter Notebook
      Apache License 2.0
      696000Updated Oct 18, 2023Oct 18, 2023
    • agents

      Public
      An Open-source Framework for Autonomous Language Agents
      Python
      Apache License 2.0
      410000Updated Sep 21, 2023Sep 21, 2023
    • Guide for fine-tuning Llama/CodeLlama models
      Python
      MIT License
      79000Updated Sep 17, 2023Sep 17, 2023
    • argilla

      Public
      ✨Argilla: the open-source data curation platform for LLMs
      Python
      Apache License 2.0
      365000Updated Sep 13, 2023Sep 13, 2023
    • h2ogpt

      Public
      Private Q&A and summarization of documents+images or chat with local GPT, 100% private, Apache 2.0. Supports LLaMa2, llama.cpp, and more. Demo: https://gpt.h2o.ai/
      Python
      Apache License 2.0
      1.2k000Updated Aug 15, 2023Aug 15, 2023
    • Example models using DeepSpeed
      Python
      Apache License 2.0
      1k000Updated Aug 10, 2023Aug 10, 2023
    • A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
      Python
      Apache License 2.0
      60000Updated Aug 10, 2023Aug 10, 2023
    • discus

      Public
      Generate and enrich datasets on-demand to fine-tune LLMs. Discord: https://discord.gg/t6ADqBKrdZ
      Python
      MIT License
      6000Updated Aug 10, 2023Aug 10, 2023
    • trlx

      Public
      A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
      Python
      MIT License
      471000Updated Aug 8, 2023Aug 8, 2023
    • The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
      89000Updated Aug 2, 2023Aug 2, 2023
    • Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate).
      Python
      Apache License 2.0
      230000Updated Jul 27, 2023Jul 27, 2023
    • llmboxing

      Public
      this is to generate anonymous responses. Useful for RL
      Elixir
      2000Updated Jul 25, 2023Jul 25, 2023
    • Quick Start LLaMA models with multiple methods, and fine-tune 7B/65B with One-Click.
      Python
      GNU General Public License v3.0
      9.5k000Updated Jul 23, 2023Jul 23, 2023
    • WizardLM

      Public
      Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder
      Python
      717000Updated Jul 13, 2023Jul 13, 2023
    • Label Studio is a multi-type data labeling and annotation tool with standardized output format
      Python
      Apache License 2.0
      2.4k000Updated Jul 13, 2023Jul 13, 2023
    • Generate question/answer training pairs out of raw text.
      Python
      Apache License 2.0
      28000Updated Jul 8, 2023Jul 8, 2023
    • Dromedary

      Public
      Dromedary: towards helpful, ethical and reliable LLMs.
      Python
      GNU General Public License v3.0
      86000Updated Jun 23, 2023Jun 23, 2023
    • A command-line interface to generate textual and conversational datasets with LLMs.
      Python
      19000Updated Jun 22, 2023Jun 22, 2023
    • Code and documentation to train Stanford's Alpaca models, and generate the data.
      Python
      Apache License 2.0
      4k000Updated Jun 7, 2023Jun 7, 2023
    • Python
      MIT License
      148000Updated May 25, 2023May 25, 2023
    • Original Implementation of Prompt Tuning from Lester, et al, 2021
      Python
      Apache License 2.0
      58000Updated May 24, 2023May 24, 2023
    • LongForm

      Public
      Instruction Tuning Dataset and Models for Long Text Generation with Corpus Extraction
      10000Updated May 23, 2023May 23, 2023
    • Aligning pretrained language models with instruction data generated by themselves.
      Python
      Apache License 2.0
      484000Updated Mar 27, 2023Mar 27, 2023
    • [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
      Python
      10000Updated Mar 7, 2023Mar 7, 2023
    • An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
      Python
      Apache License 2.0
      201000Updated Nov 4, 2022Nov 4, 2022
    • Fork of zero shot prompt consistency
      Python
      Apache License 2.0
      2000Updated Oct 15, 2022Oct 15, 2022