Skip to content
Change the repository type filter

All

    Repositories list

    • gorilla

      Public
      Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
      Python
      Apache License 2.0
      1k000Updated Nov 7, 2024Nov 7, 2024
    • Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
      Python
      Apache License 2.0
      4.3k000Updated Nov 4, 2024Nov 4, 2024
    • titu-stt

      Public
      Titu Speech To Text Development
      0100Updated Nov 1, 2024Nov 1, 2024
    • titu-tts

      Public
      Titu-tts is the high quality text to speech generation model
      0100Updated Oct 29, 2024Oct 29, 2024
    • titulm

      Public
      TituLM Development Library
      Shell
      1100Updated Oct 25, 2024Oct 25, 2024
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      1.9k000Updated Oct 22, 2024Oct 22, 2024
    • A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG
      Python
      MIT License
      33000Updated Aug 13, 2024Aug 13, 2024
    • tiktoken

      Public
      tiktoken is a fast BPE tokeniser for use with OpenAI's models.
      Python
      MIT License
      856100Updated Jul 7, 2024Jul 7, 2024
    • torchtune

      Public
      A Native-PyTorch Library for LLM Fine-tuning
      Python
      BSD 3-Clause "New" or "Revised" License
      446000Updated Jun 22, 2024Jun 22, 2024
    • LLM training code for MosaicML foundation models
      Python
      Apache License 2.0
      531000Updated May 5, 2024May 5, 2024
    • All-in-one text de-duplication
      Python
      Apache License 2.0
      71000Updated Mar 28, 2024Mar 28, 2024
    • 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
      Python
      Mozilla Public License 2.0
      4.4k000Updated Jan 18, 2024Jan 18, 2024
    • BNSECData

      Public
      0000Updated Jan 16, 2024Jan 16, 2024
    • Python
      Mozilla Public License 2.0
      89000Updated Jan 13, 2024Jan 13, 2024
    • 🐸 - A general purpose model trainer, as flexible as it gets
      Python
      118000Updated Jan 7, 2024Jan 7, 2024
    • 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
      C++
      Mozilla Public License 2.0
      278000Updated Nov 16, 2023Nov 16, 2023
    • WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
      Python
      BSD 4-Clause "Original" or "Old" License
      1.3k000Updated Nov 15, 2023Nov 15, 2023
    • gpt-neox

      Public
      An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
      Python
      Apache License 2.0
      1k000Updated Oct 22, 2023Oct 22, 2023
    • Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition
      Other
      0500Updated Oct 19, 2023Oct 19, 2023
    • This repository contains the dataset download links, dataset description and baseline model link
      Mozilla Public License 2.0
      0000Updated Sep 9, 2023Sep 9, 2023
    • NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
      Python
      Other
      400000Updated Aug 3, 2023Aug 3, 2023
    • Simple text to phones(IPA) converter for multiple languages, modified version available for handling punctuation marks
      Python
      GNU General Public License v3.0
      174000Updated Jun 14, 2023Jun 14, 2023
    • Simple but maybe too simple config management through python data classes. We use it for machine learning.
      Python
      MIT License
      33000Updated Apr 12, 2023Apr 12, 2023
    • Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo
      JavaScript
      Mozilla Public License 2.0
      18000Updated Mar 24, 2023Mar 24, 2023
    • 🐸Coqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video games. The pack includes both male and female voices from >30 different voices, and all of the files can be used for commercial purposes (royalty free).
      MIT License
      18000Updated Mar 7, 2023Mar 7, 2023
    • Coqui CLI
      Python
      Apache License 2.0
      10000Updated Jan 22, 2023Jan 22, 2023
    • 🐸 collection of TTS papers
      Mozilla Public License 2.0
      68000Updated Nov 21, 2022Nov 21, 2022
    • 🐸STT integration examples
      Python
      Mozilla Public License 2.0
      46000Updated Sep 23, 2022Sep 23, 2022
    • 🫠 check your data, before you wreck your model
      Python
      MIT License
      5000Updated Aug 11, 2022Aug 11, 2022
    • 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
      MIT License
      140000Updated Jul 27, 2022Jul 27, 2022