Skip to content
Change the repository type filter

All

    Repositories list

    • A package for sampling from Gibbs distributions during inference with LLMs.
      Python
      Apache License 2.0
      0310Updated Oct 2, 2024Oct 2, 2024
    • Python
      32400Updated Sep 27, 2024Sep 27, 2024
    • axolotl

      Public
      Go ahead and axolotl questions
      Python
      Apache License 2.0
      842000Updated Sep 26, 2024Sep 26, 2024
    • nanotron

      Public
      Minimalistic large language model 3D-parallelism training
      Python
      Apache License 2.0
      108000Updated Sep 19, 2024Sep 19, 2024
    • Python
      76527Updated Aug 29, 2024Aug 29, 2024
    • doce

      Public
      This is the a repo of DOCE
      Jupyter Notebook
      Apache License 2.0
      0200Updated Aug 25, 2024Aug 25, 2024
    • DeepSPIN's submission to SIGMORPHON 2020
      Python
      MIT License
      1511Updated Jul 25, 2024Jul 25, 2024
    • Ongoing research training transformer models at scale
      Python
      Other
      2.3k000Updated Jul 15, 2024Jul 15, 2024
    • Ongoing research training transformer language models at scale, including: BERT & GPT-2
      Python
      Other
      2.3k102Updated Jul 12, 2024Jul 12, 2024
    • SSHN

      Public
      Sparse and Structured Hopfield Networks
      Python
      MIT License
      0200Updated Jul 4, 2024Jul 4, 2024
    • entmax

      Public
      The entmax mapping and its loss, a family of sparse softmax alternatives.
      Python
      MIT License
      43407102Updated Jun 22, 2024Jun 22, 2024
    • COMET

      Public
      A Neural Framework for MT Evaluation
      Python
      Apache License 2.0
      76000Updated Jun 11, 2024Jun 11, 2024
    • robust-mt

      Public
      0000Updated Mar 6, 2024Mar 6, 2024
    • Repository for SPECTRA: Sparse Structured Text Rationalization, accepted at EMNLP 2021 main conference.
      Python
      MIT License
      21010Updated Feb 14, 2024Feb 14, 2024
    • 31711Updated Jan 16, 2024Jan 16, 2024
    • Code for alignment for the towerllm project.
      Python
      Apache License 2.0
      393100Updated Nov 29, 2023Nov 29, 2023
    • LP-SparseMAP: Differentiable sparse structured prediction in coarse factor graphs
      C++
      MIT License
      84131Updated Nov 20, 2023Nov 20, 2023
    • Jupyter Notebook
      MIT License
      3700Updated Nov 10, 2023Nov 10, 2023
    • Shell
      0300Updated Oct 17, 2023Oct 17, 2023
    • Jupyter Notebook
      2700Updated Oct 9, 2023Oct 9, 2023
    • Repository for "BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation", accepted at EAMT 2023.
      Jupyter Notebook
      Apache License 2.0
      01600Updated Jul 19, 2023Jul 19, 2023
    • Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.
      Python
      Apache License 2.0
      32100Updated Jun 23, 2023Jun 23, 2023
    • Shell
      01400Updated Jun 13, 2023Jun 13, 2023
    • crest

      Public
      Code for CREST: A Joint Framework for Rationalization and Counterfactual Text Generation, accepted at ACL 2023.
      Python
      MIT License
      1700Updated May 29, 2023May 29, 2023
    • Python
      1500Updated May 28, 2023May 28, 2023
    • This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.
      Python
      MIT License
      41501Updated May 10, 2023May 10, 2023
    • Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
      Python
      MIT License
      6.4k000Updated Mar 3, 2023Mar 3, 2023
    • stopes

      Public
      A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.
      Python
      MIT License
      37100Updated Feb 28, 2023Feb 28, 2023
    • Python
      MIT License
      1500Updated Nov 8, 2022Nov 8, 2022
    • Python
      61901Updated Oct 26, 2022Oct 26, 2022