Skip to content
Change the repository type filter

All

    Repositories list

    • miracl

      Public
      A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.
      Apache License 2.0
      416720Updated Jul 31, 2024Jul 31, 2024
    • Website for the MIRACL (Multilingual Information Retrieval Across a Continuum of Languages) Challenge at WSDM 2023
      HTML
      0400Updated Jun 25, 2024Jun 25, 2024
    • nomiracl

      Public
      NoMIRACL: A multilingual hallucination evaluation dataset to evaluate LLM robustness in RAG against first-stage retrieval errors on 18 languages.
      Python
      Apache License 2.0
      21811Updated Mar 14, 2024Mar 14, 2024
    • hagrid

      Public
      A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution
      Apache License 2.0
      23010Updated Aug 2, 2023Aug 2, 2023