Skip to content
Change the repository type filter

All

    Repositories list

    • Simple package to extract text with coordinates from programmatic PDFs
      C++
      MIT License
      82741Updated Nov 22, 2024Nov 22, 2024
    • MolGrapher: Graph-based Visual Recognition of Chemical Structures
      Python
      MIT License
      34900Updated Nov 22, 2024Nov 22, 2024
    • docling

      Public
      Get your documents ready for gen AI
      Python
      MIT License
      51011k518Updated Nov 22, 2024Nov 22, 2024
    • A python library to define and validate data types in Docling.
      Python
      MIT License
      103433Updated Nov 21, 2024Nov 21, 2024
    • Python
      MIT License
      104252Updated Nov 20, 2024Nov 20, 2024
    • Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
      C++
      MIT License
      42421Updated Oct 23, 2024Oct 23, 2024
    • Interact with the Deep Search platform for new knowledge explorations and discoveries
      Python
      MIT License
      19135811Updated Oct 17, 2024Oct 17, 2024
    • Running Docling as an API service
      Makefile
      MIT License
      31621Updated Oct 11, 2024Oct 11, 2024
    • CSS
      MIT License
      11000Updated Oct 8, 2024Oct 8, 2024
    • quackling

      Public archive
      Build document-native LLM applications
      Python
      MIT License
      15100Updated Sep 11, 2024Sep 11, 2024
    • Mognet is a fast, simple framework to build distributed applications using task queues.
      Python
      MIT License
      2901Updated Aug 7, 2024Aug 7, 2024
    • Examples using the Deep Search functionalities
      Python
      MIT License
      144704Updated Aug 7, 2024Aug 7, 2024
    • PatCID

      Public
      Python
      MIT License
      13330Updated Aug 2, 2024Aug 2, 2024
    • Python
      MIT License
      0600Updated Jul 8, 2024Jul 8, 2024
    • Python
      MIT License
      0700Updated Jul 8, 2024Jul 8, 2024
    • SemTabNet

      Public
      Repository for ACL paper: "Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs"
      Python
      MIT License
      0600Updated Jul 1, 2024Jul 1, 2024
    • .github

      Public
      0100Updated Jun 24, 2024Jun 24, 2024
    • MolGrapher: Graph-based Visual Recognition of Chemical Structures
      Python
      MIT License
      0710Updated Mar 25, 2024Mar 25, 2024
    • Repository to detect scientific software in documents for Chan Zuckerberg Initiative workshop
      Python
      MIT License
      0200Updated Oct 26, 2023Oct 26, 2023
    • langchain

      Public
      ⚡ Building applications with LLMs through composability ⚡
      Python
      MIT License
      15k100Updated May 18, 2023May 18, 2023
    • Website of the ICDAR 2023 DocLayNet competition
      1100Updated Apr 26, 2023Apr 26, 2023
    • DocLayNet

      Public
      DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
      Other
      1527630Updated Feb 1, 2023Feb 1, 2023
    • Example NLP Annotator API used for integrating with the IBM DeepSearch CPS platform
      Python
      Apache License 2.0
      31000Updated Sep 8, 2022Sep 8, 2022