Skip to content

Repository for testing and demonstrating the capabilities of Docling for document conversion.

License

Notifications You must be signed in to change notification settings

Wiminds/Docling.Testing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Docling logo

Docling Testing

This repository demonstrates how to use Docling for document conversion (PDF, HTML, etc.) to structured formats like Markdown and JSON.

The use I have for docling is particularly for tables.

Installation

  1. Clone the repository:
git clone https://github.ibm.com/Quentin-Lefevre/docling-testing.git

cd docling-testing
  1. Install dependencies:
pip install -r requirements.txt

Usage

Run examples in the examples/ folder to test different document types:

docling /copy/the/path/assets/maintenance-auto.pdf --to md --no-ocr

docling /copy/the/path/assets/meteo_montpellier.html --to md --ocr

Note that the ‘ocr’ option almost never works with pdf files...

Jupyter Notebook

Use the notebook in notebook/guide.ipynb to see my tests and results.