feat: Add PDF Table Extract Tool #127

sachinspanicker · 2024-11-27T11:15:15Z

PDF Table Extract Tool

Description

Add new PDFTableExtractTool for extracting tables from PDF documents and converting them to markdown format.

Features

Extract tables from PDF documents
Convert tables to markdown format
Handle multiple tables and large tables
Support both sync and async operations
Comprehensive error handling

Implementation

Added PDFTableExtractTool class
Added comprehensive test suite
Added documentation with usage examples
Implemented proper error handling
Added type hints and docstrings

Dependencies

Added to pyproject.toml:

PyMuPDF
pandas
tabulate

Testing

All tests passing:

Basic functionality
Error handling
Edge cases
Async operations

Documentation

Added detailed README
Added usage examples
Added inline documentation

- Add PDFTableExtractTool for extracting tables from PDFs - Convert extracted tables to markdown format - Add comprehensive test suite - Add documentation and usage examples - Handle edge cases and error conditions - Support both sync and async operations

joaomdmoura · 2024-12-05T14:57:57Z

Looks good but missing a init import if you dont mind adding it :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add PDF Table Extract Tool #127

feat: Add PDF Table Extract Tool #127

sachinspanicker commented Nov 27, 2024

joaomdmoura commented Dec 5, 2024

feat: Add PDF Table Extract Tool #127

Are you sure you want to change the base?

feat: Add PDF Table Extract Tool #127

Conversation

sachinspanicker commented Nov 27, 2024

PDF Table Extract Tool

Description

Features

Implementation

Dependencies

Testing

Documentation

joaomdmoura commented Dec 5, 2024