What's Changed
- Fix "httpx"-related ReadErrors in
es_connector
by @Criamos in #113 - Merge recent HTTPX-related fixes into
master
by @Criamos in #114 - Improved Exception Handling during website-screenshot fallback and several fixes for
pydantic
ValidationErrors by @Criamos in #115 - Merge fixes from PR 115 into
master
by @Criamos in #116 - Feat: Planet-N crawler // update GitHub workflows by @Criamos in #117
- feat: (optional) OER-Filter Pipeline by @Criamos in #118
- Portal Globales Lernen & updated DocStrings by @Criamos in #119
- feat: parse robots.txt for AI usage indicators ("ccm:ai_allow_usage") by @Criamos in #120
- Upgrade to Python 3.13 and Scrapy v2.12 / feat: robots.txt parsing for "ccm:ai_allow_usage" by @Criamos in #121
- Merge develop into master by @Criamos in #122
- Update headless browser and planet_n_spider v0.0.3 by @Criamos in #123
- Merge PR 123 from develop into master by @Criamos in #124
Full Changelog: v2024.09.04...v2024.12.18