Apify
Pinned Loading
Repositories
- crawlee Public
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
apify/crawlee’s past year of commit activity - actor-vector-database-integrations Public
Transfer data from Apify Actors to vector databases (Chroma, Milvus, Pinecone, PostgreSQL (PG-Vector), Qdrant, and Weaviate)
apify/actor-vector-database-integrations’s past year of commit activity - airbyte Public Forked from airbytehq/airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
apify/airbyte’s past year of commit activity - crawlee-python Public
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
apify/crawlee-python’s past year of commit activity - fingerprint-suite Public
Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.
apify/fingerprint-suite’s past year of commit activity