Skip to content
Change the repository type filter

All

    Repositories list

    • crawlee

      Public
      Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
      TypeScript
      Apache License 2.0
      63815k11214Updated Sep 30, 2024Sep 30, 2024
    • Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
      Python
      Apache License 2.0
      2544k695Updated Sep 30, 2024Sep 30, 2024
    • Utilities and constants shared across Apify projects.
      TypeScript
      Apache License 2.0
      101242Updated Sep 30, 2024Sep 30, 2024
    • apify-cli

      Public
      Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
      TypeScript
      18121357Updated Sep 30, 2024Sep 30, 2024
    • Transfer data from Apify Actors to vector databases (Chroma, Milvus, Pinecone, PostgreSQL (PG-Vector), Qdrant, and Weaviate)
      Python
      Apache License 2.0
      4200Updated Sep 30, 2024Sep 30, 2024
    • openapi

      Public
      An OpenAPI specification for the Apify API.
      JavaScript
      MIT License
      02163Updated Sep 30, 2024Sep 30, 2024
    • airbyte

      Public
      Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
      Python
      Other
      4k000Updated Sep 30, 2024Sep 30, 2024
    • Apify extractor for Keboola Connection
      JavaScript
      Apache License 2.0
      0051Updated Sep 30, 2024Sep 30, 2024
    • Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.
      TypeScript
      Apache License 2.0
      959111812Updated Sep 30, 2024Sep 30, 2024
    • workflows

      Public
      Apify's reusable github workflows
      3623Updated Sep 29, 2024Sep 29, 2024
    • This project is the home of Apify's documentation.
      API Blueprint
      Apache License 2.0
      73266424Updated Sep 27, 2024Sep 27, 2024
    • This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy for programs running in the cloud.
      0074Updated Sep 26, 2024Sep 26, 2024
    • Apify API client for JavaScript / Node.js.
      JavaScript
      Apache License 2.0
      2765165Updated Sep 25, 2024Sep 25, 2024
    • The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.
      Python
      Apache License 2.0
      1211591Updated Sep 25, 2024Sep 25, 2024
    • Base Docker images for Apify actors.
      Dockerfile
      Apache License 2.0
      226992Updated Sep 24, 2024Sep 24, 2024
    • This project is the 🏠 home of Apify actor template projects to help users quickly get started.
      Python
      152571Updated Sep 24, 2024Sep 24, 2024
    • Retrieve website content from the top Google Search Results Pages (SERPs)
      0001Updated Sep 24, 2024Sep 24, 2024
    • The official integration for Apify and Haystack 2.0
      Python
      Apache License 2.0
      0100Updated Sep 23, 2024Sep 23, 2024
    • Apify API client for Python
      Python
      Apache License 2.0
      114680Updated Sep 23, 2024Sep 23, 2024
    • A Homebrew tap for Apify tools
      Ruby
      1803Updated Sep 19, 2024Sep 19, 2024
    • TBD
      TypeScript
      MIT License
      0001Updated Sep 18, 2024Sep 18, 2024
    • Apify SDK monorepo
      TypeScript
      Apache License 2.0
      3111997Updated Sep 13, 2024Sep 13, 2024
    • Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.
      JavaScript
      Apache License 2.0
      140839911Updated Sep 12, 2024Sep 12, 2024
    • This action simplify creating of release PR
      JavaScript
      Apache License 2.0
      0010Updated Sep 12, 2024Sep 12, 2024
    • idcac

      Public
      I Don't Care About Cookies extension compiled for use with Playwright/Puppeteer
      JavaScript
      GNU General Public License v3.0
      0801Updated Sep 9, 2024Sep 9, 2024
    • An example repository with multiple Apify Actors sharing code between each other.
      JavaScript
      5111Updated Sep 6, 2024Sep 6, 2024
    • Custom Algolia search modal for Apify Documentation.
      TypeScript
      MIT License
      1002Updated Sep 5, 2024Sep 5, 2024
    • Apify's fork of `docusaurus-plugin-typedoc-api`, customized for our Python documentation.
      TypeScript
      25000Updated Sep 4, 2024Sep 4, 2024
    • Apify integration for Zapier
      JavaScript
      Apache License 2.0
      1850Updated Aug 27, 2024Aug 27, 2024
    • An example Actor using Standby mode
      Dockerfile
      0001Updated Aug 14, 2024Aug 14, 2024