robots-txt

Star

Here are 192 public repositories matching this topic...

rimiti / robotizer

Sponsor

Star

Robots.txt parser / generator

parser generator robots-txt robots-parser robotstxt

Updated Sep 18, 2018
TypeScript

dosbenjamin / eleventy-webpack-boilerplate

Star

Front-end workflow to start a new project with Eleventy and Webpack.

Updated Oct 27, 2020
JavaScript

amandeepmittal / robotize

Sponsor

Star

Generates a robots.txt

nodejs javascript npm npm-package robots-txt robots-generator robots

Updated Nov 1, 2019
JavaScript

JamieMagee / robots-txt

Star

python crawler analysis robots-txt

Updated Mar 19, 2018
Jupyter Notebook

hgruniaux / robotstxt

Star

The repository contains Google-based robots.txt parser and matcher as a C++ library (compliant to C++17).

robots-txt robots-parser robots-exclusion-standard robotstxt robots-exclusion-protocol

Updated Aug 20, 2020
C++

slemarchand / no-robots

Star

🚫🤖 Override /robots.txt to disallow all web crawlers, regardless settings stored in the database. Compatible with Liferay 7.0, 7.1, 7.2, 7.3 and 7.4.

web osgi robots-txt crawlers liferay liferay-portal liferay-dxp liferay-7 liferay71 liferay70 liferay72 liferay-71 liferay-72 liferay73 webcrawlers liferay-73 liferay-74 liferay74 liferay-70

Updated Sep 4, 2021
Java

dubniczky / Bad-Robot

Sponsor

Star

This is a python crawler that disregards robots.txt rules and downloads disallowed resources

python crawler robots-txt osint-python osint-tool

Updated Aug 5, 2023
Python

leshniak / robotstxt-debug

Star

A tool for debugging robots.txt

debugger crawler seo indexing robots-txt seo-optimization tester seo-tools

Updated Mar 23, 2018
JavaScript

MaximeGuinard / Robots.txt-Viewer

Star

🌐 Displays the contents of robots.txt and sitemap.xml files of a website google extension

sitemap website extension extensions website-builder websites robots-txt extension-methods sitemap-xml website-design sitemaps website-template robotstxt extension-pack extension-chrome extension-firefox

Updated Jan 3, 2024
JavaScript

FlorianWendelborn / robogen

Star

🤖 Robots.txt generator done right.

npm-package robots-txt robots-generator

Updated Mar 9, 2017
JavaScript

b4dnewz / robots-parse

Sponsor

Star

A lightweight and simple robots.txt parser in node

parser osint robots-txt robots-parser

Updated Feb 27, 2023
TypeScript

Innmind / Robots.txt

Star

Robots.txt parser

parser robots-txt

Updated Nov 1, 2023
PHP

r3k4t / pyrobotstxt

Star

A simple python program which find out any website robots.txt file.

robots-txt

Updated Mar 22, 2021
Python

adileo / MicroFrontier

Star

A lightweight crawler frontier implementation in TypeScript using Redis.

redis crawler spider microservice frontier robots-txt

Updated Nov 4, 2021
TypeScript

emanuelefavero / robots-txt-templates-

Star

This is a collection of robots.txt templates

template web user-agent robots-txt crawlers

Updated Jan 22, 2023

KesarBrown / noindex

Star

'noindex' is a movement for drawing soft boundaries on internet for search engines and generative AI crawlers.

crawler ai robots-txt robots noindex nofollow unindexed dataleaks generative-ai

Updated May 25, 2023

ptsochantaris / can-proceed

Sponsor

Star

A small, tested, no-frills parser of robots.txt files in Swift.

swift robots-txt server-side-swift robots-parser web-clients

Updated Aug 18, 2024
Swift

austinsonger / sitemapsandrobotsaroundtheweb

Sponsor

Star

Sitemaps and Robots.txt for websites around the world.

Updated Sep 18, 2020

0xIbra / robots-txt-component

Star

Fully native robots.txt parsing component without any dependencies.

nodejs robots-txt robots-parser robots-exclusion-standard robots-node robots-txt-node

Updated Oct 8, 2022
JavaScript

A3onn / mapptth

Star

A simple to use multi-threaded web-crawler written in C with libcURL and Lexbor.

c graphviz sitemap multi-threading cmake gplv3 web-crawler libcurl robots-txt lexbor

Updated Feb 13, 2024
C

Improve this page

Add a description, image, and links to the robots-txt topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the robots-txt topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

robots-txt

Here are 192 public repositories matching this topic...

rimiti / robotizer

dosbenjamin / eleventy-webpack-boilerplate

amandeepmittal / robotize

JamieMagee / robots-txt

hgruniaux / robotstxt

slemarchand / no-robots

dubniczky / Bad-Robot

leshniak / robotstxt-debug

MaximeGuinard / Robots.txt-Viewer

FlorianWendelborn / robogen

b4dnewz / robots-parse

Innmind / Robots.txt

r3k4t / pyrobotstxt

adileo / MicroFrontier

emanuelefavero / robots-txt-templates-

KesarBrown / noindex

ptsochantaris / can-proceed

austinsonger / sitemapsandrobotsaroundtheweb

0xIbra / robots-txt-component

A3onn / mapptth

Improve this page

Add this topic to your repo