NodeJS robots.txt parser with support for wildcard (*) matching.
-
Updated
Oct 28, 2024 - JavaScript
NodeJS robots.txt parser with support for wildcard (*) matching.
Simple robots.txt template. Keep unwanted robots out (disallow). White lists (allow) legitimate user-agents. Useful for all websites.
Go robots.txt parser
Alternative robots parser module for Python
A lightweight robots.txt parser for Node.js with support for wildcards, caching and promises.
Robots Exclusion Standard/Protocol Parser for Web Crawling/Scraping
The repository contains Google-based robots.txt parser and matcher as a C++ library (compliant to C++17).
Fully native robots.txt parsing component without any dependencies.
Parsers for robots.txt (aka Robots Exclusion Standard / Robots Exclusion Protocol), Robots Meta Tag, and X-Robots-Tag
Add a description, image, and links to the robots-exclusion-standard topic page so that developers can more easily learn about it.
To associate your repository with the robots-exclusion-standard topic, visit your repo's landing page and select "manage topics."