Introduction

This project provides a flexible and customizable web scraping tool that utilizes a lexer and parser to process commands written in a custom script format (e.g., .scrape files). It allows users to easily scrape data from websites using simple commands.

Features

Basic Scraping Commands: Extract data from websites and save it in different formats such as JSON, CSV, or XML.
Customizable Options: Include user-agent strings, set delays, specify retry attempts, and use proxies or authentication headers.
File-Based Command Input: Use .scrape files to define scraping tasks for better readability and reusability.
Batch File Execution: Run the script seamlessly using a .bat file for ease of use.

Limitations

Advanced Features Under Development

Some advanced features tagged in the documentation, such as validation, filtering, and certain API-related options, are currently in the development phase and not functional at this time. Please refrain from using these features as they may lead to unexpected behavior.

The following advanced features are still in progress:

validate fields
filter by
using_api
monitor
parallel

These features will be fully implemented in future updates.

Installation and Setup

Clone this repo then follow the NEXT STEPS

Python Setup:

Install Python (version 3.7 or higher).
Install dependencies:
```
pip install -r requirements.txt
```

To use it OPEN

   console.bat

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
INTEGRATED.PY		INTEGRATED.PY
README.md		README.md
advanced_features.py		advanced_features.py
commands.txt		commands.txt
console.bat		console.bat
lexer.py		lexer.py
main.py		main.py
parser.out		parser.out
parser.py		parser.py
parsetab.py		parsetab.py
requirements.txt		requirements.txt
scrape.bat		scrape.bat
scrape_runner.py		scrape_runner.py
scraper.db		scraper.db
scraper.log		scraper.log
scraper.py		scraper.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Features

Limitations

Advanced Features Under Development

Installation and Setup

Clone this repo then follow the NEXT STEPS

Python Setup:

To use it OPEN

About

Uh oh!

Releases

Packages

Languages

Deepmalya1/Web_Scraping_Language

Folders and files

Latest commit

History

Repository files navigation

Introduction

Features

Limitations

Advanced Features Under Development

Installation and Setup

Clone this repo then follow the NEXT STEPS

Python Setup:

To use it OPEN

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages