Justia Lawyer Scraper

This project is a web scraper that extracts lawyer information from Justia's directory of family law attorneys in Chicago, Illinois. The scraper uses Selenium to navigate through multiple pages and collect details such as names, profile links, phone numbers, images, descriptions, and consultation availability.

Features

Automated Pagination: Scrapes all available pages.
Data Extraction: Collects lawyer details (name, profile link, phone, website, etc.).
CSV Export: Saves the scraped data into justia_lawyers_selenium.csv.
Error Handling: Handles missing elements to avoid crashes.
Headless Mode: Runs without opening a browser window.

🛠️ Installation & Setup

1️⃣ Clone the Repository

git clone https://github.com/yourusername/justia-scraper.git
cd justia-scraper

2️⃣ Install Dependencies

Ensure you have Python 3.x installed. Then, install the required packages:

pip install selenium webdriver-manager pandas

3️⃣ Run the Scraper

python justia_scraper.py

📂 Project Structure

justia-scraper/
│── justia_scraper.py      # Main scraping script
│── justia_lawyers_selenium.csv  # Output file (generated after running the script)
│── README.md              # Documentation
│── requirements.txt       # List of dependencies

📝 Dependencies

selenium → For browser automation
webdriver-manager → Auto-downloads the correct ChromeDriver
pandas → For saving scraped data in CSV format

To install dependencies manually, run:

pip install -r requirements.txt

Notes

This scraper runs in headless mode to improve efficiency.
Ensure that Google Chrome is installed on your system.
IP Blocking Warning: Running the scraper too frequently may lead to blocking. Consider using proxies if needed.

Future Enhancements

Add proxy rotation to avoid detection.
Improve error handling and logging.
Support other lawyer categories or cities.

Developed by Yuri P. 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
ABOUT		ABOUT
README.md		README.md
justia_scrape.py		justia_scrape.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Justia Lawyer Scraper

Features

🛠️ Installation & Setup

1️⃣ Clone the Repository

2️⃣ Install Dependencies

3️⃣ Run the Scraper

📂 Project Structure

📝 Dependencies

Notes

Future Enhancements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

yp-data/Project

Folders and files

Latest commit

History

Repository files navigation

Justia Lawyer Scraper

Features

🛠️ Installation & Setup

1️⃣ Clone the Repository

2️⃣ Install Dependencies

3️⃣ Run the Scraper

📂 Project Structure

📝 Dependencies

Notes

Future Enhancements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages