Indeed Job Scraper

This project consists of two main components: scrape.py and app.py. The scrape.py script is responsible for scraping job listings from Indeed based on a specified query and location, while app.py provides a web interface to upload the scraped JSON data and display it in a tabular format with filtering and sorting capabilities.

Prerequisites

Python 3.x
Flask
Selenium (for scraping with selenium method)
BeautifulSoup, requests, and other dependencies

You can install the required Python packages using pip: pip install flask selenium beautifulsoup4 requests click

For Selenium, you will also need a WebDriver for your browser (e.g., ChromeDriver for Google Chrome).

Setup

Clone this repository.
Navigate to the project directory.
Ensure all dependencies are installed as mentioned above.
Set up a virtual environment if necessary.

scrape.py

Functionality

scrape.py is designed to scrape job listings from Indeed based on user-provided query and location parameters. It supports two scraping methods: requests and selenium. The script can authenticate if required, handles pagination through multiple pages of search results, extracts detailed features for each job listing, and saves the extracted data into a JSON file.

Usage

To run the scraper, use the following command: python scrape.py --query="your_query" --location="your_location" --method=[requests|selenium] --wait-time=seconds --auth

--query: The job search query (default is "software engineer").
--location: The location for the job search (default is "remote").
--method: The scraping method to use ('requests' or 'selenium') (default is "selenium").
--wait-time: Time in seconds to wait after loading each page for selenium-based scraping (default is 5).
--auth: A flag indicating whether to authenticate before scraping.

Example: python scrape.py --query="data scientist" --location="new york" --method=selenium --wait-time=10 --auth

app.py

Functionality

app.py is a Flask web application that allows users to upload the JSON files generated by scrape.py. It displays the job listings in a table format with filtering and sorting capabilities.

Usage

To run the Flask application, use the following command: flask run

The application will start on http://127.0.0.1:5000/ by default.

table.html

Example Display

The table.html file contains the HTML template for displaying job listings in a tabular format. It includes filters and sorting functionalities for each column except the last two (Indeed Link and Application Link).

Example of how the table might look:

#	Name	Company	Description	Salary	Remote	Requirements	City	State	Indeed Link	Application Link
1	Data Scientist	Tech Corp	Experienced data scientist needed.	$120,000+	Yes	- Python - SQL - Machine Learning	New York	NY	Indeed Link	Application Link
2	Software Eng.	Innovate Inc	Full-stack developer position open.	$100,000+	No	- JavaScript - React	San Jose	CA	Indeed Link	Application Link
3	Machine Lrnng	Smart Labs	ML Engineer with cloud experience.	$140,000+	Yes	- Python - AWS	Chicago	IL	Indeed Link	Application Link

Filters and sorting can be applied to each column by typing in the filter boxes or clicking on the column headers.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.idea		.idea
templates		templates
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
scrape.py		scrape.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Indeed Job Scraper

Table of Contents

Prerequisites

Setup

scrape.py

Functionality

Usage

app.py

Functionality

Usage

table.html

Example Display

About

Uh oh!

Releases

Packages

Uh oh!

Languages

n0hats/indeed_scraper

Folders and files

Latest commit

History

Repository files navigation

Indeed Job Scraper

Table of Contents

Prerequisites

Setup

scrape.py

Functionality

Usage

app.py

Functionality

Usage

table.html

Example Display

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages