YC Company Scraper is a Python script that fetches information about Y Combinator companies from specified batches, extracts GitHub repository links from their websites, and prints the data.
- Retrieve company information from various Y Combinator batches
- Scrape GitHub repository links from company websites
- Print the extracted data
- Python 3.6 or higher
- requestslibrary
- Clone this repository:
git clone git@github.com:dirkjbreeuwer/yc-scraper.git
cd yc-company-scraper
- Install the requestslibrary:
pip install requests- 
Open yc_company_scraper.pyand modify thebatcheslist to include the desired Y Combinator batch codes.
- 
Run the script: 
python yc_company_scraper.py
- The script will print the company name, website, YC URL, and any GitHub repository links found on their website.
This script is for educational purposes only. Always respect the websites' terms of service and robots.txt when scraping data.