Skip to content

aeleraqi/GoogleNewsScraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Google News Scraper 📰

Language Stars

A Python notebook for extracting news articles from Google News — built for journalists, researchers, and data scientists.

📖 About

GoogleNewsScraper automates the retrieval of news articles from Google News based on custom queries, date ranges, and topics. It outputs structured data ready for analysis, NLP processing, or journalism workflows.

✨ Features

  • Search Google News by keyword, topic, or category
  • Filter results by date range and language
  • Extract headlines, URLs, source names, and publication dates
  • Export to CSV/JSON for downstream analysis
  • Configurable delays to respect rate limits

🚀 Getting Started

git clone https://github.com/aeleraqi/GoogleNewsScraper.git
cd GoogleNewsScraper
pip install -r requirements.txt
jupyter notebook GoogleNewsScraper.ipynb

💡 Example

query = "artificial intelligence"
articles = scrape_google_news(query, max_results=50, language="en")
# Returns DataFrame with title, url, source, date

🧰 Requirements

  • Python 3.8+
  • requests, BeautifulSoup4, pandas

Author: Amr Eleraqi — Data Analyst | NLP Specialist | Machine Learning Expert | Educator
Affiliation: Toronto Metropolitan University, Ontario, Canada
ORCID GitHub

About

Google News Scraper, a Python notebook designed to extract news articles from Google News.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors