Skip to content

gl0bsec/get_gdelt_data

Repository files navigation

GDELT Data Collector

A small utility for fetching, filtering and saving GDELT event data.

Features

  • Download the Events table for a given date range
  • Filter events using simple text rules loaded from YAML or JSON
  • Save the resulting records as a Parquet file
  • Access via a minimal CLI or use the Python functions directly

Installation

pip install -r requirements.txt

Quick start (CLI)

python -m gdelt_data.cli 2024-01-01 2024-01-07 \
  --filters my_filters.yaml \
  --output events.parquet

Create a filters file with save_filter_rules_template() or craft your own to control which events are kept.

Library example

from datetime import datetime
from gdelt_data import collect_gdelt_data

collect_gdelt_data(
    start_date=datetime(2024, 1, 1),
    end_date=datetime(2024, 1, 7),
    output_file="events.parquet",
)

Building filters interactively

from gdelt_data import interactive_filter_builder
filters = interactive_filter_builder()

The returned dictionary can be passed to collect_gdelt_data or written to a YAML/JSON file for reuse.

About

No description or website provided.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages