API Data Extraction with Python
This project demonstrates a simple and common Data Engineering pattern:
extracting data from a REST API, transforming it into a structured format, and storing it as a Parquet file.
Technologies Used
- Python
- Requests
- Pandas
- PyArrow (Parquet)
What this project does
- Consumes a public REST API
- Validates the HTTP response
- Converts JSON data into a Pandas DataFrame
- Adds extraction metadata
- Stores the data in Parquet format