In this project, you will execute an End-To-End Data Engineering Project on Real-Time Stock Market Data using Kafka.
We are going to use different technologies such as Python, Amazon Web Services (AWS), Apache Kafka, Glue, Athena, and SQL.
- Programming Language - Python
- Amazon Web Service (AWS)
- S3 (Simple Storage Service)
- Athena
- Glue Crawler
- Glue Catalog
- EC2
- Apache Kafka
You can use any dataset, we are mainly interested in operation side of Data Engineering (building data pipeline)
Here is the dataset used - https://github.com/mihirkudale/Stock-Market-Real-Time-Data-Engineering-Project/blob/main/indexProcessed.csv