Skip to content

Real-Time ChatGPT-Focused Tweets Analysis using Big Data Technologies ( kafka, zookeeper, apache nifi, elasticsearch, kibana)

Notifications You must be signed in to change notification settings

Mouez-Yazidi/ChatGPT-tweets-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Real-Time ChatGPT-Focused Tweets Analysis using Big Data Technologies

Overview

This pipeline is designed for real-time analysis of tweets specifically discussing ChatGPT, utilizing big data technologies for efficient processing and analysis. The pipeline ingests tweets in real-time using Apache Kafka, which allows for high-throughput data collection. The tweets are then stored in Elasticsearch for efficient indexing and querying. Apache NiFi is used for data flow management, ensuring that the tweets are properly routed and transformed as they move through the pipeline. The tweets are and visualized using Kibana for easy understanding and insights. With this pipeline, users can quickly and effectively analyze large volumes of ChatGPT-focused tweets in real-time.

An example image

Pre-Requisites

Usage

Clone the projet:

git clone https://github.com/Mouez-Yazidi/ChatGPT-tweets-analysis.git
cd ChatGPT-tweets-analysis

Start the cluster:

docker-compose -f 'docker-compose-expose.yml' up --build

after running the cluster, you can check the running containers using:

docker ps
container name url
apache nifi https://localhost:8443
kibana https://localhost:5601
elastisearch https://localhost:9200
kafka https://localhost:9092

About

Real-Time ChatGPT-Focused Tweets Analysis using Big Data Technologies ( kafka, zookeeper, apache nifi, elasticsearch, kibana)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published