Skip to content
/ PARL Public

"PARL" offers advanced features, including real-time Speech to Text with high accuracy, Topic Modeling, Sentiment Analysis, and Text Summarization.

Notifications You must be signed in to change notification settings

ManciSee/PARL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

80 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PARL: Processing Analysis Real Time Language Application

This thesis represents my bachelor’s project in Computer Science at the University of Catania, focusing on the TAP course taught by Professor Salvatore Nicotra. “PARL” is an innovative solution aimed at enhancing the experience of speakers and listeners through real-time Speech to Text and advanced analysis of audio files. Implemented on a containerized distributed architecture with Docker, the platform leverages open-source technologies such as Kafka, Spark, Fluent Bit, Elasticsearch, and Kibana to optimize the process.

"PARL" offers advanced features, including real-time Speech to Text with high accuracy, Topic Modeling, Sentiment Analysis, and Text Summarization. Targeted at a diverse audience, including professionals in the audiovisual industry, journalists, academic researchers, and businesses, the project promises to significantly improve the experience by utilizing stream processing tools. These tools enable real-time processing of audio signals and the execution of operations for text comprehension and analysis, opening new opportunities for semantic and sentiment analysis in the field of information and communication.

ℹ️ How to use

  1. Clone the repository
git clone https://github.com/ManciSee/PARL.git
cd folder_name
  1. Install the requirements that are inside the transcript folder
cd transcript
pip install -r requirements.txt
  1. Build and run the docker container
cd folder_name
docker compose-up -d
  1. Run the Flask server
cd transcript
python3 transcript.py
  1. Open the server on the port 8880
  2. Before starting the recording, choose the correct language in the html file or upload the WAV file
  3. Enjoy!

⚠️ Disclaimer

If the docker compose doesn't work, run the container in this order:

  1. Zookeper
  2. Kafka
  3. Fluent Bit
  4. Elasticsearch
  5. Spark
  6. Kibana

If your browser doesn't support the Web Speech API, change the browser... Unfortunately at the moment only Google Chrome (not Brave or other) seems to be working at its best.


After downloading the repository, before being able to start everything, it is necessary to change the various ports and IP addresses of the server. In some files, sections of code for localhost have been commented.

🎙️ Demo

The first video demonstrates using the microphone for accurate voice transcriptions, while the second video showcases uploading a WAV file for a quick transcription. Both videos have been sped up for more efficient viewing. Remember to choose the appropriate Whisper model for your needs to optimize transcription based on your desired speed and accuracy.

Mini_PARL_Mic.mp4
Mni_PARL_Video.mp4

👥 License

GNU GPLv3

About

"PARL" offers advanced features, including real-time Speech to Text with high accuracy, Topic Modeling, Sentiment Analysis, and Text Summarization.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published