UN General Assermbly 79 Speeches

The app analyses the speech of each country at the #UNGA79 in 2024. It presents:

A summary of the speech
A list of risks identied in the the speech
Other coutries mentioned and the sentiment towards them
A haiku
An audio with Yoda's advice in Carl Sagan's voice

The process is the following:

flowchart LR
    data_collection(Download URLs) --> transcripts(Get transcripts)
    transcripts --> summary(LLM summary)
    transcripts --> haiku(LLM haiku)
    transcripts --> risks(LLM risks)
    transcripts --> other_countries(LLM countries mentioned)
    transcripts --> yoda(LLM yoda's advice)
    summary --> streamlit(Streamlit App)
    haiku --> streamlit(Streamlit App)
    risks --> streamlit(Streamlit App)
    other_countries --> streamlit(Streamlit App)
    yoda --> TTS(TTS audio)
    TTS --> streamlit(Streamlit App)

See the app

LLM application

Summary of the speech
List of countries mentioned and sentiment
Risks mentioned in the speech
Top 3 most important ideas mentioned
Advice in Yoda style
Emotion detection: speech, such as happiness, sadness, anger, or excitement
Inference Generation: Use LLMs to generate inferences based on the speech content, such as predicting potential consequences of policy decisions oranticipating international reactions.
Speech Emotional Arc Analysis: Analyze the emotional tone of the speech over time to identify potential shifts or arcs in sentiment. This can provide insight into the leader's communication strategy or audience engagement.

Usage locally

# Clone this repository
git clone https://github.com/darenasc/unga79.git
# Change directory to repository
cd unga79
# Install dependencies
pip install pipenv
python3 -m pipenv install Pipfile
python3 -m pipenv install -d Pipfile
# Activate Python environment
pipenv shell
# Run the streamlit app
streamlit run app/app.py

TTS

Note: Installing the TTS model is not needed to interact with the app as the audio files are included in the repository in app/audio/.

For the voice generation I'm using F5-TTS. Generated a sample of 10 seconds with the voice of Carl Sagan and passed the texts generate from the LLM.

The following is the process to install and run the script to generate the audios.

# Clone the repository
git clone https://github.com/SWivid/F5-TTS.git
cd F5-TTS

# Run the following two brew lines only if you are using an Apple Silicon chip
brew update
brew install ffmpeg

# Install torch in a second environment
pipenv install torch torchaudio
pipenv install -e .

# Launch the gradio GUI
f5-tts_infer-gradio

# Or run shell commands replacing the arguments between "<>" to generate an audio
f5-tts_infer-cli \
--model "F5-TTS" \
--ref_audio "</YOUR/REFERENCE/AUDIO.wav>" \
--ref_text "<The text in the reference audio to create the voice.>" \
--gen_file "</THE/FILE/WITH/THE/CONTENT/TO/VOICED.txt>" \
--output_dir "</OUTPUT/DIR>" \
--output_file "<OUTPUT_FILE.wav>"

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.vscode		.vscode
app		app
data		data
figures		figures
notebooks		notebooks
output		output
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UN General Assermbly 79 Speeches

LLM application

Usage locally

TTS

Resources

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

darenasc/unga79

Folders and files

Latest commit

History

Repository files navigation

UN General Assermbly 79 Speeches

LLM application

Usage locally

TTS

Resources

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages