mp2anki : Audio lesson summarizer and anki cards generator

This Python script automatically transcribes audio/video recordings of lessons (e.g., lectures, podcasts), generates summaries, and creates Anki flashcards for key vocabulary terms using the Google Gemini API and the FasterWhisper speech-to-text model.

Features

Automatic Transcription: Uses FasterWhisper to convert audio or video files to text.
Vocabulary Extraction and Anki Flashcard Generation: Extracts key vocabulary terms from the Whisper transcript using the Gemini API and formats them as Anki flashcards that can be easily imported into Anki.
Lesson Summary Creation: Generates a comprehensive summary of the lesson content using the Gemini API.
Organized Output: Saves Anki flashcards decks and summaries as separate files for easy access.
Fire & Forget: You can just throw multiple recordings and get your all your files at the end of the script.

Dependencies

Python 3.8 or higher
google-generativeai: For interacting with the Google Gemini API. Install with:
```
pip install google-generativeai
```
or preferably in a virtual python environment
FasterWhisperXXL: A fast and accurate speech-to-text model. Follow the installation instructions from the official repository: https://github.com/Purfview/whisper-standalone-win
ffmpeg: Used by Whisper. Install on Debian-based systems with :
```
sudo apt update && sudo apt install ffmpeg
```

Setup

API key:
- Obtain a Google AI Studio API key.
- Set the API_KEY variable in the script to your API key.
Project ID:
- Find your Google Cloud project ID (on the same page as your API key in Google AI Studio)
- Set the PROJECT_ID variable in the script.
Main directory:
- Set MAIN_DIR to the directory where you want to store the audio files, transcripts, Anki flashcards, and summaries.
FasterWhisper executable path:
- Set LOCATION in the script to the path to your FasterWhisper executable (e.g., /path/to/FasterWhisper/main).
Audio or video files:
- Place your audio or video files in the MAIN_DIR.

Usage

Add your files in the MAIN_DIR
- It can be any video or audio format. Rename them accordingly to their content first. All media files in this directory will be processed. You can use child directories to store files between runs.
Run the script:
```
python mp2anki.py
```
Output:
- For each audio file, it will:
  - Transcribe the audio using Whisper.
  - Generate Anki flashcards and a lesson summary using the Gemini API.
  - Save the Anki flashcards to a file named Anki_[recording_filename].txt.
  - Save the lesson summary and its key concepts to revise to a file named Summary_key_concepts_[recording_filename].md.

Notes

The script uses a prompt template to guide the Gemini API in generating the desired outputs. You can obviously customize this template if needed. If you do, please modify the markers that I used to split the API response in multiple files as they can change depending on your template.
Sadly, errors can happen with the AI Studio API. It is relatively rare but it hopefully will be debugged in the future.
Make sure you have configured your Google Cloud project and credentials correctly for the Gemini API.
The accuracy of the transcription, the quality of the summaries and the relevance of Anki cards depend on the quality of the audio recording and the performance of the Whisper and Gemini models. Please check manually the cards before importing them into Anki.

Example

If you have an audio file named lesson1.mp3 in your MAIN_DIR, the script will create the following files:

Anki_lesson1.txt, containing the Anki flashcards, that can be imported as a TXT file into Anki
Summary_key_concepts_lesson1.md, containing the lesson summary and the key concepts of the lesson.
lesson1.txt (containing the transcription created by whisper)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
LICENSE		LICENSE
README.md		README.md
mp2anki.py		mp2anki.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mp2anki : Audio lesson summarizer and anki cards generator

Features

Dependencies

Setup

Usage

Notes

Example

About

Releases

Packages

Languages

License

Snapyou2/mp2anki

Folders and files

Latest commit

History

Repository files navigation

mp2anki : Audio lesson summarizer and anki cards generator

Features

Dependencies

Setup

Usage

Notes

Example

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages