Skip to content

The YouTube Slide Extractor is a desktop application built using Python and Tkinter that allows users to extract slides from a YouTube video. This tool helps in extracting key frames (slides) from videos, based on a given frame interval and similarity threshold, and generates a PDF from the extracted slides.

Notifications You must be signed in to change notification settings

divA2805/LectureCapture

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🎞️ LectureCapture

Automatically extract meaningful lecture slides from YouTube videos using AI and generate a clean PDF.
Save time, skip screenshots, and get organized notes with a single click.


📌 Features

  • 🎥 Download lectures directly from YouTube
  • 🧠 Detect and extract unique slides using deep frame comparison
  • 🔍 Integrated OCR for reading slide content
  • 📄 Auto-generate a clean PDF of extracted slides
  • 🖼️ GUI built with Tkinter
  • 📊 Progress bar with real-time updates
  • 🧵 Threaded download & extraction (non-blocking)

🧰 Built With

Python Tkinter Pillow OpenCV Tesseract OCR ReportLab yt-dlp

📂 Project Structure

LectureCapture/
│
├── slide_extractor.py       # Core logic for detecting unique slides
├── app.py                   # Tkinter GUI interface
├── requirements.txt         # All dependencies
├── output/                  # Folder for storing extracted slides and final PDF
└── README.md

⚙️ Setup Instructions

  1. Clone or download the repository to your local machine:

    git clone https://github.com/divA2805/LectureCapture.git
    cd LectureCapture
  2. Make sure the slide_extractor.py file is in the same directory as app.py or correctly referenced.


🚀 Getting Started

1️⃣ Install Dependencies

Make sure you have Python ≥ 3.8 installed.

pip install -r requirements.txt

Also, install Tesseract OCR:

sudo apt install tesseract-ocr

2️⃣ Run the App

python app.py

A GUI will open where you can:

  • Paste the YouTube link
  • Choose a download location
  • Click "Start" to begin download, slide extraction, and PDF generation

🧠 How It Works

  • The video is downloaded using yt-dlp
  • Frames are sampled at intervals
  • Similar or duplicate frames are removed using OpenCV comparison
  • OCR reads any visible slide text (optional, for enhancements)
  • Final slides are compiled into a professional PDF using ReportLab

📸 Sample Output

✅ Clean slides PDF from a 1-hour technical lecture in under 2 minutes
📁 Slides saved in output/ with timestamped filenames
🧾 Final PDF available for download


🙋‍♂️ Author

Made with ❤️


About

The YouTube Slide Extractor is a desktop application built using Python and Tkinter that allows users to extract slides from a YouTube video. This tool helps in extracting key frames (slides) from videos, based on a given frame interval and similarity threshold, and generates a PDF from the extracted slides.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages