🎬 SRT ChatGPT Translator

A production-ready Python CLI tool that translates SubRip (.srt) subtitle files using OpenAI's API while preserving exact structure, timing, and formatting.

📋 Table of Contents

🚀 Key Features
⚙️ Installation & Setup
🚀 Quick Start Scripts
💻 Command Line Usage
🔧 Environment Variables
⚡ How It Works
📝 Complete Example
🛠️ Error Handling
📋 Requirements
🧪 Development
☕ Support the Project
📄 License

🚀 Key Features

Structure Preservation: Maintains exact SRT structure including cue indices, timestamps, and line counts
HTML Tag Protection: Preserves inline HTML tags (<i>, <b>, <font>, etc.) and entities (&, <, etc.)
Word Replacement System: Replaces specific terms in translations using source --> target format matching files
Smart Credits Management: Automatically detects, replaces, and intelligently inserts translator credits
Word Removal: Completely removes unwanted words or patterns from translations
Structured Output: Uses OpenAI's Responses API with JSON schema for reliable translation
Batch Processing: Process entire directories of SRT files
Robust Error Handling: Multiple retry strategies with graceful fallbacks

⚙️ Installation & Setup

1️⃣ Clone and Install

git clone https://github.com/ntamasM/srt-translator.git
cd srt-translator
pip install -e .

2️⃣ Set up your OpenAI API key

cp .env.example .env
# Edit .env and set your OPENAI_API_KEY

3️⃣ Create the recommended data structure

Create the following folder structure in the root directory:

data/
├── subtitles/          # Source SRT files to translate
├── translated/         # Output directory for translated files
├── matching/          # Word replacement files (source --> target)
└── remove/            # Word removal files

🚀 Quick Start Scripts

For convenience, the repository includes ready-to-use scripts for batch translation:

🪟 Windows Users (`run_translation.ps1`)

.\run_translation.ps1

🐧 Linux/Mac Users (`run_translation.sh`)

./run_translation.sh

Both scripts automatically:

Translate ALL SRT files in data\subtitles\ directory
Output translated files to data\translated\ directory
Use English to Greek translation with matching terms
Apply case-insensitive matching from data\matching\animeMatchingToEl.txt

⚠️ Before Running Scripts

🔑 Set up your API key: Make sure your OpenAI API key is configured in the .env file
📁 Prepare your files: Place your SRT files in the data/subtitles/ directory
🔧 Customize if needed: Edit the script files to change source/target languages, matching files, or other parameters

💻 Command Line Usage

📝 Basic Commands

📄 Single File Translation

srt-translate input.srt output.srt

📦 Batch Processing

srt-translate --input-dir ./subtitles --output-dir ./translated

🔄 With Word Replacement

srt-translate input.srt output.srt --matching anime_terms.txt

🎯 Complete Example

srt-translate input.srt output.srt \
  --src en --tgt el \
  --matching anime_terms.txt --matching-ci \
  --removal-file profanity.txt \
  --translator-name "Your Name"

⚙️ Command Line Options

📋 Positional Arguments

Argument	Required	Description
`input_file`	Yes*	Input SRT file path (when not using `--input-dir`)
`output_file`	Yes*	Output SRT file path (when using `input_file`)

*Either use input_file/output_file for single file mode OR --input-dir/--output-dir for batch mode.

🛠️ Optional Arguments

Option	Default	Type	Description
`--input-dir`	None	String	Input directory containing SRT files (batch)
`--output-dir`	None	String	Output directory for translated files (batch)
`--src`	`en`	String	Source language code
`--tgt`	`el`	String	Target language code
`--model`	`gpt-4o-mini`	String	OpenAI model to use
`--temperature`	`0.2`	Float	Sampling temperature (0-2)
`--top-p`	`0.1`	Float	Top-p sampling parameter (0-1)
`--matching`	None	String	Path to word replacement file
`--matching-ci`	False	Flag	Case-insensitive word replacement
`--removal-file`	None	String	Path to word removal file
`--translator-name`	`Ntamas`	String	Name of translator to use in credits
`--replace-old-credits`	True	Flag	Replace existing translator credits
`--add-new-credits`	True	Flag	Intelligently add translator credits
`--append-credits-at-the-end`	False	Flag	Force credits at end instead of finding gaps

🔧 Environment Variables

Set your OpenAI API key in a .env file or environment:

OPENAI_API_KEY=your_api_key_here

⚡ How It Works

🔄 Word Replacement System

The matching system supports post-translation word replacement using a simple source --> target format:

🔄 Process Flow

Translation First: AI translates the subtitle normally
🔄 Word Replacement: After translation, specific terms are replaced using your matching file
🎯 Intelligent Matching: Uses word boundaries to avoid partial replacements

📄 Matching File Format

Create a text file with source --> target format:

# Comments start with #
# English --> Greek translations for anime terms

Demon Slayer Corps --> Σώμα Εξολοθρευτών Δαιμόνων
Water Breathing --> Αναπνοή του Νερού
Thunder Breathing --> Αναπνοή της Βροντής
Nichirin Blade --> Λεπίδα Νιτσιρίν
Total Concentration Breathing --> Αναπνοή Ολικής Συγκέντρωσης
Final Selection --> Τελική Δοκιμασία

# Character names (keep same)
Tanjiro --> Tanjiro
Nezuko --> Nezuko

💡 Example Process

Original: "The Demon Slayer Corps uses Water Breathing techniques."

Step 1 - AI Translation: "Το Σώμα Εξολοθρευτών Δαιμόνων χρησιμοποιεί τεχνικές Water Breathing."

Step 2 - Word Replacement: "Το Σώμα Εξολοθρευτών Δαιμόνων χρησιμοποιεί τεχνικές Αναπνοή του Νερού."

🗑️ Word Removal

Remove unwanted words or patterns from subtitles using the --removal-file option:

📄 Removal File Format

damn
shit
hell
{\an8}
[MUSIC]

🎯 Smart Pattern Matching

📝 Normal words: Uses word boundaries (removes "word" from "word text" but not from "password")
🔧 Special patterns: Removes pattern anywhere it appears (removes {\an8} from {\an8}text)

📝 Smart Credits Management

⚙️ Credit Options

--replace-old-credits (default: True): Replaces existing translator credits with yours
--add-new-credits (default: True): Intelligently adds translator credits
--append-credits-at-the-end (default: False): Forces credits at the end

⚡ How It Works

📊 Gap Analysis: Analyzes timing gaps between subtitles (≥5 seconds)
🎯 Optimal Placement: Inserts credits in the largest suitable gap
🔄 Fallback: If no suitable gap exists, credits are added at the end
⚙️ Force End Option: Use --append-credits-at-the-end to always put credits at the end

🔄 Processing Order

The tool processes subtitles in the following order:

🔄 Credit Replacement: Replace existing translator credits (if enabled)
🗑️ Word Removal: Remove specified words from original text
🌐 Translation: Translate remaining text using OpenAI
🔄 Word Replacement: Apply word replacements from matching file
🏗️ Structure Restoration: Restore formatting and timing
📝 Smart Credits Insertion: Add translator credits in optimal location

📝 Complete Example

📄 Input File (`sample.srt`)

1
00:00:01,000 --> 00:00:03,500
Hello, this is a <i>sample</i> subtitle with Demon Slayer Corps.

2
00:00:04,000 --> 00:00:06,500
Character says: "Thank you, sensei!" about Water Breathing.

3
00:00:07,000 --> 00:00:09,500
Translated by Original Translator

📄 Matching File (`anime_terms.txt`)

Demon Slayer Corps --> Σώμα Εξολοθρευτών Δαιμόνων
Water Breathing --> Αναπνοή του Νερού

💻 Command

srt-translate sample.srt output.srt \
  --matching anime_terms.txt \
  --translator-name "Ntamas"

📄 Output File (`output.srt`)

1
00:00:01,000 --> 00:00:03,500
Γεια σας, αυτός είναι ένας <i>δείγμα</i> υπότιτλος με Σώμα Εξολοθρευτών Δαιμόνων.

2
00:00:04,000 --> 00:00:06,500
Ο χαρακτήρας λέει: "Ευχαριστώ, sensei!" για Αναπνοή του Νερού.

3
00:00:07,000 --> 00:00:09,500
Translated by Ntamas with AI

🛠️ Error Handling

The tool implements multiple retry strategies:

📦 Batch Translation: Attempts to translate all lines in one API call
📝 Indexed Translation: Adds line numbers to help model maintain structure
📄 Line-by-Line Fallback: Translates each line individually if batch fails

If translation fails completely, the original line is preserved.

📋 Requirements

Python 3.9+
OpenAI API key
Dependencies: openai, srt, python-dotenv, tqdm

🧪 Development

🧪 Running Tests

pytest

📁 Project Structure

src/srt_chatgpt_translator/
├── __init__.py          # Package initialization
├── cli.py              # Command-line interface
├── translate.py        # Main translation logic
├── openai_client.py    # OpenAI API wrapper
├── placeholders.py     # HTML/word protection & replacement
├── credits.py          # Credits detection & replacement
└── word_removal.py     # Word removal functionality

☕ Support the Project

If this tool has been helpful for your subtitle translation projects, consider supporting its development!

🌟 Other Ways to Support

⭐ Star this repository on GitHub
🐦 Share it on social media - mention @ntamasM
🐛 Report bugs or suggest features
📖 Contribute to the documentation
💬 Spread the word to other subtitle translators

Every bit of support helps maintain and improve this tool! 🚀

📄 License

MIT License - see LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
examples		examples
src/srt_chatgpt_translator		src/srt_chatgpt_translator
tests		tests
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
run_translation.ps1		run_translation.ps1
run_translation.sh		run_translation.sh

License

ntamasM/srt-translator

Folders and files

Latest commit

History

Repository files navigation

🎬 SRT ChatGPT Translator

📋 Table of Contents

🚀 Key Features

⚙️ Installation & Setup

1️⃣ Clone and Install

2️⃣ Set up your OpenAI API key

3️⃣ Create the recommended data structure

🚀 Quick Start Scripts

🪟 Windows Users (run_translation.ps1)

🐧 Linux/Mac Users (run_translation.sh)

⚠️ Before Running Scripts

💻 Command Line Usage

📝 Basic Commands

📄 Single File Translation

📦 Batch Processing

🔄 With Word Replacement

🎯 Complete Example

⚙️ Command Line Options

📋 Positional Arguments

🛠️ Optional Arguments

🔧 Environment Variables

⚡ How It Works

🔄 Word Replacement System

🔄 Process Flow

📄 Matching File Format

💡 Example Process

🗑️ Word Removal

📄 Removal File Format

🎯 Smart Pattern Matching

📝 Smart Credits Management

⚙️ Credit Options

⚡ How It Works

🔄 Processing Order

📝 Complete Example

📄 Input File (sample.srt)

📄 Matching File (anime_terms.txt)

💻 Command

📄 Output File (output.srt)

🛠️ Error Handling

📋 Requirements

🧪 Development

🧪 Running Tests

📁 Project Structure

☕ Support the Project

🌟 Other Ways to Support

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

🪟 Windows Users (`run_translation.ps1`)

🐧 Linux/Mac Users (`run_translation.sh`)

📄 Input File (`sample.srt`)

📄 Matching File (`anime_terms.txt`)

📄 Output File (`output.srt`)

Packages