Releases · lukaszliniewicz/Pandrator

04 Nov 05:29

v.0.29

3516f9d

0.29 Latest

Latest

This is a very small update that addresses several dependency-related bugs as well as improves the training workflow (specifically the segmentation and segment-refinement process of source audio). Because the launcher was updated as well, if you were affected, please replace the old launcher executable with this one and then update Pandrator.

Self-contained packages

I've prepared packages (archives) that you can simply unpack - everything is preinstalled in its own portable conda environment. You can download them from here.

You can use the launcher to start Pandrator, update it and install new features.

Package	Contents	Unpacked Size
1	Pandrator and Silero	4GB
2	Pandrator and XTTS (CPU only)	7GB
3	Pandrator and XTTS (Nvidia GPU Support)	14GB
4	Pandrator, XTTS, RVC, WhisperX (for dubbing and training) and XTTS fine-tuning	36GB

Installer

You may use the installer/launcher below, which was created from the pandrator_installer_launcher.py file in the repository, or use the source file directly. Please remember to run the executable as an administrator. It's possible that Windows or your antivirus software will flag it as a threat. You may whitelist it, or, if you're not comfortable doing that, review the code in the repository and install Pandrator manually.

Assets 3

02 Nov 05:08

lukaszliniewicz

v.0.28

3516f9d

0.28

This update includes several enhancements to the Easy XTTS Trainer, aimed at improving the quality of trained models and providing more control over the training process.

Improved Audio Segmentation: The trainer now identifies optimal split points between segments by locating the quietest points in the audio. This method results in cleaner transitions between segments, reducing the likelihood of abrupt cutoffs or the inclusion of fragments of the previous or next segment, which in turn improves the overall quality and naturalness of the synthesized speech and helps eliminate artifacts.
Integrated Audio Preprocessing: You can now apply the following audio processing steps directly within Pandrator as a part of the training workflow:
- Normalization: Normalize audio to a target LUFS value (default -16.0). Use --normalize <value> to specify a different target.
- De-essing: Reduce sibilance with the --dess flag.
- Noise Reduction: Apply DeepFilterNet noise reduction with --denoise.
- Dynamic Range Compression: Use the --compress option with profiles for male, female, or neutral voices.
- Sample Rate Control: Use --sample-rate to explicitly set the sample rate (22050Hz or 44100Hz). 22050Hz is recommended.
Training Options:
- Training/Validation Split: The --training-proportion argument (e.g., --training-proportion 8_2) now controls the train/validation split ratio.
- Segmentation Methods: The trainer supports three segmentation methods: maximise-punctuation, punctuation-only, and mixed. The --method-proportion argument controls the ratio for the mixed method.
Pandrator Integration: Trained models and reference audio samples (two: a random one from the 10% longest segments and the fastest one from the 70% longest segments) are automatically made available in Pandrator for immediate generation, as in previous versions.

These changes provide more precise control over the training process and should result in higher-quality custom XTTS voices.

Self-contained packages

I've prepared packages (archives) that you can simply unpack - everything is preinstalled in its own portable conda environment. You can download them from here.

You can use the launcher to start Pandrator, update it and install new features.

Package	Contents	Unpacked Size
1	Pandrator and Silero	4GB
2	Pandrator and XTTS (CPU only)	7GB
3	Pandrator and XTTS	14GB
4	Pandrator, XTTS, RVC, WhisperX (for dubbing) and XTTS fine-tuning	36GB

Installer

Assets 3

26 Oct 02:06

lukaszliniewicz

v.0.27

248f669

0.27

EDIT (28 Oct): There was a bug that prevented Pandrator from launching under certain circumstances. It has been fixed. If you were affected, please download the launcher from this release and use the update option.

This is a very small update. I added the possibility to crop PDFs before text extraction (to remove headers and footers) as well as to remove pages that are not needed for TTS (like the title page or the table of contents) using PyCropPDF:

python_gJvypGniog.mp4

You can use the Update option in the Launcher.

Self-contained packages

I've prepared packages (archives) that you can simply unpack - everything is preinstalled in its own portable conda environment. You can download them from here.

You can use the launcher to start Pandrator, update it and install new features.

Package	Contents	Unpacked Size
1	Pandrator and Silero	4GB
2	Pandrator and XTTS	14GB
3	Pandrator, XTTS, RVC, WhisperX (for dubbing) and XTTS fine-tuning	36GB

Installer

Assets 3

19 Oct 01:32

lukaszliniewicz

v.0.26

7b1fbf0

0.26

This release focuses on the installer. Chocolatey is used instead of winget, because of better Build Tools installation reliability, and the handling of starting the XTTS server has been improved. Hopefully this will solve the issue some people experienced with it not coming online when started from the launcher.

Self-contained packages

I've prepared packages (archives) that you can simply unpack - everything is preinstalled in its own portable conda environment. You can download them from here.

You can use the launcher to start Pandrator, update it and install new features.

Package	Contents	Unpacked Size
1	Pandrator and Silero	4GB
2	Pandrator and XTTS	14GB
3	Pandrator, XTTS, RVC, WhisperX (for dubbing) and XTTS fine-tuning	36GB

Installer

Assets 3

12 Oct 04:03

lukaszliniewicz

v.0.25

696e941

0.25

Changes

Introduced marking sentences for regeneration and saving them as a list, either with a button, by pressing the "m" key or a right-click. This can be useful when generating a longer text - you can mark problematic sentences and work on them later (a right click will save both the currently playing sentence and the previous sentence, the "m" key will save just the current sentence - if you're not looking at the playlist when listening, it might be difficult to get the right sentence in time otherwise).
Added downloading videos from YouTube (and other web sources) using yt-dlp (for the dubbing/subtitle/translation workflow),
Refined the metadata options and handling,
Small bug fixes and improvements.

Self-contained packages

I've prepared packages (archives) that you can simply unpack - everything is preinstalled in its own portable conda environment. You can use the launcher to start Pandrator, update it and install new features, depending on the version of the package you downloaded.

Package	Contents	Unpacked Size	Link
1	Pandrator and Silero	4GB	Download
2	Pandrator and XTTS	14GB	Download
3	Pandrator, XTTS, RVC, WhisperX (for dubbing) and XTTS fine-tuning	36GB	Download

Installer

Assets 3

10 Oct 01:13

lukaszliniewicz

v.0.2

f8fc2b5

0.2

Changes

The UI takes the whole width of the screen now and consists of two parts - the settings on the left and the generated sentences player/editor on the right.
Preprocessing of long files has been significantly sped up through parallelisation. This led to a 3-4x time reduction.
Introduced metadata: the ability to set the album title, the artist, the genre and upload a cover image.
Added support for .m4b.
Added support for chapter detection (at the moment only for epub files) and chapter markers in m4b files (if you want to have the smallest file possible, use opus - it performs very well for speech even at 16k!)
Small improvements in the training workflow (a folder with reference samples is automatically created in the tts_voices folder when training finishes) and fixes for the RVC workflow.

Pre-Installed Packages

You may download self-contained packages that only require unpacking from here. You don't have to install anything, all components are included in portable conda environments. You may install additional components at any time using the launcher. But please remember to update Pandrator from the launcher.

Installer

Assets 3

06 Oct 02:38

lukaszliniewicz

v.0.15

48e7ccc

0.15

Changes

Besides bug fixes and small UI improvements, I've added the ability to fine-tune (train) a custom XTTS model. It is very simple - just select a file or a folder with multiple audio files, give the model a name and training will be performed fully automatically. The trained model will appear in the "XTTS Model" dropdown in the GUI after clicking on "Connect to server". An Nvidia GPU with at least 8 GB of VRAM is required. As little as 10m of audio is enough to improve voice cloning results vs zero-shot significantly, though I recommend at least 30m. You may experiment with increasing the number of epochs and gradient accumulation layers. When using a custom model, you still have to provide a voice file. You may upload one of the segments produced from the source audio (they are located in Pandrator/easy_xtts_trainer/<model_name>/audio_sources/processed. Training models requires installing a tool through the launcher (if you have an existing installation, just download the newest launcher executable, put it in the same folder as the Pandrator folder, and install it).

Pre-Installed Packages

Installer

Assets 3

28 Sep 03:30

lukaszliniewicz

v.0.1

06d759f

0.1

In this release, I've:

fixed splitting of Chinese and Japanese sentences,
added the option to regenerate all sentences,
changed the RVC implementation to RVC Python and added it to the installer as an optional tool (RVC model files are now kept in the rvc_models folder inside the Pandrator folder, each in its own directory; when uploading RVC models through the UI, please make sure that the .pth and .index files have the same name),
completely reworked the dubbing workflow by offloading most of it to a separate cli app, Subdub, which I made for this purpose. It is installed together with Pandrator when using the installer script or executable. It is now possible to select a video file, transcribe it (using WhisperX), translate the subtitles (using LLMs, including proprietary ones, or the DeepL api, which is free up to 500,000 characters a month), generate speech using the standard Pandrator workflow, mix the dubbing audio with the original soundtrack and save it to the video; it's also possible to load an .srt file and a video if transcription is not necessary,
added logging to a file and log preview in the UI,
made Pandrator connect automatically to the chosen TTS engine if opened through the launcher,
improved the UI a little.

Assets 3

10 Sep 02:24

lukaszliniewicz

v.0.0.9.5

13e56cb

0.0.9.5

I've added an update function to the installer/launcher (to update Pandrator) and fixed some issues with RVC installation and processing.

The .exe installer/launcher was created using pyinstaller from pandrator_installer_launcher.py in the repository. Please remember to run it as an administrator if you want it to install git, ffmpeg, C++ Build Tools and/or calibre.

It's possible that your antivirus software flags the installer as a threat, because it is not signed. In that case, add it as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.

Assets 3

06 Sep 17:50

lukaszliniewicz

v.0.0.9

f3c702d

0.0.9

I've created a single installer/launcher that supports all TTS engines and RVC. This should make it much simpler to experiment with different tools. Also, advanced settings have been exposed for XTTS, including speed and temperature, and support for fine-tuned models has been added.

Assets 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Self-contained packages

Installer

Self-contained packages

Installer

Self-contained packages

Installer

Self-contained packages

Installer

Changes

Self-contained packages

Installer

Changes

Pre-Installed Packages

Installer

Changes

Pre-Installed Packages

Installer

Releases: lukaszliniewicz/Pandrator

0.29

Self-contained packages

Installer

0.28

Self-contained packages

Installer

0.27

Self-contained packages

Installer

0.26

Self-contained packages

Installer

0.25

Changes

Self-contained packages

Installer

0.2

Changes

Pre-Installed Packages

Installer

0.15

Changes

Pre-Installed Packages

Installer

0.1

0.0.9.5

0.0.9