v2.9.3
Dataset Processor Tools is a comprehensive set of tools designed for processing image datasets for machine learning. With these tools, you can easily discard low-resolution images, resize images while preserving their aspect ratio, generate tags using a pre-trained model, mass edit .txt files with tags, and even manually edit .txt files with ease.
The download comes bundled with all the necessary dependencies, including OnnxRuntime .dll/.so files, to ensure a seamless experience. Please note that some of these files can be quite large due to their importance in the functionality of the tools.
For detailed instructions and examples, be sure to visit the Wiki page. You'll find comprehensive guidance on how to make the most of Dataset Processor Tools.
New Features:
- Gemini Guided Captioning:
- Added Gemini Captioning feature that uses the official Python package.
- Developer notes: this feature requires Python 3.10.x installed (with PATH environment variables) since its using the Python library to generate the captions; the official REST API is limited when it comes to content filtering and blocking. - It will use existing tags in .txt files if available to help guide the Gemini Captions.
- Added Gemini Captioning feature that uses the official Python package.
- Extended user settings for the Process Tags page based on user feedback
Requirements:
This software requires the following runtimes or newest versions, so be sure to install them:
.NET Desktop Runtime 8.0.2 or newer
For Windows users: Visual C++ Redistributable for Visual Studio 2019 for running the Model
Python 3.10.x to use Gemini Captioning
Linux specific requirements:
Follow this installation guide from the LibVLCSharp github documentation:
- sudo apt update
- sudo apt install vlc libvlc-dev
- sudo apt install vlc
For MAC users please check this Issue on how to build yourself, don't forget to read my comment!