Trinetra is a powerful AI-driven application that helps users prepare, clean, and enhance various types of data including CSV files, PDFs, and images. The application leverages advanced AI capabilities to automate data quality checks, implement data cleaning operations, and provide intelligent data processing solutions.
- Automated data quality checks
- Intelligent data cleaning and preparation
- Interactive data preview and analysis
- Summary statistics generation
- AI-powered data transformation suggestions
- Automatic page orientation correction
- OCR (Optical Character Recognition) capabilities
- Noise and artifact removal
- Text formatting and alignment fixes
- Blank page detection and removal
- Interactive image enhancement controls
- Brightness, contrast, sharpness, and saturation adjustment
- Quality issue detection
- Format optimization
- Batch processing capabilities
- Clone the repository:
git clone https://github.com/yourusername/Trinetra.git
cd Trinetra
- Install the required dependencies:
pip install -r requirements.txt
- Set up environment variables:
- Create a
.env
file in the root directory - Add your Google API key:
GOOGLE_API_KEY=your_api_key_here
- Start the application:
streamlit run streamlit_app.py
-
Access the web interface through your browser at
http://localhost:8501
-
Use the sidebar to upload your files:
- CSV files for data preparation
- PDF files for document processing
- Images for enhancement
-
Navigate between tabs:
- Data Quality Explorer: View and analyze data quality issues
- AI Data Prep: Select issues to resolve and apply AI-powered solutions
Trinetra/
├── backend/
│ ├── data_preparation_gemini.py
│ ├── data_quality_checks.py
│ ├── image_processing.py
│ └── pdf_processing.py
├── streamlit_app.py
├── requirements.txt
└── README.md
- streamlit
- pandas
- numpy
- scikit-learn
- scipy
- statsmodels
- google-generativeai
- datasketch
- python-dotenv
- PyMuPDF
- pytesseract
- Pillow
- opencv-python
- pywavelets
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature
) - Commit your changes (
git commit -m 'Add some AmazingFeature'
) - Push to the branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.
- Google Generative AI for powering the intelligent data preparation features
- Streamlit for the interactive web interface
- The open-source community for various data processing libraries