A comprehensive document analysis and LaTeX conversion automation tool built with React and TypeScript. This project provides intelligent document processing capabilities to convert Word documents (.docx) into LaTeX format with automatic table detection, equation recognition, and formatting preservation.
- Document Analysis: Intelligent parsing of .docx files with content extraction
- LaTeX Generation: Automatic conversion to LaTeX format with proper formatting
- Table Detection: Advanced table recognition and LaTeX table generation
- Equation Support: Mathematical equation detection and LaTeX conversion
- Multiple Format Support: ACM, IEEE, and Springer formatting styles
- Real-time Preview: Live preview of generated LaTeX output
- Batch Processing: Handle multiple documents efficiently
- Frontend: React 19.1.0, TypeScript 4.9.5
- Document Processing: JSZip for .docx parsing
- UI Components: Lucide React icons
- Testing: Jest, React Testing Library
- Build Tools: React Scripts 5.0.1
- Clone the repository:
git clone https://github.com/akshitharsola/overleaf-automation.git
cd overleaf-automation
- Navigate to the main application:
cd docx-analyzer
- Install dependencies:
npm install
- Start the development server:
npm start
The application will be available at http://localhost:3000
overleaf-automation/
βββ docx-analyzer/ # Main React application
β βββ src/
β β βββ components/ # React components
β β βββ utils/ # Utility functions
β β βββ types/ # TypeScript definitions
βββ DATES/ # Development history and iterations
βββ GUIDEs/ # Implementation guides and documentation
βββ LATEXs/ # LaTeX generation components
βββ Main I/ # Instructions and limitations
βββ README.md # This file
- Upload Document: Select a .docx file using the file input
- Choose Format: Select your preferred LaTeX format (ACM, IEEE, Springer)
- Process: Click "Analyze Document" to start processing
- Review: Examine the generated LaTeX output
- Export: Copy or download the LaTeX code
This project has undergone multiple iterations and improvements:
- 09/07/2025: Initial table detection and format examples
- 10/07/2025: Added document support and LaTeX integration
- 11-12/07/2025: Enhanced equation detection capabilities
- 12/07/2025: Major refactoring with improved UI
- 13/07/2025: Multiple attempts at unified document processing
- 14/07/2025: Final enhanced unified processor implementation
Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.
This project is open source and available under the MIT License.
Thanks to these wonderful people who have contributed to this project:
![]() Akshit Harsola Project Lead & Developer π» π π¨ π§ |
Claude AI AI Assistant & Co-Developer π» π π€ β‘ |
- π» Code - π Documentation - π¨ Design - π§ Maintenance - π€ AI Development - β‘ Performance
- Large document processing may require optimization
- Complex table structures might need manual adjustment
- Equation detection accuracy depends on document formatting
- Support for additional document formats (.doc, .pdf)
- Enhanced equation recognition algorithms
- Cloud-based document processing
- Integration with Overleaf API
- Advanced formatting preservation
- Batch processing improvements
For support, please open an issue in the GitHub repository or contact the development team.
Built with β€οΈ by Akshit Harsola and Claude AI