MedScan AI is a modular, multimodal medical assistant built to streamline clinical data processing. It intelligently integrates and interprets information from diverse sources:
- ๐ผ๏ธ Radiology images (e.g., X-rays, CT scans)
- โ๏ธ Handwritten doctor notes
- ๐๏ธ Audio transcripts
By leveraging state-of-the-art open-source models like Qwen2.5-VL, Whisper, and Qwen-Omni, MedScan AI generates comprehensive and accurate clinical summaries.
-
๐ฌ Advanced Image Analysis
Uses Qwen2.5-VL to interpret radiology images with medical-grade accuracy. -
๐ Accurate Handwriting OCR
Employs Tesseract to digitize and extract insights from handwritten doctor notes. -
๐ง Robust Audio Transcription
Converts medical conversations into text using Whisper. -
๐งฉ Intelligent Medical Reasoning
Integrates and analyzes multimodal data with Qwen-Omni to generate clinical insights. -
โ๏ธ Seamless Orchestration
Orchestrated byorchestrator.py
pipelines for streamlined and efficient data processing.