Computer Vision and Machine Learning Researcher
I am a Master's student at the University of Surrey currently working on my thesis at the SketchX Lab under the guidance of Subhadeep Koley and Professor Yi-Zhe Song. My research centers on Deep Learning, Medical Imaging, and Foundation Modelsโdeveloping AI-driven diagnostic tools, advanced X-ray classification systems, and innovative approaches for sketch understanding.
I have actively contributed to multiple international research teams, collaborating with esteemed groups such as SWLEOC, Bagci Lab, and INRIA, STARS Team. My research experience spans collaborations across the US, UK, France, Germany, Japan, and India.
- SAM-Mamba
Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation (WACV 2025) - CDANet
Dual Attention-Based CNN for Brain Tumor Classification (ICIP 2022) - GT-Net
Global Transformer Network for Multiclass Brain Tumor Classification - ARM-Net
Attention-Guided Residual Multiscale CNN for Brain Tumor Classification
- Google Landmark Recognition 2020: Achieved Top 11%
- Data Science Bowl 2019: Ranked within Top 14%
- Japanese Manga Translation:
Developed an integrated pipeline using DETR for text detection, TrOCR for recognition, and Transformer models for translation. - Anomaly Detection:
Implemented an unsupervised Student-Teacher Model paired with HistAuGAN to achieve robust anomaly detection.
- Portfolio: tapaskumardutta1.github.io
- LinkedIn: Tapas Dutta
- Google Scholar: Research
- GitHub: TapasKumarDutta1
- Email: tapasduttatom13@gmail.com
- Kaggle: tapaskd123
I am enthusiastic about potential research collaborations and opportunities. Please feel free to reach out if you would like to discuss research ideas or opportunities.