PhD Researcher in Computer Vision at Free University of Bozen-Bolzano π
Specializing in Multimodal Video Understanding and Vision-Language Models for sports performance analysis πβ½
π¬ Research Focus: Efficient architectures for action quality assessment, skill evaluation, and AI-driven feedback generation
βοΈ Technical Writer @ Towards AI β’ 20+ articles β’ 25K+ reads
π Building lightweight VLMs with state-of-the-art performance
- SkillFormer β Multi-view action quality assessment (4.5Γ fewer parameters)
- PATS β Proficiency-aware temporal sampling (+26% performance gains)
- ProfVLM β Lightweight vision-language model for skill assessment (20Γ fewer parameters)
- PATS: Proficiency-Aware Temporal Sampling for Multi-View Sports Skill Assessment (IEEE STAR 2025)
- SkillFormer: Unified Multi-View Video Understanding for Proficiency Estimation (ICMV 2025)
- Gate-Shift-Pose: Enhancing Action Recognition in Sports with Skeleton Information. (WACVW 2025)
