Skip to content
View ankitshah009's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report ankitshah009

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ankitshah009/README.md

Hi πŸ‘‹, I'm Ankit Shah

ankitshah009

ankits0052

🌐 Website: https://ankitshah009.github.io/


πŸ‘¨β€πŸ’Ό About Me

I’m a Full Stack LLM Development Associate Director at Accenture (Center for Advanced AI) and a Ph.D. graduate from the Language Technologies Institute (LTI) at Carnegie Mellon University.

My work focuses on designing, building, and scaling production-grade AI systems, spanning:

  • Large Language Models (LLMs)
  • Multimodal and audio-centric learning systems
  • End-to-end AI platforms from research to deployment

I operate at the intersection of research rigor, real-world constraints, and system reliability, leading teams that translate cutting-edge ideas into deployed AI solutions.


🧠 Current Focus Areas

  • Full-stack LLM system design (data β†’ models β†’ orchestration β†’ evaluation β†’ governance)
  • Multimodal learning (audio, speech, language)
  • Weakly- and semi-supervised learning at scale
  • AI platform architecture for enterprise environments
  • Bridging academic research with production engineering

πŸŽ“ Research Background

  • Ph.D., Language Technologies Institute (LTI), Carnegie Mellon University
  • Research emphasis on:
    • Computational audition
    • Weakly labeled and large-scale learning
    • Multimodal representation learning
  • Contributor and organizer in the DCASE sound event detection benchmark (Task 4)

My academic work is closely aligned with real-world AI deployment challenges, especially where labeled data is scarce or noisy.


πŸ“¦ About This GitHub

This GitHub contains a mix of:

  • Research codebases from my academic work
  • Experimental systems for audio and multimodal learning
  • Tools and prototypes exploring scalable ML systems

Note: A significant portion of my recent work is developed privately or in collaboration with industry and academic partners.


🀝 Collaboration

I’m open to:

  • Research collaborations in multimodal and trustworthy AI
  • Conversations with senior engineers and researchers
  • Select industry and platform-level partnerships

πŸ“« Best ways to reach me:

Languages and Tools:

c cplusplus java javascript python

Pinned Loading

  1. Task-4-Large-scale-weakly-supervised-sound-event-detection-for-smart-cars Task-4-Large-scale-weakly-supervised-sound-event-detection-for-smart-cars Public

    Task 4 Large-scale weakly supervised sound event detection for smart cars

    Python 68 32

  2. all-about-ai-residency all-about-ai-residency Public

    AI residency programs information

    478 57

  3. High-Radix-Adaptive-CORDIC High-Radix-Adaptive-CORDIC Public

    High Radix Adaptive CORDIC Algorithm - Improvement over Traditional CORDIC

    Verilog 14 3

  4. WALNet-Weak_Label_Analysis WALNet-Weak_Label_Analysis Public

    Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.

    Python 32 3

  5. CarCrash_forecasting_and_detection CarCrash_forecasting_and_detection Public

    Python 55 8

  6. youtube-dl-with-aria youtube-dl-with-aria Public

    Wrapper to download faster youtube-dl with aria to support multi-threaded-output

    Shell 30 6