Bio-data scientist | Multi-omics pipeline architect | Computational microbiome neurobiologist
I build fault-tolerant, production-ready pipelines that turn terabytes of raw omics data into testable biological insight. Recent work spans:
- Streaming multi-GB+ HMDB XML files on low-RAM machines using Python
iterparse
and multithreading - Automating ligand preparation with Open Babel and AutoDockTools for structure-based drug discovery
- Dissecting the molecular dialogue between the human microbiome and brain through Human Accelerated Regions (HARs)
- Designing standardized, reproducible multi-omics workflows for large‐scale microbiome projects
- Mapping metabolomic signatures that stratify COVID-19 severity
- Using Mendelian randomization to link host genetics, gut microbes, and neuro-psychiatric risk
Category | Toolkit |
---|---|
Languages | Python 3 (asyncio, pandas), R (tidyverse), Bash/Batch |
Data Engineering | multiprocessing, xml.etree.ElementTree , Airflow |
Bio-/Cheminformatics | HMDB, Open Babel, AutoDock Vina, PDB |
Multi-omics Analytics | MetaboAnalystR, NMDC EDGE, Cytoscape |
Statistics | Mendelian Randomization, multivariate metabolomics |
Full list → Professional portfolio
- I’m keen to collaborate on multi-omics standardization, resource-constrained bio-computing, and translational microbiome research.*
Channel | Handle |
---|---|
mscphd2301171011@iiti.ac.in |
|
/in/sidsin0809 | |
ORCID | 0000-0002-1388-6221 |
When I’m not wrangling cells and code, you’ll find me sketching data-driven sci-fi storyboards.