-
Indian Institute Of Technology Delhi
- New Delhi
- https://www.linkedin.com/in/abdur75648/
- https://orcid.org/0000-0002-9547-2435
- @abdur75648
- in/abdur75648
Highlights
- Pro
-
AI-Camera Public
AI Camera: High-Performance Real-Time Object Detection & Tracking
-
MedicalGPT Public
Medical Report Generation And VQA (Adapting XrayGPT to Any Modality)
-
Echo-DND Public
Official implementation of Echo-DND: A Dual Noise Diffusion Model for Robust and Precise Left Ventricle Segmentation in Echocardiography
-
Lightweight, deadlock-free multithreaded pipeline framework for fast, modular Python data and ML model workflows. Easily extensible for real-time or batch processing tasks.
-
ffmpeg-gpu Public
Dockerised setup for using GPU-based video decoding and encoding with FFmpeg and NVIDIA's NVDEC/NVENC, integrated with TorchAudio
-
GFPGAN_Video_SR_Colab Public
Colab Notebook for Video Super-Resolution using GFPGAN
Jupyter Notebook UpdatedJan 9, 2025 -
End-To-End-Urdu-OCR-WebApp Public
End-to-End Urdu OCR: A Demo Web App For UTRNet
-
urdu-text-detection Public
Text line detection for Urdu OCR (UTRNet)
-
UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)
-
-
V-Zen Public
V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLM Resources
-
DINet-Inference Public
Create high-resolution visually dubbed videos with DINet
-
AI-Graphic-Designer Public
An AI-powered graphic design tool that generates customized logos based on user input.
-
Code Template For Cll788 Assignment 5
-
Num10k-Dataset Public
Num10k Dataset: A synthetically created dataset of numerical digits, designed for Optical Character Recognition (OCR) tasks.
UpdatedApr 12, 2024 -
ChatterBox-Finetuning Public
SOTA Model For Multi-round Multimodal Referring and Grounding
-
CogAgent Public
Forked from zai-org/CogVLMState-of-the-art-level Multimodal LLM
Python Other UpdatedMar 26, 2024 -
This repo contains the updated version of all the assignments/labs (done by me) of Deep Learning Specialization on Coursera by Andrew Ng. It includes building various deep learning models from scra…
-
-
Seq2Seq-NMT-PyTorch Public
PyTorch implementation of a Neural Machine Translation model for Cherokee to English translation, based on Stanford’s CS224N: A learning resource for understanding NMT models.
Jupyter Notebook UpdatedDec 25, 2023 -
simple-agi Public
SimpleAGI: A versatile autonomous agent compatible with advanced language models, designed for automating tasks, creating art, analyzing data, and more.
-
SuperAGI Public
Forked from TransformerOptimus/SuperAGI<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
Python MIT License UpdatedNov 30, 2023 -
-
Inference Notebook For DiT: Scalable Diffusion Models with Transformer
Jupyter Notebook UpdatedOct 26, 2023 -
urdu-synth Public
High-quality synthetic text data generation for Urdu Text Recognition
-
-
AttSwinUNet For LV Segmentation in Echocardiograms (The dataset used is CAMUS)
-
A Java Program for Automated Job Scheduling and Resource Management Using Data Structures
Java Apache License 2.0 UpdatedJul 1, 2023 -
ChatGPT-API-Python Public
Building a Chatbot in Python using OpenAI's Official ChatGPT API
-