Skip to content
View MuhammadBilal848's full-sized avatar
😼
Motivated
😼
Motivated

Highlights

  • Pro

Block or report MuhammadBilal848

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code and Slides

Jupyter Notebook 1,614 498 Updated Mar 16, 2025

UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)

Python 50 10 Updated Oct 8, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 21,723 2,810 Updated Aug 15, 2024

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,956 246 Updated Apr 8, 2025

πŸ’‘ All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Python 10,702 678 Updated Apr 8, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

Jupyter Notebook 7,901 505 Updated Apr 2, 2025

πŸ€— smolagents: a barebones library for agents that think in python code.

Python 16,772 1,470 Updated Apr 10, 2025

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,885 194 Updated Nov 14, 2024

A diffusers pipeline for zero shot stylised portrait creation

Python 481 25 Updated Sep 25, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 14,901 1,623 Updated Apr 10, 2025

Document to Markdown OCR library with Llama 3.2 vision

TypeScript 2,255 218 Updated Jan 20, 2025

Face Analysis: Detection, Age Gender Estimation & Recognition

Python 310 54 Updated Jun 12, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,821 786 Updated Aug 12, 2024

Inpaint anything using Segment Anything and inpainting models.

Jupyter Notebook 7,072 605 Updated Feb 29, 2024

πŸ€— AutoTrain Advanced

Python 4,357 558 Updated Jan 21, 2025

Go ahead and axolotl questions

Python 9,059 989 Updated Apr 11, 2025

Animation engine for explanatory math videos

Python 76,682 6,649 Updated Mar 20, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,049 2,435 Updated Apr 10, 2025

πŸ… Collection of Kaggle Solutions and Ideas πŸ…

HTML 5,140 1,923 Updated Apr 5, 2025

The Microsoft Bot Framework provides what you need to build and connect intelligent bots that interact naturally wherever your users are talking, from text/sms to Skype, Slack, Office 365 mail and …

Python 749 296 Updated Apr 7, 2025

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 7,898 752 Updated Feb 27, 2025

πŸ”Š Text-Prompted Generative Audio Model

Jupyter Notebook 37,417 4,429 Updated Aug 19, 2024

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Python 36,559 6,064 Updated Jul 26, 2024

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 30,446 3,789 Updated Aug 6, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 79,750 9,585 Updated Jan 4, 2025
Jupyter Notebook 483 60 Updated Aug 23, 2023

πŸš€πŸ€– Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 38,938 3,452 Updated Apr 10, 2025

Faster Whisper transcription with CTranslate2

Python 15,331 1,289 Updated Mar 20, 2025
Next