Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
🔊 Text-Prompted Generative Audio Model
A simple screen parsing tool towards pure vision based GUI agent
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
10 Lessons to Get Started Building AI Agents
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
AirLLM 70B inference with single 4GB GPU
This repository offers a comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies. It serves as an essen…
CoTracker is a model for tracking any point (pixel) on a video.
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
All the demos in my Youtube Channel: Yeyu Lab
[CVPR 2025] Code for Segment Any Motion in Videos
code samples for the goodreads datasets
Generate a video script, voice and a talking face completely with AI
Following emerging Large Language Model Operations (LLM Ops) best practices in the industry, you’ll learn all about the key technologies that enable Generative AI practitioners like you to leverage…
App for Chord Sequence Detection
This script will dump youtube video comments to a CSV from youtube video links. Video links can be placed inside a variable or list or CSV
A deep learning lyrics-to-audio alignment system, generating synchronized lyrics from a song and its lyrics
Deep Learning based reverse image search engine for your local computer.
📸 Simplifying photo management on desktop via mobile with a client-server setup. Upload, organize, and share photos seamlessly using local storage and AI for automatic tagging and grouping.
A good understanding of the Fourier Transform and its coding implementation using Python