Stars
- All languages
- Assembly
- Batchfile
- C
- C#
- C++
- CSS
- CoffeeScript
- DIGITAL Command Language
- Dart
- Dockerfile
- Elixir
- Elm
- Emacs Lisp
- Gherkin
- Go
- Groovy
- HTML
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- LLVM
- Logos
- Lua
- MDX
- Makefile
- Markdown
- Objective-C
- PHP
- Papyrus
- Pascal
- PowerShell
- Python
- QML
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Svelte
- Swift
- SystemVerilog
- TypeScript
- VBScript
- Vim Script
- Vue
A latent text-to-image diffusion model
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
🔊 Text-Prompted Generative Audio Model
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A guidance language for controlling large language models.
LAVIS - A One-stop Library for Language-Vision Intelligence
Examples and guides for using the Gemini API
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Anthropic's educational courses
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Zero-Shot Speech Editing and Text-to-Speech in the Wild
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
The Udacity open source self-driving car project
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models l…
A simple screen parsing tool towards pure vision based GUI agent
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
An Open Source text-to-speech system built by inverting Whisper.
Notebooks using the Hugging Face libraries 🤗
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
Solve puzzles. Improve your pytorch.
Optimized Stable Diffusion modified to run on lower GPU VRAM
🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SL…
Easily compute clip embeddings and build a clip retrieval system with them
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
Metric depth estimation from a single image