- Chengdu
Stars
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
🎨 Fill in masked parts of images with FLUX.1-dev 🖌️
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
A python script used to general a batch of text-to-image photos using AI for tiktok videos. Uses the Flux-Dev model and images can also be used for general purpose reasons.
hughescr / ai-toolkit
Forked from ostris/ai-toolkitVarious AI scripts. Mostly Stable Diffusion stuff.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
SEED-Story: Multimodal Long Story Generation with Large Language Model
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
A browser extension for automating your browser by connecting blocks
All the world is a play, we are but actors in it.
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻中国独立开发者项目列表 -- 分享大家都在做什么
WACV 2020 "Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison"
CoreNet: A library for training deep neural networks
kinivi / hand-gesture-recognition-mediapipe
Forked from Kazuhito00/hand-gesture-recognition-using-mediapipeThis is a sample program that recognizes hand signs and finger gestures with a simple MLP using the detected key points. Handpose is estimated using MediaPipe.
Face and iris detection for Python based on MediaPipe
Live2D Virtual Human for Chatting based on Unity
MediaPipeのPythonパッケージのサンプルです。2024/9/1時点でPython実装のある15機能について用意しています。
Nightly release of ControlNet 1.1
✨ Standard library for JavaScript and Node.js. ✨
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
High-performance In-browser LLM Inference Engine
GPT based autonomous agent designed to create personalized newspapers tailored to user preferences.
Code and documentation to train Stanford's Alpaca models, and generate the data.