Stars
Create your self-hosted, open-source Operator model.
🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which aims to generate realistic composite image.
Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai
One-click training of your own GPT. Training a GPT has never been easier. / 训练一个GPT原来可以这么简单?
EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation
Align Anything: Training All-modality Model with Feedback
Manuscript is a revolutionary blockchain data streaming framework. With Manuscript, you can seamlessly integrate on-chain and off-chain data into target data storage for unrestricted querying and a…
A React-based virtual avatar component for real-time gameplay analysis and emotional support. Integrate with screen capture to provide insights and companionship through advanced LLM integration.
以开源为核心的IDaas/IAM平台,用于管理企业内员工账号、权限、身份认证、应用访问,帮助整合部署在本地或云端的内部办公系统、业务系统及三方 SaaS 系统的所有身份,实现一个账号打通所有应用的服务。
Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
Creating a simple Go module for Backend Teams' DevOps Workflow
It is an Android-based application that enables managing hotspot properties through a web interface, providing mobile routing functionality
A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement, ASE 2024 (Distinguished Paper Award)
MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler and analysis tools for LLMs.
The first open autoregressive foundational video AI model.
Using Wasserstein Generative Adversarial Network to fool intrusion detection systems (IDS) into believing that malicious traffic is normal traffic.