Skip to content
View larsoncs's full-sized avatar

Block or report larsoncs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fully open reproduction of DeepSeek-R1

Python 18,529 1,560 Updated Feb 10, 2025

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

475 30 Updated Jan 28, 2025

A visual form designer/generator base on Vue.js, make form development simple and efficient.(基于Vue3的可视化表单设计器,拖拽式操作让你快速构建一个表单, 让表单开发简单而高效。)

Vue 418 79 Updated May 20, 2024

Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown

TypeScript 75,524 6,944 Updated Feb 11, 2025

PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)

C++ 7,015 1,613 Updated Jan 19, 2025

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-…

Python 2,229 477 Updated Oct 6, 2021

TidyBot++: An Open-Source Holonomic Mobile Manipulator for Robot Learning

Python 218 16 Updated Dec 11, 2024

UGround: Universal GUI Visual Grounding for GUI Agents

Python 158 10 Updated Jan 30, 2025

Building a comprehensive and handy list of papers for GUI agents

Python 200 11 Updated Jan 17, 2025

Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Jupyter Notebook 925 52 Updated Feb 11, 2025

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,338 426 Updated May 29, 2024

An open-sourced end-to-end VLM-based GUI Agent

Python 697 53 Updated Jan 27, 2025

AUITestAgent is the first automatic, natural language-driven GUI testing tool for mobile apps, capable of fully automating the entire process of GUI interaction and function verification.

182 15 Updated Jul 19, 2024

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Python 268 18 Updated Jan 2, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 22,269 2,213 Updated Feb 11, 2025

Database diagrams editor that allows you to visualize and design your DB with a single query.

TypeScript 13,428 656 Updated Feb 10, 2025

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 68,186 8,359 Updated Feb 4, 2025

Penpot: The open-source design tool for design and code collaboration

Clojure 36,328 1,840 Updated Feb 11, 2025

The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".

Python 131 5 Updated Dec 14, 2024

Text2Diagram is an AI based diagramming tool that uses Natural language text to create diagrams.

TypeScript 296 29 Updated Sep 27, 2024

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 92,068 8,796 Updated Feb 10, 2025

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,129 270 Updated Nov 5, 2024

Official code, datasets and checkpoints for "Timer: Generative Pre-trained Transformers Are Large Time Series Models" (ICML 2024)

Python 488 53 Updated Jan 11, 2025

[NeurIPS 24] PromptFix: You Prompt and We Fix the Photo

Python 710 38 Updated Oct 4, 2024

Building a modern alternative to Salesforce, powered by the community.

TypeScript 25,198 2,669 Updated Feb 10, 2025

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 866 80 Updated Dec 24, 2024

GLM-4-Voice | 端到端中英语音对话模型

Python 2,632 215 Updated Dec 5, 2024

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 12,293 2,292 Updated Jun 26, 2024

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,575 930 Updated Aug 21, 2024

Multilingual Voice Understanding Model

Python 4,335 382 Updated Jan 8, 2025
Next