Skip to content
View Zager-Zhang's full-sized avatar
🗽
Power
🗽
Power

Highlights

  • Pro

Organizations

@SYSU-STAR

Block or report Zager-Zhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
354 results for source starred repositories
Clear filter
33 Updated Mar 11, 2025

Unofficial implementation of YOLO-World + EfficientSAM for ComfyUI

Python 693 64 Updated May 22, 2024
Python 7 Updated Mar 7, 2025

GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models

Python 176 Updated Mar 11, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 10,430 1,092 Updated Mar 12, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 30,789 4,739 Updated Mar 12, 2025

[CVPR 2025] EgoLife: Towards Egocentric Life Assistant

Python 202 12 Updated Mar 7, 2025

[ICRA'25] One Map to Find Them All: Real-time Open-Vocabulary Mapping for Zero-shot Multi-Object Navigation

Python 18 3 Updated Mar 5, 2025

Protocol Buffers - Google's data interchange format

C++ 66,829 15,633 Updated Mar 12, 2025

The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"

Python 409 21 Updated Mar 10, 2025

[CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Python 647 27 Updated Mar 11, 2025

PyTorch implementation of paper: GaussNav: Gaussian Splatting for Visual Navigation

Python 94 10 Updated Nov 11, 2024

Primitive-Swarm: An Ultra-lightweight and Scalable Planner for Large-scale Aerial Swarms

C++ 66 4 Updated Sep 2, 2024

[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors

Python 1,317 98 Updated Mar 11, 2025
Python 31 6 Updated Feb 28, 2025

Integrate the DeepSeek API into popular softwares

28,045 3,019 Updated Mar 11, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,270 790 Updated Mar 1, 2025

[IEEE RA-L'25] NavRL: Learning Safe Flight in Dynamic Environments (NVIDIA Isaac/Python/ROS1/ROS2)

C++ 161 7 Updated Mar 10, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

18,831 1,816 Updated Sep 19, 2024

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

Python 1,535 480 Updated Mar 10, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,495 1,661 Updated Feb 26, 2025

Open-sourced code for "HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit".

C++ 151 10 Updated Feb 25, 2025

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository

75 7 Updated Mar 12, 2025

[RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation

Python 46 2 Updated Jan 1, 2025

[IJRR2024] The official repository for the WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Natural Environments

Python 73 3 Updated Oct 13, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 15,193 1,754 Updated Mar 2, 2025

深度学习经典、新论文逐段精读

29,426 2,596 Updated Nov 17, 2024

A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.

461 19 Updated Feb 13, 2025

3D高斯论文,持续更新,欢迎交流讨论。

Python 1,639 61 Updated Mar 10, 2025
Next