-
HKUST(GZ) && @SYSU-STAR
Highlights
- Pro
Lists (24)
Sort Name ascending (A-Z)
Aerial Reconstruction
🤓Course
Data Structure
Depth Estimation
Hardware
Large Language Models
Manipulation
Multi-robot System
Object Detection/Segmentation
Object Goal Navigation
🤓Papar list
🤓Phd Survival Guide
Pre-training Models
Reconstruction
Robot Exploration
Robot Simulator
Scene Graph
Semantic Dataset
Semantic Mapping
🤓Tools
Trajectory Planner
Vision Language Action
Vision Language Models
Vision Language Navigation
Stars
Robust Speech Recognition via Large-Scale Weak Supervision
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Python packaging and dependency management made easy
No fortress, purely open ground. OpenManus is Coming.
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
Python sample codes and textbook for robotics algorithms.
A generative world for general-purpose robotics & embodied AI learning.
Fully open reproduction of DeepSeek-R1
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
DeepSeek Coder: Let the Code Write Itself
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
Janus-Series: Unified Multimodal Understanding and Generation Models
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
An open source implementation of CLIP.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Hydra is a framework for elegantly configuring complex applications
ASCII generator (image to text, image to image, video to video)
CodeGeeX2: A More Powerful Multilingual Code Generation Model
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation