wolf943134497

davci wolf943134497

32 followers · 509 following

Lists (1)

Sort

🚀 My stack

8 repositories

Starred repositories

OpenDriveLab / BeTop

[NeurIPS 2024] Behavioral Topology (BeTop), a multi-agent behavior formulation for interactive motion prediction and planning

Python 93 5 Updated Nov 12, 2024

megvii-research / FQ-ViT

[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer

Python 321 48 Updated Apr 11, 2023

RodeWayne / SR-FoT

Official code for paper "SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning tasks"

2 Updated Jan 22, 2025

Ucas-HaoranWei / Slow-Perception

Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step

Python 100 4 Updated Jan 26, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 1,183 62 Updated Feb 4, 2025

cxlz / Int2Planner

4 Updated Dec 13, 2024

gaoyinfeng / PIWM

(T-IV) Dream to Drive with Predictive Individual World Model

Python 16 Updated Jan 14, 2025

gregor-ge / Centurio

Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model

Python 2 Updated Feb 4, 2025

NVlabs / TokenBench

A Video Tokenizer Evaluation Dataset

Python 94 6 Updated Jan 13, 2025

DEEP-PolyU / Awesome-GraphRAG

A curated list of resources on graph-based retrieval-augmented generation (GraphRAG) for customized large language models.

440 78 Updated Feb 5, 2025

Vchitect / Vchitect-2.0

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 861 18 Updated Jan 26, 2025

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,534 68 Updated Aug 15, 2024

Lightning-AI / litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 11,446 1,145 Updated Feb 3, 2025

donghao51 / Awesome-Multimodal-Adaptation

Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models

19 Updated Feb 4, 2025

penfever / wildchat-50m

Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.

Jupyter Notebook 21 1 Updated Jan 31, 2025

HxLyn3 / ADMPO

Any-step Dynamics Model for Policy Optimization

Python 40 3 Updated Feb 1, 2025

LMD0311 / HERMES

HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation

61 1 Updated Jan 27, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 558 38 Updated Feb 5, 2025

Physical-Intelligence / openpi

Python 841 55 Updated Feb 4, 2025

showlab / MakeAnything

Official code of "MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation"

Python 25 1 Updated Feb 5, 2025

wayveai / LingoQA

[ECCV 2024] Official GitHub repository for the paper "LingoQA: Visual Question Answering for Autonomous Driving"

Python 157 6 Updated Sep 26, 2024

cumulo-autumn / StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,915 729 Updated Dec 4, 2024

slamcore / semlaps

Code and data for "SeMLaPS: Real-time Semantic Mapping with Latent Prior Networks and Quasi-Planar Segmentation"

Python 13 Updated May 24, 2024

umd-huang-lab / tracevla

Python 16 1 Updated Jan 8, 2025

zli12321 / VLM-surveys

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository

27 3 Updated Feb 3, 2025

zhiheLu / Ensemble_VLM

Official code for paper "Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models, ICML2024"

Python 23 2 Updated Feb 2, 2025

jackfsuia / nanoRLHF

RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.

Python 18 2 Updated Feb 5, 2025

ErlichLiu / DeepClaude

DeepSeek r1 and Claude 3.5 Sonnet achieve the best combination, fully unleashing the power of the strongest models. Supports OpenAI streaming output and can run on your favorite ChatBox!

Python 319 84 Updated Feb 5, 2025

TEGRAXD / mozha-r1

Mozha-R1 is an AI-powered application utilizing DeepSeek R1 Distill Model. This project designed to run locally on Windows and Linux (AMD64 & ARM64). This application provides an API interface.

Python 1 Updated Feb 3, 2025

hardikjp7 / DeepSeek-R1-RAG-for-Document-QA

🐋 DeepSeek-R1: Retrieval-Augmented Generation for Document Q&A 📄

Python 6 1 Updated Feb 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly