-
Nankai University, VCIP
- Tianjin
-
23:38
(UTC +08:00)
Lists (2)
Sort Name ascending (A-Z)
Stars
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
[ICLR 2025] Reconstructive Visual Instruction Tuning
Fully open reproduction of DeepSeek-R1
Investigating CoT Reasoning in Autoregressive Image Generation
PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Jittor version code for "CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation"
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Techniques for deep learning with satellite & aerial imagery
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas
This repository contains the official implementation for the paper "RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark".
Train a 1B LLM with 1T tokens from scratch by personal
Chimera: Efficiently Training Large-Scale Neural Networks with Bidirectional Pipelines.
Offical implementation of "Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection"
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
Offical implementation of "SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection"
Official repository of the paper "MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation"
Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
Towards Robust Evaluation for Geospatial Foundation Models