Skip to content

Commit

Permalink
add 11 Papers
Browse files Browse the repository at this point in the history
  • Loading branch information
amusi committed Mar 23, 2024
1 parent 5238289 commit 8f8992a
Showing 1 changed file with 71 additions and 3 deletions.
74 changes: 71 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -68,6 +68,7 @@ CVPR 2024 decisions are now available on OpenReview!
- [医学图像(Medical Image)](#Medical-Image)
- [图像生成(Image Generation)](#Image-Generation)
- [视频生成(Video Generation)](#Video-Generation)
- [3D生成(3D Generation)](#3D-Generation)
- [视频理解(Video Understanding)](#Video-Understanding)
- [行为检测(Action Detection)](#Action-Detection)
- [文本检测(Text Detection)](#Text-Detection)
Expand Down Expand Up @@ -239,6 +240,11 @@ CVPR 2024 decisions are now available on OpenReview!

# NeRF

**PIE-NeRF🍕: Physics-based Interactive Elastodynamics with NeRF**

- Paper: https://arxiv.org/abs/2311.13099
- Code: https://github.com/FYTalon/pienerf/

<a name="DETR"></a>

# DETR
Expand Down Expand Up @@ -284,7 +290,12 @@ CVPR 2024 decisions are now available on OpenReview!
**Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding**

- Paper: https://arxiv.org/abs/2311.08046
- Code: https://github.com/PKU-YuanGroup/Chat-UniVi
- Code: https://github.com/PKU-YuanGroup/Chat-UniVi

**OneLLM: One Framework to Align All Modalities with Language**

- Paper: https://arxiv.org/abs/2312.03700
- Code: https://github.com/csuhan/OneLLM

<a name="NAS"></a>

Expand All @@ -294,6 +305,11 @@ CVPR 2024 decisions are now available on OpenReview!

# ReID(重识别)

**Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification**

- Paper: https://arxiv.org/abs/2403.10254
- Code: https://github.com/924973292/EDITOR

<a name="Diffusion"></a>

# Diffusion Models(扩散模型)
Expand All @@ -313,7 +329,24 @@ CVPR 2024 decisions are now available on OpenReview!
**DeepCache: Accelerating Diffusion Models for Free**

- Paper: https://arxiv.org/abs/2312.00858
- Code: https://github.com/horseee/DeepCache
- Code: https://github.com/horseee/DeepCache

**DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations**

- Homepage: https://tianhao-qi.github.io/DEADiff/

- Paper: https://arxiv.org/abs/2403.06951
- Code: https://github.com/Tianhao-Qi/DEADiff_code

**SVGDreamer: Text Guided SVG Generation with Diffusion Model**

- Paper: https://arxiv.org/abs/2312.16476
- Code: https://ximinng.github.io/SVGDreamer-project/

**InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model**

- Paper: https://arxiv.org/abs/2312.05849
- Code: https://github.com/jiuntian/interactdiffusion

<a name="Vision-Transformer"></a>

Expand All @@ -333,6 +366,11 @@ CVPR 2024 decisions are now available on OpenReview!

# 视觉和语言(Vision-Language)

**PromptKD: Unsupervised Prompt Distillation for Vision-Language Models**

- Paper: https://arxiv.org/abs/2403.02781
- Code: https://github.com/zhengli97/PromptKD

<a name="Object-Detection"></a>

# 目标检测(Object Detection)
Expand Down Expand Up @@ -426,6 +464,11 @@ CVPR 2024 decisions are now available on OpenReview!
- Paper: https://arxiv.org/abs/2403.10145
- Code: https://github.com/AIR-THU/DAIR-RCooper

**Adaptive Fusion of Single-View and Multi-View Depth for Autonomous Driving**

- Paper: https://arxiv.org/abs/2403.07535
- Code: https://github.com/Junda24/AFNet

<a name="3D-Point-Cloud"></a>

# 3D点云(3D-Point-Cloud)
Expand Down Expand Up @@ -531,7 +574,17 @@ CVPR 2024 decisions are now available on OpenReview!
**Multi-Instance Generation Controller for Text-to-Image Synthesis**

- Paper: https://arxiv.org/abs/2402.05408
- Code: https://github.com/limuloo/migc
- Code: https://github.com/limuloo/migc

**SVGDreamer: Text Guided SVG Generation with Diffusion Model**

- Paper: https://arxiv.org/abs/2312.16476
- Code: https://ximinng.github.io/SVGDreamer-project/

**InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model**

- Paper: https://arxiv.org/abs/2312.05849
- Code: https://github.com/jiuntian/interactdiffusion

<a name="Video-Generation"></a>

Expand All @@ -549,6 +602,21 @@ CVPR 2024 decisions are now available on OpenReview!
- Paper: https://arxiv.org/abs/2311.17982
- Code: https://github.com/Vchitect/VBench

<a name="3D-Generation"></a>

# 3D生成

**CityDreamer: Compositional Generative Model of Unbounded 3D Cities**

- Homepage: https://haozhexie.com/project/city-dreamer/
- Paper: https://arxiv.org/abs/2309.00610
- Code: https://github.com/hzxie/city-dreamer

**LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching**

- Paper: https://arxiv.org/abs/2311.11284
- Code: https://github.com/EnVision-Research/LucidDreamer

<a name="Video-Understanding"></a>

# 视频理解(Video Understanding)
Expand Down

0 comments on commit 8f8992a

Please sign in to comment.