Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add rtdetr final #8094

Merged
merged 4 commits into from
Apr 18, 2023
Merged

add rtdetr final #8094

merged 4 commits into from
Apr 18, 2023

Conversation

lyuwenyu
Copy link
Collaborator

  • add pp rt-detr

ghostxsl and others added 2 commits April 13, 2023 16:09
add yoloe reader

alter reference points to unsigmoid

add 3_0_6_neck_256 base

fix amp training

alter usage in paddle-inference

update new base

alter ext_ops

add hybrid encoder
@paddle-bot
Copy link

paddle-bot bot commented Apr 18, 2023

Thanks for your contribution!

@lyuwenyu lyuwenyu changed the title Rtdetr final l add rtdetr final Apr 18, 2023
configs/rtdetr/README.md Outdated Show resolved Hide resolved
value_spatial_shapes,
value_level_start_index,
value_mask=None):
"""
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

说明下和基类的差异



@register
class PPDETRTransformer(nn.Layer):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

改为RTDETR?

DETR:
backbone: ResNet
neck: HybridEncoder
transformer: PPDETRTransformer
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PPDETR->RTDETR

# DETRs Beat YOLOs on Real-time Object Detection

## Introduction
We propose a **R**eal-**T**ime **DE**tection **TR**ansformer (RT-DETR), the first real-time end-to-end object detector to our best knowledge. Specifically, we design an efficient hybrid encoder to efficiently process multi-scale features by decoupling the intra-scale interaction and cross-scale fusion, and propose IoU-aware query selection to improve the initialization of object queries. In addition, our proposed detector supports flexibly adjustment of the inference speed by using different decoder layers without the need for retraining, which facilitates the practical application of real-time object detectors. Our RT-DETR-L achieves 53.0% AP on COCO val2017 and 114 FPS on T4 GPU, while RT-DETR-X achieves 54.8% AP and 74 FPS, outperforming all YOLO detectors of the same scale in both speed and accuracy. Furthermore, our RT-DETR-R50 achieves 53.1% AP and 108 FPS, outperforming DINO-Deformable-DETR-R50 by 2.2% AP in accuracy and by about 21 times in FPS.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

可以加下arxiv链接,test集数据也加下吧

@lyuwenyu lyuwenyu requested a review from jerrywgz April 18, 2023 07:49
@lyuwenyu lyuwenyu merged commit 5d1f888 into PaddlePaddle:develop Apr 18, 2023
@qianyizhang
Copy link

checkpoint、模型能转换成TRT engine么

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants