From 089a329fc27856d7262fbc84b1a554a3fa9374a6 Mon Sep 17 00:00:00 2001 From: wyb7 Date: Tue, 26 Apr 2022 14:19:29 +0800 Subject: [PATCH] add several papers --- README.md | 13 ++++++++++++- 1 file changed, 12 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index a7d6ddc..fcf6940 100644 --- a/README.md +++ b/README.md @@ -48,8 +48,19 @@ Markdown format: #### Useful Toolkit - [Extracting CNN features from video frames by MXNet](https://github.com/xuchaoxi/video-cnn-feat) - ## Papers +### 2022 + +`[Dong et al. TCSVT22]` Reading-strategy Inspired Visual Representation Learning for Text-to-Video Retrieval. TCSVT, 2022. [[paper]]( https://arxiv.org/abs/2201.09168) [[code]](https://github.com/LiJiaBei-7/rivrl) + +`[Gabeur et al. WACV22]` Masking Modalities for Cross-modal Video Retrieval. WACV, 2022. [[paper]]( https://openaccess.thecvf.com/content/WACV2022/papers/Gabeur_Masking_Modalities_for_Cross-Modal_Video_Retrieval_WACV_2022_paper.pdf) + +`[Cao et al. AAAI22]` Visual Consensus Modeling for Video-Text Retrieval. AAAI, 2022. [[paper]]( https://www.aaai.org/AAAI22Papers/AAAI-12427.CaoS.pdf) [[code]](https://github.com/sqiangcao99/VCM) + +`[Wang et al. TMM]` Many Hands Make Light Work: Transferring Knowledge from Auxiliary Tasks for Video-Text Retrieval. IEEE Transactions on Multimedia, 2022. [[paper]](https://ieeexplore.ieee.org/abstract/document/9707645/metrics#metrics) [[code]](https://github.com/sqiangcao99/VCM) + +`[Song et al. TOMM22]` Adversarial Multi-Grained Embedding Network for Cross-Modal Text-Video Retrieval. TOMM, 2022. [[paper]]( https://dl.acm.org/doi/10.1145/3483381) + ### 2021 * `[Dong et al. TPAMI21]` Dual Encoding for Video Retrieval by Text. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021. [[paper]](https://arxiv.org/abs/2009.05381) [[code]](https://github.com/danieljf24/hybrid_space)