Skip to content

Commit

Permalink
update ImageNet pretrained_model for VideoSwin_small
Browse files Browse the repository at this point in the history
  • Loading branch information
HydrogenSulfate committed Aug 30, 2022
1 parent 05cee06 commit 61ff2ba
Show file tree
Hide file tree
Showing 2 changed files with 14 additions and 10 deletions.
12 changes: 7 additions & 5 deletions docs/en/model_zoo/recognition/videoswin.md
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,9 @@ K400 data download and preparation please refer to [Kinetics-400 data preparatio
1. Download the image pre-training model [swin_base_patch4_window7_224.pdparams](https://videotag.bj.bcebos.com/PaddleVideo-release2.2/swin_base_patch4_window7_224.pdparams) as the Backbone initialization parameter, or download it through the wget command

```bash
wget https://videotag.bj.bcebos.com/PaddleVideo-release2.2/swin_base_patch4_window7_224.pdparams
wget https://videotag.bj.bcebos.com/PaddleVideo-release2.2/swin_base_patch4_window7_224.pdparams # ImageNet pretrained model for VideoSwin_base

# wget https://videotag.bj.bcebos.com/PaddleVideorelease2.2/swin_small_patch4_window7_224.pdparams # Imagenet pretrained model for VideoSwin_small
```

2. Open `configs/recognition/videoswin/videoswin_base_k400_videos.yaml`, and fill in the downloaded weight storage path below `pretrained:`
Expand Down Expand Up @@ -84,10 +86,10 @@ K400 data download and preparation please refer to [Kinetics-400 data preparatio

When the test configuration uses the following parameters, the test indicators on the validation data set of Kinetics-400 are as follows:

| backbone | Sampling method | num_seg | target_size | Top-1 | checkpoints |
| :--------------------: | :-------------: | :-----: | :---------: | :---- | :------------------------------------------------------------------------------------------------------------------------: |
| Swin-Transformer_base | UniformCrop | 32 | 224 | 82.40 | [SwinTransformer_k400_base.pdparams](https://videotag.bj.bcebos.com/PaddleVideo-release2.2/VideoSwin_base_k400.pdparams) |
| Swin-Transformer_small | UniformCrop | 32 | 224 | 80.18 | [SwinTransformer_k400_small.pdparams](https://videotag.bj.bcebos.com/PaddleVideo-release2.2/VideoSwin_small_k400.pdparams) |
| backbone | Sampling method | num_seg | target_size | Top-1 | checkpoints | pretrain model |
| :--------------------: | :-------------: | :-----: | :---------: | :---- | :------------------------------------------------------------------------------------------------------------------------: | :----: |
| Swin-Transformer_base | UniformCrop | 32 | 224 | 82.40 | [SwinTransformer_k400_base.pdparams](https://videotag.bj.bcebos.com/PaddleVideo-release2.2/VideoSwin_base_k400.pdparams) | [swin_base_patch4_window7_224.pdparams](https://videotag.bj.bcebos.com/PaddleVideo-release2.2/swin_base_patch4_window7_224.pdparams) |
| Swin-Transformer_small | UniformCrop | 32 | 224 | 80.18 | [SwinTransformer_k400_small.pdparams](https://videotag.bj.bcebos.com/PaddleVideo-release2.2/VideoSwin_small_k400.pdparams) | [swin_small_patch4_window7_224.pdparams](https://videotag.bj.bcebos.com/PaddleVideo-release2.2/swin_small_patch4_window7_224.pdparams) |

## Inference

Expand Down
12 changes: 7 additions & 5 deletions docs/zh-CN/model_zoo/recognition/videoswin.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,9 @@ K400数据下载及准备请参考[Kinetics-400数据准备](../../dataset/k400.
1. 下载图像预训练模型[swin_base_patch4_window7_224.pdparams](https://videotag.bj.bcebos.com/PaddleVideo-release2.2/swin_base_patch4_window7_224.pdparams)作为Backbone初始化参数,或通过wget命令下载

```bash
wget https://videotag.bj.bcebos.com/PaddleVideo-release2.2/swin_base_patch4_window7_224.pdparams
wget https://videotag.bj.bcebos.com/PaddleVideo-release2.2/swin_base_patch4_window7_224.pdparams # ImageNet pretrained model for VideoSwin_base

# wget https://videotag.bj.bcebos.com/PaddleVideorelease2.2/swin_small_patch4_window7_224.pdparams # Imagenet pretrained model for VideoSwin_small
```

2. 打开`configs/recognition/videoswin/videoswin_base_k400_videos.yaml`,将下载好的权重存放路径填写到下方`pretrained:`之后
Expand Down Expand Up @@ -86,10 +88,10 @@ K400数据下载及准备请参考[Kinetics-400数据准备](../../dataset/k400.

当测试配置采用如下参数时,在Kinetics-400的validation数据集上的测试指标如下:

| backbone | Sampling method | num_seg | target_size | Top-1 | checkpoints |
| :--------------------: | :-------------: | :-----: | :---------: | :---- | :------------------------------------------------------------------------------------------------------------------------: |
| Swin-Transformer_base | UniformCrop | 32 | 224 | 82.40 | [SwinTransformer_k400_base.pdparams](https://videotag.bj.bcebos.com/PaddleVideo-release2.2/VideoSwin_base_k400.pdparams) |
| Swin-Transformer_small | UniformCrop | 32 | 224 | 80.18 | [SwinTransformer_k400_small.pdparams](https://videotag.bj.bcebos.com/PaddleVideo-release2.2/VideoSwin_small_k400.pdparams) |
| backbone | Sampling method | num_seg | target_size | Top-1 | checkpoints | pretrain model |
| :--------------------: | :-------------: | :-----: | :---------: | :---- | :------------------------------------------------------------------------------------------------------------------------: | :----: |
| Swin-Transformer_base | UniformCrop | 32 | 224 | 82.40 | [SwinTransformer_k400_base.pdparams](https://videotag.bj.bcebos.com/PaddleVideo-release2.2/VideoSwin_base_k400.pdparams) | [swin_base_patch4_window7_224.pdparams](https://videotag.bj.bcebos.com/PaddleVideo-release2.2/swin_base_patch4_window7_224.pdparams) |
| Swin-Transformer_small | UniformCrop | 32 | 224 | 80.18 | [SwinTransformer_k400_small.pdparams](https://videotag.bj.bcebos.com/PaddleVideo-release2.2/VideoSwin_small_k400.pdparams) | [swin_small_patch4_window7_224.pdparams](https://videotag.bj.bcebos.com/PaddleVideo-release2.2/swin_small_patch4_window7_224.pdparams) |

## 模型推理

Expand Down

0 comments on commit 61ff2ba

Please sign in to comment.