Making PLKSR stable for real-world SISR

Hi. First of all, thanks to everyone who participated on this research. Very thorough analysis on the paper.

As reported by others in [issue 3](https://github.com/dslisleedh/PLKSR/issues/3), PLKSR seems to be unstable for real-world SISR. GAN training is notoriously unstable, and causes issues even at lower learning rate.
So in an attempt to make it more stable, I have released a simple modification to PLKSR, named [RealPLKSR](https://github.com/muslll/neosr/blob/master/neosr/archs/realplksr_arch.py):
- Normalization was missing, as pointed by @dslisleedh. From my understanding, layer norm was avoided because of the impact on inference latency. I have tested multiple methods, including Instance norm, Layer norm, Batch norm, Group norm and RMSNorm. Because we usually train at lower batch sizes (<16), out of those tested, [Group Normalization](https://github.com/muslll/neosr/blob/master/neosr/archs/realplksr_arch.py#L87) performed best on my experiments. The impact on inference latency was minimal (~5% max). The number of groups was also tested. Increasing it leads to better regularization, but impacts convergence speed. The value `4` offered a good balance on all tests.
- Replacing GELU with [Mish](https://github.com/digantamisra98/Mish) on [channel mixer](https://github.com/muslll/neosr/blob/master/neosr/archs/realplksr_arch.py#L20). Mish showed better, more stable, convergence compared to GELU.
- Added [`nn.Dropout2d`](https://github.com/muslll/neosr/blob/master/neosr/archs/realplksr_arch.py#L130) to the last conv, as proposed in ["Reflash Dropout in Image Super-Resolution"](https://arxiv.org/pdf/2112.12089). Although not ideal, dropout is a simple method to increase generalization on real-world SISR.

Pretrained models:

| scale       | download                                                                                                                                 |
|--------------|-------------------------------------------------------------------------------------------------------------------------|
| 4x GAN   | [GDrive](https://drive.google.com/file/d/1iQmsnMhWXsHLYuYEwwRhdj9fMg71h-IU/view)   |
| 4x            | [GDrive](https://drive.google.com/file/d/12ek1vitEporWc5qqaYo6AMy0-RYlRqu8/view)       |
| 2x            | [GDrive](https://drive.google.com/file/d/1GAdf5VOqYa5ntswT9sYsKKZ2Z7OQp7gO/view) |


Training can be done on [neosr](https://github.com/muslll/neosr) using the following configurations: [paired dataset](https://github.com/muslll/neosr/blob/master/options/train_realplksr.toml) or through [realesrgan degradation pipeline](https://github.com/muslll/neosr/blob/master/options/train_realplksr_otf.toml).
Credits were acknowledged inside the [code](https://github.com/muslll/neosr/blob/master/neosr/archs/realplksr_arch.py#L103) and released under the same license as PLKSR (MIT). I hope this makes PLKSR more used under real-world degradations. It's a really impressive network. Thanks again for your research :+1: 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Making PLKSR stable for real-world SISR #4

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

scale	download
4x GAN	GDrive
4x	GDrive
2x	GDrive

Making PLKSR stable for real-world SISR #4

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions