Skip to content
/ DeepSVC Public

Official codes for "DeepSVC: Deep Scalable Video Coding for Both Machine and Human Vision"

Notifications You must be signed in to change notification settings

LHB116/DeepSVC

Repository files navigation

DeepSVC: Deep Scalable Video Coding for Both Machine and Human Vision

If our open source codes are helpful for your research, please cite our paper:

@inproceedings{lin2023deepsvc,
  title={DeepSVC: Deep Scalable Video Coding for Both Machine and Human Vision},
  author={Lin, Hongbin and Chen, Bolin and Zhang, Zhichen and Lin, Jielian and Wang, Xu and Zhao, Tiesong},
  booktitle={Proceedings of the 31st ACM International Conference on Multimedia},
  pages={9205--9214},
  year={2023}
}

Dependency

  • see env.txt

Test codes

Semantic Layer

Structure and Texture Layer

  • Run test_video.py, please change data path in the file.

Training your own models

Semantic Layer

Structure and Texture Layer,training the PSNR/MS-SSIM models

  • Download the training data. We train the models on the Vimeo90k dataset (Download link).

  • Run main.py to train the PSNR/MS-SSIM models. We first pretrian model with key frame coded with bpg and lambda=2048. Then load the pretrianed weights, train with key frame coded with key frame coded with AI codecs (in image_model.py). More detail see main.py.

About

Official codes for "DeepSVC: Deep Scalable Video Coding for Both Machine and Human Vision"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages