CUC-MIPG · earthseaon · Sep 20, 2024 · Sep 21, 2024 · Sep 21, 2024 · Sep 21, 2024
diff --git a/.gitattributes b/.gitattributes
@@ -0,0 +1 @@
+*.ckpt filter=lfs diff=lfs merge=lfs -text
diff --git a/Finetuned-VQGAN/README.md b/Finetuned-VQGAN/README.md
@@ -0,0 +1,65 @@
+# Extreme Image Compression using Fine-tuned VQGAN Models (DCC 2024)
+This repository is the official implementation of VQGAN-Comporession.
+
+[Qi Mao](https://sites.google.com/view/qi-mao/), [Tinghan Yang](), [Yinuo Zhang](), [Zijian Wang](), [Meng Wang](https://scholar.google.com/citations?user=6vnhEIgAAAAJ&hl=zh-TW&oi=sra), [Shiqi Wang](), [Libiao Jin](), [Siwei Ma](https://scholar.google.com/citations?user=y3YqlaUAAAAJ&hl=zh-TW&oi=sra)
+
+<p align="center">
+<img src="assets/Framework.png"width="1060px"/>  
+<br>
+<em> Figure:  Overview of the proposed VQGAN-based image coding framework.</em>
+</p>
+
+## Acknowledgement
+The framework is based on [VQGAN](https://github.com/CompVis/taming-transformers). We modify taming.modules.vqgan and add train.py and reconstruction.py for usage.
+
+## Introduction
+In this work, we propose a simple yet effective coding framework by introducing vector quantization (VQ)--based generative models into the image compression domain.
+
+[[Paper](https://arxiv.org/abs/2307.08265)] 
+
+<p align="center">
+<img src="assets/sub.jpg"width="1060px"/>  
+<br>
+<em> Figure: Our Results. </em> </p>
+
+## Installation
+Our method is tested using cuda11.3 on a single A100. The preparation work mainly includes configuring the environment.
+```bash
+conda env create -f environment.yaml
+conda activate vqgan
+```
+### Reconstruction
+If you want reconstruction imge with pretrained model, please download [Google driver](https://drive.google.com/drive/folders/14I_RnQ3cA6etdKGPVMFdmmVgMtBTB5rn?usp=sharing) from [Baidu cloud](https://pan.baidu.com/s/1zBeWKh6vgof13iTBwtA65A?pwd=kfl7) (code: kfl7) and put in `logs/`
+
+Some evaluation dataset can be downloaded from 
+[kodak dataset](http://r0k.us/graphics/kodak/) and [CLIC](http://challenge.compression.cc/tasks/) and put in `data/`
+```bash
+python reconstruction.py --logs_path $model_dir --dataset $dataset_name
+```
+An example: After evaluation on the Kodak dataset, fine tune the pre trained model of [vqgan_imagenet_f16_16384](https://heibox.uni-heidelberg.de/d/a7530b09fed84f80a887/) to a codebook size of 1024.
+```bash
+python reconstruction.py --logs_path logs/kmeans_tune/16384_kmeans_1024_epoch/epoch1/ --dataset Kodak/
+```
+The result is saved at `rec/Kodak/`
+
+### Train
+Prepare the dataset according to the instructions of the original [VQGAN](https://github.com/CompVis/taming-transformers?tab=readme-ov-file#data-preparation) project, but our training involves freezing the codec and only updating the codebook for fine-tuning. You can use the following code to achieve this:
+```bash
+python train.py --base configs/custom_vqgan.yaml -t True --gpus 0, --is_frozen
+```
+The fine-tune model is saved at `logs/`
+
+## Citation
+```
+@inproceedings{mao2024extreme,
+  title={Extreme image compression using fine-tuned vqgans},
+  author={Mao, Qi and Yang, Tinghan and Zhang, Yinuo and Wang, Zijian and Wang, Meng and Wang, Shiqi and Jin, Libiao and Ma, Siwei},
+  booktitle={2024 Data Compression Conference (DCC)},
+  pages={203--212},
+  year={2024},
+  organization={IEEE}
+}
+``` 
+
+## Contact
+Feel free to contact us if there is any question. (Qi Mao, qimao@cuc.edu.cn; Tinghan Yang, yangtinghan@cuc.edu.cn)
diff --git a/assets/Framework.png → Finetuned-VQGAN/assets/Framework.png b/assets/Framework.png → Finetuned-VQGAN/assets/Framework.png
diff --git a/assets/R-D.png → Finetuned-VQGAN/assets/R-D.png b/assets/R-D.png → Finetuned-VQGAN/assets/R-D.png
diff --git a/assets/sub.jpg → Finetuned-VQGAN/assets/sub.jpg b/assets/sub.jpg → Finetuned-VQGAN/assets/sub.jpg
diff --git a/bpp_use_torchac.py → Finetuned-VQGAN/bpp_use_torchac.py b/bpp_use_torchac.py → Finetuned-VQGAN/bpp_use_torchac.py
diff --git a/environment.yaml → Finetuned-VQGAN/environment.yaml b/environment.yaml → Finetuned-VQGAN/environment.yaml
diff --git a/main.py → Finetuned-VQGAN/main.py b/main.py → Finetuned-VQGAN/main.py
diff --git a/quality.py → Finetuned-VQGAN/quality.py b/quality.py → Finetuned-VQGAN/quality.py
diff --git a/reconstruction.py → Finetuned-VQGAN/reconstruction.py b/reconstruction.py → Finetuned-VQGAN/reconstruction.py
@@ -161,40 +161,55 @@ def compute_bpp_zip(file_path, model, z):
 def parse_args():
     parser = argparse.ArgumentParser('', add_help=False)
     parser.add_argument('--logs_path', default='logs/imagenet_f16_16384/', type=str)
-    parser.add_argument('--dataset', default='Kodak/', type=str)
+    parser.add_argument('--dataset', default='/home/CodingG/yth/dataset/Kodak/', type=str)
     return parser.parse_args()
 
 if __name__=='__main__':
     args = parse_args()
     torch.set_grad_enabled(False)
 
     # Load Model
-    config_path = args.logs_path+'configs/model.yaml'
-    ckpt_path = args.logs_path+'checkpoints/last.ckpt'
+    config_path = args.logs_path+'configs/' + os.listdir(args.logs_path+'configs/') [0]
+    ckpt_path = args.logs_path+'checkpoints/'+ os.listdir(args.logs_path+'checkpoints/')[0]
     model = load_model(config_path=config_path, ckpt_path=ckpt_path)
 
     # set path
-    name = args.dataset.replace('/','')
-    rec_path = 'rec/' + args.dataset
+    name = args.dataset.split('/')[-2]
+    if args.logs_path.find('epoch') != -1:
+        model_name = args.logs_path.split('/')[-3]
+    else:
+        model_name = args.logs_path.split('/')[-2]
+    rec_path = 'rec/' + name + '/'
+    if not os.path.exists(rec_path):
+        os.makedirs(rec_path)
+    rec_path = rec_path + model_name + '/'
     if not os.path.exists(rec_path):
         os.makedirs(rec_path)
-    index_path = 'index/' + args.dataset
+
+    index_path = 'index/' + name + '/'
     if not os.path.exists(index_path):
         os.makedirs(index_path)
-    tmp_path = 'tmp/' + args.dataset
+    index_path = index_path + model_name + '/'
+    if not os.path.exists(index_path ):
+        os.makedirs(index_path)
+
+    tmp_path = 'tmp/' + name + '/'
+    if not os.path.exists(tmp_path):
+        os.makedirs(tmp_path)
+    tmp_path = tmp_path + model_name + "/"
     if not os.path.exists(tmp_path):
         os.makedirs(tmp_path)
-
     # Load Model
     config = load_config(config_path, display=False)
 
 
     # read image
-    img_path = 'data/' + args.dataset
+    img_path = args.dataset
     filenames = os.listdir(img_path)
     file_list = []
     bpp_list = []
     psnr_list = []
+    lpips_list = []
     for img in os.listdir(img_path):
         file_list.append(img[:-4])
         image = PIL.Image.open(img_path + img)
@@ -215,7 +230,31 @@ def parse_args():
         img2 = cv2.imread(save_img)
         psnr_list.append(psnr(img1, img2))
 
+        # Caculate lpips
+        import lpips
+        use_gpu = True  # Whether to use GPU
+        spatial = True  # Return a spatial map of perceptual distance.
+        # Linearly calibrated models (LPIPS)
+        loss_fn = lpips.LPIPS(net='alex', spatial=spatial)  # Can also set net = 'squeeze' or 'vgg'
+        # loss_fn = lpips.LPIPS(net='alex', spatial=spatial, lpips=False) # Can also set net = 'squeeze' or 'vgg'
+        if (use_gpu):
+            loss_fn.cuda()
+        total = 0
+        LP = []
+        try:
+            dummy_im0 = lpips.im2tensor(lpips.load_image(img_path+img))
+            dummy_im1 = lpips.im2tensor(lpips.load_image(save_img))
+            if (use_gpu):
+                dummy_im0 = dummy_im0.cuda()
+                dummy_im1 = dummy_im1.cuda()
+            dist = loss_fn.forward(dummy_im0, dummy_im1)
+            d = dist.mean().item()
+            lpips_list.append(d)
+        except:
+            print(f'the image path: {img_path+img},{save_img} is wrong!')
+            exit()
         # Arithmetic encoding
+        print('model.quantize.embedding.weight.size():',model.quantize.embedding.weight.size())
         idx_cdf_uniform = pmf_to_cdf(get_uniform_pmf(model.quantize.embedding.weight.size(), index))
         byte_stream = torchac.encode_float_cdf(cdf_float=idx_cdf_uniform, sym=index.to(dtype=torch.int16).cpu(),
                                                check_input_bounds=True)
@@ -243,22 +282,26 @@ def parse_args():
         num_bits = os.path.getsize(save_tmp) * 8
         bpp = num_bits / num_pixel
         bpp_list.append(bpp)
+
     average_bpp = sum(bpp_list) / len(bpp_list)
     average_psnr= sum(psnr_list) / len(psnr_list)
+    average_lpips = sum(lpips_list) / len(lpips_list)
     bpp_list.append(average_bpp)
     psnr_list.append(average_psnr)
+    lpips_list.append(average_lpips)
     file_list.append('Average')
     data = {
         'Image Name': file_list,
         'Bits Per Pixel (BPP)': bpp_list,
-        'PSNR Value': psnr_list
+        'PSNR Value': psnr_list,
+        'LPIPS Value': lpips_list,
     }
 
     df = pd.DataFrame(data)
 
     # Write the DataFrame to an Excel file
-    output_file = 'bpp/' + name + '.xlsx'
+    output_file = f'bpp/{name}_{model_name}_bpp.xlsx'
     df.to_excel(output_file, index=False, engine='xlsxwriter')
 
-    print(f'Finish Model:{args.logs_path} test! Avg bpp = {average_bpp} psnr = {average_psnr}')
+    print(f'Finish Model:{args.logs_path} test! Avg bpp = {average_bpp} psnr = {average_psnr} lpips = {average_lpips}')
     print(f'Save bpp.csv to {output_file}')
diff --git a/scripts/extract_depth.py → Finetuned-VQGAN/scripts/extract_depth.py b/scripts/extract_depth.py → Finetuned-VQGAN/scripts/extract_depth.py
diff --git a/scripts/extract_segmentation.py → ...ned-VQGAN/scripts/extract_segmentation.py b/scripts/extract_segmentation.py → ...ned-VQGAN/scripts/extract_segmentation.py
diff --git a/scripts/extract_submodel.py → Finetuned-VQGAN/scripts/extract_submodel.py b/scripts/extract_submodel.py → Finetuned-VQGAN/scripts/extract_submodel.py
diff --git a/scripts/make_samples.py → Finetuned-VQGAN/scripts/make_samples.py b/scripts/make_samples.py → Finetuned-VQGAN/scripts/make_samples.py
diff --git a/scripts/make_scene_samples.py → ...tuned-VQGAN/scripts/make_scene_samples.py b/scripts/make_scene_samples.py → ...tuned-VQGAN/scripts/make_scene_samples.py
diff --git a/scripts/reconstruction_usage.ipynb → ...-VQGAN/scripts/reconstruction_usage.ipynb b/scripts/reconstruction_usage.ipynb → ...-VQGAN/scripts/reconstruction_usage.ipynb
diff --git a/scripts/sample_conditional.py → ...tuned-VQGAN/scripts/sample_conditional.py b/scripts/sample_conditional.py → ...tuned-VQGAN/scripts/sample_conditional.py
diff --git a/scripts/sample_fast.py → Finetuned-VQGAN/scripts/sample_fast.py b/scripts/sample_fast.py → Finetuned-VQGAN/scripts/sample_fast.py
diff --git a/scripts/taming-transformers.ipynb → ...d-VQGAN/scripts/taming-transformers.ipynb b/scripts/taming-transformers.ipynb → ...d-VQGAN/scripts/taming-transformers.ipynb
diff --git a/setup.py → Finetuned-VQGAN/setup.py b/setup.py → Finetuned-VQGAN/setup.py
diff --git a/taming/__pycache__/util.cpython-38.pyc → ...AN/taming/__pycache__/util.cpython-38.pyc b/taming/__pycache__/util.cpython-38.pyc → ...AN/taming/__pycache__/util.cpython-38.pyc
diff --git a/taming/data/__pycache__/base.cpython-38.pyc → ...ming/data/__pycache__/base.cpython-38.pyc b/taming/data/__pycache__/base.cpython-38.pyc → ...ming/data/__pycache__/base.cpython-38.pyc
diff --git a/...ng/data/__pycache__/custom.cpython-38.pyc → ...ng/data/__pycache__/custom.cpython-38.pyc b/...ng/data/__pycache__/custom.cpython-38.pyc → ...ng/data/__pycache__/custom.cpython-38.pyc
diff --git a/...g/data/__pycache__/faceshq.cpython-38.pyc → ...g/data/__pycache__/faceshq.cpython-38.pyc b/...g/data/__pycache__/faceshq.cpython-38.pyc → ...g/data/__pycache__/faceshq.cpython-38.pyc
diff --git a/.../__pycache__/helper_types.cpython-311.pyc → .../__pycache__/helper_types.cpython-311.pyc b/.../__pycache__/helper_types.cpython-311.pyc → .../__pycache__/helper_types.cpython-311.pyc
diff --git a/...a/__pycache__/helper_types.cpython-38.pyc → ...a/__pycache__/helper_types.cpython-38.pyc b/...a/__pycache__/helper_types.cpython-38.pyc → ...a/__pycache__/helper_types.cpython-38.pyc
diff --git a/...ng/data/__pycache__/sflckr.cpython-38.pyc → ...ng/data/__pycache__/sflckr.cpython-38.pyc b/...ng/data/__pycache__/sflckr.cpython-38.pyc → ...ng/data/__pycache__/sflckr.cpython-38.pyc
diff --git a/...ng/data/__pycache__/utils.cpython-311.pyc → ...ng/data/__pycache__/utils.cpython-311.pyc b/...ng/data/__pycache__/utils.cpython-311.pyc → ...ng/data/__pycache__/utils.cpython-311.pyc
diff --git a/taming/data/__pycache__/utils.cpython-38.pyc → ...ing/data/__pycache__/utils.cpython-38.pyc b/taming/data/__pycache__/utils.cpython-38.pyc → ...ing/data/__pycache__/utils.cpython-38.pyc
diff --git a/taming/data/ade20k.py → Finetuned-VQGAN/taming/data/ade20k.py b/taming/data/ade20k.py → Finetuned-VQGAN/taming/data/ade20k.py
diff --git a/taming/data/annotated_objects_coco.py → ...GAN/taming/data/annotated_objects_coco.py b/taming/data/annotated_objects_coco.py → ...GAN/taming/data/annotated_objects_coco.py
diff --git a/taming/data/annotated_objects_dataset.py → .../taming/data/annotated_objects_dataset.py b/taming/data/annotated_objects_dataset.py → .../taming/data/annotated_objects_dataset.py
diff --git a/taming/data/annotated_objects_open_images.py → ...ing/data/annotated_objects_open_images.py b/taming/data/annotated_objects_open_images.py → ...ing/data/annotated_objects_open_images.py
diff --git a/taming/data/base.py → Finetuned-VQGAN/taming/data/base.py b/taming/data/base.py → Finetuned-VQGAN/taming/data/base.py
diff --git a/taming/data/coco.py → Finetuned-VQGAN/taming/data/coco.py b/taming/data/coco.py → Finetuned-VQGAN/taming/data/coco.py
diff --git a/.../data/conditional_builder/objects_bbox.py → .../data/conditional_builder/objects_bbox.py b/.../data/conditional_builder/objects_bbox.py → .../data/conditional_builder/objects_bbox.py
diff --git a/...ditional_builder/objects_center_points.py → ...ditional_builder/objects_center_points.py b/...ditional_builder/objects_center_points.py → ...ditional_builder/objects_center_points.py
diff --git a/taming/data/conditional_builder/utils.py → .../taming/data/conditional_builder/utils.py b/taming/data/conditional_builder/utils.py → .../taming/data/conditional_builder/utils.py
diff --git a/taming/data/custom.py → Finetuned-VQGAN/taming/data/custom.py b/taming/data/custom.py → Finetuned-VQGAN/taming/data/custom.py
diff --git a/taming/data/faceshq.py → Finetuned-VQGAN/taming/data/faceshq.py b/taming/data/faceshq.py → Finetuned-VQGAN/taming/data/faceshq.py
diff --git a/taming/data/helper_types.py → Finetuned-VQGAN/taming/data/helper_types.py b/taming/data/helper_types.py → Finetuned-VQGAN/taming/data/helper_types.py
diff --git a/taming/data/image_transforms.py → ...ned-VQGAN/taming/data/image_transforms.py b/taming/data/image_transforms.py → ...ned-VQGAN/taming/data/image_transforms.py
diff --git a/taming/data/imagenet.py → Finetuned-VQGAN/taming/data/imagenet.py b/taming/data/imagenet.py → Finetuned-VQGAN/taming/data/imagenet.py
diff --git a/taming/data/open_images_helper.py → ...d-VQGAN/taming/data/open_images_helper.py b/taming/data/open_images_helper.py → ...d-VQGAN/taming/data/open_images_helper.py
diff --git a/taming/data/sflckr.py → Finetuned-VQGAN/taming/data/sflckr.py b/taming/data/sflckr.py → Finetuned-VQGAN/taming/data/sflckr.py
diff --git a/taming/data/utils.py → Finetuned-VQGAN/taming/data/utils.py b/taming/data/utils.py → Finetuned-VQGAN/taming/data/utils.py
diff --git a/taming/lr_scheduler.py → Finetuned-VQGAN/taming/lr_scheduler.py b/taming/lr_scheduler.py → Finetuned-VQGAN/taming/lr_scheduler.py
diff --git a/...pycache__/cond_transformer.cpython-38.pyc → ...pycache__/cond_transformer.cpython-38.pyc b/...pycache__/cond_transformer.cpython-38.pyc → ...pycache__/cond_transformer.cpython-38.pyc
diff --git a/.../models/__pycache__/vqgan.cpython-311.pyc → .../models/__pycache__/vqgan.cpython-311.pyc b/.../models/__pycache__/vqgan.cpython-311.pyc → .../models/__pycache__/vqgan.cpython-311.pyc
diff --git a/...g/models/__pycache__/vqgan.cpython-38.pyc → ...g/models/__pycache__/vqgan.cpython-38.pyc b/...g/models/__pycache__/vqgan.cpython-38.pyc → ...g/models/__pycache__/vqgan.cpython-38.pyc
diff --git a/taming/models/cond_transformer.py → ...d-VQGAN/taming/models/cond_transformer.py b/taming/models/cond_transformer.py → ...d-VQGAN/taming/models/cond_transformer.py
diff --git a/taming/models/dummy_cond_stage.py → ...d-VQGAN/taming/models/dummy_cond_stage.py b/taming/models/dummy_cond_stage.py → ...d-VQGAN/taming/models/dummy_cond_stage.py
diff --git a/taming/models/vqgan.py → Finetuned-VQGAN/taming/models/vqgan.py b/taming/models/vqgan.py → Finetuned-VQGAN/taming/models/vqgan.py
diff --git a/...g/modules/__pycache__/util.cpython-38.pyc → ...g/modules/__pycache__/util.cpython-38.pyc b/...g/modules/__pycache__/util.cpython-38.pyc → ...g/modules/__pycache__/util.cpython-38.pyc
diff --git a/taming/modules/autoencoder/lpips/vgg.pth → .../taming/modules/autoencoder/lpips/vgg.pth b/taming/modules/autoencoder/lpips/vgg.pth → .../taming/modules/autoencoder/lpips/vgg.pth
diff --git a/...nmodules/__pycache__/model.cpython-38.pyc → ...nmodules/__pycache__/model.cpython-38.pyc b/...nmodules/__pycache__/model.cpython-38.pyc → ...nmodules/__pycache__/model.cpython-38.pyc
diff --git a/taming/modules/diffusionmodules/model.py → .../taming/modules/diffusionmodules/model.py b/taming/modules/diffusionmodules/model.py → .../taming/modules/diffusionmodules/model.py
diff --git a/...iminator/__pycache__/model.cpython-38.pyc → ...iminator/__pycache__/model.cpython-38.pyc b/...iminator/__pycache__/model.cpython-38.pyc → ...iminator/__pycache__/model.cpython-38.pyc
diff --git a/taming/modules/discriminator/model.py → ...GAN/taming/modules/discriminator/model.py b/taming/modules/discriminator/model.py → ...GAN/taming/modules/discriminator/model.py
diff --git a/taming/modules/losses/__init__.py → ...d-VQGAN/taming/modules/losses/__init__.py b/taming/modules/losses/__init__.py → ...d-VQGAN/taming/modules/losses/__init__.py
diff --git a/...osses/__pycache__/__init__.cpython-38.pyc → ...osses/__pycache__/__init__.cpython-38.pyc b/...osses/__pycache__/__init__.cpython-38.pyc → ...osses/__pycache__/__init__.cpython-38.pyc
diff --git a/...s/losses/__pycache__/lpips.cpython-38.pyc → ...s/losses/__pycache__/lpips.cpython-38.pyc b/...s/losses/__pycache__/lpips.cpython-38.pyc → ...s/losses/__pycache__/lpips.cpython-38.pyc
diff --git a/...s/__pycache__/vqperceptual.cpython-38.pyc → ...s/__pycache__/vqperceptual.cpython-38.pyc b/...s/__pycache__/vqperceptual.cpython-38.pyc → ...s/__pycache__/vqperceptual.cpython-38.pyc
diff --git a/taming/modules/losses/lpips.py → ...uned-VQGAN/taming/modules/losses/lpips.py b/taming/modules/losses/lpips.py → ...uned-VQGAN/taming/modules/losses/lpips.py
diff --git a/taming/modules/losses/segmentation.py → ...GAN/taming/modules/losses/segmentation.py b/taming/modules/losses/segmentation.py → ...GAN/taming/modules/losses/segmentation.py
diff --git a/taming/modules/losses/vqperceptual.py → ...GAN/taming/modules/losses/vqperceptual.py b/taming/modules/losses/vqperceptual.py → ...GAN/taming/modules/losses/vqperceptual.py
diff --git a/taming/modules/misc/coord.py → Finetuned-VQGAN/taming/modules/misc/coord.py b/taming/modules/misc/coord.py → Finetuned-VQGAN/taming/modules/misc/coord.py
diff --git a/...sformer/__pycache__/mingpt.cpython-38.pyc → ...sformer/__pycache__/mingpt.cpython-38.pyc b/...sformer/__pycache__/mingpt.cpython-38.pyc → ...sformer/__pycache__/mingpt.cpython-38.pyc
diff --git a/...ormer/__pycache__/permuter.cpython-38.pyc → ...ormer/__pycache__/permuter.cpython-38.pyc b/...ormer/__pycache__/permuter.cpython-38.pyc → ...ormer/__pycache__/permuter.cpython-38.pyc
diff --git a/taming/modules/transformer/mingpt.py → ...QGAN/taming/modules/transformer/mingpt.py b/taming/modules/transformer/mingpt.py → ...QGAN/taming/modules/transformer/mingpt.py
diff --git a/taming/modules/transformer/permuter.py → ...AN/taming/modules/transformer/permuter.py b/taming/modules/transformer/permuter.py → ...AN/taming/modules/transformer/permuter.py
diff --git a/taming/modules/util.py → Finetuned-VQGAN/taming/modules/util.py b/taming/modules/util.py → Finetuned-VQGAN/taming/modules/util.py
diff --git a/...vqvae/__pycache__/quantize.cpython-38.pyc → ...vqvae/__pycache__/quantize.cpython-38.pyc b/...vqvae/__pycache__/quantize.cpython-38.pyc → ...vqvae/__pycache__/quantize.cpython-38.pyc
diff --git a/taming/modules/vqvae/quantize.py → ...ed-VQGAN/taming/modules/vqvae/quantize.py b/taming/modules/vqvae/quantize.py → ...ed-VQGAN/taming/modules/vqvae/quantize.py
diff --git a/taming/util.py → Finetuned-VQGAN/taming/util.py b/taming/util.py → Finetuned-VQGAN/taming/util.py
diff --git a/test.py → Finetuned-VQGAN/test.py b/test.py → Finetuned-VQGAN/test.py
diff --git a/train.py → Finetuned-VQGAN/train.py b/train.py → Finetuned-VQGAN/train.py
diff --git a/License.txt b/License.txt
diff --git a/README.md b/README.md
@@ -1,65 +1,30 @@
-# Extreme Image Compression using Fine-tuned VQGAN Models
-This repository is the official implementation of VQGAN-Comporession.
+# Introduction
+Official Pytorch implementation for image compression based on VQGAN model includes:
+* Finetuned-GAN:[Extreme Image Compression using Fine-tuned VQGAN Models](https://ieeexplore.ieee.org/document/10533792), DCC 2024, in [this folder](./Finetuned-VQGAN)
+* UIGC:[Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer](https://ieeexplore.ieee.org/abstract/document/10687549), ICME 2024， in [this floder](./UIGC)
 
-[Qi Mao](https://sites.google.com/view/qi-mao/), [Tinghan Yang](), [Yinuo Zhang](), [Zijian Wang](), [Meng Wang](https://scholar.google.com/citations?user=6vnhEIgAAAAJ&hl=zh-TW&oi=sra), [Shiqi Wang](), [Libiao Jin](), [Siwei Ma](https://scholar.google.com/citations?user=y3YqlaUAAAAJ&hl=zh-TW&oi=sra)
+# :heart: Acknowledgement
+The implementation is based on [VQGAN](https://github.com/CompVis/taming-transformers).
 
-<p align="center">
-<img src="assets/Framework.png"width="1060px"/>  
-<br>
-<em> Figure:  Overview of the proposed VQGAN-based image coding framework.</em>
-</p>
+# :clipboard: Citation
+If you find this work useful for your research, please cite:
 
-## Acknowledgement
-The framework is based on [VQGAN](https://github.com/CompVis/taming-transformers). We modify taming.modules.vqgan and add train.py and reconstruction.py for usage.
-
-## Introduction
-In this work, we propose a simple yet effective coding framework by introducing vector quantization (VQ)--based generative models into the image compression domain.
-
-[[Paper](https://arxiv.org/abs/2108.03690)] 
-
-<p align="center">
-<img src="assets/sub.jpg"width="1060px"/>  
-<br>
-<em> Figure: Our Results. </em> </p>
-
-## Installation
-Our method is tested using cuda11.3 on a single A100. The preparation work mainly includes configuring the environment.
-```bash
-conda env create -f environment.yaml
-conda activate vqgan
-```
-### Reconstruction
-If you want reconstruction imge with pretrained model, please download [Google driver](https://drive.google.com/drive/folders/14I_RnQ3cA6etdKGPVMFdmmVgMtBTB5rn?usp=sharing) from [Baidu cloud](https://pan.baidu.com/s/1zBeWKh6vgof13iTBwtA65A?pwd=kfl7) (code: kfl7) and put in `logs/`
-
-Some evaluation dataset can be downloaded from 
-[kodak dataset](http://r0k.us/graphics/kodak/) and [CLIC](http://challenge.compression.cc/tasks/) and put in `data/`
-```bash
-python reconstruction.py --logs_path $model_dir --dataset $dataset_name
-```
-An example: After evaluation on the Kodak dataset, fine tune the pre trained model of [vqgan_imagenet_f16_16384](https://heibox.uni-heidelberg.de/d/a7530b09fed84f80a887/) to a codebook size of 1024.
-```bash
-python reconstruction.py --logs_path logs/kmeans_tune/16384_kmeans_1024_epoch/epoch1/ --dataset Kodak/
 ```
-The result is saved at `rec/Kodak/`
-
-### Train
-Prepare the dataset according to the instructions of the original [VQGAN](https://github.com/CompVis/taming-transformers?tab=readme-ov-file#data-preparation) project, but our training involves freezing the codec and only updating the codebook for fine-tuning. You can use the following code to achieve this:
-```bash
-python train.py --base configs/custom_vqgan.yaml -t True --gpus 0, --is_frozen
-```
-The fine-tune model is saved at `logs/`
-
-## Citation
-```
-@inproceedings{wang2023extreme,
-  title={Extreme Generative Human-Oriented Video Coding via Motion Representation Compression},
-  author={Wang, Ruofan and Mao, Qi and Jia, Chuanmin and Wang, Ronggang and Ma, Siwei},
-  booktitle={2023 IEEE International Symposium on Circuits and Systems (ISCAS)},
-  pages={1--5},
-  year={2023},
+@inproceedings{mao2024extreme,
+  title={Extreme image compression using fine-tuned vqgans},
+  author={Mao, Qi and Yang, Tinghan and Zhang, Yinuo and Wang, Zijian and Wang, Meng and Wang, Shiqi and Jin, Libiao and Ma, Siwei},
+  booktitle={2024 Data Compression Conference (DCC)},
+  pages={203--212},
+  year={2024},
   organization={IEEE}
 }
-``` 
 
-## Contact
-Feel free to contact us if there is any question. (Qi Mao, qimao@cuc.edu.cn; Tinghan Yang, yangtinghan@cuc.edu.cn)
+@inproceedings{xue2024unifying,
+  title={Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer},
+  author={Xue, Naifu and Mao, Qi and Wang, Zijian and Zhang, Yuan and Ma, Siwei},
+  booktitle={2024 IEEE International Conference on Multimedia and Expo (ICME)}, 
+  pages={1-6},
+  year={2024}，
+  organization={IEEE}
+}
+```
diff --git a/Read.md b/Read.md
diff --git a/UIGC/.idea/.gitignore b/UIGC/.idea/.gitignore
diff --git a/UIGC/.idea/UIGC.iml b/UIGC/.idea/UIGC.iml
diff --git a/UIGC/.idea/inspectionProfiles/profiles_settings.xml b/UIGC/.idea/inspectionProfiles/profiles_settings.xml
diff --git a/UIGC/.idea/misc.xml b/UIGC/.idea/misc.xml
diff --git a/UIGC/.idea/modules.xml b/UIGC/.idea/modules.xml
diff --git a/UIGC/.idea/vcs.xml b/UIGC/.idea/vcs.xml