We are seeking collaboration in image/video compression. If you are interested, please contact me so we can work together to advance image/video compression.
Here is my e-mail: haowei@stu.xjtu.edu.cn.
- 2025-11-04: Our paper ICISP has been accepted by Neural Networks. Our previous work VQIR, DiffEIC and RDEIC have been accepted by TCSVT.
AAAI 2025, ICLR 2025, CVPR 2025, ICML 2025, ACMMM 2025, ICCV 2025, NeurIPS 2025, IJCAI 2025
ICLR 2024, CVPR 2024, ICML 2024, ECCV 2024, ACMMM 2024, NeurIPS 2024, AAAI 2024,
ICLR 2023, ICML 2023, NeurIPS 2023, AAAI 2023, ICCV 2023, CVPR 2023, ACMMM 2023,
IEEE TPAMI, IEEE TCSVT, IEEE TIP, IEEE TMM, IF
A Tri-Dynamic Preprocessing Framework for UGC Video Compression. ICASSP 2024. Paper
A Preprocessing Framework for Video Machine Vision under Compression. DCC Paper
Generative Preprocessing for Image Compression with Pre-trained Diffusion Models. DCC 2026. Paper
Content Adaptive based Motion Alignment Framework for Learned Video Compression. DCC 2026 Paper
Lightweight 3D Gaussian Splatting Compression via Video Codec. DCC 2026 Paper
A Lightweight Model for Perceptual Image Compression via Implicit Priors. Neural Networks 2025. Paper; Code
Turbo principles meet compression: Rethinking nonlinear transformations in learned image compression. JVCIR 2025. Paper
DiffVC-OSD: One-Step Diffusion-based Perceptual Neural Video Compression Framework. VCIP2025. Paper
Neural Image Compression With Multi-Type Feature Fusion and Multi-Distribution Mixture Likelihood. IEEE Transactions on Broadcasting. Paper
FlashGMM: Fast Gaussian Mixture Entropy Model for Learned Image Compression. VCIP 2025. Paper
Semantics-Guided Generative Image Compression. ICIP 2025. Paper
Learned Compression for Compressed Learning. DCC2025. Paper
DeepFGS: Fine-Grained Scalable Coding for Learned Image Compression. DCC2024. Paper
Hybrid Local-Global Context Learning for Neural Video Compression. DCC2024. Paper
LMM-driven Semantic Image-Text Coding for Ultra Low-bitrate Learned Image Compression. VCIP2024. Paper; Code
Efficient Progressive Image Compression with Variance-aware Masking. WACV2025. Paper
L-LBVC: Long-Term Motion Estimation and Prediction for Learned Bi-Directional Video Compression. DCC 2025 Paper
Extreme Image Compression Using Fine-tuned VQGANs. DCC2024. Paper; Code
Progressive Learned Image Compression for Machine Perception. Paper
SLIM: Semantic-based Low-bitrate Image compression for Machines by leveraging diffusion. Paper
L-STEC: Learned Video Compression with Long-term Spatio-Temporal Enhanced Context. Paper Siwei Ma
Content Adaptive based Motion Alignment Framework for Learned Video Compression. Paper Siwei Ma
TreeNet: A Light Weight Model for Low Bitrate Image Compression. Paper
VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression. Paper
Ultra-Low Bitrate Perceptual Image Compression with Shallow Encoder. Paper
Embodied Image Compression. Paper
Error-Propagation-Free Learned Video Compression With Dual-Domain Progressive Temporal Alignment. Paper
HVQ-CGIC: Enabling Hyperprior Entropy Modeling for VQ-Based Controllable Generative Image Compression. Paper
Beyond Hallucinations: A Multimodal-Guided Task-Aware Generative Image Compression for Ultra-Low Bitrate. Paper
Generative Neural Video Compression via Video Diffusion Prior. Paper
2025.10.14 Generative Latent Video Compression. Paper
UniComp: Rethinking Video Compression Through Informational Uniqueness. Paper
Ultra-lightweight Neural Video Representation Compression. Paper
Joint Multi-scale Gated Transformer and Prior-guided Convolutional Network for Learned Image Compression. Paper
EV-NVC: Efficient Variable bitrate Neural Video Compression. Paper
2025.11.26 Hybrid Convolution and Frequency State Space Network for Image Compression. Paper
2025.11.25 CoD: A Diffusion Foundation Model for Image Compression. Paper
A Multi-Stage Optimization Framework for Deploying Learned Image Compression on FPGAs. Paper
Generative Coding: Promise and Challenges. Paper
2025.11.17 Rethinking Autoregressive Models for Lossless Image Compression via Hierarchical Parallelism and Progressive Adaptation. Paper
2025.11.13 ROI-based Deep Image Compression with Implicit Bit Allocation. Paper
2025.11.13 Machines Serve Human: A Novel Variable Human-machine Collaborative Compression Framework. Paper
2025.11.13 Neural B-frame Video Compression with Bi-directional Reference Harmonization. Paper
2025.11.13 GRACE: Designing Generative Face Video Codec via Agile Hardware-Centric Workflow. Paper
2025.11.12 From Noise to Latent: Generating Gaussian Latents for INR-Based Image Compression. Paper
Variable-rate learned image compression with integer-arithmetic-only inference. Paper
2025.11.11 Turbo-DDCM: Fast and Flexible Zero-Shot Diffusion-Based Image Compression. Paper
2025.11.11 Training-Free Adaptive Quantization for Variable Rate Image Coding for Machines. Paper
2025.11.11 GFix: Perceptually Enhanced Gaussian Splatting Video Compression. Paper
2025.11.04 T-MLA: A Targeted Multiscale Log--Exponential Attack Framework for Neural Image Compression. Paper
2025.11.03 Generative Semantic Coding for Ultra-Low Bitrate Visual Communication and Analysis. Paper
2025.10.22 Leveraging Learned Image Prior for 3D Gaussian Compression. Paper
2025.10.20 SANR: Scene-Aware Neural Representation for Light Field Image Compression with Rate-Distortion Optimization. Paper
2025.10.18 MH-LVC: Multi-Hypothesis Temporal Prediction for Learned Conditional Residual Video Coding. Paper
2025.10.14 JND-Guided Light-Weight Neural Pre-Filter for Perceptual Image Coding. Paper
2025.09.25 SEEC: Segmentation-Assisted Multi-Entropy Models for Learned Lossless Image Compression. Paper
2025.09.25 PORT: Perception-Oriented Image Compression with Real-Time Decoding. Paper
2025.09.23 ISCS: Parameter-Guided Channel Ordering and Grouping for Learned Image Compression. Paper
2025.09.18 Generative Image Coding with Diffusion Prior. Paper
2025.09.15 Efficient Learned Image Compression Through Knowledge Distillation. Paper
2025.09.12 In-Loop Filtering Using Learned Look-Up Tables for Video Coding. Paper
2025.09.09 Histogram Driven Amplitude Embedding for Qubit Efficient Quantum Image Compression. Paper
2025.09.05 LMVC: An End-to-End Learned Multiview Video Coding Framework. Paper
2025.08.13 Adaptive High-Frequency Preprocessing for Video Coding. Paper
2025.08.13 Scaling Learned Image Compression Models up to 1 Billion. Paper
Conditional Residual Coding with Explicit-Implicit Temporal Buffering for Learned Video Compression. Paper
HyTIP: Hybrid Temporal Information Propagation for Masked Conditional Residual Video Coding. Paper
Multimodal-Guided Perceptual Image Compression via Joint Text and Audio. Paper
2025.08.06 HCF: Hierarchical Cascade Framework for Distributed Multi-Stage Image Compression. Paper
2025.08.06 CMIC: Content-Adaptive Mamba for Learned Image Compression. Paper
2025.08.01 JPEG Processing Neural Operator for Backward-Compatible Coding. Paper
2025.07.22 Conditional Video Generation for High-Efficiency Video Compression. Paper
T-GVC: Trajectory-Guided Generative Video Coding at Ultra-Low Bitrates. Paper
Customizable ROI-Based Deep Image Compression. Paper
How to Design and Train Your Implicit Neural Representation for Video Compression. Paper
LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs. Paper
End-to-End RGB-IR Joint Image Compression With Channel-wise Cross-modality Entropy Model. Paper
Video Compression for Spatiotemporal Earth System Data. Paper
DiffO: Single-step Diffusion for Image Compression at Ultra-Low Bitrates. Paper; Code
Fast Training-free Perceptual Image Compression. Paper
Gumbel-max List Sampling for Distribution Coupling with Multiple Samples. Paper
2025.06.03. Fourier-Modulated Implicit Neural Representation for Multispectral Satellite Image Compression. Paper
2025.05.30. Towards Facial Image Compression with Consistency Preserving Diffusion Prior. Paper
2025.05.30. High Quality Underwater Image Compression with Adaptive Correction and Codebook-based Augmentation. Paper
2025.05.30. 3DGS Compression with Sparsity-guided Hierarchical Transform Coding. Paper
2025.05.28. Generative Image Compression by Estimating Gradients of the Rate-variable Feature Distribution. Paper; Code
2025.05.23. DualComp: End-to-End Learning of a Unified Dual-Modality Lossless Compressor. Paper
2025.05.16. High Quality Underwater Image Compression with Adaptive Correction and Codebook-based Augmentation. Paper
2025.05.15. Neural Video Compression using 2D Gaussian Splatting. Paper
2025.05.15. BiECVC: Gated Diversification of Bidirectional Contexts for Learned Video Compression. Paper
2025.04.08. 3DM-WeConvene: Learned Image Compression with 3D Multi-Level Wavelet-Domain Convolution and Entropy Model. Paper
2025.04.04. Image Coding for Machines via Feature-Preserving Rate-Distortion Optimization. Paper
2025.03.30. Embedding Compression Distortion in Video Coding for Machines. Paper
2025.03.26. GIViC: Generative Implicit Video Compression. Paper
2025.03.22. Leveraging Diffusion Knowledge for Generative Image Compression with Fractal Frequency-Aware Band Learning. Paper
PerCoV2: Improved Ultra-Low Bit-Rate Perceptual Image Compression with Implicit Hierarchical Masked Image Modeling. Paper; Code
Residual Learning and Filtering Networks for End-to-End Lossless Video Compression. Paper
Integrating Adaptive Sampling for Optimal Learned Video Compression. Paper
FEDS: Feature and Entropy-Based Distillation Strategy for Efficient Learned Image Compression. Paper
UAR-NVC: A Unified AutoRegressive Framework for Memory-Efficient Neural Video Compression. Paper
Pleno-Generation: A Scalable Generative Face Video Compression Framework with Bandwidth Intelligence. Paper
Continuous Patch Stitching for Block-wise Image Compression. Paper
S2CFormer: Reorienting Learned Image Compression from Spatial Interaction to Channel Aggregation. Paper
HDCompression: Hybrid-Diffusion Image Compression for Ultra-Low Bitrates. Paper
CANeRV: Content Adaptive Neural Representation for Video Compression. Paper
CMamba: Learned Image Compression with State Space Models. Paper
Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse. Paper
RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression. Paper
GaussianVideo: Efficient Video Representation Through 2D Gaussian Splatting. Paper
Locality-aware Gaussian Compression for Fast and High-quality Rendering. Paper
CoordFlow: Coordinate Flow for Pixel-wise Neural Video Representation.Paper; Code
Conditional Hallucinations for Image Compression. Paper
Deeply-Conditioned Image Compression via Self-Generated Priors. Paper
Resi-VidTok: An Efficient and Decomposed Progressive Tokenization Framework for Ultra-Low-Rate and Lightweight Video Transmission. Paper
Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression. Paper; Supplementary File; Code
DynaQuant: Dynamic Mixed-Precision Quantization for Learned Image Compression. Paper; Code
MRT: Learning Compact Representations with Mixed RWKV-Transformer for Extreme Image Compression. Paper; Code
Emerging Advances in Learned Video Compression: Models, Systems and Beyond. Paper
FIPER: Factorized Features for Robust Image Super-Resolution and Compression. Paper
MoRIC: A Modular Region-based Implicit Codec for Image Compression. Paper
Optimal Neural Compressors for the Rate-Distortion-Perception Tradeoff. Paper
Diff-ICMH: Harmonizing Machine and Human Vision in Image Compression with Generative Prior. Paper
Switchable Token-Specific Codebook Quantization For Face Image Compression. Paper
OSCAR: One-Step Diffusion Codec Across Multiple Bit-rates. Paper; Code
One-Step Diffusion-Based Image Compression with Semantic Distillation. Paper; Code
Beyond Perspective: Neural 360-Degree Video Compression. Paper
An Information-Theoretic Regularizer for Lossy Neural Image Compression. Paper
GIViC: Generative Implicit Video Compression. Paper
LINR-PCGC: Lossless Implicit Neural Representations for Point Cloud Geometry Compression. Paper
Knowledge Distillation for Learned Image Compression. Paper
Diffusion-based Compression Quality Tradeoffs without Retraining. Paper
Cross-Granularity Online Optimization with Masked Compensated Information for Learned Image Compression. Paper
Cassic: Towards Content-Adaptive State-Space Models for Learned Image Compression. Paper
DLF: Extreme Image Compression with Dual-generative Latent Fusion. Paper; Code
Context Guided Transformer Entropy Modeling for Video Compression. Paper; Code
StableCodec: Taming One-Step Diffusion for Extreme Image Compression. Paper; Code
Dataset Distillation as Data Compression: A Rate-Utility Perspective. Paper
Learned Image Compression with Hierarchical Progressive Context Modeling. Paper; Code
Reproducibility Companion Paper - NIF: A Fast Implicit Image Compression with Bottleneck Layers and Modulated Sinusoidal Activations. Paper
SCID-Compress900: A Multi-Scene Dataset of 4K and 1080P Screen Content Images for Image Compression Research. Paper
OpenMVC: An Open-Source Library for Learning-based Multi-view Compression. Paper
Challenging Cases of Neural Image Compression: A Dataset of Visually Compelling Yet Semantically Incorrect Reconstructions. Paper
Towards a New Paradigm of Visual Signal Compression. Paper
VidIQ: Inference-Aware Neural Codecs for Quality-Enhanced, Real-Time Video Analytics. Paper
Neural Video Compression with In-Loop Contextual Filtering and Out-of-Loop Reconstruction Enhancement. Paper
How2Compress: Scalable and Efficient Edge Video Analytics via Adaptive Granular Video Compression. Paper
Unicorn: Unified Neural Image Compression with One Number Reconstruction. Paper
3D Gaussian Splatting Data Compression with Mixture of Priors. Paper
Activation and Weight Distribution Balancing for Optimal Post-Training Quantization in Learned Image Compression. Paper
BiECVC: Gated Diversification of Bidirectional Contexts for Learned Video Compression. Paper
MuCodec: Ultra Low-Bitrate Music Codec for Music Generation. Paper
EHVC: Efficient Hierarchical Reference and Quality Structure for Neural Video Coding. Paper; Code
NIC-RobustBench: A Comprehensive Open-Source Toolkit for Neural Image Compression and Robustness Analysis. Paper; Code
Compressed Image Generation with Denoising Diffusion Codebook Models. Paper
3D-LMVIC: Learning-based Multi-View Image Compression with 3D Gaussian Geometric Priors. Paper
Privacy-Shielded Image Compression: Defending Against Exploitation from Vision-Language Pretrained Models. Paper
LotteryCodec: Searching the Implicit Representation in a Random Network for Low-Complexity Image Compression. Paper
Synonymous Variational Inference for Perceptual Image Compression. Paper
Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion. Paper; Code
Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression. Paper
Efficient Decoupled Feature 3D Gaussian Splatting via Hierarchical Compression. Paper
Fitted Neural Lossless Image Compression. Paper
Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers. Paper
Zero-shot 3D Question Answering via Voxel-based Dynamic Token Compression. Paper
CASP: Compression of Large Multimodal Models Based on Attention Sparsity. Paper
Generalized Gaussian Entropy Model for Point Cloud Attribute Compression with Dynamic Likelihood Intervals. Paper
RENO: Real-Time Neural Compression for 3D LiDAR Point Clouds. Paper
PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models. Paper
Good, Cheap, and Fast: Overfitted Image Compression with Wasserstein Distortion. Paper
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression. Paper
DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Model. Paper
FLAVC: Learned Video Compression with Feature Level Attention. Paper; Code
WISE: A Framework for Gigapixel Whole-Slide-Image Lossless Compression. Paper
VoCo-LLaMA: Towards Vision Compression with Large Language Models. Paper
RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression. Paper
4DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video. Paper
Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models. Paper
Perceptual Video Compression with Neural Wrapping. Paper
TopNet: Transformer-Efficient Occupancy Prediction Network for Octree-Structured Point Cloud Geometry Compression. Paper
Frequency-Biased Synergistic Design for Image Compression and Compensation. Paper
Decouple Distortion from Perception: Region Adaptive Diffusion for Extreme-low Bitrate Perception Image Compression. Paper; Code
Neural Video Compression with Context Modulation. Paper
High Dynamic Range Video Compression: A Large-Scale Benchmark Dataset and A Learned Bit-depth Scalable Compression Algorithm. Paper
PICD: Versatile Perceptual Image Compression with Diffusion Rendering. Paper
Learned Image Compression with Dictionary-based Entropy Model. Paper; Code
Multirate Neural Image Compression with Adaptive Lattice Vector Quantization. Paper
Test-Time Fine-Tuning of Image Compression Models for Multi-Task Adaptability. Paper
MambaIC: State Space Models for High-Performance Learned Image Compression. Paper; Code
Linear Attention Modeling for Learned Image Compression. Paper; Code
ECVC: Exploiting Non-Local Correlations in Multiple Frames for Contextual Video Compression. Paper; Code
Balanced Rate-Distortion Optimization in Learned Image Compression. Paper
Extremely low-bitrate Image Compression Semantically Disentangled by LMMs from a Human Perception Perspective. Paper
Towards Practical Real-Time Neural Video Compression. Paper
Adaptive length image tokenization via recurrent allocation. Paper; Code
DiffPC: Diffusion-based High Perceptual Fidelity Image Compression with Semantic Refinement. Paper
Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption. Paper;Code
Fast Feedforward 3D Gaussian Splatting Compression. Paper
Locality-aware Gaussian Compression for Fast and High-quality Rendering. Paper
An Exploration with Entropy Constrained 3D Gaussians for 2D Video Compression. Paper; Code
Scaling Transformers for Low-bitrate High-quality Speech Coding. Paper
Resolution Attack: Exploiting Image Compression to Deceive Deep Neural Networks. Paper
FlowDec: A Flow-based Full-band General Audio Codec with High Perceptual Quality. Paper
Lossy Compression with Pretrained Diffusion Models. Paper
On Quantizing Neural Representation For Variable-Rate Video Coding. Paper
Progressive Compression with Universally Quantized Diffusion Models. Paper
CAT-3DGS: A Context-Adaptive Triplane Approach to Rate-Distortion-Optimized 3DGS Compression. Paper
On Disentangled Training for Nonlinear Transform in Learned Image Compression. Paper; Code
Few-Shot Domain Adaptation for Learned Image Compression. Paper
PNVC: Towards Practical INR-based Video Compression. Paper
Conditional Latent Coding with Learnable Synthesized Reference for Deep Image Compression (Oral). Paper; Code
Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting. Paper
Towards Loss-Resilient Image Coding for Unstable Satellite Networks. Paper
CALLIC: Content Adaptive Learning for Lossless Image Compression. Paper
Controllable Distortion-Perception Tradeoff Through Latent Diffusion for Neural Image Compression. Paper
CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression.
Deep Hierarchical Video Compression. Paper; Code; Journal Version
Offline and Online Optical Flow Enhancement for Deep Video Compression. Paper
NVRC: Neural Video Representation Compression. Paper
Robustly overfitting latents for flexible neural image compression. Paper
Learning Optimal Lattice Vector Quantizers for End-to-end Neural Image Compression. Paper
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS. Paper
Diversify, Contextualize, and Adapt: Efficient Entropy Modeling for Neural Image Codec. Paper
Causal Context Adjustment Loss for Learned Image Compression. Paper; Code
COSMIC: Compress Satellite Images Efficiently via Diffusion Compensation. Paper
Rate-aware Compression for NeRF-based Volumetric Video. Paper
Task-Oriented Multi-Bitstream Optimization for Image Compression and Transmission via Optimal Transport. Paper
Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression. Paper
Spatial-Temporal Context Model for Remote Sensing Imagery Compression. Paper
QS-NeRV: Real-Time Quality-Scalable Decoding with Neural Representation for Videos. Paper
HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression. Paper
Consistency Guided Diffusion Model with Neural Syntax for Perceptual Image Compression. Paper
SNeRV: Spectra-preserving Neural Representation for Video. Paper
Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics. Paper
Neural Graphics Texture Compression Supporting Random Acces. Paper
MesonGS: Post-training Compression of 3D Gaussians via Efficient Attribute Transformation. Paper
Lossy Image Compression with Foundation Diffusion Models. Paper
Long-term Temporal Context Gathering for Neural Video Compression. Paper
Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression. Paper
Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement. Paper
A Unified Image Compression Method for Human Perception and Multiple Vision Tasks. Paper
BaSIC: BayesNet Structure Learning for Computational Scalable Neural Image Compression. Paper; Code
Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network. Paper
Fast Encoding and Decoding for Implicit Video Representation. Paper
GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting. Paper; Code
Learned HDR Image Compression for Perceptually Optimal Storage and Display. Paper; Code
Region Adaptive Transform with Segmentation Prior for Image Compression. Paper; Code
WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model. Paper; Code
Rate-Distortion-Cognition Controllable Versatile Neural Image Compression. Paper
EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation. Paper
Image Compression for Machine and Human Vision With Spatial-Frequency Adaptation. Paper; Code
HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression. Paper; Code
Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model. Paper
Compress Clean Signal from Noisy Raw Image: A Self-Supervised Approach. Paper; Code
Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder. Paper
Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity. Paper; Code
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens. Paper
DS-NeRV: Implicit Neural Video Representation with Decomposed Static and Dynamic Codes. Paper
JDEC: JPEG Decoding via Enhanced Continuous Cosine Coefficients. Paper
Combining Frame and GOP Embeddings for Neural Video Representation. Paper
C3: High-performance and Low-complexity Neural Compression from A Single Image or Video. Paper
Generative Latent Coding for Ultra-Low Bitrate Image Compression. Paper
Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis. Paper
Learned Lossless Image Compression based on Bit Plane Slicing. Paper
Towards Backward-Compatible Continual Learning of Image Compression. Paper
Enhancing Quality of Compressed Images by Mitigating Enhancement Bias Towards Compression Domain. Paper
Boosting Neural Representations for Videos with a Conditional Decoder. Paper; Code
Neural Video Compression with Feature Modulation. Paper
Task-Aware Encoder Control for Deep Video Compression. Paper
NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation. Paper
Frequency-Aware Transformer for Learned Image Compression. Paper; Code
Idempotence and Perceptual Image Compression. Paper; Code
Towards Image Compression with Perfect Realism At Ultra-Low Bitrates. Paper
Neural Rate Control for Learned Video Compression. Paper
Finite-State Autoregressive Entropy Coding for Efficient Learned Lossless Compression. Paper
EVC: Towards Real-Time Neural Image Compression with Mask Decay. Paper
MIMT: Masked Image Modeling Transformer for Video Compression. Paper
Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models. Paper
Modality-Agnostic Variational Compression of Implicit Neural Representations. Paper
Is Overfitting Necessary for Implicit Video Representation? Paper
Learned Distributed Image Compression with Multi-Scale Patch Matching in Feature Domain. Paper
Self-Asymmetric Invertible Network for Compression-Aware Image Rescaling. Paper
Multi-Modality Deep Network for Extreme Learned Image Compression. Paper
Neural Image Compression: Generalization, Robustness, and Spectral Biases. Paper
Compression with Bayesian Implicit Neural Representations. Paper
Towards Efficient Image Compression Without Autoregressive Models. Paper
Lossy Image Compression with Conditional Diffusion Models. Paper
Task-aware Distributed Source Coding Under Dynamic Bandwidth. Paper
On the Choice of Perception Loss Function for Learned Video Compression. Paper
HiNeRV: Video Compression with Hierarchical Encoding-based Neural Representation. Paper
Compact Neural Volumetric Video Representations with Dynamic Codebooks. Paper
Computationally-Efficient Neural Image Compression with Shallow Decoders. Paper
SHACIRA: Scalable HAsh-grid Compression for Implicit Neural Representations. Paper
COMPASS: High-Efficiency Deep Image Compression with Arbitrary-scale Spatial Scalability. Paper
Dec-Adapter: Exploring Efficient Decoder-Side Adapter for Bridging Screen Content and Natural Image Compression. Paper
Semantically Structured Image Compression via Irregular Group-Based Decoupling. Paper
TransTIC: Transferring Transformer-based Image Compression from Human Perception to Machine Perception. Paper
RFD-ECNet: Extreme Underwater Image Compression with Reference to Feature Dictionary. Paper
AdaNIC: Towards Practical Neural Image Compression via Dynamic Transform Routing. Paper
Non-Semantics Suppressed Mask Learning for Unsupervised Video Semantic Compression. Paper
Scene Matters: Model-based Deep Video Compression. Paper
Learned Image Compression With Mixed Transformer-CNN Architectures. Paper; Code
Backdoor Attacks Against Deep Image Compression via Adaptive Frequency Trigger. Paper
TINC: Tree-structured Implicit Neural Compression. Paper
Context-Based Trit-Plane Coding for Progressive Image Compression. Paper
LVQAC: Lattice Vector Quantization Coupled With Spatially Adaptive Companding for Efficient Learned Image Compression. Paper
Multi-Realism Image Compression With a Conditional Generator. Paper; Code
AccelIR: Task-aware Image Compression for Accelerating Neural Restoration. Paper
Real-Time 6K Image Rescaling With Rate-Distortion Optimization. Paper
Complexity-Guided Slimmable Decoder for Efficient Deep Video Compression. Paper
Video Compression With Entropy-Constrained Neural Representations. Paper
Neural Video Compression with Diverse Contexts. Paper
Motion Information Propagation for Neural Video Compression. Paper
MMVC: Learned Multi-Mode Video Compression With Block-Based Prediction Mode Selection and Density-Adaptive Entropy Coding. Paper
DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for Videos. Paper
HNeRV: A Hybrid Neural Representation for Videos. Paper; Code
Hierarchical B-frame Video Coding Using Two-Layer CANF without Motion Coding. Paper
Towards Scalable Neural Representation for Diverse Videos. Paper
NIRVANA: Neural Implicit Representations of Videos with Adaptive Networks and Autoregressive Patch-wise Modeling. Paper
Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach. Paper
MLIC: Multi-Reference Entropy Model for Learned Image Compression. Paper; Code
Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior. 2024. Paper
Deep Lossy Plus Residual Coding for Lossless and Near-lossless Image Compression. 2024. Paper
I2C: Invertible Continuous Codec for High-Fidelity Variable-Rate Image Compression. 2024. Paper
Insights from Generative Modeling for Neural Video Compression. 2023. Paper
Generative Latent Coding for Ultra-Low Bitrate Image and Video Compression. 2025. Paper
High Accuracy Rate Control for Neural Video Coding Based on Rate-Distortion Modeling. Paper
RDEIC: Accelerating Diffusion-Based Extreme Image Compression with Relay Residual Diffusion. Paper
Rethinking the Functionality of Latent Representation: A Logarithmic Rate-Distortion Model for Learned Image Compression. Paper
Sparse Point Clouds Assisted Learned Image Compression. 2024. Paper
Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior. 2024. Paper; code
FDNet: Frequency Decomposition Network for Learned Image Compression. 2024. Paper
Conditional Residual Coding: A Remedy for Bottleneck Problems in Conditional Inter Frame Coding. 2024. Paper
Spatial Decomposition and Temporal Fusion based Inter Prediction for Learned Video Compression. 2024. Paper
Exploring Resolution Fields for Scalable Image Compression with Uncertainty Guidance. 2023. Paper
Extremely Low Bit-rate Image Compression via Invertible Image Generation. 2023. Paper
DMVC: Decomposed Motion Modeling for Learned Video Compression. 2023. Paper
Saliency Segmentation Oriented Deep Image Compression With Novel Bit Allocation. 2025 Paper
Exploring Multimodal Knowledge for Image Compression via Large Foundation Models. 2025 Paper
MISC: Ultra-Low Bitrate Image Semantic Compression Driven by Large Multimodal Model. 2025.Paper; Code
Linearly transformed color guide for low-bitrate diffusion based image compression. 2024. Paper
Exploiting Latent Properties to Optimize Neural Codecs. Paper
Exploring Long- and Short-Range Temporal Information for Learned Video Compression. 2024. Paper
Learned Video Compression With Efficient Temporal Context Learning. 2023 Paper
Scalable Face Image Coding via StyleGAN Prior: Toward Compression for Human-Machine Collaborative Vision. 2023. Paper
Learned Image Compression With Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules. 2023. Paper
Learning Cross-Scale Weighted Prediction for Efficient Neural Video Compression. 2023. Paper
One is All: A Unified Rate-Distortion-Complexity Framework for Learned Image Compression Under Energy Concentration Criteria. 2025. Paper
Uncertainty-Aware Deep Video Compression with Ensembles. 2024. Paper
Visual fidelity-oriented compression via representation fusion and diffusion reconstruction. 2025. Paper