Abstract
The learned image compression (LIC) methods have already surpassed traditional techniques in compressing natural scene (NS) images. However, directly applying these methods to screen content (SC) images, which possess distinct characteristics such as sharp edges, repetitive patterns, embedded text and graphics, yields suboptimal results. This paper addresses three key challenges in SC image compression: learning compact latent features, adapting quantization step sizes, and the lack of large SC datasets. To overcome these challenges, we propose a novel compression method that employs a multi-frequency two-stage octave residual block (MToRB) for feature extraction, a cascaded triple-scale feature fusion residual block (CTSFRB) for multi-scale feature integration and a multi-frequency context interaction module (MFCIM) to reduce inter-frequency correlations. Additionally, we introduce an adaptive quantization module that learns scaled uniform noise for each frequency component, enabling flexible control over quantization granularity. Furthermore, we construct a large SC image compression dataset (SDU-SCICD10K), which includes over 10,000 images spanning basic SC images, computer-rendered images, and mixed NS and SC images from both PC and mobile platforms. Experimental results demonstrate that our approach significantly improves SC image compression performance, outperforming traditional standards and state-of-the-art learning-based methods in terms of peak signal-to-noise ratio (PSNR) and multi-scale structural similarity (MS-SSIM).
| Original language | English |
|---|---|
| Pages (from-to) | 1034-1047 |
| Number of pages | 14 |
| Journal | IEEE Transactions on Broadcasting |
| Volume | 71 |
| Issue number | 4 |
| Early online date | 19 Sept 2025 |
| DOIs | |
| Publication status | Published - Dec 2025 |
Bibliographical note
Publisher Copyright:© 1963-12012 IEEE.
Funding
This work was supported in part by the National Natural Science Foundation of China under Grant 62222110, Grant 62571303, and Grant 62172259; in part by the Tn Scholar Project of Shandong Province under Grant tsqn202103001; in part by Shandong Provincial Natural Science Foundation under Grant ZR2022ZD38; and in part by the OPPO Research Fund.
Keywords
- Screen content
- adaptive quantization
- feature decomposition
- image compression
- multi-frequency