Abstract
When transmission medium and compression degradation are intertwined, new challenges emerge. This study addresses the problem of raindrop removal from compressed images, where raindrops obscure large areas of the background and compression leads to the loss of high-frequency (HF) information. The restoration of the former requires global contextual information, while the latter necessitates guidance for high-frequency details, resulting in a conflict in utilizing these two types of information when designing existing methods. To address this issue, we propose a novel transformer architecture that leverages the advantages of attention mechanism and HF-friendly design to effectively restore the compressed raindrop images at the framework, component, and module levels. Specifically, at the framework level, we integrate relative position multi-head self-attention and convolutional layers into the proposed low-high-frequency transformer (LHFT), where the former captures global contextual information and the latter focuses on high-frequency information. Their combination effectively resolves the issue of mixed degradation. At the component level, we utilize high-frequency depth-wise convolution (HFDC) with zero-mean kernels to improve the capability to extract high-frequency features, drawing inspiration from typical high-frequency filters like Prewitt and Sobel operators. Finally, at the module level, we introduce a low-high-attention module (LHAM) to adaptively allocate the importance of low and high frequencies along channels for effective fusion. We establish the JPEG-compressed raindrop image dataset and conduct extensive experiments on different compression rates. Experimental results demonstrate that the proposed method outperforms state-of-the-art methods without increasing computational costs.
Original language | English |
---|---|
Pages (from-to) | 2070-2082 |
Number of pages | 13 |
Journal | IEEE Transactions on Multimedia |
Volume | 27 |
Early online date | 24 Jan 2024 |
DOIs | |
Publication status | Published - 2025 |
Bibliographical note
Publisher Copyright:© 1999-2012 IEEE.
Funding
Thisworkwas supported in part byRGCGeneral Research Funds underGrant 11203820, in part by ITF Project underGrant GHP/044/21SZ, in part byRGCGeneral Research Fund underGrant 11203220/11200323, in part by the National Natural Science Foundation of China under Grant 62401214, and in part by Guangdong Basic and Applied Basic Research Foundation under Grant 2024A1515010454.
Keywords
- Compressed image
- convolution
- high frequency
- low frequency
- raindrop removal
- transformer
- zero-mean