Hero-Mamba: Mamba-based Dual Domain Learning for Underwater Image Enhancement

2026-04-17Computer Vision and Pattern Recognition

Computer Vision and Pattern Recognition
AI summary

The authors address problems in underwater images like bad colors and blurriness caused by water effects. They created Hero-Mamba, a new tool that looks at images in two ways—by pixels and by waves—to better fix color and detail issues separately. Their method uses a special design to understand the whole image efficiently, avoiding the slowdowns of previous techniques. Testing showed Hero-Mamba improves underwater pictures better than older methods.

underwater image enhancementcolor distortioncontrastFourier transform (FFT)CNN (Convolutional Neural Network)TransformerPSNR (Peak Signal-to-Noise Ratio)SSIM (Structural Similarity Index)long-range dependenciesMamba-based network
Authors
Tejeswar Pokuri, Shivarth Rai
Abstract
Underwater images often suffer from severe degradation, such as color distortion, low contrast, and blurred details, due to light absorption and scattering in water. While learning-based methods like CNNs and Transformers have shown promise, they face critical limitations: CNNs struggle to model the long-range dependencies needed for non-uniform degradation, and Transformers incur quadratic computational complexity, making them inefficient for high-resolution images. To address these challenges, we propose Hero-Mamba, a novel Mamba-based network that achieves efficient dual-domain learning for underwater image enhancement. Our approach uniquely processes information from both the spatial domain (RGB image) and the spectral domain (FFT components) in parallel. This dual-domain input allows the network to decouple degradation factors, separating color/brightness information from texture/noise. The core of our network utilizes Mamba-based SS2D blocks to capture global receptive fields and long-range dependencies with linear complexity, overcoming the limitations of both CNNs and Transformers. Furthermore, we introduce a ColorFusion block, guided by a background light prior, to restore color information with high fidelity. Extensive experiments on the LSUI and UIEB benchmark datasets demonstrate that Hero-Mamba outperforms state-of-the-art methods. Notably, our model achieves a PSNR of 25.802 and an SSIM of 0.913 on LSUI, validating its superior performance and generalization capabilities.