Enhancing Underwater Image Segmentation with Deep Learning: A Novel Approach to Dataset Expansion and Preprocessing Techniques

Underwater image processing combined with machine learning offers significant potential for enhancing the capabilities of underwater robots across various marine exploration tasks. Image segmentation, a key aspect of machine vision, is crucial for identifying and isolating objects of interest within underwater images. Traditional segmentation methods, such as threshold-based and morphology-based algorithms, have been employed but need help accurately delineating objects in the complex underwater environment where image degradation is common.

Researchers increasingly use deep learning techniques for underwater image segmentation to address these challenges. Deep learning methods, including semantic and instance segmentation, provide more precise analysis by enabling pixel-level and object-level segmentation. Recent advancements, such as FCN-DenseNet and Mask R-CNN, promise to improve segmentation accuracy and speed. However, further research is needed to overcome challenges like limited dataset availability and image quality degradation, ensuring robust performance in underwater exploration scenarios.

To deal with the challenges posed by limited underwater image datasets and image quality degradation, a research team from China recently published a new paper proposing innovative solutions.

The proposed method is based on the following steps: Firstly, they expanded the size of the underwater image dataset by employing techniques such as image rotation, flipping, and a Generative Adversarial Network (GAN) to generate additional images. Secondly, they applied an underwater image enhancement algorithm to preprocess the dataset, addressing issues related to image quality degradation. Thirdly, the researchers reconstructed the deep learning network by removing the last layer of the feature map with the largest receptive field in the Feature Pyramid Network (FPN) and replacing the original backbone network with a lightweight feature extraction network.

Using image transformations and a ConSinGan network, they enhanced the initial images from the Underwater Robot Picking Contest (URPC2020) to create an underwater image dataset, for instance, segmentation. This network uses three convolutional layers to expand the dataset by producing higher-resolution images after several training cycles. They also labeled target positions and categories using a Mask R-CNN network for image annotation, building a fully labeled dataset in Visual Object Classes (VOC) format. Creating new datasets increases their diversity and unpredictability, which is important for developing strong segmentation models that can adapt to various undersea conditions.

The experimental study assessed the effectiveness of the proposed approach in enhancing underwater image quality and refining instance segmentation accuracy. Quantitative metrics, including information entropy, root mean square contrast, average gradient, and underwater color image quality evaluation, were utilized to evaluate image enhancement algorithms, where the combination algorithm, notably WAC, exhibited superior performance. Validation experiments confirmed the efficacy of data augmentation techniques in refining segmentation accuracy and underscored the effectiveness of image preprocessing algorithms, with WAC surpassing alternative methods. Modifications to the Mask R-CNN network, particularly the Feature Pyramid Network (FPN), improved segmentation accuracy and processing speed. Integrating image preprocessing with network enhancements further bolstered recognition and segmentation accuracy, validating the approach’s efficacy in underwater image analysis and segmentation tasks.

In summary, integrating underwater image processing with machine learning holds promise for enhancing underwater robot capabilities in marine exploration. Deep learning techniques, including semantic and instance segmentation, offer precise analysis despite the challenges of the underwater environment. Recent advancements like FCN-DenseNet and Mask R-CNN show potential for improving segmentation accuracy. A recent study proposed a comprehensive approach involving dataset expansion, image enhancement algorithms, and network modifications, demonstrating effectiveness in enhancing image quality and refining segmentation accuracy. This approach has significant implications for underwater image analysis and segmentation tasks.

Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and Google News. Join our 37k+ ML SubReddit, 41k+ Facebook Community, Discord Channel, and LinkedIn Group.

If you like our work, you will love our newsletter..

Don’t Forget to join our Telegram Channel

Mahmoud is a PhD researcher in machine learning. He also holds a
bachelor's degree in physical science and a master's degree in
telecommunications and networking systems. His current areas of
research concern computer vision, stock market prediction and deep
learning. He produced several scientific articles about person re-
identification and the study of the robustness and stability of deep

🐝 Join the Fastest Growing AI Research Newsletter Read by Researchers from Google + NVIDIA + Meta + Stanford + MIT + Microsoft and many others...