Journal of Southern Medical University ›› 2025, Vol. 45 ›› Issue (6): 1317-1326.doi: 10.12122/j.issn.1673-4254.2025.06.21

Previous Articles    

SG-UNet: a melanoma segmentation model enhanced with global attention and self-calibrated convolution

Huanyu JI1(), Rui WANG2, Shengxiang GAO1,4, Wengang CHE1,3()   

  1. 1.Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
    2.School of Data Science and Engineering, Kunming City College, Kunming 650106, China
    3.Yunnan Key Laboratory of Computer Technology Applications, Kunming University of Science and Technology, Kunming 650500, China
    4.Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming 650500, China
  • Received:2024-11-13 Online:2025-06-20 Published:2025-06-27
  • Contact: Wengang CHE E-mail:2285873874@qq.com;goooglethink@gmail.com
  • Supported by:
    National Natural Science Foundation of China(U23A20388)

Abstract:

Objective We propose a new melanoma segmentation model, SG-UNet, to enhance the precision of melanoma segmentation in dermascopy images to facilitate early melanoma detection. Methods We utilized a U-shaped convolutional neural network, UNet, and made improvements to its backbone, skip connections, and downsampling pooling sections. In the backbone, with reference to the structure of VGG, we increased the number of convolutions from 10 to 13 in the downsampling part of UNet to achieve a deepened network hierarchy that allowed capture of more refined feature representations. To further enhance feature extraction and detail recognition, we replaced the traditional convolution the backbone section with self-calibrated convolution to enhance the model's ability to capture both spatial and channel dimensional features. In the pooling part, the original pooling layer was replaced by Haar wavelet downsampling to achieve more effective multi-scale feature fusion and reduce the spatial resolution of the feature map. The global attention mechanism was then incorporated into the skip connections at each layer to enhance the understanding of contextual information of the image. Results The experimental results showed that the SG-UNet model achieved significantly improved segmentation accuracy on ISIC 2017 and ISIC 2018 datasets as compared with other current state-of-the-art segmentation models, with Dice reached 92.41% and 86.62% and IoU reaching 92.31% and 86.48% on the two datasets, respectively. Conclusion The proposed model is capable of effective and accurate segmentation of melanoma from dermoscopy images.

Key words: image segmentation, global attention mechanism, melanoma, UNet, self-calibrated convolution, Haar wavelet downsampling, SG-UNet