Journal of Southern Medical University ›› 2025, Vol. 45 ›› Issue (2): 409-421.doi: 10.12122/j.issn.1673-4254.2025.02.22

Previous Articles    

An efficient and lightweight skin pathology detection method based on multi-scale feature fusion using an improved RT-DETR model

Yuying REN(), Lingxiao HUANG(), Fang DU, Xinbo YAO   

  1. School of Information Engineering, Ningxia University// Ningxia Key Laboratory of Artificial Intelligence and Information Security for Channeling Computing Resources from the East to the West//Collaborative Innovation Center for Ningxia Big Data and Artificial Intelligence Co-founded by Ningxia Municipality and Ministry of Education, Yinchuan 750021, China
  • Received:2024-10-30 Online:2025-02-20 Published:2025-03-03
  • Contact: Lingxiao HUANG E-mail:ran96822@stu.nxu.edu.cn;huanglx@nxu.edu.cn
  • Supported by:
    National Natural Science Foundation of China(62062058)

Abstract:

Objective The presence of multi-scale skin lesion regions and image noise interference and limited resources of auxiliary diagnostic equipment affect the accuracy of skin disease detection in skin disease detection tasks. To solve these problems, we propose a highly efficient and lightweight skin disease detection model using an improved RT-DETR model. Method A lightweight FasterNet was introduced as the backbone network and the FasterNetBlock module was parametrically refined. A Convolutional and Attention Fusion Module (CAFM) was used to replace the multi-head self-attention mechanism in the neck network to enhance the ability of the AIFI-CAFM module for capturing global dependencies and local detail information. The DRB-HSFPN feature pyramid network was designed to replace the Cross-Scale Feature Fusion Module (CCFM) to allow the integration of contextual information across different scales to improve the semantic feature expression capacity of the neck network. Finally, combining the advantages of Inner-IoU and EIoU, the Inner-EIoU was used to replace the original loss function GIOU to further enhance the model's inference accuracy and convergence speed. Results The experimental results on the HAM10000 dataset showed that the improved RT-DETR model, as compared with the original model, had increased mAP@50 and mAP@50:95 by 4.5% and 2.8%, respectively, with a detection speed of 59.1 frames per second (FPS). The improved model had a parameter count of 10.9 M and a computational load of 19.3 GFLOPs, which were reduced by 46.0% and 67.2% compared to those of the original model, validating the effectiveness of the improved model. Conclusion The proposed SD-DETR model significantly improves the performance of skin disease detection tasks by effectively extracting and integrating multi-scale features while reducing both parameter count and computational load.

Key words: skin disease, lightweight network, multi-feature fusion, attention mechanism, RT-DETR