Journal of Southern Medical University ›› 2026, Vol. 46 ›› Issue (1): 231-238.doi: 10.12122/j.issn.1673-4254.2026.01.25

Previous Articles    

Research progress of large language models in tumor diagnosis: applications in textual reports and medical imaging

Haoran CHENG1,2(), Hongbin YAN3, Ziyun YUAN4, Zehong ZHUANG1,2, Xuegang SUN5, Xueqing YAO1,2,6   

  1. 1.Department of Gastrointestinal Surgery, Department of General Surgery, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou 510080, China
    2.Department of General Surgery, Guangdong Provincial People's Hospital Ganzhou Hospital (Ganzhou Municipal Hospital), Ganzhou 341099, China
    3.First Clinical Medical School
    4.Department of General Surgery, Zhongshan People's Hospital, Zhongshan 528400, China
    5.School of Chinese Medicine, Southern Medical University, Guangzhou 510515, China
    6.School of Medicine, South China University of Technology, Guangzhou 510641, China
  • Received:2025-09-02 Online:2026-01-20 Published:2026-01-16
  • Contact: Xueqing YAO E-mail:chenghaoran@gdph.org.cn
  • Supported by:
    National Natural Science Foundation of China(82260501)

Abstract:

Large language models (LLMs) are emerging artificial intelligence technologies with strong text and image processing capabilities, offering critical support for the intelligent transformation of healthcare and improving clinical efficiency and quality. This review summarizes the current applications, technical features, and future directions of LLMs in cancer diagnosis, focusing on two key scenarios: automated analysis of textual reports (e.g., imaging, pathology, and case summaries) and multimodal diagnosis combining text and medical images. Findings show that LLMs now perform at a level comparable to general resident physicians in cancer diagnosis but are still incapable of making specialized and precise judgments. They also exhibit application-specific traits, such as parameter-efficient models adapted for grassroots-level scenario and divergent versatility in multilingual report analysis. Future efforts should prioritize developing specialized, practical medical LLMs through optimized fine-tuning strategies, construction of high-quality Chinese medical datasets, and integration with vision-language models to promote the clinical application of these models and increase the accessibility of healthcare resources.

Key words: large language models, artificial intelligence, cancer diagnosis, pathology, medical imaging