Journal of Southern Medical University ›› 2026, Vol. 46 ›› Issue (2): 466-472.doi: 10.12122/j.issn.1673-4254.2026.02.24

Previous Articles    

Comparison of missing data handling methods for AC1 coefficient estimation

Keke LI(), Lishan XU, Milai YU, Shengli AN()   

  1. Department of Biostatistics, School of Public Health, Southern Medical University, Guangzhou 510515, China
  • Received:2025-07-22 Online:2026-02-20 Published:2026-03-10
  • Contact: Shengli AN E-mail:kk20001205@163.com;1069766473@qq.com

Abstract:

Objective To explore the impact of different missing data handling methods on AC1 coefficient estimation through simulation studies. Methods Monte Carlo simulation was used to generate evaluation data under different missing mechanisms. The parameters generated included the number of raters, categories, sample size, disease prevalence, random rating probability, and missing proportion. Four missing data handling methods, by excluding subjects with zero ratings, excluding subjects with incomplete ratings, rater mode imputation, and subject mode imputation, were compared using bias and mean squared error (MSE) as metrics. Results When disease prevalence was balanced or the missing data mechanism was missing completely at random (MCAR) or at random (MAR), excluding subjects with zero ratings showed the best performance, with bias and MSE close to zero at a missing proportion below 30%. Under skewed prevalence and missing not at random (MNAR), subject mode imputation was superior for AC1 coefficient estimation, resulting in a bias within ±0.10 and an MSE below 0.09; for a sufficient sample size and a missing proportion ≤30%, the MSE of this method was nearly zero. Rater mode imputation showed the worst performance across all these scenarios. Excluding subjects with incomplete ratings resulted in an acceptable error only in relatively simple settings (two raters and two categories) with low a missing proportion under MCAR/MAR, but showed a poor stability in other scenarios. Conclusion No universally optimal method exists for handling missing data in AC1 estimation. We recommend excluding subjects with zero ratings for balanced prevalence or MCAR/MAR, and subject mode imputation for skewed prevalence under MNAR. Researchers should report AC1 estimates from multiple methods to allow assessment of result sensitivity.

Key words: agreement evaluation, nominal ratings, AC1 coefficient, missing data