基于堆叠学习的18F-FDG PET/CT影像组学模型在弥漫大B细胞淋巴瘤疗效预测中的价值

Value of 18F-FDG PET/CT radiomics model based on stack learning in predicting the efficacy of diffuse large B-cell lymphoma

  • 摘要:
    目的 探讨基于堆叠学习的18F-氟脱氧葡萄糖(FDG) PET/CT影像组学模型对弥漫大B细胞淋巴瘤(DLBCL)的疗效预测价值。
    方法  回顾性分析2021年3月至2024年6月于长治市人民医院行18F-FDG PET/CT检查的82例DLBCL患者的临床资料,其中男性44例、女性38例,年龄(57.9±12.8)岁。采用简单随机抽样方法按3∶1的比例将所有患者分为训练集(n=61)和测试集(n=21)。根据Lugano 2014分类法的多维尔评分将患者分为完全缓解组和未完全缓解组。根据标准化摄取值(SUV)≥4.0并且体积临界值≥3 ml对治疗前18F-FDG PET图像进行感兴趣区(ROI)的预勾画,手动调整ROI后得到297个影像组学特征,使用最小绝对收缩和选择算子(LASSO)计算影像组学评分(RS)。结合DLBCL的临床特征及RS,基于4种基础机器学习模型(梯度提升机、随机森林、朴素贝叶斯和神经网络)开发堆叠学习模型用于预测DLBCL疗效。计量资料采用Studen′s t检验或Mann-Whitney U检验进行组间比较;计数资料采用χ2检验进行比较;采用受试者工作特征曲线的曲线下面积(AUC)、准确率、F1分数、精准率和召回率评估各模型的预测效能;采用Delong检验比较不同模型AUC的差异。
    结果  82例患者中,32例获得完全缓解,50例为未完全缓解。训练集中完全缓解组与未完全缓解组患者伴有B症状、国际预后指数(IPI)评分和巨大肿块的差异均有统计学意义(χ2=4.462~8.509,均P<0.05)。通过LASSO分析筛选出9个与DLBCL训练集患者疗效相关的影像组学特征。结合临床特征与RS构建的5种预测模型中,基于堆叠学习的模型的预测效能最佳,其在测试集中的AUC和准确率分别为0.85和0.86。
    结论 基于18F-FDG PET/CT影像组学与临床特征的堆叠学习模型可以改善DLBCL患者的风险分层。

     

    Abstract:
    Objective  To develop a 18F-fluorodeoxyglucose (FDG) PET/CT radiomics model based on stack learning for predicting the treatment response of patients with diffuse large B-cell lymphoma (DLBCL).
    Methods  Retrospective analysis was conducted on 82 patients with DLBCL (44 males and 38 females, with a mean age of (57.9±12.8) years) who underwent 18F-FDG PET/CT at Changzhi People′s Hospital between March 2021 and June 2024. All patients were randomly divided into the training set (n=61) and the test set (n=21) in a 3∶1 ratio. Based on the Lugano 2014 classification system′s Deauville score, patients were categorized into the complete response group and non-complete response group. Regions of interest (ROI) on pre-treatment 18F-FDG PET images were pre-segmented using the thresholds standardized uptake value (SUV)≥4.0 and volume≥3 ml, followed by manual adjustments to obtain 297 radiomic features. Radiomic scores (RS) were calculated using the least absolute shrinkage and selection operator (LASSO). By integrating clinical characteristics and RS of DLBCL, a stacking learning model was developed using four base machine learning models (gradient boosting machines, random forest, naive Bayes, and neural network) to predict the treatment response of patients with DLBCL. Measurement data were compared between the two groups using either Student′s t-test or Mann-Whitney U test, whereas counting data were analyzed using the χ2 test. The predictive performance of the models was evaluated by metrics, including the area under curve (AUC) of the receiver operating characteristic curve, accuracy, F1 score, precision, and recall. Differences in AUC among the models were compared using the Delong test.
    Results  Of the 82 patients, 32 patients achieved complete response, whereas 50 patients were classified as non-complete response. In the training set, significant differences were observed in the presence of B symptoms, international prognostic index (IPI) scores, and bulky disease between the complete response and non-complete response groups (χ2=4.462–8.509, all P<0.05). LASSO analysis identified nine radiomic features related to treatment response of DLBCL training set patients for model construction. Among the five predictive models combining clinical features and RS, the stacking learning model demonstrated the best performance, achieving an AUC of 0.85 and an accuracy of 0.86 in the test set.
    Conclusion  A stacking learning model integrating 18F-FDG PET/CT radiomics and clinical features improves risk stratification in patients with DLBCL.

     

/

返回文章
返回