A computed tomography-based lung radiomics nomogram to identify acute exacerbation of chronic obstructive pulmonary disease: a multi-institutional validation study
Highlight box
Key findings
• The extraction of radiomic features from the whole lung volume using computed tomography (CT) and machine learning algorithms can effectively construct a model to identify acute exacerbation of chronic obstructive pulmonary disease (AECOPD), where the logistic regression (LR) model exhibited the best diagnostic performance with high accuracy in discriminating the status of acute exacerbation of chronic obstructive pulmonary disease (COPD), showing area under the curve (AUC) values of 0.974, 0.836, and 0.944 in the training, internal, and external validation cohorts.
• The radiomic signature, constructed via least absolute shrinkage and selection operator LR after feature selection, demonstrated better predictive ability compared to the clinical model for distinguishing AECOPD, with AUCs in all three datasets exceeding those of the clinical-only model.
What is known and what is new?
• It is known that objective biomarkers for quantifying acute exacerbation severity in COPD remain limited, with current assessments relying heavily on clinical symptoms and spirometry. While radiomics has emerged as a tool to extract sub-visual imaging features reflecting disease heterogeneity, its integration with functional parameters for exacerbation assessment is underexplored.
• This study demonstrates a novel framework combining CT radiomic features with pulmonary function tests to diagnose and stratify AECOPD. This approach aligns with advancing computational methods in respiratory medicine 2 and addresses the unmet need for multimodal biomarkers highlighted in recent COPD research.
What is the implication, and what should change now?
• The CT-based whole-lung radiomic nomogram accurately identifies AECOPD, furnishing a robust foundation for clinical diagnosis and treatment planning.
Introduction
Chronic obstructive pulmonary disease (COPD) is a chronic, progressive, and irreversible chronic respiratory disease characterized by abnormalities in airway (such as bronchitis and bronchiolitis) and/or alveoli (like emphysema), accompanied by persistent respiratory symptoms including dyspnea, cough, expectoration, and/or acute exacerbations (1). COPD constitutes a major global public-health challenge, marked by high morbidity and mortality (2,3).
In the course of COPD, patients experience worsening respiratory symptoms, a gradual decline in lung function, and even suffer from respiratory failure and systemic inflammatory response syndrome. An acute worsening of respiratory symptoms requiring additional therapy defines an acute exacerbation of chronic obstructive pulmonary disease (AECOPD) (4). AECOPD plays a critical role in the natural history of COPD, accelerating lung-function decline, aggravating comorbidities, eroding quality of life, and driving high morbidity and mortality. Early identification and proper assessment of AECOPD can facilitate timely treatment and rehabilitation, thereby improving quality of life and reducing the risk of hospitalization. Currently, clinical diagnosis and evaluation of AECOPD rely mainly on the level of serum inflammatory factors and pulmonary function indices, and there is a lack of clinically effective quantitative and objective data to assess the degree of acute exacerbation. Although spirometry remains the diagnostic cornerstone for COPD, its limited capacity to characterize structural lung damage hampers precise AECOPD risk stratification. High-resolution computed tomography (CT) uniquely quantifies three-dimensional patterns of disease progression inaccessible to pulmonary function tests, including airway remodeling, emphysema heterogeneity (5), and vascular pruning (6). The “Global Strategy for the Diagnosis, Management, and Prevention of Chronic Obstructive Lung Disease 2025” (GOLD 2025) has updated the importance of CT in evaluating COPD, as CT imaging provides critical structural insights into COPD phenotypes. Our previous studies (7-9) have likewise demonstrated the high value of chest CT qualitative, quantitative, and functional parameters in diagnosing COPD, grading its severity, and classifying its phenotypes. However, the pronounced heterogeneity of COPD still precludes effective, comprehensive assessment of lung status, particularly in AECOPD patients. Radiomics represents an emerging methodology that enables high-throughput extraction of quantitative features from medical imaging (e.g., CT), clarifying that while predominantly used in pulmonary oncology (10). This approach captures complex, visually imperceptible patterns within images, demonstrating significant potential for advancing clinical decision-making. Current applications in COPD research are rapidly expanding, with mounting evidence indicating distinct advantages of radiomic analysis for characterizing disease manifestations in this population (11,12). Radiomics utilizes computational machine learning (ML) to extract and quantify sub-visual imaging biomarkers that capture disease heterogeneity associated with the pathogenesis of AECOPD (13), while lung function parameters can objectively quantify ventilation injury. The integration of these two approaches enables a comprehensive view of both structural and functional alterations in AECOPD patients. It is hypothesized that this combined strategy will enhance diagnostic accuracy and clinical utility. Accordingly, this study aims to diagnose and assess AECOPD by combining CT radiomic features with pulmonary function parameters, thereby improving the understanding of disease severity and guiding personalized therapeutic strategies. We present this article in accordance with the TRIPOD reporting checklist (available at https://jtd.amegroups.com/article/view/10.21037/jtd-2025-972/rc).
Methods
Patient selection
The study was conducted in accordance with the Declaration of Helsinki and its subsequent amendments. The study was approved by the ethics committee of the Second Affiliated Hospital of PLA Naval Medical University (approval No. 2022SL068) and individual consent for this retrospective analysis was waived. All participating hospitals were informed and agreed the study. A retrospective analysis was conducted on 343 COPD patients admitted to Zhejiang Provincial People’s Hospital from January 2013 to December 2022, including 158 patients with AECOPD and 185 patients who were not necessarily hospitalized patients. The 343 COPD patients were partitioned into a 240-member training cohort (107 AECOPD/133 non-AECOPD) and a 103-member internal validation cohort (51 AECOPD/52 non-AECOPD). Additionally, 132 patients were included from the Second Affiliated Hospital of PLA Naval Medical University and Harbin Chest Hospital, including 60 patients with AECOPD and 72 patients with non-AECOPD as an external validation set. The inclusion criteria were as follows: (I) the diagnostic criteria of COPD are the forced expiratory volume in one second (FEV1)/forced vital capacity (FVC) being less than 70%; (II) underwent pulmonary function testing and chest CT scanning at an interval of less than 7 days; (III) had complete routine clinical data. The exclusion criteria were as follows: (I) CT images have artifacts that affect quantitative analysis; (II) history of other lung diseases, such as lung cancer, sarcoidosis, lymphoma, etc.; (III) history of lung surgery. AECOPD is independently diagnosed by two respiratory physicians, using the Rome proposal criteria (14), incorporating quantitative severity grading through symptom scores, inflammatory biomarkers, and airflow limitation progression, while the non-AECOPD group was defined as stable COPD patients without acute exacerbation, who are admitted for follow-up or routine medication, and undergo chest CT scans to assess lung conditions (such as emphysema). The routine clinical data of the patients included age, gender and body mass index (BMI). Based on the 2020 Global Initiative for Chronic Obstructive Lung Disease (GOLD) guidelines (15), patients were divided into four stages (I–IV) based on their FEV1/FVC ratio and ratio of FEV1 to predicted value. Pulmonary function tests included vital capacity (VC), FVC, FEV1, ratio of FEV1 to predicted value (FEV1%pred), FEV1/FVC ratio, ratio of residual volume (RV) to total lung capacity (TLC), peak expiratory flow (PEF), and maximum ventilatory volume (MVV).
CT examination method and CT image preprocessing
All patients underwent chest CT scans following respiratory training. They were positioned in a supine position with their arms raised above their heads and underwent CT scanning after taking a deep inhalation. The scanning range covered the area from the level of the thoracic inlet to the diaphragm. The scanning parameters were as follows: tube voltage of 120 kV, tube current of 200 mA, collimation of 0.564, matrix of 512×512, layer thickness and spacing of 1 or 2 mm, window width and level of 1,350/−600 HU, standard reconstruction algorithm.
Before extracting the radiomic features, the images were preprocessed, which consisted of three steps. First, spatial standardization via trilinear interpolation to isotropic 1 mm3 × 1 mm3 × 1 mm3 resolution harmonized voxel dimensions across different scanners. Second, intensity discretization with fixed-bin-width quantization ensured gray-level consistency. Third, hybrid multiscale filtering combined Laplacian of Gaussian (LoG) denoising and 3-level Symlet wavelet decomposition separated frequency-specific signatures.
CT image segmentation and radiomics feature extraction
The ITK-Snap software (Version 3.8.0; http://www.itksnap.org/) was employed for semi-automatic whole-lung volume of interest (VOI) segmentation: its 3D Segment tool first auto-detected lung parenchyma, after which manual refinements finalized the VOI. Then, in Python software (Version 3.11.1; https://www.python.org/), the open-source “Pyradiomics” software package was used to extract radiomics features for the included patients. A total of 100 original image features and 1,118 derived features were extracted, with the latter generated using Wavelet and LoG filtering. All features comply with the Image Biomarker Standardization Initiative and were Z-score normalized to remove scale differences.
Features selection and radiomics score construction
Feature selection proceeded in two steps. First, Pearson’s rank correlation coefficient was also used to assess the correlation between features, and one of the features with a correlation coefficient greater than 0.9 between any two features was retained. Second, the least absolute shrinkage and selection operator (LASSO) regression algorithm, with penalty parameter tuning, performed ten-fold cross-validation. The optimal feature dataset with the smallest cross-validation binomial deviation was selected, and the non-zero coefficients were defined as the weight of the selected feature, representing the correlation between the feature and AECOPD. Each patient’s Rad-score was computed as the linear combination of selected features and their coefficients, forming the radiomic model.
Model construction and evaluation
Univariate and multivariate analyses were performed to compare the clinical characteristics between the non-AECOPD and the AECOPD groups. Clinical parameters with P<0.05 were selected to construct a clinical model.
The final retained radiomics features were input into 11 ML classifiers, including logistic regression (LR), NaiveBayes, support vector machine (SVM), K-nearest neighbor (KNN), random forest (RF), extra tree (ET), extreme gradient enhancer (XGBoost), light gradient boosting machine (LightGBM), GradientBoosting, AdaBoost and multi-layer perception (MLP), to develop the Rad-signature for identifying AECOPD patients. The receiver operating characteristic (ROC) curve and Hosmer-Lemeshow (HL) test were employed to select the best-performing model. LR was then used to combine clinical and radiomics features, resulting in an optimal clinical radiomic model. A clinical-radiomics nomogram was subsequently constructed for bedside use. To evaluate the predictive capabilities of the three models, ROC curves were drawn for both the training and validation cohorts, and the average area under the curve (AUC), accuracy, sensitivity, and specificity were calculated. The clinical practicability of the models was evaluated using a calibration curve and decision curve analysis (DCA).
Statistical analysis
Data were analyzed using Python (Version 3.11.1) and R (Version 4.2.3 software). For continuous variables, such as clinical and pulmonary function indicators were expressed by the mean ± standard deviation. The Shapiro-Wilk method was used to test for normality. For normally distributed data, an independent sample t-test was used, whereas for non-normally distributed data, a Mann-Whitney U test was used. Categorical variables were presented as frequencies (percentages) and were analyzed using a chi-square test. P<0.05 was considered statistically significant.
Results
Clinical data and pulmonary function results of patients
This study retrospectively enrolled 532 COPD patients; 475 met the inclusion and exclusion criteria. Among them, 257 non-AECOPD patients (41 female) averaged 70.89±9.92 years, and 218 AECOPD patients (52 female) averaged 69.56±9.85 years. The patient selection flowchart is shown in Figure 1. In the training cohort, there was a significant difference in age (P=0.03) between the non-AECOPD and AECOPD groups. Gender and GOLD grade showed no significant differences between groups in the training or internal-validation cohorts (all P>0.05). In the external validation cohort, there are statistical differences in gender between the non-AECOPD and AECOPD groups (P<0.001). BMI did not differ significantly between non-AECOPD and AECOPD groups across any cohort (all P>0.05). Additionally, significant differences were found in pulmonary function parameters, including FVC, FEV1, FEV1/FVC%, PEF and MVV between the non-AECOPD and AECOPD groups in the training cohort (P<0.05) (Table 1).
Table 1
| Clinical data | Training cohort (n=240) | Internal validation cohort (n=103) | External validation cohort (n=132) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Non-AECOPD (n=133) | AECOPD (n=107) | P | Non-AECOPD (n=52) | AECOPD (n=51) | P | Non-AECOPD (n=72) | AECOPD (n=60) | P | |||
| Age (years) | 69.65±10.38 | 72.43±9.14 | 0.03 | 67.71±10.25 | 72.67±9.67 | 0.01 | 67.61±8.46 | 65.58±9.29 | 0.19 | ||
| BMI (kg/m2) | 22.07±3.29 | 22.70±3.67 | 0.20 | 22.28±3.68 | 21.82±3.85 | 0.53 | 23.83±3.74 | 22.62±3.266 | 0.052 | ||
| Gender | 0.97 | 0.76 | <0.001 | ||||||||
| Male | 108 (81.20) | 88 (82.24) | 44 (84.62) | 41 (80.39) | 64 (88.89) | 37 (61.67) | |||||
| Female | 25 (18.80) | 19 (17.76) | 8 (15.38) | 10 (19.61) | 8 (11.11) | 23 (38.33) | |||||
| GOLD | 0.07 | 0.42 | <0.001 | ||||||||
| GOLD I | 8 (6.02) | 9 (8.41) | 1 (1.92) | 0 (0) | 0 (0) | 0 (0) | |||||
| GOLD II | 58 (43.61) | 29 (27.10) | 21 (40.38) | 15 (29.41) | 15 (20.83) | 11 (18.33) | |||||
| GOLD III | 47 (35.34) | 50 (46.73) | 21 (40.38) | 23 (45.10) | 57 (79.17) | 21 (35.00) | |||||
| GOLD IV | 20 (15.04) | 19 (17.76) | 9 (17.31) | 13 (25.49) | 0 (0) | 28 (46.67) | |||||
| VC (L) | 2.34±0.85 | 2.15±0.80 | 0.06 | 2.29±0.63 | 1.91±0.62 | 0.002 | 2.40±0.63 | 1.39±0.58 | <0.001 | ||
| FVC (L) | 2.28±0.84 | 2.09±0.81 | 0.04 | 2.22±0.62 | 1.85±0.62 | 0.002 | 2.34±0.63 | 1.39±0.58 | <0.001 | ||
| FEV1 (L) | 1.34±0.60 | 1.17±0.58 | 0.006 | 1.26±0.47 | 1.01±0.42 | 0.007 | 1.25±0.44 | 0.86±0.40 | <0.001 | ||
| FEV1%pred (%) | 51.82±19.18 | 47.47±20.07 | 0.051 | 46.99±15.27 | 42.48±15.44 | 0.15 | 46.24±12.39 | 33.89±14.48 | <0.001 | ||
| FEV1/FVC (%) | 57.89±9.65 | 54.75±9.47 | 0.008 | 55.53±10.17 | 53.77±8.84 | 0.35 | 53.19±8.58 | 61.25±8.01 | <0.001 | ||
| PEF (L/s) | 3.11±1.42 | 2.78±1.47 | 0.03 | 2.95±1.25 | 2.18±0.99 | 0.002 | 3.88±1.39 | 1.41±0.94 | <0.001 | ||
| MVV (L) | 48.45±22.12 | 42.82±22.61 | 0.02 | 46.27±19.58 | 35.06±17.19 | 0.002 | 50.93±20.81 | 29.77±16.93 | <0.001 | ||
Data are presented as n (%) or mean ± standard deviation. AECOPD, acute exacerbation of chronic obstructive pulmonary disease; BMI, body mass index; FEV1, forced expiratory volume in one second; FEV1/FVC, ratio of the first second forced expiratory volume to forced vital capacity; FEV1%pred, percentage of forced expiratory volume in the one second predicted; FVC, forced vital capacity; GOLD I, patients with FEV1/FVC <0.7 and the FEV1% predicted ≥80% after bronchodilation; GOLD II, patients with FEV1/FVC <0.7 and 50%≤ the FEV1% predicted <80% after bronchodilation; GOLD III, 30%≤ the FEV1% predicted <50% after bronchodilation; GOLD IV, the FEV1% predicted <30%; GOLD, global initiative for chronic obstructive lung disease; MVV, maximum ventilatory volume; PEF, peak expiratory flow; VC, vital capacity.
Construction of Radscore and radiomics model
A total of 1,218 lung radiomic features were extracted from each VOI. After LASSO regression, 61 features with non-zero coefficients were retained (Figure 2). These features and their corresponding coefficients were linearly combined to compute the Radscore, whose formula is provided in the Appendix 1. The 61 selected radiomics features were then utilized to construct a radiomics model for each classifier, and subsequently, the performance of each model was evaluated in the training, internal, and external validation cohorts. Table 2 shows the comparison of AUC values of 11 ML models for identifying AECOPD, and presents statistics on the diagnostic efficacy of the radiomics models constructed by the various classifiers. Notably, the LR radiomics model achieved excellent performance, with AUCs of 0.974 (95% CI: 0.957–0.990), 0.836 (95% CI: 0.759–0.913) and 0.944 (95% CI: 0.902–0.986) in the training, internal and external validation cohorts, respectively. ROC analyses for the different radiomics models across the three cohorts are illustrated in Figure 3.
Table 2
| ML | Group | Accuracy | AUC (95% CI) | Sensitivity | Specificity | PPV | NPV | Precision | Recall | F1 |
|---|---|---|---|---|---|---|---|---|---|---|
| LR | Train | 0.933 | 0.974 (0.957–0.990) | 0.944 | 0.925 | 0.910 | 0.953 | 0.910 | 0.944 | 0.927 |
| Internal | 0.786 | 0.836 (0.759–0.913) | 0.667 | 0.904 | 0.872 | 0.734 | 0.872 | 0.667 | 0.756 | |
| External | 0.894 | 0.944 (0.902–0.986) | 0.783 | 0.986 | 0.979 | 0.845 | 0.979 | 0.783 | 0.870 | |
| NaiveBayes | Train | 0.717 | 0.751 (0.689–0.813) | 0.664 | 0.759 | 0.689 | 0.737 | 0.689 | 0.664 | 0.676 |
| Internal | 0.689 | 0.702 (0.601–0.804) | 0.471 | 0.904 | 0.828 | 0.635 | 0.828 | 0.471 | 0.600 | |
| External | 0.606 | 0.568 (0.468–0.668) | 0.400 | 0.778 | 0.600 | 0.609 | 0.600 | 0.400 | 0.480 | |
| SVM | Train | 0.917 | 0.964 (0.940–0.988) | 0.916 | 0.917 | 0.899 | 0.931 | 0.899 | 0.916 | 0.907 |
| Internal | 0.748 | 0.819 (0.740–0.898) | 0.588 | 0.904 | 0.857 | 0.691 | 0.857 | 0.588 | 0.698 | |
| External | 0.758 | 0.805 (0.729–0.880) | 0.683 | 0.819 | 0.759 | 0.756 | 0.759 | 0.683 | 0.719 | |
| KNN | Train | 0.775 | 0.85 (0.804–0.896) | 0.579 | 0.932 | 0.873 | 0.734 | 0.873 | 0.579 | 0.697 |
| Internal | 0.631 | 0.736 (0.643–0.828) | 0.431 | 0.827 | 0.710 | 0.597 | 0.710 | 0.431 | 0.537 | |
| External | 0.591 | 0.624 (0.530–0.717) | 0.250 | 0.875 | 0.625 | 0.583 | 0.625 | 0.250 | 0.357 | |
| RandomForest | Train | 0.983 | 0.999 (0.998–1.000) | 0.963 | 1.000 | 1.000 | 0.971 | 1.000 | 0.963 | 0.981 |
| Internal | 0.583 | 0.648 (0.544–0.752) | 0.510 | 0.654 | 0.591 | 0.576 | 0.591 | 0.510 | 0.547 | |
| External | 0.576 | 0.573 (0.477–0.670) | 0.233 | 0.861 | 0.583 | 0.574 | 0.583 | 0.233 | 0.333 | |
| ExtraTrees | Train | 0.554 | 1 (1.000–1.000) | 0.000 | 1.000 | 0.000 | 0.554 | 0.000 | 0.000 | |
| Internal | 0.621 | 0.708 (0.611–0.806) | 0.451 | 0.788 | 0.676 | 0.594 | 0.676 | 0.451 | 0.541 | |
| External | 0.591 | 0.57 (0.472–0.668) | 0.167 | 0.944 | 0.714 | 0.576 | 0.714 | 0.167 | 0.270 | |
| XGBoost | Train | 0.996 | 1 (1.000–1.000) | 0.991 | 1.000 | 1.000 | 0.993 | 1.000 | 0.991 | 0.995 |
| Internal | 0.767 | 0.794 (0.703–0.884) | 0.706 | 0.827 | 0.800 | 0.741 | 0.800 | 0.706 | 0.750 | |
| External | 0.500 | 0.46 (0.361–0.560) | 0.783 | 0.264 | 0.470 | 0.594 | 0.470 | 0.783 | 0.587 | |
| LightGBM | Train | 0.950 | 0.989 (0.980–0.998) | 0.963 | 0.940 | 0.928 | 0.969 | 0.928 | 0.963 | 0.945 |
| Internal | 0.728 | 0.728 (0.627–0.829) | 0.745 | 0.712 | 0.717 | 0.740 | 0.717 | 0.745 | 0.731 | |
| External | 0.515 | 0.316 (0.219–0.412) | 0.150 | 0.819 | 0.409 | 0.536 | 0.409 | 0.150 | 0.220 | |
| GradientBoosting | Train | 0.933 | 0.97 (0.948–0.993) | 0.907 | 0.955 | 0.942 | 0.927 | 0.942 | 0.907 | 0.924 |
| Internal | 0.699 | 0.754 (0.660–0.847) | 0.549 | 0.846 | 0.778 | 0.657 | 0.778 | 0.549 | 0.644 | |
| External | 0.765 | 0.706 (0.609–0.802) | 0.767 | 0.764 | 0.730 | 0.797 | 0.730 | 0.767 | 0.748 | |
| AdaBoost | Train | 0.829 | 0.896 (0.857–0.935) | 0.766 | 0.880 | 0.837 | 0.824 | 0.837 | 0.766 | 0.800 |
| Internal | 0.650 | 0.699 (0.598–0.800) | 0.608 | 0.692 | 0.660 | 0.643 | 0.660 | 0.608 | 0.633 | |
| External | 0.394 | 0.43 (0.328–0.532) | 0.283 | 0.486 | 0.315 | 0.449 | 0.315 | 0.283 | 0.298 | |
| MLP | Train | 0.900 | 0.953 (0.928–0.979) | 0.841 | 0.947 | 0.928 | 0.881 | 0.928 | 0.841 | 0.882 |
| Internal | 0.748 | 0.798 (0.710–0.886) | 0.745 | 0.750 | 0.745 | 0.750 | 0.745 | 0.745 | 0.745 | |
| External | 0.629 | 0.644 (0.548–0.740) | 0.517 | 0.722 | 0.608 | 0.642 | 0.608 | 0.517 | 0.559 |
AECOPD, acute exacerbation of chronic obstructive pulmonary disease; AUC, area under the curve; CI, confidence interval; KNN, K-nearest neighbor; LightGBM, light gradient boosting machine; LR, logistic regression; ML, machine learning; MLP, multi-layer perceptron; NPV, negative predictive value; PPV, positive predictive value; SVM, support vector machine; XGBoost, extreme gradient enhancer.
Establishment and performance of the clinical model and the Radiomics-Clinical model
Given the small sample size, univariate analysis of clinical parameters was performed across all patients. The results revealed that there were significant differences in FEV1, FVC, VC, FEV1%pred, FEV1/FVC%, PEF and MVV between the non-AECOPD and AECOPD cohorts (all P<0.05). Subsequently, multivariate analysis of these six clinical parameters in the training cohort showed that FEV1 was an independent predictor of AECOPD (P=0.04). Table 3 shows the results of univariate and multivariate analysis.
Table 3
| Clinical parameters | Univariate analysis | Multivariate analysis | |||
|---|---|---|---|---|---|
| OR (95% CI) | P | OR (95% CI) | P | ||
| Gender | 0.841 (0.708–0.999) | 0.10 | – | – | |
| Age (year) | 0.998 (0.995–1.001) | 0.17 | – | – | |
| BMI (kg/m2) | 0.992 (0.982–1.001) | 0.15 | – | – | |
| FEV1 (L) | 0.792 (0.678–0.926) | 0.01 | 0.115 (0.021–0.631) | 0.04 | |
| FVC (L) | 0.885 (0.807–0.97) | 0.03 | 0.693 (0.047–10.105) | 0.82 | |
| VC (L) | 0.889 (0.813–0.973) | 0.03 | 2.806 (0.216–36.416) | 0.51 | |
| FEV1%pred | 0.995 (0.991–0.999) | 0.03 | 1.015 (0.993–1.039) | 0.26 | |
| FEV1/FVC% | 0.995 (0.992–0.999) | 0.04 | – | – | |
| GOLD | 0.946 (0.875–1.021) | 0.23 | – | – | |
| PEF (L/s) | 0.913 (0.855–0.975) | 0.02 | 1.014 (0.769–1.336) | 0.94 | |
| MVV (L) | 0.994 (0.99–0.998) | 0.02 | 1.012 (0.98–1.044) | 0.54 | |
BMI, body mass index; CI, confidence interval; FEV1, forced expiratory volume in one second; FEV1/FVC, ratio of the first second forced expiratory volume to forced vital capacity; FEV1%pred, percentage of forced expiratory volume in the one second predicted; FVC, forced vital capacity; GOLD, global initiative for chronic obstructive lung disease; LR, logistic regression; MVV, maximum ventilatory volume; OR, odds ratios; PEF, peak expiratory flow; VC, vital capacity.
Clinical predictor FEV1 was finally combined with the LR-based radscore with high stability and reliability to construct the integrated nomogram in Figure 4A. The HL test indicated good agreement (P>0.05) for the calibration curves of the nomograms identifying AECOPD across the training, internal, and external validation cohorts (Figure 4B-4D). The clinical model performed poorly in the external validation cohort, likely owing to the limited number of independent predictive factors. In all three cohorts, the calibration curves of the clinical radiomics nomogram and the radiomics model fit the diagonal better than the clinical model, reflecting more accurate AECOPD predictions. The DCA (Figure 5A-5C) of the training, internal and external validation models were above the two reference lines, confirming their clinical net benefit.
The performance of the three models was compared from the aspects of accuracy, AUC, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), precision, recall and F1. All statistics are listed in Table 4. The clinical model showed an AUC of 0.630 (95% CI: 0.559–0.701) with balanced sensitivity and specificity of 0.393 and 0.850, respectively, in the training cohort. In the internal and external validation cohorts, the AUC was 0.636 (95% CI: 0.526–0.746) and 0.734 (95% CI: 0.645–0.823), respectively. The AUC of radiomics was 0.974 (95% CI: 0.957–0.991) in the training cohort and 0.836 (95% CI: 0.758–0.913) in the internal validation cohort and 0.944 (95% CI: 0.902–0.986) in the external validation cohort, respectively. The AUC of the clinical radiomics nomogram was 0.974 (95% CI: 0.957–0.991) in the training cohort, 0.849 (95% CI: 0.775–0.922) in the internal validation cohort, and 0.957 (95% CI: 0.921–0.993) in the external validation cohort, respectively (Figure 6). The clinical radiomics nomogram delivered the highest AUC in both training and validation cohort and is a reliable tool for clinical identification of AECOPD. An application example of a nomogram is shown in Figure 7. Similar to the points scoring system, we assigned points for each predictor of AECOPD and then equated these predictors with the risk of AECOPD. We can read the top score scale upward from the predictors to determine the points score associated with FEV1 and the Radscore (LR). Once a score has been assigned to each predictor, an overall score is calculated. Then, the total score is converted to the probability of AECOPD by reading the associated prob ability of AECOPD from the total point scale.
Table 4
| Cohort | Model | Accuracy | AUC (95% CI) | Sensitivity | Specificity | PPV | NPV | Precision | Recall | F1 |
|---|---|---|---|---|---|---|---|---|---|---|
| Training cohort | Clinic signature | 0.646 | 0.630 (0.559–0.701) | 0.393 | 0.850 | 0.677 | 0.635 | 0.677 | 0.393 | 0.497 |
| Rad signature | 0.933 | 0.974 (0.957–0.991) | 0.944 | 0.925 | 0.910 | 0.953 | 0.910 | 0.944 | 0.927 | |
| Nomogram | 0.933 | 0.974 (0.957–0.991) | 0.944 | 0.925 | 0.910 | 0.953 | 0.910 | 0.944 | 0.927 | |
| Internal validation cohort | Clinic signature | 0.650 | 0.636 (0.526–0.746) | 0.765 | 0.538 | 0.619 | 0.700 | 0.619 | 0.765 | 0.684 |
| Rad signature | 0.786 | 0.836 (0.758–0.913) | 0.667 | 0.904 | 0.872 | 0.734 | 0.872 | 0.667 | 0.756 | |
| Nomogram | 0.796 | 0.849 (0.775–0.922) | 0.686 | 0.904 | 0.875 | 0.746 | 0.875 | 0.686 | 0.769 | |
| External validation cohort | Clinic signature | 0.553 | 0.734 (0.645–0.823) | 0.278 | 0.883 | 0.505 | 0.741 | 0.455 | 0.083 | 0.141 |
| Rad signature | 0.894 | 0.944 (0.902–0.986) | 0.783 | 0.903 | 0.877 | 0.867 | 0.877 | 0.833 | 0.855 | |
| Nomogram | 0.902 | 0.957 (0.921–0.993) | 0.800 | 0.931 | 0.907 | 0.859 | 0.907 | 0.817 | 0.860 |
AUC, area under the curve; CI, confidence interval; NPV, negative predictive value; PPV, positive predictive value.
The Delong test was used to compare the AUC of the models. Significant differences were observed between the clinical model and the nomogram, and between the clinical model and the radiomics model, across all three cohorts (all P<0.05). However, no significant difference was detected between the radiomic model and the nomogram model in the training [AUC =0.974 (95% CI: 0.967–0.991) vs. AUC =0.974 (95% CI: 0.956–0.991); P=0.74], internal [AUC =0.836 (95% CI: 0.758–0.913) vs. AUC =0.849 (95% CI: 0.775–0.922); P=0.08] and external validation cohort [AUC =0.944 (95% CI: 0.902–0.986) vs. AUC =0.957 (95% CI 0.921–0.993); P=0.06].
Discussion
This study demonstrated that radiomics features extracted from chest CT images can effectively identify AECOPD, and that the model integrating clinical data offers superior performance, facilitating timely and effective COPD management in clinical practice.
At present, AECOPD diagnosis mainly relies on clinical manifestations, which are highly subjective and lack objective biomarkers (16). While pulmonary function test (PFT) remains the gold standard for diagnosing COPD, it is often challenging to perform satisfactorily in AECOPD patients due to compromised lung function, particularly a reduction in FEV1. In this study, univariate and multivariate analyses of the training cohort showed that FEV1 was an independent predictor of AECOPD (P=0.04), a finding consistent with findings from the Korean COPD Subgroup Study (KOCOSS) queue in South Korea (17).
AECOPD is a highly heterogeneous disorder marked by small airway remodeling, small vessel remodeling, and emphysema formation (18). CT scanning is considered to be the most effective method to characterize and quantify the features of COPD (19). Chest CT can visually depict structural changes, delineate their anatomical distribution, and enable individualised phenotyping of airways, vessels, and lung density, which is crucial for early COPD diagnosis and longitudinal evaluation (20). A large number of studies have demonstrated the diagnostic potential of imaging features for AECOPD (21-23). Through lung CT imaging, it has been found that CT imaging measurements can help predict the utilization of subsequent medical services by COPD patients (24). Furthermore, artificial intelligence (AI) has been widely used in the research of COPD (25). CT texture-based radiomics has proven effective in predicting COPD occurrence and severity. The performance of CT images was found to be effective in evaluating the severity classification of COPD. ML is a subfield of AI that utilizes algorithms and statistical models to identify patterns in data, capable of handling complex nonlinear relationships between predictor variables and generating novel results (26). Compared with traditional models, ML handles large, heterogeneous datasets more robustly, delivering accurate risk predictions even in the presence of noise or missing data, and obviates stringent preprocessing (27). While traditional early screening and risk assessment for AECOPD rely on direct clinical observation of respiratory symptoms and pulmonary function tests, along with evaluations of known risk factors, these methods are limited in handling multi-source data and capturing complex disease patterns, necessitating the use of ML to construct early screening models for AECOPD. Li et al. applied CT-based radiomics to the identification of COPD, SVM and LR ML algorithms were used, the AUCs of SVM and LR models in the training set were 0.992 and 0.993, respectively (28). Numerous machine-learning algorithms exist, yet which yields optimal performance remains unclear (29). In this study, 61 radiomics features were analyzed to construct a diagnostic ML model for AECOPD. Across eleven algorithms, the LR model achieved the highest diagnostic efficiency, with the training cohort AUC of 0.974 (95% CI: 0.957–0.990), internal validation cohort AUC of 0.836 (95% CI: 0.759–0.913), and an external validation cohort AUC of 0.944 (95% CI: 0.902–0.986). Consequently, the LR ML algorithm was selected and the corresponding Rad-score calculated from CT images for further analysis.
Based on clinical data and pulmonary function parameters, regression analysis can be used to evaluate the risk of AECOPD with moderate power. A study indicated that texture features based on chest CT images are more effective in predicting AECOPD patients than clinical features such as BMI, airflow Obstruction, Dyspnea and Exercise capacity (BODE), biomechanics, or acute exacerbation history (30). To compare effectiveness, a clinical model was constructed from meaningful parameters. Its AUCs for identifying AECOPD were 0.630 (95% CI: 0.559–0.701) in the training cohort, 0.636 (95% CI: 0.526–0.746) in the internal validation cohort, and 0.734 (95% CI: 0.645–0.823) in the external validation cohort, all lower than the corresponding radiomics model. Preliminary research showed that the COPD identification model combining CT with meaningful clinical parameters outperformed the model built solely on CT imaging radiomics. A study found that a multi-modal data combination strategy based on chest high-resolution CT (HRCT) images and pulmonary function parameters can intelligently identify respiratory distress complications in COPD patients (31). Therefore, we also explored the combination of radiomics and integrating statistically significant pulmonary function parameters and Rad-score values to construct a comprehensive discrimination model, and found that the AUC values of the training cohort (AUC is 0.974), internal validation cohort (AUC is 0.849), and external validation (AUC is 0.957) cohort of this model were relatively higher than those of the rad-score and radiomics model of 0.974 (95% CI: 0.957–0.991), 0.836 (95% CI: 0.758–0.913), and 0.944 (95% CI: 0.902–0.986), respectively. However, the diagnostic performance of the Radiomics-Clinical model in this study still has room for improvement, possibly because routine blood examination indicators and cardiac function parameters were not included. Accumulating clinical evidence shows that specific indicators derived from routine complete blood count, such as the neutrophil-to-lymphocyte ratio and platelet-to-lymphocyte ratio, can significantly predict adverse outcomes in AECOPD patients (32). A neutrophil-to-lymphocyte ratio greater than 6.90 is considered an important biomarker for predicting in-hospital mortality in AECOPD patients (33).
There are some limitations in this study. Firstly, this study enrolled 475 patients, and after calculation, the sample size was sufficient. The gender imbalance in our external validation cohort (76.5% male), while reflective of real-world COPD epidemiology, may affect generalizability to female populations. However, in order to further enhance the generalizability of the model among different genders and ethnic groups, we will further validate it in public databases. Secondly, at present, this study focuses on independent validation of radiomics and does not include blood inflammatory indicators, aiming to lay the foundation for its mechanism specificity and avoid scientific ambiguity caused by premature integration. In future research, we will systematically evaluate the synergistic potential of combining radiomics with biomarkers to optimize the identification of high-risk populations for AECOPD. Thirdly, this study analyzed CT images of COPD patients during acute exacerbation to assist in diagnosis, but further exploration of stable baseline CT or inter-stage comparisons is needed. Fourthly, quantitative CT analysis is important for COPD, yet this study did not compare CT quantitative data with image-based radiomics models. Therefore, additional laboratory tests, stable baseline CT examinations, and CT quantitative parameters should be incorporated in future work to predict AECOPD risk. Lastly, it is crucial to seamlessly integrate the column chart into existing electronic health record systems and validate the generality and robustness of the model in clinical settings. We will also conduct further research and validation in the future.
Conclusions
In summary, the combination strategy of clinical data, pulmonary function parameters, and CT radiomics facilitates AECOPD diagnosis and furnishes objective, quantitative evidence for formulating clinical treatment strategies.
Acknowledgments
We thank the patients who participated in this study.
Footnote
Reporting Checklist: The authors have completed the TRIPOD reporting checklist. Available at https://jtd.amegroups.com/article/view/10.21037/jtd-2025-972/rc
Data Sharing Statement: Available at https://jtd.amegroups.com/article/view/10.21037/jtd-2025-972/dss
Peer Review File: Available at https://jtd.amegroups.com/article/view/10.21037/jtd-2025-972/prf
Funding: This work was supported by
Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://jtd.amegroups.com/article/view/10.21037/jtd-2025-972/coif). X.Z. reports receiving funding from the Naval Medical University Campus Level Project (No. 2023QN074) and National Key R&D Program of China (No. 2022YFC2010005). Y.M. reports receiving funding from the Zhejiang Province Medical and Health Technology Plan Project (Nos. 2022KY040, 2023KY472) and Zhejiang Provincial Natural Science Foundation of China (No. LTGY24H180017). S.L. reports receiving funding from the National Natural Science Foundation of China (No. 81930049) and National Key R&D Program of China (No. 2022YFC2010000). L.F. reports receiving funding from the National Natural Science Foundation of China (No. 82171926), National Key R&D Program of China (No. 2022YFC2010002), the Program of Science and Technology Commission of Shanghai Municipality (No. 21DZ2202600), Construction of CT Standardized Database for Chronic Obstructive Pulmonary Disease (No. YXFSC2022JJSJ002), and Clinical Innovation Project of Shanghai Changzheng Hospital (No. 2020YLCYJ-Y24). The other authors have no conflicts of interest to declare.
Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. The study was conducted in accordance with the Declaration of Helsinki and its subsequent amendments. The study was approved by the ethics committee of The Second Affiliated Hospital of PLA Naval Medical University (approval No. 2022SL068) and individual consent for this retrospective analysis was waived. All participating hospitals were informed and agreed the study.
Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.
References
- Agustí A, Celli BR, Criner GJ, et al. Global Initiative for Chronic Obstructive Lung Disease 2023 Report: GOLD Executive Summary. Am J Respir Crit Care Med 2023;207:819-37. [Crossref] [PubMed]
- Deolmi M, Decarolis NM, Motta M, et al. Early Origins of Chronic Obstructive Pulmonary Disease: Prenatal and Early Life Risk Factors. Int J Environ Res Public Health 2023;20:2294. [Crossref] [PubMed]
- Kwok WC, Tam TCC, Ho JCM, et al. Hospitalized acute exacerbation in chronic obstructive pulmonary disease - impact on long-term renal outcomes. Respir Res 2024;25:36. [Crossref] [PubMed]
- Sharma M, Joshi S, Banjade P, et al. Global Initiative for Chronic Obstructive Lung Disease (GOLD) 2023 Guidelines Reviewed. Open Respir Med J 2024;18:e18743064279064. [Crossref] [PubMed]
- Yuan NF, Hasenstab K, Retson T, et al. Unsupervised Learning Identifies Computed Tomographic Measurements as Primary Drivers of Progression, Exacerbation, and Mortality in Chronic Obstructive Pulmonary Disease. Ann Am Thorac Soc 2022;19:1993-2002. [Crossref] [PubMed]
- Yang T, Chen C, Chen Z. The CT pulmonary vascular parameters and disease severity in COPD patients on acute exacerbation: a correlation analysis. BMC Pulm Med 2021;21:34. [Crossref] [PubMed]
- Zhang D, Zhao M, Zhou X, et al. Single Inspiratory Chest CT-based Generative Deep Learning Models to Evaluate Functional Small Airways Disease. Radiol Artif Intell 2025;7:e240680. [Crossref] [PubMed]
- Pu Y, Zhou X, Zhang D, et al. Quantitative Assessment Characteristics of Small Pulmonary Vessel Remodelling in Populations at High Risk for COPD and Smokers Using Low-Dose CT. Int J Chron Obstruct Pulmon Dis 2024;19:51-62. [Crossref] [PubMed]
- Zhou X, Pu Y, Zhang D, et al. Development of machine learning model to predict pulmonary function with low-dose CT-derived parameter response mapping in a community-based chest screening cohort. J Appl Clin Med Phys 2023;24:e14171. [Crossref] [PubMed]
- Russo L, Charles-Davies D, Bottazzi S, et al. Radiomics for clinical decision support in radiation oncology. Clin Oncol (R Coll Radiol) 2024;36:e269-81. [Crossref] [PubMed]
- Zhou T, Guan Y, Lin X, et al. CT-based whole lung radiomics nomogram for identification of PRISm from non-COPD subjects. Respir Res 2024;25:329. [Crossref] [PubMed]
- Koo MC, Au R, Hague CJ, et al. Expiration CT Gas Trapping Measures with Texture-Based Radiomics Improves Association with Lung Function and Lung Function Decline in COPD. Acad Radiol 2025;32:3711-21. [Crossref] [PubMed]
- Zhou TH, Zhou XX, Ni J, et al. CT whole lung radiomic nomogram: a potential biomarker for lung function evaluation and identification of COPD. Mil Med Res 2024;11:14. [Crossref] [PubMed]
- Celli BR, Fabbri LM, Aaron SD, et al. An Updated Definition and Severity Classification of Chronic Obstructive Pulmonary Disease Exacerbations: The Rome Proposal. Am J Respir Crit Care Med 2021;204:1251-8. [Crossref] [PubMed]
- Global Initiative for Chronic Obstructive Lung Disease. Global Strategy for the Diagnosis, Management, and Prevention of Chronic Obstructive Pulmonary Disease: 2020 Report. (2019-11-05) [2019-11-30]. Available online: https://goldcopd.org/gold-reports/
- Pratt AJ, Purssell A, Zhang T, et al. Complexity in clinical diagnoses of acute exacerbation of chronic obstructive pulmonary disease. BMC Pulm Med 2023;23:298. [Crossref] [PubMed]
- Choi JY, Lee CH, Joo H, et al. Implications of Global Lung Function Initiative Spirometry Reference Equations in Northeast Asian Patients With COPD. Chest 2025;167:414-24. [Crossref] [PubMed]
- Ritchie AI, Wedzicha JA. Definition, Causes, Pathogenesis, and Consequences of Chronic Obstructive Pulmonary Disease Exacerbations. Clin Chest Med 2020;41:421-38. [Crossref] [PubMed]
- Baraghoshi D, Strand M, Humphries SM, et al. Quantitative CT Evaluation of Emphysema Progression over 10 Years in the COPDGene Study. Radiology 2023;307:e222786. [Crossref] [PubMed]
- Iwasawa T, Matsushita S, Hirayama M, et al. Quantitative Analysis for Lung Disease on Thin-Section CT. Diagnostics (Basel) 2023;13:2988. [Crossref] [PubMed]
- Chaudhary MFA, Hoffman EA, Guo J, et al. Predicting severe chronic obstructive pulmonary disease exacerbations using quantitative CT: a retrospective model development and external validation study. Lancet Digit Health 2023;5:e83-92. [Crossref] [PubMed]
- Rangelov BA, Young AL, Jacob J, et al. Thoracic Imaging at Exacerbation of Chronic Obstructive Pulmonary Disease: A Systematic Review. Int J Chron Obstruct Pulmon Dis 2020;15:1751-87. [Crossref] [PubMed]
- Hua Q, Chen G, Yang Y, et al. Quantitative Evaluation of Chronic Obstructive Pulmonary Disease and Risk Prediction of Acute Exacerbation by High-Resolution Computed Tomography. Evid Based Complement Alternat Med 2022;2022:6015766. [Crossref] [PubMed]
- Moslemi A, Makimoto K, Tan WC, et al. Quantitative CT Lung Imaging and Machine Learning Improves Prediction of Emergency Room Visits and Hospitalizations in COPD. Acad Radiol 2023;30:707-16. [Crossref] [PubMed]
- Kaplan A, Cao H, FitzGerald JM, et al. Artificial Intelligence/Machine Learning in Respiratory Medicine and Potential Role in Asthma and COPD Diagnosis. J Allergy Clin Immunol Pract 2021;9:2255-61. [Crossref] [PubMed]
- Sarker IH. Machine Learning: Algorithms, Real-World Applications and Research Directions. SN Comput Sci 2021;2:160. [Crossref] [PubMed]
- Coates JT, de Koning C. Machine learning-driven critical care decision making. J R Soc Med 2022;115:236-8. [Crossref] [PubMed]
- Li Z, Liu L, Zhang Z, et al. A Novel CT-Based Radiomics Features Analysis for Identification and Severity Staging of COPD. Acad Radiol 2022;29:663-73. [Crossref] [PubMed]
- Makimoto K, Au R, Moslemi A, et al. Comparison of Feature Selection Methods and Machine Learning Classifiers for Predicting Chronic Obstructive Pulmonary Disease Using Texture-Based CT Lung Radiomic Features. Acad Radiol 2023;30:900-10. [Crossref] [PubMed]
- Chaudhary MFA, Hoffman EA, Comellas AP, et al. CT Texture Features Predict Severe COPD Exacerbations in SPIROMICS. C5 C005 Diagn. Progn. COPD, American Thoracic Society; 2021:A1122. Available online: https://www.atsjournals.org/doi/pdf/10.1164/ajrccm-conference.2021.203.1_MeetingAbstracts.A1122
- Yang Y, Chen Z, Li W, et al. Multi-modal data combination strategy based on chest HRCT images and PFT parameters for intelligent dyspnea identification in COPD. Front Med (Lausanne) 2022;9:980950. [Crossref] [PubMed]
- Zinellu A, Zinellu E, Mangoni AA, et al. Clinical significance of the neutrophil-to-lymphocyte ratio and platelet-to-lymphocyte ratio in acute exacerbations of COPD: present and future. Eur Respir Rev 2022;31:220095. [Crossref] [PubMed]
- Emami Ardestani M, Alavi-Naeini N. Evaluation of the relationship of neutrophil-to lymphocyte ratio and platelet-to-lymphocyte ratio with in-hospital mortality in patients with acute exacerbation of chronic obstructive pulmonary disease. Clin Respir J 2021;15:382-8. [Crossref] [PubMed]

