Validation of the American College of Surgeons surgical risk calculator for thoracic surgery

Nikolay Tsvetkov; Makhmudbek Mallaev; Brigitta Gahl; Aljaz Hojski; Michael Tamm; Luzius A. Steiner; Didier Lardinois

doi:10.21037/jtd-24-611

Original Article

Validation of the American College of Surgeons surgical risk calculator for thoracic surgery

Nikolay Tsvetkov¹ , Makhmudbek Mallaev¹, Brigitta Gahl², Aljaz Hojski¹, Michael Tamm³, Luzius A. Steiner⁴, Didier Lardinois¹

¹Department of Thoracic Surgery, University Hospital Basel, Basel, Switzerland; ²Surgical Outcome Research Centre, Department of Clinical Research, University Hospital Basel and University of Basel, Basel, Switzerland; ³University of Basel, Basel, Switzerland; ⁴Department of Anaesthesiology, University Hospital Basel, Basel, Switzerland

Contributions: (I) Conception and design: N Tsvetkov, B Gahl, D Lardinois; (II) Administrative support: N Tsvetkov, M Tamm, D Lardinois, LA Steiner; (III) Provision of study materials or patients: N Tsvetkov, M Mallaev, A Hojski; (IV) Collection and assembly of data: N Tsvetkov, M Mallaev, A Hojski; (V) Data analysis and interpretation: N Tsvetkov, B Gahl; (VI) Manuscript writing: All authors; (VII) Final approval of manuscript: All authors.

Correspondence to: Prof. Didier Lardinois, MD. Department of Thoracic Surgery, University Hospital Basel, Spitalstrasse 21, 4001 Basel, Switzerland. Email: didier.lardinois@usb.ch.

Background: Advances in medicine and surgical techniques make it possible to operate on selected comorbid elderly patients for whom risk assessment is essential. We aimed to validate the American College of Surgeons National Surgical Quality Improvement Program (ACS NSQIP) surgical risk calculator specifically for thoracic surgery.

Methods: This study retrospectively included 283 consecutive patients who all underwent various thoracic surgeries at our center. Considering “serious complication” as the most important outcome, we compared the predicted risk scores with the observed incidence of 30-day morbidity and mortality. We calculated the area under the receiver operating characteristic curve (AUROC) with 95% confidence intervals for each outcome and utilized the Brier score to check the calibration and complication odds ratios above vs. below average risk in all score outcomes with the number of occurred events.

Results: In our study population, most patients were <65 years old (48%), predominantly male (63%), and overweight or obese (48%). In addition, 13% had severe chronic obstructive pulmonary disease (COPD), and 75% were categorized as American Society of Anesthesiologists (ASA) class III or higher. For “serious complication”, AUROC was 59%, and events were equal in patients with above or below average risk scores (P=0.96). AUROC was 67% for “any complication” and 58% for “return to OR”, expressing no useful predictive value. The Brier score and odds ratios were low for all outcomes. Dyspnea, ASA class, COPD, and body mass index as single postoperative risk predictors significantly improved the basic model consisting of the logit of the risk calculator alone. Thus, the calculator alone did not perform as well as these single variables did.

Conclusions: The ACS NSQIP surgical risk calculator exhibited low sensitivity, specificity, and low AUROC for postoperative 30-day morbidity and mortality in our study cohort. Therefore, we think it cannot be considered as valid risk estimation tool for general thoracic surgery.

Keywords: Surgical management; risk assessment; outcomes evaluation; thoracic surgery

Submitted Apr 24, 2024. Accepted for publication Jul 19, 2024. Published online Sep 26, 2024.

doi: 10.21037/jtd-24-611

Highlight box

Key findings

• The American College of Surgeons (ACS) surgical risk calculator showed low sensitivity and specificity in our study cohort.

What is known and what is new?

• ACS surgical risk calculator has been validated in various surgical populations with controversial results.

• ACS surgical risk calculator has not been validated for general thoracic surgery.

What is the implication, and what should change now?

• The ACS surgical risk calculator cannot be considered valid for general thoracic surgery. There is an obvious need to further develop a risk prediction tool specifically for thoracic surgery.

Introduction

Background

Due to advances in medical and surgical treatments, high-risk patients are now being considered for surgery, which may result in higher rates of postoperative complications and death. Therefore, it is crucial to assess the risk of morbidity and mortality after medical or surgical procedures, which can be affected by patient age, body mass index (BMI), lung function, cancer stage, concomitant diseases, and other risk factors (1-3).

Rationale and knowledge gap

The reported overall complication rate after thoracic surgery under general anesthesia ranges from 15% to 37.5% (4). Pulmonary and cardiac morbidity are the main causes of mortality, with reported mortality rates of 1.2–2.9% after a lobectomy and 3.2–6.8% after pneumonectomy (2,5,6). Traditionally, surgeons rely on their own clinical experience to identify patients at “high risk” for postoperative complications, but this judgment is not always based on precise risk assessments (7). Today, indices can be used to identify “high-risk” patients and to assess surgical risks. Two of the most common predictive tools for surgical risk assessment are the Revised Cardiac Risk Index (RCRI) (8), which estimates the risk of cardiac complications after non-cardiac surgery, and the American College of Surgeons National Surgical Quality Improvement Program (ACS NSQIP) surgical risk calculator (SRC) (7). The ACS NSQIP database was developed to predict risk-adjusted surgical outcomes and improve the quality of surgical care (9). In 2013, the ACS NSQIP presented a web-based SRC that is currently based on clinical data (patient demographics, comorbidities, and complications observed up to 30-days after surgery) continuously collected from more than 500 hospitals (7). It estimates the 30-day postoperative mortality and morbidity risk for a variety of surgical procedures, including thoracic surgery.

To date, the suitability of the universal ACS NSQIP SRC in thoracic surgery has only been evaluated for patients undergoing elective lung resection, with inconsistent results (10,11). There is no data how the SRC will perform in the general thoracic surgery population (emergency chest trauma, surgery for empyema, pleural and mediastinal diseases, etc.). Annually, approximately 700 thoracic surgeries of various types are performed in our center. These patients are also at risk for unfavorable postoperative events.

Objective

The aim of this study was to validate the ACS NSQIP SRC and explore its suitability for general thoracic surgery. We compared the predicted ACS NSQIP risk outcomes with the observed incidence of 30-day morbidity and mortality after any thoracic procedure conducted in our study cohort. We present this article in accordance with the STARD reporting checklist (available at https://jtd.amegroups.com/article/view/10.21037/jtd-24-611/rc).

Methods

This single-center, blinded (regardless of precalculated ACS NSQIP risk score), retrospective study included a consecutive series of patients, who underwent a thoracic surgery at the University Hospital Basel from May 1, 2021 to March 1, 2022 and who had signed the general research consent form of our institution. Patients who clearly stated that they would not agree in providing their clinical data for scientific purposes were excluded. The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). The study was approved by the local ethics committee Ethikkommission Nordwest-und Zentralschweiz (EKNZ) (No. BASEC 2022-02163, granted on December 15, 2022). All included patients signed the general research consent form of our institution.

Study design

For all patients included, we collected data on 20 preoperative risk factors [including surgical procedure, age group, sex, functional status, emergency of the surgery, American Society of Anesthesiologists (ASA) class, use of steroids, ascites, systemic sepsis, ventilator dependent, disseminated cancer, diabetes, hypertension, congestive heart failure, dyspnoea, current smoker, history of chronic obstructive pulmonary disease (COPD), dialysis, renal failure, and BMI] according to the ACS NSQIP SRC from the hospital’s internal electronic medical records system. Based on these risk factors and prior to the surgical procedure, we generated a perioperative risk profile including “average risk” for the outcomes “serious complication”, “any complication”, “pneumonia”, “cardiac complication”, “urinary tract infection”, “surgical site infection”, “venous thromboembolism”, “renal failure”, “readmission”, “return to OR”, “discharge to nursing or rehabilitation facility”, “sepsis”, and “death” on the SRC website (Figure S1). The predicted risks for all outcomes generated by the SRC were then copied into the data file for analysis. The resulting postoperative 30-day morbidity and mortality was graded based on the Clavien-Dindo classification (12) for surgical complications, which is used to record and classify postoperative complications, then blindly documented the complications (regardless of the precalculated ACS NSQIP risk score). The complications were subsequently categorized based on the SRC outcomes, where any complications of Grade IIIa and above was considered as a “serious complication”. The outcome “serious complications” is defined according to the ACS NSQIP SRC as combination of various outcomes.

Statistical methods and analysis

Considering a “serious complication” as primary study outcome, we calculated the sample size needed to achieve a 95% confidence interval (CI) width of ≤20% of the area under the receiver operating characteristic curve (AUROC) as a measure of discrimination. For an incidence of 15% serious complications (e.g., primary outcome and AUROC 60%), 262 patients are sufficient to achieve this precision. This sample size would yield a narrower CI if the AUROC was 80% or higher. We would not consider the ACS NSQIP SRC to be a useful risk estimation tool if discrimination was markedly below an AUROC of 80%.

We calculated the AUROC with CI for each outcome to assess discriminative ability. An AUROC between 50% and 70% suggested no discrimination (i.e., ability to distinguish patients with and without disease or condition based on the test), 70% to 80% was considered acceptable, 80% to 90% was considered excellent, and >90% was considered outstanding (13). In addition, we calculated the Brier score to check the calibration and used nonparametric AUROCs and scatterplots of the ACS NSQIP scores for visualization.

We conducted three sensitivity analyses for all SRC outcomes, except for “discharge to a nursing or rehabilitation facility”, as patients in Swiss health care system can choose to either be discharged at home or to a rehabilitation center. First, we repeated the analyses described above, including only lung resections. Second, we investigated whether single risk factors of the SRC model provided a better prediction of the two outcomes: “serious complication” and “any complication” compared to the entire score (see Appendix 1 for details). Third, we assessed the risk for the occurrence of a specific event associated with an exceedance of the “average risk” provided by the SRC for the respective outcome by calculating odds ratios and 95% CI.

We performed descriptive statistics of the study cohort; patient characteristics of the SRC are provided in numbers and percents. This included a comparison of patients with and without events using Fisher’s exact test. Surgical procedures were divided into four main groups with their respective Current Procedural Terminology (CPT) codes—lung resection, pleural empyema, chest wall interventions, and various (Table S1). Unspecific results were interpreted as false-positive or false-negative and reported in the analysis.

All analyses were conducted using Stata 16.0 (StataCorp LLC, College Station, TX, USA).

Results

Our study cohort included 283 patients (Figure S2 Flow diagram) who underwent lung resections (n=174), pleural empyema surgery (n=16), chest-wall interventions (n=39), and other thoracic surgeries (n=54) (CPT codes see Table S1). Nearly half of the patients (48%) were younger than 65 years, 75% were classified as ≥ ASA III, 48% as overweight and obese, 39% required hypertensive medication, 27% were current smokers, 25% had dyspnea, and 13% had severe COPD. “Serious complications” occurred in 41 patients. Of these, 56% were younger than 65 years, 75% were classified as ≥ ASA III, 32% were overweight and obese, 34% had dyspnea, and 29% had severe COPD (Table 1).

Table 1

Patient demographic characteristics according to ACS NSQIP risk factors and occurred “serious complication” events

Characteristic	Total (N=283)	No serious complication (N=242)	Serious complication (N=41)	P
Age (years)				0.70
<65	135 (48%)	112 (46%)	23 (56%)
65–74	81 (29%)	72 (30%)	9 (22%)
75–84	58 (20%)	50 (21%)	8 (20%)
≥85	9 (3.2%)	8 (3.3%)	1 (2.4%)
Sex				0.08
Male	178 (63%)	147 (61%)	31 (76%)
Female	105 (37%)	95 (39%)	10 (24%)
Functional status				0.10
Independent	257 (91%)	218 (90%)	39 (95%)
Partially dependent	23 (8.1%)	22 (9.1%)	1 (2.4%)
Totally dependent	3 (0.9%)	2 (0.83%)	1 (2.4%)
Emergency case	30 (11%)	24 (10%)	6 (15%)	0.41
ASA				0.007
I	20 (7.1%)	13 (5.4%)	7 (17%)
II: mild systemic disease	49 (17%)	46 (19%)	3 (7.3%)
III: severe systemic disease	196 (69%)	170 (70%)	26 (63%)
IV: severe systemic disease/constant threat to life	18 (6.4%)	13 (5.4%)	5 (12%)
Steroid use	16 (5.7%)	13 (5.4%)	3 (7.3%)	0.71
Ascites	0	0	0
Systemic sepsis	4 (1.4%)	3 (1.2%)	1 (2.4%)	0.47
Ventilator dependent	1 (0.35%)	1 (0.41%)	0 (0.0%)	>0.99
Disseminated cancer	42 (15%)	39 (16%)	3 (7.3%)	0.23
Diabetes				0.46
None	255 (90%)	216 (89%)	39 (95%)
Oral medication	21 (7.4%)	20 (8.3%)	1 (2.4%)
Insulin	7 (2.5%)	6 (2.5%)	1 (2.4%)
Hypertension requiring medication	111 (39%)	100 (41%)	11 (27%)	0.09
Congestive heart failure	3 (1.1%)	3 (1.2%)	0 (0.0%)	>0.99
Dyspnea	72 (25%)	58 (24%)	14 (34%)	0.18
Current smoker	76 (27%)	60 (25%)	16 (39%)	0.08
Severe COPD	38 (13%)	26 (11%)	12 (29%)	0.005
Dialysis	2 (0.71%)	1 (0.41%)	1 (2.4%)	0.27
Acute renal failure	3 (1.1%)	3 (1.2%)	0 (0.0%)	>0.99
BMI				0.003
Underweight (<18 kg/m²)	13 (4.6%)	7 (2.9%)	6 (15%)
Normal weight (18–25 kg/m²)	135 (48%)	113 (47%)	22 (54%)
Overweight & obese (>25 kg/m²)	135 (48%)	122 (50%)	13 (32%)

For each category, the number of patients, and its percentage of total [n (%)] is reported; ACS NSQIP SRC, American College of Surgeons National Surgical Quality Improvement Program surgical risk calculator; ASA, American Society of Anesthesiologists; COPD, chronic obstructive pulmonary disease; BMI, body mass index.

Association of predicted and observed “serious complications”

The observed frequency of “serious complications” within 30 days was 14.5%, whereas the predicted mean was 10.9%. The analysis of the discriminative ability of ACS NSQIP SRC in our study cohort yielded an AUROC of 59% (95% CI: 49–68%) for “serious complication”, indicating no useful discrimination (Figure 1). The Brier score was moderate at 0.13 (Table 2). Restricting the analysis to lung resections increased the AUROC to 63% (95% CI: 53–73%), suggesting low discrimination (Table 3). “Serious complications” were equally frequent among patients above and below the predicted ACS NSQIP SRC average risk (odds ratio, 1.02; P=0.96) (Table 4).

Figure 1 Left hand side: AUROC for predicted risk of “serious complication”. Right hand side: scatterplot of predicted risks of patients with [1] and without [0] “serious complication”. AUROC, area under the receiver operating characteristic.

Table 2

ACS NSQIP SRC’s discriminative ability and comparison between observed events and predicted probabilities

Outcome	Events	Predicted probability (%)	AUROC (%)	Brier score (P)
Serious complication	41 (14.5%)	10.9 (6.3)	59 (49 to 68)	0.13
Any complication	74 (26.1%)	12.0 (6.9)	67 (60 to 74)	0.20
Pneumonia	10 (3.5%)	3.2 (3.2)	76 (61 to 92)	0.03
Cardiac complication	6 (2.1%)	1.1 (1.4)	73 (43 to 100)	0.02
Surgical site infection	2 (0.7%)	1.7 (1.0)	4 (2 to 6)	0.007
Urinary tract infection	8 (2.8%)	0.8 (0.5)	62 (45 to 80)	0.03
Venous thromboembolism	4 (1.4%)	1.2 (0.8)	81 (73 to 88)	0.01
Renal failure	2 (0.7%)	0.6 (0.9)	99 (98 to 100)	0.006
Readmission	9 (3.2%)	7.3 (3.7)	49 (26 to 73)	0.03
Return to OR	33 (11.7%)	3.6 (2.4)	58 (49 to 68)	0.11
Death	0 (0.0%)	2.4 (5.1)	Not calculable	–
Sepsis	0 (0.0%)	1.5 (1.3)	Not calculable	–

Number and percentage [N (%)] of events for each outcome. Mean and standard deviation of predicted probabilities (%) (according to the ACS NSQIP score) among patients with the event. AUROC with 95% confidence interval. Brier score checking calibration by distinguishing between predicted probability (0≤P≤1) and occurrence of an event (0 or 1). ACS NSQIP SRC, American College of Surgeons National Surgical Quality Improvement Program surgical risk calculator; AUROC, area under the receiver operation characteristics curve; OR, operating room.

Table 3

ACS NSQIP SRC’s discriminative ability and comparison between observed events and predicted outcome probabilities for lung resection only

Outcome	Events	Predicted probability (%)	AUROC (%)	Brier score (P)
Serious complication	26 (14.9%)	11.7 (6.2)	63 (53 to 73)	0.13
Any complication	46 (26.4%)	12.6 (6.6)	68 (59 to 76)	0.20
Pneumonia	8 (4.6%)	4.0 (3.4)	79 (65 to 93)	0.04
Cardiac complications	2 (1.1%)	1.1 (1.4)	19 (0 to 42)	0.01
Surgical site infection	0 (0.0%)	1.5 (0.7)	Not calculable	–
Urinary tract infection	6 (3.4%)	1.0 (0.6)	56 (30 to 82)	0.03
Venous thromboembolism	4 (2.3%)	1.4 (0.8)	75 (65 to 86)	0.01
Renal failure	0 (0.0%)	0.6 (0.5)	Not calculable	–
Readmission	5 (2.9%)	7.7 (3.3)	66 (40 to 91)	0.03
Return to OR	19 (10.9%)	3.9 (2.5)	61 (49 to 74)	0.10
Death	0 (0.0%)	2.4 (5.0)	Not calculable	0.003
Sepsis	0 (0.0%)	1.7 (1.4)	Not calculable	–

Number and percentage [N (%)] of events for each outcome. Mean and standard deviation of predicted probabilities (%) (according to the ACS NSQIP score) among patients with the event. AUROC with 95% confidence interval. Brier Score checking calibration by distinguishing between predicted probability (0≤P≤1) and occurrence of an event (0 or 1). ACS NSQIP SRC, American College of Surgeons National Surgical Quality Improvement Program surgical risk calculator; AUROC, area under the receiver operation characteristics curve; OR, operating room.

Table 4

Associations between observed event frequencies and a risk score above average

Outcome	> average		≤ average		Odds ratio	P value
Outcome	Patients	Events	Patients	Events	Odds ratio	P value
Serious complication	137	20	146	21	1.02 (0.50 to 2.08)	0.96
Any complication	130	42	153	32	1.80 (1.02 to 3.20)	0.03
Pneumonia	147	8	136	2	3.86 (0.75 to 37.8)	0.07
Cardiac complications	119	4	164	2	2.82 (0.40 to 31.6)	0.22
Surgical site infection	102	0	181	2	0.00 (0.00 to 3.41)	0.29
Urinary tract infection	127	6	156	2	3.82 (0.66 to 39.2)	0.08
Venous thromboembolism	118	3	165	1	4.28 (0.34 to 226)	0.17
Renal failure	94	2	185	0	Not calculable	–
Readmission	149	4	134	5	0.71 (0.14 to 3.39)	0.62
Return to OR	116	17	167	16	1.62 (0.73 to 3.60)	0.19
Death	139	0	144	0	Not calculable	–
Sepsis	133	0	149	0	Not calculable	–

Number of patients (N); number of patients with events (N); odds ratio between respective outcome and a risk score above average. OR, operating room.

Prediction of other outcomes

The observed postoperative frequencies of “any complication” and “return to OR” within 30 days were 26.1% and 11.7% compared to the mean predicted values of 12% and 3.6%, respectively. The AUROC was 67% for “any complication” (Figure 2) and 58% for “return to OR”, expressing no useful prediction. Excellent discrimination was only observed for “renal failure” (Table 2), with an AUROC of 99%. However, this was based on only two events. Discrimination with respect to “pneumonia” (AUROC, 76%) was fair (Figure 3), with a 95% CI ranging from poor to excellent discrimination (61–92%). Limiting the analysis exclusively to lung resection showed comparable results. The score’s predictive ability for “any complication” showed similar results when compared to the entire cohort (AUROC, 67% vs. 68%), yet the predictive value remained unsatisfactory (Tables 2,3). For all outcomes, the Brier score was low particularly for outcomes with very few events.

Figure 2 Left hand side: AUROC for predicted risk of “any complication”. Right hand side: scatterplot of predicted risks of patients with [1] and without [0] “any complication”. AUROC, area under the receiver operating characteristic.

Figure 3 Left hand side: AUROC for predicted risk of “pneumonia”. Right hand side: scatterplot of predicted risks of patients with [1] and without [0] “pneumonia”. AUROC, area under the receiver operating characteristic.

The likelihood of experiencing “any complication” was greater for patients with above average risk scores (odds ratio, 1.8; P=0.03). This was also observed with “return to OR”, although it did not reach statistical significance (Table 4).

Including selected, statistically significant risk factors (dyspnea, ASA, severe COPD, and BMI) as single postoperative risk predictors for “serious complication” and “any complication” showed that each of the selected variables significantly improved the basic model, consisting of the logit of the ACS NSQIP SRC alone. Therefore, the SRC alone did not perform as well as these single variables did (Tables 1,5).

Table 5

Prediction improvements obtained by adding statistically significant ACS NSQIP SRC variables to the basic logistic regression models for “serious complication” and “any complication” including only logit (NSQIP SRC/100) as predictor variable

Outcome	Model fit		Model fit improvement*
Outcome	Chi²	P	Chi²	P
Serious complication
Logit (NSQIP SRC/100)	3.21	0.073
+ ASA	7.13	0.028	3.92	0.048
+ Severe COPD	9.54	0.008	6.33	0.01
+ BMI	12.26	0.002	9.05	0.003
+ ASA, severe COPD, BMI	21.55	<0.001	18.34	<0.001
Any complication
Logit (NSQIP SRC/100)	20.25	<0.001
+ Dyspnea	25.56	<0.001	5.30	0.02
+ Severe COPD	24.23	<0.001	3.98	0.046
+ Dyspnea, severe COPD	27.97	<0.001	7.72	0.02

*, if variable(s) at the left-hand side are added to the basic model consisting of logit (NSQIP SRC/100) alone. ACS NSQIP SRC, American College of Surgeons National Surgical Quality Improvement Program surgical risk calculator; ASA, American Society of Anesthesiologists; COPD, chronic obstructive pulmonary disease; BMI, body mass index.

Discussion

Key findings

The results of our study show that the ACS NSQIP SRC has a low discriminative ability for our primary and secondary outcomes. These findings suggest an overall low predictive ability of the SRC for 30-day mortality and morbidity risk after general thoracic surgery in our cohort.

Strengths and limitations

In terms of limitations, we must acknowledge that our study was conducted retrospectively. In addition our cohort was relatively small (n=283), although a meta-analysis of six studies evaluating the accuracy of the ACS NSQIP score in emergency abdominal surgery showed a median cohort size of 300 patients (ranging from 69 to 758 patients) (14). Nevertheless, our pre-calculated patient count was sufficient to validate the score as predictor of “serious complication”, “any complication”, and “return to OR”, but this was not the case for outcomes with frequencies lower than 10%.

Comparison with similar research

The ACS NSQIP SRC has been validated in various surgical specialities. Some studies have reported excellent prognostic ability of the ACS NSQIP SRC, while others have found it to be an inadequate predictive tool (10,11,14-16).

A 2015 study from Washington University, analysing the calculated ACS NSQIP risk for patients undergoing surgery (n=277) or stereotactic body radiation therapy (SBRT) (n=195) for stage I lung cancer published results similar to ours (10). The observed rate for “serious complications” in patients undergoing lung resection was twice as high as predicted by the score (16.6% vs. 8.8%), showing its tendency to underestimate the occurrence of serious complications. Their reported AUROC for wedge resections was 65% (95% CI: 56–74%). This is consistent with our results (63%) and suggests a poor discriminative ability of the score (Table 3). In the lobectomy group, they observed an acceptable AUROC of 78% (95% CI: 73–82%). Patients in this group had a lower ASA, a median forced expiratory volume in 1 second (FEV1) of 82.7%, a diffusing capacity of the lung for carbon monoxide (DLCO) of 73.6% (which speaks against COPD), and were less likely to have history of congestive heart failure. Our study cohort included 13% patients with severe COPD, more than twice as many with congestive heart failure, and 17% with ASA II.

On the other hand, Chudgar et al. reported results comparable to the risk score estimations in their analysis in a cohort of patients undergoing lung resection (11). Observed complication rates were generally lower than estimated (“serious complication” 7.4% vs. 9.2%) with almost identical results for “any complication”, pneumonia, and cardiac complications. Despite examining more than 2,000 recruited cases and a fair number of events, only the discriminative ability for cardiac complications was excellent (AUROC, 82%; 95% CI: 70–95%). For most of the outcomes, the AUROC was merely acceptable (95% CI: 70–79%). The Department of Thoracic Surgery, who conducted the present study, participates in the Society of Thoracic Surgeon’s General Thoracic Surgery Database (STS GTSD), which provides preoperative prediction models comparable to the ACS NSQIP reports (17). Thus, the score may perform better for participating hospitals in the USA.

None of the above-mentioned studies compared observed morbidity and mortality in patients with a risk score above and below average risk, which we think is a main function of the score, allowing practitioners to consider a predicted risk as acceptable. A 2018 Filipino study validated the score’s ability to predict below and above average risk, with major adverse cardiac events (MACE) as a primary outcome for non-cardiac surgeries and compared it to RCRI risk estimations (15). According to their results, the SRC provided a low sensitivity and an excellent discriminative ability for MACE (AUROC, 93%) for above and below average risk, where we observed acceptable results (AUROC, 73%). However, they do not report on “serious complication” and “any complication”. In these outcomes, we observed no significant differences (a small event prevalence for above average risk), suggesting poor discriminative ability. The patient cohort in this study was 84% ASA I, predominantly female (63%), and had a mean age of 54.58 years, all of which are associated with low morbidity and mortality (13).

Studies we found validating the ACS NSQIP SRC in the field of thoracic surgery considered cohorts limited to patients undergoing lung resection (10,11). Although, pulmonary resection may be the most common procedure performed in thoracic surgery, operations on the chest-wall, mediastinum, diaphragm, esophagus, pleura and pleura space, trachea, and traumatic injuries of chest-wall or organs within the chest are part of the daily work of any thoracic surgeon (6,18). We think that the ACS NSQIP SRC cannot be validated as a universal risk stratification tool in thoracic surgery based only on a specific range of surgical procedures.

A 2016 analysis of the European Society of Thoracic Surgeons database by Brunelli et al., suggested two models (i.e., EuroLung1 for morbidity and EuroLung2 for mortality) to predict outcomes following anatomic lung resections. Variables such as male sex, age, decreased predicted postoperative FEV1 (reduced in patients with COPD), BMI, and others were associated with 30-day morbidity and mortality risk (6). This corresponds to our analysis of the ACS NSQIP SRC risk factors. We observed that male sex, ASA, COPD, and BMI as individual risk factors predicted complications better than the ACS NSQIP SRC alone (Tables 1,4).

Explanations of findings

Our study evaluated the suitability of the ACS NSQIP SRC in predicting postoperative complications after general thoracic surgery. The AUROC for “serious complications”, “any complications”, and “return to OR” yielded values below the threshold for useful prediction of postoperative 30-day morbidity and mortality in our study cohort. As observed in the results, discrimination for “pneumonia” was fair, with a 95% CI ranging from poor to excellent. This might indicate that the SRC score was generally low in our study cohort, rather than a good calibration (Table 2). Moreover, restricting the analysis to lung resections, as suggested by other colleagues, did not improve the discriminative ability of the SRC (10,11).

Implications and actions needed

Although, the ACS NSQIP SRC is considered comparable to procedure-specific calculators (7), we think that the lack of specific risk factors for thoracic surgery led to a low sensitivity and specificity in our study cohort, and therefore, cannot consider it as a valid tool for routine clinical practice. As reported in the results, individual risk factors such as dyspnea, ASA, severe COPD, and BMI significantly improved the predictive accuracy of the model over the ACS NSQIP SRC alone. Our findings highlight the obvious need to further develop a risk prediction tool specifically for thoracic surgery.

Conclusions

The ACS NSQIP SRC exhibited low sensitivity, specificity, and low AUROC for postoperative 30-day morbidity and mortality in our study cohort. Such outcomes can mislead patients and surgeons in shared decision-making. Therefore, we think it cannot be considered as valid risk estimation tool for general thoracic surgery.

Acknowledgments

The authors would like to thank Cecile Buenter for editorial and language assistance and Dr. Christian Schindler, Swiss Tropical and Public Health Institute, University of Basel, for statistical assistance and advice.

Funding: This work was supported by the Department of Thoracic Surgery, University Hospital Basel (No. FO 126 102).

Footnote

Reporting Checklist: The authors have completed the STARD reporting checklist. Available at https://jtd.amegroups.com/article/view/10.21037/jtd-24-611/rc

Data Sharing Statement: Available at https://jtd.amegroups.com/article/view/10.21037/jtd-24-611/dss

Peer Review File: Available at https://jtd.amegroups.com/article/view/10.21037/jtd-24-611/prf

Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://jtd.amegroups.com/article/view/10.21037/jtd-24-611/coif). The authors have no conflicts of interest to declare.

Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). The study was approved by the local ethics committee Ethikkommission Nordwest-und Zentralschweiz (EKNZ) (No. BASEC 2022-02163, granted on December 15, 2022). All included patients signed the general research consent form of our institution.

Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.

References

García-Miguel FJ, Serrano-Aguilar PG, López-Bastida J. Preoperative assessment. Lancet 2003;362:1749-57. [Crossref] [PubMed]
Agostini P, Cieslik H, Rathinam S, et al. Postoperative pulmonary complications following thoracic surgery: are there any modifiable risk factors? Thorax 2010;65:815-8. [Crossref] [PubMed]
Cohen MM, Duncan PG, Tate RB. Does anesthesia contribute to operative mortality? JAMA 1988;260:2859-63.
Iyer AP, Yadav S. Postoperative Care and Complications After Thoracic Surgery. In: Firstenberg MS (ed). Principles and Practice of Cardiothoracic Surgery. InTech; 2013. Available online: http://dx.doi.org/10.5772/55351
Ginsberg RJ. Lung cancer surgery: acceptable morbidity and mortality, expected results and quality control. Surg Oncol 2002;11:263-6. [Crossref] [PubMed]
Brunelli A, Salati M, Rocco G, et al. European risk models for morbidity (EuroLung1) and mortality (EuroLung2) to predict outcome following anatomic lung resections: an analysis from the European Society of Thoracic Surgeons database. Eur J Cardiothorac Surg 2017;51:490-7. [Crossref] [PubMed]
Bilimoria KY, Liu Y, Paruch JL, et al. Development and evaluation of the universal ACS NSQIP surgical risk calculator: a decision aid and informed consent tool for patients and surgeons. J Am Coll Surg 2013;217:833-42.e1-3.
Fleisher LA, Fleischmann KE, Auerbach AD, et al. 2014 ACC/AHA guideline on perioperative cardiovascular evaluation and management of patients undergoing noncardiac surgery: executive summary: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. Circulation 2014;130:2215-45. [Crossref] [PubMed]
Ingraham AM, Richards KE, Hall BL, et al. Quality improvement in surgery: the American College of Surgeons National Surgical Quality Improvement Program approach. Adv Surg 2010;44:251-67. [Crossref] [PubMed]
Samson P, Robinson CG, Bradley J, et al. The National Surgical Quality Improvement Program risk calculator does not adequately stratify risk for patients with clinical stage I non-small cell lung cancer. J Thorac Cardiovasc Surg 2016;151:697-705.e1. [Crossref] [PubMed]
Chudgar N, Yan S, Hsu M, et al. The American College of Surgeons Surgical Risk Calculator performs well for pulmonary resection: A validation study. J Thorac Cardiovasc Surg 2022;163:1509-1516.e1. [Crossref] [PubMed]
Dindo D, Demartines N, Clavien PA. Classification of surgical complications: a new proposal with evaluation in a cohort of 6336 patients and results of a survey. Ann Surg 2004;240:205-13. [Crossref] [PubMed]
Hosmer DW, Lemeshow S (2000) Applied Logistic Regression. Ed Ke-2. John Wiley and Sons, New York. Available online: https://doi.org/10.1002/0471722146
Parkin CJ, Moritz P, Kirkland O, et al. What is the Accuracy of the ACS-NSQIP Surgical Risk Calculator in Emergency Abdominal Surgery? A Meta-Analysis. J Surg Res 2021;268:300-7. [Crossref] [PubMed]
Yap MKC, Ang KF, Gonzales-Porciuncula LA, et al. Validation of the American College of Surgeons Risk Calculator for preoperative risk stratification. Heart Asia 2018;10:e010993. [Crossref] [PubMed]
Hers TM, Van Schaik J, Keekstra N, et al. Inaccurate Risk Assessment by the ACS NSQIP Risk Calculator in Aortic Surgery. J Clin Med 2021;10:5426. [Crossref] [PubMed]
Dyas AR, Bronsert MR, Henderson WG, et al. A comparison of the National Surgical Quality Improvement Program and the Society of Thoracic Surgery Cardiac Surgery preoperative risk models: a cohort study. Int J Surg 2023;109:2334-43. [Crossref] [PubMed]
Klepetko W, Aberg TH, Lerut AE, et al. Structure of general thoracic surgery in Europe. Eur J Cardiothorac Surg 2001;20:663-8. [Crossref] [PubMed]

Cite this article as: Tsvetkov N, Mallaev M, Gahl B, Hojski A, Tamm M, Steiner LA, Lardinois D. Validation of the American College of Surgeons surgical risk calculator for thoracic surgery. J Thorac Dis 2024;16(9):5698-5708. doi: 10.21037/jtd-24-611

Validation of the American College of Surgeons surgical risk calculator for thoracic surgery

Highlight box

Introduction

Background

Rationale and knowledge gap

Objective

Methods

Study design

Statistical methods and analysis

Results

Table 1

Association of predicted and observed “serious complications”

Table 2

Table 3

Table 4

Prediction of other outcomes

Table 5

Discussion

Key findings

Strengths and limitations

Comparison with similar research

Explanations of findings

Implications and actions needed

Conclusions

Acknowledgments

Footnote

References

Article Options

Download Citation

Share