In the 8th edition of the TNM classification of lung cancer, the N category is determined exclusively by the anatomic location of metastatic lymph nodes (LNs). However, this simple classification yields heterogeneous prognostic subgroups among patients within the same N category. Therefore, efforts have been made to incorporate the overall metastatic LN burden into lung cancer LN staging assessment. For this purpose, studies have proposed the use of various metrics, including the number, station, zone, and size of metastatic LNs, as well as the ratio of the number of metastatic LNs to the total number of removed nodes at surgery [
1–
3]. In addition, the International Association for the Study of Lung Cancer (IASLC) proposed further subdividing N1 and N2 disease on the basis of the presence of skip nodal metastases and the number of metastatic LN stations [
4]. Although these efforts have been promising, opportunity for substantial improvement remains, because results to date either have been inconsistent or have indicated prognostic differences only between certain N subcategories [
5–
7].
Extranodal extension (ENE) of nodal metastasis is defined as the extension of metastatic cells beyond the capsule of the metastatic node. ENE has received increasing attention in oncology, now being recognized as a prognostic factor for several malignant tumors [
8–
10]. In addition, the 8th edition TNM staging system introduced the use of ENE in determining the N category for head and neck cancers and for vulvar cancer [
11,
12]. After that change, the clinical N category of head and neck cancers now contains detailed subcategories based not only on the number and size of metastatic LNs but also on the clinical assessment of ENE. Specifically, patients with head and neck cancer may be judged as clinically positive for ENE on the basis of unambiguous findings of ENE on physical examination (e.g., fixation of the nodal mass in the neck or evidence of nerve dysfunction), supported by radiologic evidence of ENE (e.g., indistinct nodal margin or direct nodal invasion of surrounding structures) [
11].
In 2018, the Staging and Prognostic Factors Committee of the IASLC highlighted the prognostic impact of ENE as a research goal in preparing the 9th edition of the N descriptor [
13]. Several studies in patients with non–small cell lung cancer (NSCLC) have shown pathologic ENE in metastatic LNs to be associated with unfavorable prognosis [
14–
16]. If evidence of ENE on imaging studies were to predict pathologic ENE and likewise have prognostic impact, then radiologic ENE could play a role in determining treatment strategy. However, to our knowledge, the prognostic implications of radiologic ENE in the clinical staging of lung cancer and the diagnostic performance of radiologic ENE in predicting pathologic ENE have not been studied. Moreover, studies that have assessed the prognostic impact of ENE solely on the basis of pathologic assessment from surgical specimens are limited because up-front surgery is generally not performed in patients with clinical N2 disease if preoperative imaging shows evidence of ENE [
17]; thus, studies including only patients who underwent up-front surgical resection would have captured a small fraction of all patients with ENE (i.e., primarily those patients without radiologic ENE). The aims of this study were to evaluate the prognostic utility of radiologic ENE and the diagnostic performance of radiologic ENE in predicting pathologic ENE in patients with NSCLC.
Methods
The institutional review board of the authors' institution approved this retrospective study and waived the requirement for informed consent.
Study Patients
The institution's clinical data warehouse, housed within the EMR, was searched for consecutive patients who underwent a staging workup, including chest CT, for histologically confirmed NSCLC that was initially diagnosed between January 2010 and December 2016 at Seoul National University Bundang Hospital (a tertiary care center). Patients were then excluded if they did not have clinical T (of any category), N1 or N2, and M0 disease, based on a review of the staging chest CT examinations and additional available staging-related imaging tests and pathology reports (as described later in the Methods). Additional patients were excluded if they had LN enlargement owing to concomitant lymphoma or previous malignancy with evidence of disease in the past 5 years (aside from inclusion of the latter patients in a sensitivity analysis, as explained later in the Methods). Of included patients, 242 were also included in a prior study that evaluated the prognostic performance of the clinical N descriptors proposed by the IASLC for lung cancer staging [
18]; however, that prior study did not assess radiologic or pathologic ENE.
Baseline Characteristics
Two board-certified subspecialty-trained thoracic radiologists (S.J. and K.H.L. with 2 and 10 years of posttraining experience, respectively) in consensus determined patients' clinical T, N, and M categories according to the 8th edition of the TNM staging system, on the basis of a review that included assessment of the staging chest CT examinations and their associated clinical reports within the EMR. The clinical T category was classified as Tis or T1 (hereafter, Tis/T1), T2, T3, or T4. The clinical N category was classified on the basis of the IASLC system as N1a, N1b, N2a1, N2a2, or N2b. The clinical N category was determined on the basis of the results of all available staging tests, including imaging studies (e.g., chest CT and FDG PET/CT examinations), endobronchial ultrasound-guided transbronchial LN needle aspiration, and percutaneous cervical LN biopsy [
19,
20]. For patients who did not undergo LN aspiration or biopsy, the clinical N category was determined on the basis of the results of imaging studies only (e.g., CT and FDG PET/CT). Contrast-enhanced CT, when available, was preferred over noncontrast CT for purposes of LN assessment. The two reviewers jointly measured LNs during this review. LNs were considered metastatic if having a short-axis diameter of at least 1.0 cm; however, peripheral LNs (i.e., LNs located at the lobar level or more distally) were considered metastatic if having a short-axis diameter of at least 0.8 cm, a location in the primary tumor's lymphatic drainage pathway, and either round morphology or contrast enhancement. The short-axis diameter of the largest metastatic LN was recorded. The M category was determined on the basis of the results of all available staging tests, including imaging studies (e.g., chest CT, brain MRI, and FDG PET/CT examinations) and pathology reports and was classified as M1 in the presence of clinical or microscopic evidence of distant metastasis and as M0 otherwise.
Information on age, sex, smoking status, and family history of lung cancer was obtained from the EMR. Treatment of lung cancer was classified as surgery without neoadjuvant chemotherapy, surgery with neoadjuvant chemotherapy, or nonsurgical management. The nodule type (subsolid vs solid) of the primary tumor and location (upper lobe, lower lobe, or both upper and lower lobes) of the primary tumor were recorded after review of the images from the staging chest CT examinations by the two previously noted radiologists (S.J. and K.H.L.) in consensus. Finally, the histologic subtype of lung cancer (adenocarcinoma, squamous cell carcinoma, or other) was recorded after review of pathologic reports.
Assessment of Radiologic Extranodal Extension
Six weeks after completion of the described steps, the previously noted two thoracic radiologists (S.J. and K.H.L.) performed an additional review of the staging chest CT in each patient, blinded to pathologic findings of ENE and to clinical outcomes. The examinations from the first 30 patients in the study sample (according to the date of staging chest CT, in chronologic order) were reviewed in consensus; the remaining examinations were reviewed independently.
Figure 1 provides illustrations of the conceptual framework of ENE on which the selection of imaging features was based [
21] (
Fig. 1A: no ENE;
Fig. 1B: microscopic ENE [breaching of LN capsule by tumor];
Fig. 1C: macroscopic ENE [grossly visible extranodal soft-tissue involvement]). The radiologists assessed the CT images for four possible findings of ENE (indistinct LN margin, coalescent LNs, direct invasion of adjacent structures, and central necrosis), each evaluated in a binary fashion.
Figure 2 provides a representative example of each of the four findings, which in prior studies have shown varying degrees of association with pathologic ENE in head and neck cancer [
22–
24]. Indistinct margin was selected as a possible correlate of either microscopic or macroscopic ENE and was defined as a poorly defined or irregular nodal margin or infiltration of the fat plane around the LN. Coalescent LNs and direct invasion of adjacent structures were selected as possible correlates of macroscopic ENE; coalescent LNs were defined as juxtaposed LNs with no definable intervening fat plane, including a confluent mass of LNs that were indistinguishable from one other [
24]. Although necrosis does not have a direct pathologic correlate relating to ENE, it was selected because of its previously observed strong association with pathologic ENE in head and neck cancer [
23,
25]; necrosis was defined as central low attenuation within the LN, qualitatively similar to the attenuation of fluid. The four features were assessed on a patient level and considered positive if any intrathoracic LN showed the given feature. The radiologists also assessed overall confidence in the presence of ENE, scored on a subjective 3-point scale on the basis of gestalt impression at the patient level: no evidence of ENE, possible or probable ENE (hereafter, possible/probable ENE), and unambiguous ENE.
Figure 3 provides a representative example of each of these three tiers.
Assessment of Pathologic Extranodal Extension
In patients who underwent surgery, the pathologic reports were reviewed to record the presence of pathologic ENE; the pathologic slides were not reviewed for this purpose. During the study period, attending pathologists at the study institution routinely commented on the presence of pathologic ENE within the reports as part of the standard of care. The pathologists defined pathologic ENE as the extension of tumor cells beyond the capsule of a metastatic LN and into perinodal tissue. The pathology reports did not differentiate ENE involving LNs in the primary lung specimen versus in separately submitted LNs. No attempt was made to correlate the specific location of LNs between radiologic and pathologic assessments. In addition, in patients who underwent surgery, the pathologic T category and pathologic N category were recorded based on the pathologic reports.
Overall Survival
A database of the Ministry of the Interior and Safety, Republic of Korea, was reviewed to determine information regarding survival status and date of death for each patient. Overall survival (OS) was determined for each patient, defined as the interval from the date of staging CT to the date of death from any cause. The time of censoring was the earliest of the date of death, the last follow-up date, or February 17, 2022 (reflecting the date of final database review).
Statistical Analysis
The study's primary outcome was OS. Secondary outcomes were interreader agreement for radiologic ENE and the diagnostic performance of radiologic ENE in predicting pathologic ENE.
Survival curves and 5-year OS rates were derived and compared using the Kaplan-Meier method and log-rank test. Univariable Cox regression analysis was performed using clinical and radiologic variables, including age, sex, smoking history, family history, primary tumor nodule type, primary tumor lobar location, tumor histology (classified as adenocarcinoma vs other), clinical T category, clinical N category, LN size (classified as < 2.0 cm vs ≥ 2.0 cm), and radiologic ENE. Variables with p values less than .10 in the univariable analysis were entered into the multivariable analyses. Multivariable Cox proportional hazards models were constructed to estimate the multivariable-adjusted HRs and 95% CIs of radiologic ENE. The assumption of proportionality was checked by visually inspecting Schoenfeld residual plots. The prognostic significance of radiologic ENE was analyzed separately for each reader.
Two sensitivity analyses were performed. The first sensitivity analysis assessed clinical N category as a three-tiered classification (N1, N2a, N2b), reflecting a prior study that showed significant differences in survival between these three groups [
18]. The second sensitivity analysis included patients with a history of malignancy and evidence of disease within the prior 5 years who otherwise were excluded from the study. The Kaplan-Meier method was also used to assess the association of survival rates with pathologic ENE in the subset of patients who underwent surgery (adjusting for pathologic T and N categories rather than clinical T and N categories). An exploratory analysis was also performed using the previously noted three-tiered classification of nodal disease, with further stratification of clinical N2a disease (i.e., a single N2 metastatic LN) by the presence versus absence of radiologic unambiguous ENE; this analysis used the Holm method to adjust for multiple comparisons [
26].
Interreader agreement between the two readers was evaluated on the basis of Cohen kappa coefficients for the four binary features and weighted kappa coefficients for radiologic ENE. Kappa coefficients were classified as follows: 0.00–0.20, slight; 0.21–0.40, fair; 0.41–0.60, moderate; 0.61–0.80, substantial; 0.81–1.00, almost perfect [
27]. Agreement was also calculated for the four binary features and three-tiered radiologic ENE using percentage agreements.
For each reader, the sensitivity, specificity, PPV, NPV, and accuracy of CT findings were calculated in patients who underwent surgery, using as reference standard pathologic ENE as documented in the pathologic reports. These performance measures were assessed for each of the four binary CT features, for possible/probable ENE or unambiguous ENE, and for unambiguous ENE. These assessments were also performed in patients who underwent surgery without neoadjuvant chemotherapy. The sensitivity, specificity, PPV, and NPV of the four binary CT findings were also assessed, stratified by histologic type.
In both the survival and diagnostic performance assessments, the analysis for each reader used the results of the consensus assessments for the first 30 patients and the results of the given reader's independent assessments for the remaining patients. The assessment of interreader agreement was performed only in the patients evaluated independently by the two readers (e.g., excluding the initial 30 patients used in the consensus reading). A p value less than .05 was considered to be statistically significant. All statistical analyses were performed using R software, version 4.1.2.
Discussion
In this retrospective study, we evaluated the prognostic implications of radiologic ENE on staging chest CT in patients with NSCLC. The 5-year OS rate was significantly associated with radiologic ENE, being progressively worse in patients with, in order, no ENE, possible/probable ENE, and unambiguous ENE. In addition, for both readers, unambiguous ENE was an independent poor prognostic predictor of OS in multivariable analysis incorporating a range of clinical variables. Interreader agreement was moderate or substantial for four binary CT features that were evaluated as potential predictors of ENE and substantial for overall impression of radiologic ENE. Unambiguous radiologic ENE had low sensitivity (9–23%) but high specificity (87–96%) for predicting pathologic ENE.
The findings support the use of radiologic ENE in prognostic assessment in patients with NSCLC. Although radiologic ENE is an imperfect predictor of pathologic ENE, radiologic ENE can be applied in patients who are not candidates for surgery. Radiologic unambiguous ENE but not radiologic possible/probable ENE was an independent poor prognostic factor. This result is consistent with the current clinical staging guideline for head and neck cancers, in which only unambiguous ENE is considered to represent clinical ENE [
28,
29]. Previous studies of pathologic ENE in head and neck cancers also support the present findings in that only macroscopic ENE is associated with poor outcomes, whereas microscopic ENE has a similar prognosis as no ENE [
21,
30]. In the current study, the CT features with highest specificity were those selected to correlate with macroscopic ENE (coalescent LNs and direct invasion of adjacent structures). Given these considerations, we believe that only radiologic unambiguous ENE should be considered to represent clinical ENE in lung cancer staging.
In patients with NSCLC and clinical N2 disease, surgery may be deferred in those with unambiguous radiologic ENE, in whom the disease may be deemed unresectable [
17]. Although debate continues regarding the role of surgery in patients with N2 disease, international guidelines [
31,
32] suggest that surgical resection may be appropriate in the subset of patients with N2a disease (i.e., single-station N2 disease). Radiologic ENE could potentially be used to help refine prognosis assessments and surgical decision-making in patients with clinical N2a disease. The exploratory analysis in patients with clinical N2a disease showed a nonsignificant difference in the 5-year OS rate between those with and without radiologic unambiguous ENE. Identification of radiologic ENE may also be important in patients with clinical N1 disease in that pathologic ENE results in reclassification of such patients from R0 to R1 status (in terms of presence of residual tumor) [
33]. Thus, patients with clinical N1 disease and radiologic unambiguous ENE may be selected to undergo neoadjuvant chemotherapy or to undergo surgery with planned postoperative adjuvant therapy given anticipated R1 resection.
The interreader agreement for assessment of radiologic ENE, when considering binary features and overall impression, was moderate to substantial. Previous studies investigating radiologic ENE in head and neck cancers found wide interreader agreement across features (κ, 0.18–0.86) [
22,
23,
25,
34]. Diagnostic performance of radiologic ENE for predicting pathologic ENE was also suboptimal. Therefore, more reliable diagnostic CT criteria for ENE should be established and radiologists should be trained in the application of such criteria. In particular, the sensitivity of the various features and of overall impression was low. In comparison with cervical and axillary LNs, evaluation of the margins of mediastinal LNs is difficult given the scarcity of surrounding fat and adjacent bronchovascular structures. In addition, necrosis showed relatively low accuracy compared with the other binary findings; this finding was selected as a potential CT feature of ENE on the basis of its association with pathologic ENE in head and neck cancers, rather than on the basis of an anticipated direct correlation with ENE. Despite these various concerns, overall impression for radiologic ENE had high specificity for pathologic ENE; the high specificity is important for radiologic ENE to maintain prognostic impact during staging and risk stratification.
There were limitations to this study. First, generalizability remains uncertain, given the study's retrospective single-institution design. Second, the study sample had a strong male predominance, also introducing bias. Third, only two fellowship-trained thoracic radiologists participated in the image assessment. Further research should study interreader agreement among readers with greater variation in background and experience and assess intrareader agreement. Fourth, the evaluation of intrathoracic LN metastasis may have been confounded by the high prevalence of Mycobacterium tuberculosis in the Republic of Korea. Fifth, the association of radiologic ENE with pathologic ENE was evaluated on a per-patient basis, not on a per-station basis. Finally, ENE involving LNs within the main lung specimen and involving separately submitted LNs were both counted as pathologic ENE.
In conclusion, radiologic unambiguous ENE was an independent predictor of worse OS in patients with NSCLC in multivariable analysis including numerous clinical prognostic factors. Radiologists' overall impression for ENE showed substantial interreader agreement and low sensitivity but high specificity for pathologic ENE. Although the findings require validation in further studies, the results support the use of radiologic ENE in staging workup and treatment selection in patients with NSCLC and clinical N1 or N2 disease.