 Original article
 Open Access
 Published:
Modelling the probability of erroneous negative lymph node staging in patients with colon cancer
Cancer Communications volume 39, Article number: 31 (2019)
Abstract
Background
Patients in who with insufficient number of analysed lymph nodes (LNs) are more likely to receive an incorrect LN staging. The ability to calculate the overall probability of undiagnosed LN involvement errors in these patients could be very useful for approximating the real patient prognosis and for giving possible indications for adjuvant treatments. The objective of this work was to establish the predictive capacity and prognostic discriminative ability of the final error probability (FEP) among patients with colon cancer and with a potentially incorrectlystaged LNnegative disease.
Methods
This was a retrospective multicentric population study carried out between January 2004 and December 2007. We used a mathematical model based on Bayes’ theorem to calculate the probability of LN involvement given a FEP test result. Cumulative sum graphs were used to calculate risk groups and the survival rates were calculated, by month, using the Kaplan–Meier method.
Results
A total of 548 patients were analysed and classified into three risk groups according to their FEP score: lowrisk (FEP < 2%), intermediaterisk (FEP 2%–15%), and highrisk (FEP > 15%). Patients with LN involvement had the lowest overall survival rate when compared to the three risk groups. This difference was statistically significant for the low and intermediaterisk groups (P = 0.002 and P = 0.004, respectively), but highrisk group presented similar survival curves to pN+ group (P = 0.505). In terms of diseasefree survival, the highrisk group presented similar curves to the intermediaterisk group until approximately 60 months’ followup (P = 0.906). After 80 months’ followup, the curve of highrisk group coincided with that of the pN+ group (P = 0.172). Finally, we summarized the FEP according to the number of analysed LNs and accompanied by a contour plot which represents its calculation graphically.
Conclusions
The application of Bayes’ theorem in the calculation of FEP is useful to delimit risk subgroups from among patients without LN involvement.
Introduction
Colon cancer is the most frequent malignancy in both sexes in Western countries, with an incidence of approximately 471,000 cases per year and a mortality of 228,000 cases per year in Europe [1]. Lymph node (LN) involvement is the prognostic factor most directly related to the survival and diseasefree interval of cancer patients. Thus, patients with stages I and II cancers have a 5year overall survival (OS) rate higher than 75% compared to 30%–60% in patients with stages III and IV cancers [2,3,4]. Tumornodemetastasis (TNM) classification is the gold standard method for staging colon cancer, however, this system recommends collecting at least 12 LNs for correct staging. Despite the multidisciplinary approach to LN analysis in colon cancer, for various reasons related to patients, surgeons, and pathologists, the number of LNs analysed is very variable between patients [5], and significantly fewer than 12 are usually analysed [6, 7]. Without a doubt, patients with a pN0 LN staging have the highest risk (in terms of their therapeutic management) of suffering the most harmful consequences of being given an incorrect classification and prognosis. Calculation of the final error probability (FEP), i.e. the probability that the patients will present undiagnosed LN involvement, would be very useful for predicting the real prognosis and possible indications for adjuvant treatment among these patients.
The Bayes’ theorem statistical method can be used to calculate the probability of presenting affected LNs even when a patient presents negative anatomopathological study results. It considers the anatomopathological study of the surgical specimen as a diagnostic test with a binary result in this case: positive LNs (presence of disease) or negative LNs (absence of disease) [8]. The probability of a patient having the disease, even when given a negative test result (i.e., the probability that a patient with a negative histological study result has an unidentified lymphnode metastasis) is represented by the complementary value of the final negative predictive value. The objective of this work was to establish the predictive capacity and prognostic discriminative ability of the FEP among patients with colon cancer and with a potentially incorrectlystaged LNnegative disease.
Materials and methods
Patients
This is a multicentric population study using data from a highquality tumour sample registry included in the European cancer registrybased study on survival and care of cancer patients (EUROCARE study) [1]. The data used from this registry corresponded to the period between January 2004 and December 2007. Patients with colon cancer treated with surgery with curative intent and lymphadenectomy, a complete anatomopathological report, and a clear clinical status at their last follow up were included. Patients with cancer of the rectum or caecal appendix, with metastases at diagnosis, scheduled surgery with palliative intention without lymphadenectomy, scheduled surgery without resection, incomplete anatomopathological reports, a dubious vital status at the last followup control, and those with insufficient or no monitoring were excluded. The study was approved by the institutional review board of the Hospital General de Castellon (PIC: 2013/2/CIR). All participating patients provided their written informed consent.
Variables
The study variables were age, sex, tumour location, histology, differentiation grade, and the size, number of analysed LNs, number of positive LNs, TNM classification, condensed T and N stages, chemotherapy, FEP, OS, diseasefree survival (DFS), overall recurrence, locoregional recurrence, metastasis, and followup time.
Because all the data in the tumour registry is coded according to the sixth edition of the Union for International Cancer Control (UICC) TNM classification, we had to adapt them to the new guidelines for the seventh edition. Thus, although the N category was easily adapted, the T category could not be adapted to the new classification because the tumour registry contained insufficient data. As in other population studies, to minimise the effects of possible misclassifications, we used condensed TNM stages. The recurrence variable included patients who presented locoregional recurrence and those who presented distant metastases.
FEP
We used a wellknown mathematical model based on Bayes’ theorem to calculate the various diagnostic test parameters (sensitivity, specificity, and predictive values). According to Bayes’ theorem, the FEP is the probability of LN involvement (N+) given a negative test result (n−), in other words, p(N+/n−), can be deduced from the following mathematical formula [8]:
In the Bayes’ theorem formula: p(N+) is the prevalence of pN1 cases in the series; p(N−) is the complement to p(N+); p(n−/N+) is probability of a false negative (1 − Sensitivity) and is calculated by obtaining the hypergeometric probability resulting from the consideration of (1) the total LNs analysed from all the patients in the series; (2) the total number of positive LNs obtained in the series; (3) the number of positive LNs in a specific patient (equal to 0 for pN0 cases); and (4) the number of LNs analysed in a specific patient; the Specificity is p(n−/N+) and equals 1 because the presence of false ganglionic positives is considered impossible.
Given that there is a substantially greater probability of patients misclassified as pN0 (because they had an insufficient number of LNs analysed) having pN1 rather than pN2 or pN3 tumours, we decided to calculate the FEP of pN1 incorrectly being classified as pN0. Thus, all the FEP calculations refer to this adjusted FEP, set to pN1. Once the FEP was obtained, Cumulative Sum (CUSUM) [9] curves were used to calculate the optimal cutoff points following the method described by Barrio et al. [10], to obtain three incorrect pN0 classification risk groups.
Statistical analysis
Quantitative variables are expressed as the mean ± standard deviation (SD). Categorical variables are reported as frequencies and percentages. For the univariate analysis, the Chi square test (or exact Fisher test in small samples) was used to compare two qualitative samples; the Student ttest was used to compare two quantitative samples; and the ANOVA test was used to compare more than two quantitative samples.
The followup time we considered was from the date of surgery until the day of death, or the last day of followup in patients who did not die. This was because the tumour registry did not contain any clear definition of the date of diagnosis. Survival analysis was performed using the Kaplan–Meier method and the logrank test was implemented to estimate the differences between groups in terms of OS and DFS. Probability values of P < 0.05 were accepted as the statistical significance cutoff level. Statistical analysis was carried out with the IBM SPSS Statistics^{®} program version 22 (IBM^{®}, Armonk, New York, USA). The CUSUM curves were calculated using the STATA^{®} program version 14 (StataCorp LP^{®}, College Station, Texas, USA).
Results
During the period from January 2004 to December 2007, 944 patients were diagnosed with colon cancer in Castellon province (Spain), 140 of which were not operated on because they had contraindications for anaesthesia or because they had unresectable neoplasms. Eightythree patients were operated on with palliative intention and did not have an accurate lymphadenectomy record and so were not included in this work. We also excluded 116 cases with colon neoplasms that did receive an intervention but who also presented synchronous distant metastases. Insufficient data were obtained regarding the number of LNs analysed and affected in 49 cases and for 8 of the patients, there were no followup data recorded. Thus, here we eventually analysed 548 patients (Fig. 1).
According to the adjusted FEP and the optimal cutoff points, the lowrisk (FEP < 2%) category corresponded to patients who had > 20 analysed LNs; the intermediaterisk (FEP 2%–15%) corresponded to cases with 6–20 analysed LNs; and highrisk (FEP > 15%) corresponded to patients with ≤ 5 analysed LNs.
The clinical and histopathological characteristics of the 548 patients, and of the three riskgroups, are shown in Table 1. Of note, (1) younger patients had a significantly lower risk (P = 0.002); (2) more tumours were located in the right colon in lowrisk patients (P < 0.001); (3) the highrisk group contains more well differentiated tumours (P = 0.013); (4) the lowrisk group had more cases of pT3–T4 TNM classifications (P = 0.044); (5) as the patient risk increased, the tumour sizes tended to decrease (P < 0.001); and (6) as fewer LNs were analysed the patient risk and overall mortality increased (P < 0.001 and P = 0.019, respectively). The following factors were identified as being related to OS (Table 2): age [hazard ratio (HR) = 1.05; 95% confidence intervals (CI) 1.03–1.07; P < 0.001], pT (HR = 1.79; 95% CI 1.44–2.22; P < 0.001), pN (HR = 1.22; 95% CI 1.10–1.35; P < 0.001), condensed TNM (HR = 1.55; 95% CI 1.27–1.90; P < 0.001), number of analysed LNs < 12 (HR = 0.74; 95% CI 0.56–0.98; P = 0.025), FEP (HR = 1.51; 95% CI 1.09–2.10; P = 0.019), chemotherapy (HR = 0.62; 95% CI 0.45–0.84; P = 0.014), locoregional recurrence (HR = 2.28; 95% CI 1.54–3.38; P < 0.001), and the presence of metastasis (HR = 3.51; 95% CI 2.66–4.65; P < 0.001). On the other hand, the following factors were identified as being related to DFS (Table 2): pT (HR = 1.87; 95% CI 1.38–2.53; P = 0.006), pN (HR = 1.89; 95% CI 1.49–2.41; P < 0.001), condensed TNM (HR = 1.89; 95% CI 1.41–2.52; P = 0.001), chemotherapy (HR = 1.93; 95% CI 1.33–2.81; P < 0.001), locoregional recurrence (HR = 12.28; 95% CI 8.13–15.54; P < 0.001), and the presence of metastasis (HR = 67.95; 95% CI 39.68–116.33; P < 0.001).
Table 3 shows the FEP results according to the number of analysed LNs and is accompanied by a contour plot (Fig. 2) which represents its calculation graphically.
The OS charts (Fig. 3a) show that patients with LN involvement had the lowest OS rate than the three risk groups. This difference was statistically significant for the low and intermediaterisk groups (P = 0.002 and P = 0.004, respectively), but not with respect to the highrisk group (P = 0.505). In other words, N0 patients with a highrisk FEP (> 15%) had an OS rate like that of pN+ patients.
We subsequently carried out a similar analysis in which we divided patients with LN involvement into pN1 and pN2 groups. As shown in Fig. 3b, the OS rates differed between the low and highrisk groups (P = 0.014). The pN1 group had statistically significant difference to the lowrisk and pN2 group (P = 0.008 and P = 0.034, respectively). On the other hand, pN2 group had a significantly worse prognosis than low and intermediaterisk groups (all P < 0.001), but there were no statistically significant differences between highrisk group (P = 0.066). Interestingly, the survival curves of the highrisk and pN1 patient were very similar (P = 0.980). In terms of DFS (Fig. 4a), the curves for the intermediate and highrisk groups were similar until approximately 60 months’ followup (P = 0.906). After 80 months’ followup, the curve of highrisk group coincided with that of the pN+ group (P = 0.172). The curves for the intermediate and highrisk group and pN1 groups were all similarly (Fig. 4b; intermediaterisk vs. highrisk, P = 0.441; intermediaterisk vs. pN1, P = 0.289; highrisk vs. pN1, P = 0.978).
Discussion
LN involvement is one of the most important prognostic factors in colon cancer. Given its enormous importance, especially in terms of prognostic and therapeutic decisions, gaining a detailed picture of the true LN status of patients with colon cancer should be a priority for clinicians involved in the diagnostic–therapeutic process of these patients. In this sense, the use of FEP may be useful, especially for patient groups which more frequently see staging errors. Our group has extensive experience in using FEP to assess LN involvement in the contexts of colon [11], gastric [12], and breast cancers [13]. This method aims to calculate the probability that a pN+ patient in whom with insufficient analysed LNs will be erroneously classified as pN0. This has obvious clinical and prognostic consequences, especially in the light that stage III patients (i.e., those with LN involvement) can significantly benefit from chemotherapy [2, 14,15,16]. Correctly staging patients with colon cancer is very important given that around 60% of them do not currently present LN involvement at diagnosis [6, 7], a figure similar to the 63.3% staged as pN0 in this study.
FEP is particularly important for the patients in whom with insufficient analysed LNs because their TNM classification would not necessarily be accurate. This was the case in the series used in this study, in which the threshold 12 LNs were not analysed in 56% of the cases. Similar results have been reported in several previous studies, especially in population analyses like ours, in which this lymphnode threshold goal was not reached [2, 6, 17, 18]. There are likely several reasons why it is common for so few LNs to be analysed, likely because of factors related to the patient and surgeons, and to the type of anatomopathological study undertaken. Our results clearly show that FEP is strongly negatively related to lower OS rates (Fig. 3a). Moreover, patients with LN metastases have a poorer prognosis than those classified into the three risk groups. Most importantly, there were no statistically significant differences between the pN+ and highrisk patients. In other words, patients categorised into the highrisk FEP group had similar OS rates to those with LN involvement.
To try to increase the accuracy of our analysis, we further categorised pN1 and pN2 patients with LN infiltration and, as expected, pN2 patients had the poorest prognosis of the three groups whereas the lowrisk group had the best OS rate (Fig. 3b). It is also important to highlight that the highrisk and pN1 patients had very similar OS curves. Therefore, the pN1 patients had a similar OS rate to the pN0 patients with a high staging error risk. In terms of DFS, patients with an intermediate and high risk had a DFS rate like that of pN1 patients (Fig. 4b). Thus, our data confirm that a minimum of 12 LNs must be analysed to reduce TNM classification staging errors. It should also be noted that lowrisk patients had more rightcolon neoplasms, were younger, and had larger tumours, i.e., their tumour characteristics favoured easier LN analysis [5, 19]. Similarly, when younger patients were included in this group and more of LNs were analysed, their OS rate was better.
The use of this mathematical model, which is based on the Bayes’ theorem, has been previously described in the literature in the identification of groups with similar prognoses from among patients with different cancers but with similar characteristics to those of our patient cohort. This model was first described by Kiricuta et al. [8] in 1992 and was applied in breast cancer to calculate the probability of tumour persistence after an incomplete axillary dissection, staging the patients according to the T category of the TNM classification. Later, Okamoto et al. [20] studied the probability of LN involvement in patients with negative sentinelLN breast cancer using a Bayesian model. Iyer et al. [21] further demonstrated the usefulness of this method in 1652 patients with breast cancer in a study which aimed to evaluate the probability of LN involvement by staging it into T1 and T2. Following on from this work, Joseph et al. [22] studied 1585 patients with colorectal cancer and used this method to demonstrate the probability of LN involvement according to the number of analysed LNs using the same staging as Kiricuta et al. [8]. In a study of 480 patients with colon cancer, Martínez et al. [11] also showed that the risk of an erroneous negative ganglionic classification in colon cancer can be individualised by calculating its probability according to Bayes’ theorem.
According to the European Society for Medical Oncology guidelines, in our speciality a specific adjuvant treatment regimen is recommended for stage II patients with poor prognosis factors, including the analysis of fewer than 12 LNs. However, this regimen is not routinely applied, and the patient must sign their express informed consent to undergo this treatment [23].
It is useful to know the final probability of error in this field because it allows pN0 patients to be divided into several risk categories. This work shows that highrisk patients (patients without LN involvement but with five or fewer analysed LNs), have a similar prognosis to pN1 patients. The main limitations of this work lie in its retrospective nonrandomised population study design which resulted in a loss of evidence. Another limitation was our use of the T category from the sixth rather than the seventh edition of the TNM classification; this was because this version was in effect during the period in which the study data was collected. However, to help reduce this bias, we allocated the patients either into a group with earlierstage tumours (T1 and T2) or into one with locally more advanced tumours (stages T3 and T4). This minimised the effect of any modifications to the T factor classification made between these two TNM editions.
Conclusions
The application of Bayes’ theorem in the calculation of FEP is useful to delimit risk subgroups from among patients without LN involvement. Therefore, the FEP system complements the TNM LN staging classification well by improving the discrimination of the colon cancer prognosis in pN0 patients with a highrisk of having been misclassified as free of LN involvement because too few of their LNs had been analysed.
Availability of data and materials
The primary dataset will not be shared because it is subject to confidentiality restrictions.
Abbreviations
 FEP:

final error probability
 OS:

overall survival
 TNM:

tumornodemetastasis
 DFS:

diseasefree survival
 HR:

hazard ratio
 CI:

confidence intervals
 SD:

standard deviation
 CUSUM:

cumulative sum
References
 1.
Holleczek B, Rossi S, Domenic A, Innos K, Minicozzi P, Francisci S, et al. Ongoing improvement and persistent differences in the survival for patients with colon and rectum cancer across Europe 1999–2007—results from the EUROCARE5 study. Eur J Cancer. 2015;51:2158–68.
 2.
Johnson PM, Porter GA, Ricciardi R, Baxter NN. Increasing negative LN count is independently associated with improved longterm survival in stage IIIB and IIIC colon cancer. J Clin Oncol. 2006;24:3570–5.
 3.
Swanson RS, Compton CC, Stewart AK, Bland KI. The prognosis of T3N0 colon cancer is dependent on the number of LNs examined. Ann Surg Oncol Off J Soc Surg Oncol. 2003;10:65–71.
 4.
Le Voyer TE, Sigurdson ER, Hanlon AL, Mayer RJ, Macdonald JS, Catalano PJ, et al. Colon cancer survival is associated with increasing number of LNs analyzed: a secondary survey of intergroup trial INT0089. J Clin Oncol. 2003;21:2912–9.
 5.
Chou JF, Row D, Gonen M, Liu YH, Schrag D, Weiser MR. Clinical and pathologic factors that predict LN yield from surgical specimens in colorectal cancer: a populationbased study. Cancer. 2010;116:2560–70.
 6.
Ricciardi R, Madoff RD, Rothenberger DA, Baxter NN. Populationbased analyses of LN metastases in colorectal cancer. Clin Gastroenterol Hepatol. 2006;4:1522–7.
 7.
Persiani R, Cananzi FCM, Biondi A, Paliani G, Tufo A, Ferrara F, et al. Log odds of positive LNs in colon cancer: a meaningful ratiobased LN classification system. World J Surg. 2012;36:667–74.
 8.
Kiricuta CI, Tausch J. A mathematical model of axillary LN involvement based on 1446 complete axillary dissections in patients with breast carcinoma. Cancer. 1992;69:2496–501.
 9.
Noyez L. Control charts, cusum techniques and funnel plots. A review of methods for monitoring performance in healthcare. Interact Cardiovasc Thorac Surg. 2009;9:494–9.
 10.
Barrio I, Arostegui I, RodríguezÁlvarez MX, Quintana JM. A new approach to categorising continuous variables in prediction models: proposal and validation. Stat Methods Med Res. 2017;26:2586–602.
 11.
MartínezRamos D, EscrigSos J, Hoashi JS, RivadullaSerrano I, SalvadorSanchís JL, Ruiz del Castillo J. Un método para individualizar el riesgo de una errónea clasificación ganglionar negativa en el cáncer de colon. Cir Esp. 2010;88:383–9.
 12.
MirallesTena JM, EscrigSos J, MartínezRamos D, ÁngelYepes V, VillegasCánovas C, SenentVizcaíno V, et al. Cáncer gástrico: valoración probabilística ante un estadio ganglionar negativo y sus consecuencias. Cir Esp. 2006;80:32–7.
 13.
MartinezRamos D, EscrigSos J, AlcaldeSanchez M, TorrellaRamos A, SalvadorSanchis JL. Diseasefree survival and prognostic significance of metastatic LN ratio in T1–T2N positive breast cancer patients. A population registrybased study in a European Country. World J Surg. 2009;33:1659–64.
 14.
Shia J, Wang H, Nash GM, Klimstra DS. LN staging in colorectal cancer: revisiting the benchmark of at least 12 LNs in R0 resection. J Am Coll Surg. 2012;214:348–55.
 15.
Fernández SP, Díaz SP, Calviño BL, Santamaría PG, Pillado TS, Monreal FA, et al. Diagnosis delay and followup strategies in colorectal cancer. Prognosis implications: a study protocol. BMC Cancer. 2010;10:1–6.
 16.
Topham C, Zaninelli M, Clingan P, Bridgewater J, Tabahfisch I. Oxaliplatin, fluorouracil, and leucovorin as adjuvant treatment for colon cancer. N Engl J Med. 2010;23:43–51.
 17.
Edler D, Öhrling K, Hallström M, Karlberg M, Ragnhammar P. The number of analyzed LNs—a prognostic factor in colorectal cancer. Acta Oncol (Madr). 2007;46:975–81.
 18.
Wright FC, Law CHL, Last L, Khalifa M, Arnaout A, Naseer Z, et al. LN retrieval and assessment in stage II colorectal cancer: a populationbased study. Ann Surg Oncol. 2003;10:903–9.
 19.
Jessup JM, McGinnis LS, Steele GD Jr, Menck HR, Winchester DP. The National Cancer Data Base. Report on colon cancer. Cancer. 1996;78:918–26.
 20.
Okamoto T, Yamazaki K, Kanbe M, Kodama H, Omi Y, Kawamata A, et al. Probability of axillary LN metastasis when sentinel LN biopsy is negative in women with clinically node negative breast cancer: a Bayesian approach. Breast Cancer. 2005;12:203–10.
 21.
Iyer RV, Hanlon A, Fowble B, Freedman G, Nicolaou N, Anderson P, et al. Accuracy of the extent of axillary nodal positivity related to primary tumor size, number of involved nodes, and number of nodes examined. Int J Radiat Oncol Biol Phys. 2000;47:1177–83.
 22.
Joseph NE, Sigurdson ER, Hanlon AL, Wang H, Mayer RJ, MacDonald JS, et al. Accuracy of determining nodal negativity in colorectal cancer on the basis of the number of nodes retrieved on resection. Ann Surg Oncol. 2003;10:213–8.
 23.
Labianca R, Nordlinger B, Beretta GD, Mosconi S, Mandalà M, Cervantes A, et al. Early colon cancer: ESMO clinical practice guidelines for diagnosis, treatment and followup. Ann Oncol. 2013;24:64.
Acknowledgements
Not applicable.
Funding
The paper was funded by Fundación Hospital Provincial de Castellón.
Author information
Affiliations
Contributions
CFS, EFC and JES planned and designed the study. CFS is the principal investigator and performed the data analysis. JES conducted the statistical analyses. CFS, EFC and DMR wrote the manuscript. JES critically reviewed the article. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
The patients’ right to data confidentiality and privacy were specifically preserved and bioethical approval for the study was obtained from the cancer registry in the Castellon province (Spain). The study was approved by the institutional review board of the Hospital General de Castellon (PIC: 2013/2/CIR). All participating patients provided their written informed consent.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
ForteaSanchis, C., ForcadellComes, E., MartínezRamos, D. et al. Modelling the probability of erroneous negative lymph node staging in patients with colon cancer. Cancer Commun 39, 31 (2019). https://doi.org/10.1186/s4088001903775
Received:
Accepted:
Published:
Keywords
 Colon cancer
 Lymph node staging
 Bayes’ theorem
 Final error probability
 Prognosis