Tu-Nan Yu, Ying-Ying Mao, Fang-Qiang Wei, Hui Liu
Abstract BACKGROUND According to the latest American Joint Committee on Cancer and Union for International Cancer Control manuals, cystic duct cancer (CC) is categorized as a type of gallbladder cancer (GC), which has the worst prognosis among all types of biliary cancers. We hypothesized that this categorization could be verified by using taxonomic methods.AIM To investigate the categorization of CC based on population-level data.METHODS Cases of biliary cancers were identified from the Surveillance, Epidemiology, and End Results 18 registries database. Together with routinely used statistical methods, three taxonomic methods, including Fisher’s discriminant, binary logistics and artificial neuron network (ANN) models, were used to clarify the categorizing problem of CC.RESULTS The T staging system of perihilar cholangiocarcinoma [a type of extrahepatic cholangiocarcinoma (EC)] better discriminated CC prognosis than that of GC.After adjusting other covariates, the hazard ratio of CC tended to be closer to that of EC, although not reaching statistical significance. To differentiate EC from GC,three taxonomic models were built and all showed good accuracies. The ANN model had an area under the receiver operating characteristic curve of 0.902.Using the three models, the majority (75.0%-77.8%) of CC cases were categorized as EC.CONCLUSION Our study suggested that CC should be categorized as a type of EC, not GC.Aggressive surgical attitude might be considered in CC cases, to see whether long-term prognosis could be immensely improved like the situation in EC.
Key words: Cystic duct cancer; Gallbladder cancer; Extrahepatic cholangiocarcinoma;Surgical management; Taxonomy; Categorization
Cystic duct cancer (CC) is a rare type of biliary cancer which arises in the conjunction between the gallbladder and the extraheptic bile duct[1]. By Farrar[2]in 1951, CC was defined based on the following three criteria: “(1) The growth must be restricted to the cystic duct; (2) There must be no neoplastic process in the gallbladder, hepatic or common bile ducts; and (3) A histological examination of the growth must confirm the presence of carcinoma cells”. Unlike other biliary neoplasms, the incidence of CC is extremely low (0.03%-0.05%[3]), and previous studies on this disease were focusing on single cases[4-7]or very small-volume series (less than 10 cases)[8,9]. As a consequence, the clinical characteristics of CC remain largely unknown. According to the 8thAmerican Joint Committee on Cancer (AJCC)[10]and 9thUnion for International Cancer Control (UICC)[11]manuals, CC is categorized as a type of gallbladder cancer(GC), which has the worst prognosis among all types of biliary tumor[12,13]. However,this categorization lacks verification by direct epidemiological evidence until now.
In the field of botany, a well-known statistical tool is the Fisher’s linear discriminant analysis, which was invented by Sir Ronald Aylmer Fisher[14]and initially used to make optimum categorization for three types of iris flowers, namely,Setosa, Versicolour and Virginica. In the current study, besides routinely used statistical tools, we applied to use this taxonomic method together with binary logistics and artificial neuron network (ANN) models, to clarify the categorizing problem of CC. To our knowledge, this is the first study attempting to re-categorize CC using population based data.
Eligible cases of CC, GC and extrahepatic cholangiocarcinoma (EC) diagnosed between 2006 and 2015 were obtained from the Surveillance, Epidemiology, and End Results (SEER) 18 registries database (last submission in November, 2017). CC or GC cases were retrieved with their terms in “CS schema v0204+”; while EC cases were retrieved with a combined term of “BileductsPerihilar” and “BileductsDistal”. All cases reviewed had their pathologies microscopically confirmed, their ages known,and their biological behavior classified as malignant.
For each case, ten variables were evaluated as follows: Sex, age at diagnosis, year of diagnosis, marital status at diagnosis, race recode, SEER historical stage A, tumor size,Rx Summ-Surg Prim Site (1998+), CHDSA region and histology recode-broad grouping.
For marital status, the values of single, separated, divorced, widowed and unmarried or domestic partner were defined as “unmarried”. For Rx Summ-Surg Prim Site (1998+)[15], which means surgical procedure, a value of 0 was defined as “no operation”; values from 10 to 30, and of 50 were defined as “biopsy/ partial resection”; a value of 40 was defined as “total resection”; and a value of 60 was defined as “radical resection”. For histology recode, a value of 8140-8389 was defined as “adenocarcinoma”, while other values were defined as “others”.
Routine statistical methods:The Pearson’s χ2test or Fisher’s exact test was used for categorical data to compare the distributions of variables between CC, GC and EC.The Mann-Whitney U test was used for continuous variables. The 1, 3 and 5-year overall survival (OS) and cancer specific survival (CSS) rates were calculated by the method of life table. CC cases were divided into T1-2 group and T3-4 group by the current T staging system for GC or EC, respectively (Supplementary table 1), and the prognosis of different groups was compared by the log-rank test. The COX proportional model was utilized to analyze the whole population of CC, GC and EC,and identify the independent prognostic factors for OS and CSS.
Methods to categorize CC:Three steps were required to categorize CC. First, three models were built. All cases of GC and EC were randomly partitioned, and then divided into either a training (70%) or testing group (30%). Three models were constructed to differentiate between GC and EC. In Fisher’s linear discriminant model, the importance of variables was evaluated by the absolute values of their coefficients in a structure matrix. In binary logistic model, multivariate analysis was performed, and P-value was calculated for each variable. In ANN model, a multilayer perceptron neural network was utilized, and normalized importance was calculated for each variable.
In the next step, the models were evaluated. Accuracies were calculated for the training, testing and whole groups, respectively, and good models generally had these three values very close. The power of a model was also evaluated by the area under the receiver operating characteristic curve (AUROC). Generally, AUROC indicated a model of “good” accuracy at a value between 0.75 and 1.00, of “fair” accuracy at a value between 0.60 and 0.75, and of “poor” accuracy at a value under 0.60.
Finally, CC cases were categorized. Each case of CC would be categorized as GC or EC using the three models. Overall agreement of results of the three models was evaluated by an online Kappa test calculator[16]as well as the McNemar’s test.
Between 2006 and 2015, the SEER database recognized a total of 71 cases of CC.However, a certain proportion of cases had missing values and could not be utilized for analysis for all aims. Regarding the analysis of 1, 3 and 5-year survival and T stage, the included cases should have valid data for the variables of “CS extension”,“Survival months” and “Vital status recode” (for OS, or “SEER cause-specific classification” for CSS). Consequently, 65 CC cases were analyzed for OS and 56 were analyzed for CSS. In other analyses (comparison of clinical characteristics between different diseases, COX analysis, building models and categorization of CC), all cases involved should have valid data for 10 variables from sex to histology, and cases with missing data could not be produced. Consequently, a total of 36 CC cases were analyzed for the aim of categorization, together with 4878 GC cases and 3295 EC cases. P < 0.05 was considered statistically significant, and the IBM SPSS version 22.0 software (SPSS Inc., Chicago, IL, United States) was utilized for all analyses.
Table 1 shows the comparison of the clinical characteristics between CC, GC and EC.Five variables were found to be statistically different between CC and GC, including sex, tumor size, SEER historical stage, surgical procedure and histology (P < 0.05).However, only one variable (surgical procedure) was found to be different between CC and EC (P < 0.05).
Table 1 Comparison of clinical characteristics between cystic duct cancer, gallbladder cancer and extrahepatic cholangiocarcinoma, n (%)
The 1, 3 and 5-year survival rates of CC were 50%, 27% and 12% for OS, and 51%, 30%and 20% for CSS, respectively. The median survival time was 13.0 months for both OS and CSS.
All CC cases were divided into T1-2 and T3-4 groups according to the T staging system of GC or EC (Figure 1). Based on the GC system, the prognosis was similar between the T1-2 and T3-4 groups (P = 0.961 for OS, and 0.568 for CSS).Comparatively, the EC system (perihilar cholangiocarcinoma) showed statistically different prognoses between the T1-2 and T3-4 groups (P < 0.05 for both OS and CSS).Therefore, the T staging system of EC better discriminated CC prognosis than that of GC.
In COX analysis for the whole population of biliary cancer cases, nine variables were found as independent prognostic factors for OS and CSS, including sex, age at diagnosis, year of diagnosis, marital status, SEER historical stage, tumor size, surgical procedure, histology and disease (Table 2, P < 0.05). By using CC as the reference(1.000), the hazard ratio (HR) for OS was 1.238 [95% confidence interval (CI): 0.813 to 1.884, P = 0.320] for EC but 1.494 (95%CI: 0.980 to 2.279, P = 0.062) for GC. Regarding CSS, the HR was 1.124 (95%CI: 0.706 to 1.789, P = 0.623) for EC but 1.348 (95%CI: 0.845 to 2.151, P = 0.210) for GC. Compared with GC, the HR of EC tended to be closer to that of CC, although not reaching statistical significance (P > 0.05).
Figure 1 Survival curves of cystic duct cancer based on T staging system of gallbladder cancer or extrahepatic cholangiocarcinoma.
Three models were constructed to differentiate GC from EC (Table 3). In the Fisher’s linear discriminant model, the most important three variables (highest absolute values) were sex (0.479), SEER historical stage (0.665) and surgical procedure (-0.238).In the binary logistic model, six variables had statistical significance (P < 0.05),including sex, age at diagnosis, tumor size, SEER historical stage, surgical procedure and histology. In the ANN model, the variables with the highest value of normalized importance were tumor size (71.8%), SEER historical stage (80.4%) and surgical procedure (100.0%).
The accuracies of the Fisher’s linear discriminant, binary logistic and ANN models were 69.9%, 73.0% and 82.1%, respectively. And for each model, the accuracies in the training, testing and whole populations were close. The AUROCs of the aforementioned three models were 0.781 (95%CI: 0.771-0.791), 0.785 (95%CI: 0.775-0.795) and 0.902 (95%CI: 0.895-0.908), all of which suggested “good” accuracies(Figure 2). The ANN model had the best categorizing power.
Based on the ANN model, the majority (77.8%, 28/36) of CC cases were categorized as EC, the percentage of which was similar to those obtained based on the Fisher’s(77.8%) and binary logistic (75.0%) models (Table 3). The three categorizing results had an overall agreement of 85.19%, and Mcnemer tests between each two models showed no statistical differences (P = 1.000).
Table 2 The COX multivariate analysis for cystic duct cancer, gallbladder cancer, and extrahepatic cholangiocarcinoma
CC was categorized as a type of GC in the latest AJCC[10]and UICC[11]manuals.However, our study argued with their practice, mainly based on two statisticalfindings: (1) Compared with the staging system of GC, the T staging system of perihilar cholangiocarcinoma (a type of EC) better discriminated the prognosis of CC.Unlike the staging system of lymphatic or distant metastasis (N or M), the T staging system was obviously different in EC and GC. That is, a lesion of GC would be staged as T3-4 when it involved the serosa of the biliary wall, while a lesion of EC would not be staged as T3-4 only when it invaded the portal vein, artery, or the second-order biliary system[10,11]. In this study, when the GC staging system was used, no prognostic differences were observed between the T1-2 and T3-4 groups. However, the difference was of statistical significance when the EC system was used (P < 0.05); and (2) The majority of CC cases (75.0%-77.8%) were categorized as EC by using the three taxonomic models. The three models used, including the Fisher’s discriminant, binary logistics and ANN models, were constructed based on big data of over 8000 biliary cancer cases. Based on the AUROC, all models had “good” categorizing power, and the ANN model even had a value of over 0.90.
Table 3 Three models to differentiate between gallbladder cancer and extrahepatic cholangiocarcinoma, and categorize cystic duct cancer cases
To our knowledge, this study provided the first population-level evidence for the categorizing problem of CC. In the 6thAJCC manual, CC was included in the EC chapter, but was classified as a type of GC since the 7thedition[17]. This practice was agreed by a recent single-center study by Nakanishi et al[18]. However, in that study,the definition of CC seemed to be not strict, because the incidence of CC was too high and even exceeded that of GC (47 vs 43) during the follow-up period. Besides, a large proportion of cases were at very advanced stage (with invasion to the duodenum,artery or portal vein), possibly attributed to the fact that some GC cases were misjudged as CC due to difficulty in detecting primary lesions.
Interestingly, in previous publications, there were also some clues to suggest the difference between CC and ordinary GC. In the study of Nishio[19], “GC cases” with common bile duct invasion (most of which were CC) had a favorable 5-year survival rate of over 20%, which was much better than the commonly expected prognosis in T3 stage GC cases. In the study of Ozden[20], “GC cases” with their lesion centers located in the cystic ducts (actually CC) had some unique clinical features, including their approximately equal proportion of sex, advanced T stage, and a lower than expected frequency of lymph node metastasis, which more resembled EC.
Figure 2 Receiver operating characteristic curves to discriminate between gallbladder cancer and extrahepatic cholangiocarcinoma.
However, the surgical attitude for EC and GC was different in clinical practice. EC(especially for perihiliar cholangiocarcinoma) was considered as being worthy of extensive resection even at the expense of increased complication rates (biliary fistula,etc.), since a successful R0 resection could drastically improve long-term prognosis[21],and transplantation could be considered in well-selected patients[22]. Comparatively,in advanced GC cases, aggressive surgical resection did not show concrete survival benefits, including combined common bile duct resection[23]and extended regional lymphadenectomy[24]. According to a recent United States population-level study[25],extended resection alone in GC provided a worse prognosis than simple removal of the gallbladder plus adjuvant chemotherapy. The similarity between CC and EC suggested the rationality that CC should not be treated like GC. In other words,extended surgical resection could be attempted in CC even at advanced stage, to see whether this treatment could immensely improve the prognosis like the situation in EC.
There are several limitations to this study. First, the variable of lymphatic status was not analyzed. In the SEER database, most of biliary cases had their lymphatic status unknown, and few cases had their harvested lymph nodes more than three,which was the minimum number to accurately evaluate lymphatic metastasis[26].Therefore, although the pattern of lymphatic metastasis was different among CC, GC and EC[20,27,28], inclusion of this variable had modest value to increase performance of models. Second, whole genome sequencing also could help to clarify the categorization of CC. However, due to the extreme rarity of CC specimens, this molecular evidence is currently not available.
In conclusion, our study argued with categorization of CC in the current AJCC and UICC manuals, and suggested that CC should be categorized as a type of EC.Extended surgical resection might be considered in CC cases, to see whether longterm prognosis could be immensely improved like the situation in EC.
According to the current guidelines, cystic duct cancer (CC) is categorized as a type of gallbladder cancer (GC), which has the worst prognosis among all types of biliary cancers.
In previous studies, no direct epidemiological evidence verified that CC was more similar to GC,rather than extrahepatic cholangiocarcinoma (EC).
This study aimed to investigate the categorization of CC based on population-level data.
We used three taxonomic methods for analysis, including Fisher’s discriminant, binary logistics and artificial neuron network models.
The T staging system of perihilar cholangiocarcinoma (a type of EC) better discriminated CC prognosis than that of GC. By using the three taxonomic models, the majority (75.0%-77.8%) of CC cases were categorized as EC.
Our study suggested that CC should be categorized as a type of EC, not GC.
Aggressive surgical attitude might be considered in CC cases, to see whether long-term prognosis could be immensely improved like the situation in EC. Future studies with larger sample size are needed.
World Journal of Gastroenterology2019年44期