Xiao-Na Liu, Dan-Ni Cui, Yu-Fang Li, Yun-He Liu, Gang Liu, Lei Liu
Abstract The huge prognostic difference between early and late stage hepatocellular carcinoma (HCC) is a challenging diagnostic problem. Alpha-fetoprotein is the mostly widely used biomarker for HCC used in the clinic, however it’s sensitivity and specificity of is not optimal. The development and application of multiple biotechnologies, including next generation sequencing, multiple “omics” data,that include genomics, epigenomics, transcriptomics, proteomics, metabolomics,metagenomics has been used for HCC diagnostic biomarker screening. Effective biomarkers/panels/models have been identified and validated at different clinical levels. A large proportion of these have a good diagnostic performance for HCC, especially for early HCC. In this article, we reviewed the various HCC biomarkers derived from “omics” data and discussed the advantages and disadvantages for diagnosis HCC.
Key words: Hepatocellular carcinoma; Diagnosis; Circulating tumor cells; Exosomes;Circulating tumor DNA; RNA; Metabolomics; Protein
In 2018, hepatocellular carcinoma (HCC) was the sixth most common cancer and the fourth leading cause of deaths worldwide. It is estimated that about 841000 new cases and 78000 deaths will occur each year. The incidence of HCC varies regionally but is the most common cancer in thirteen geographically countries[1].
HCC accounts for 75% to 85% of all liver cancers[1]. HCC is a multifactorial disease due to a variety of risk factors. Chronic infection with hepatitis B virus (HBV) or hepatitis C virus, aflatoxin-contaminated food, heavy alcohol intake, obesity,smoking, and type 2 diabetes are the main risk factors for HCC[2,3]. The occurrence of HCC results from a combination of virus-specific factors, immune mechanisms,environmental factors and genetics[4]. Due to the complicated etiology of liver cancer and the different molecular subtypes observed in individuals, early diagnosis is difficult[5]. As a result, the majority of patients are diagnosed with late-stage HCC,with a five-year survival rate of only 10.1%[6]. Thus, early diagnosis is necessary to improve HCC patient survival.
Early screening is essential for the early detection and treatment of HCC.Diagnostic methods for HCC include clinical, imaging, molecular marker and omics diagnosis[7]. Clinical symptom, Imaging diagnosis[8], and molecular biomarkers are the most com-monly used evidences for HCC detection. The molecular markers commonly used for the diagnosis of HCC are alpha-fetoprotein (AFP)[9], desaturationγ-lock-up-thrombin (DCP)[10], and phosphatidylinositol proteoglycan-3 (GPC-3)[10-12].HCC heterogeneity significantly limits the diagnosis and treatment of HCC patients[13-15], making the efficiency of diagnosis and treatment very low. With the advent of next generation sequencing and the development of precision medicine,individual heterogeneity-based histological diagnosis makes it possible for individualized diagnosis and targeted HCC therapy[16,17]. Omics data, including but not limited to genomic, epigenomic, transcriptomic, proteomic and metabolomic,identifies biological heterogeneity and has provided novel insights for HCC diagnosis, as shown in Figure 1. With regards to genomics, circulating tumor DNA(ctDNA)[18]has been recognized as omics diagnostic markers. Currently, ctDNA can be extracted using a small amount of peripheral blood from patients and is minimally invasive. ctDNA can reveal genetic and epigenetic changes associated with specific cancers and their metastatic potential, and provides unique insights for the continuous monitoring of tumor genomes in a non-invasive, convenient, and accurate way[18]. With regards to epigenomics, methylation patterns in ctDNA could be used for the diagnosis of HCC[19]. Using transcriptomics, circulating miRNAs has been demonstrated to have unique expression patterns for various tumors including HCC[10,20,21]. In addition, circulating miRNAs combined with conventional AFP and ultrasound screening tools have great application prospects for the prediction and prevention of HCC in high-risk populations[5,22]. Proteomic markers such as AFP, AFPL3, DCP, GPC3, Golgi protein-73 (GP73)[9]etc. are considered diagnostic markers for HCC, however they have poor sensitivity and specificity[9,23]. In addition to the diagnostic tools mentioned above, metabolomics and metagenomics has also contributed to the diagnosis of HCC. This review will focus on the omics based HCC diagnosis.
Circulating tumor cells (CTCs) are generally considered to be the “seeds” of tumors that migrate from carcinomas in situ to the peripheral blood or lymphatic system[24].This process takes place at every stage of tumor development. Hence, CTCs may be a good tool for HCC diagnosis.
CTCs can be separated and enriched using methods based on their physical and biological properties. The physical characteristics including size, density, migration capacity, and charge and are independent of the expression of established cancer epitopes. However, the separation and enrichment of CTCs are less cancer-specific[25].Various bio-based technologies rely on specific antibodies that bind to CTC surface markers, including but not limited to epithelial cell adhesion molecule (EpCAM),human epidermal growth factor receptor, members of the cytokeratin (CK) family(CK8, CK18 and CK19) and mesenchymal markers (N-cadherin and vimentin)[26].Various technologies based on the physical or biological properties of CTCs have been used for CTC detection. These include, the CellSearch®system (CSS: Veridex LLC, NJ,United States)[27], RT-PCR method, the CanPatrol CTC analysis platform (SurExam,China)[28], the ISET technology (segregation by the size of epithelial tumor cells)[29],flow cytometry[30], and CTC-Chip[31,32].
Figure 1 Multiple “Omics” data-based diagnostic biomarker for hepatocellular carcinoma. Circulating tumor cells, circulating tumor DNA, exsomes, transcriptomics, proteomics, metabolomics and intestinal microorganisms provide new non-invasive candidates diagnostic biomarkers for hepatocellular carcinoma. HCC: Hepatocellular carcinoma; CTCs: Circulating tumor cells; ctDNA: Circulating tumor DNA.
Numerous studies have been performed to investigate the diagnostic value of CTCs using these approaches. Guo et al[33]found that the rate of detectable EpCAMmRNA+CTCs using their optimized qRT-PCR–based platform was significantly higher in HCC patients compared to healthy controls (P < 0.05), while the area under the curve(AUC) for EpCAMmRNA+CTCs combined with AFP levels for discriminating HCC vs controls was 0.857, with a sensitivity of 73.0% and specificity of 93.4%. In addition, it was accurate for detecting early stage and AFP-negative HCC. Guo et al[34]demonstrated the clinical significance of CTC with stem-like phenotypes for diagnosing HBV–related HCC using an optimized qPCR-based detection platform. They found that using a panel containing four putative stem cell biomarkers (EpCAM, CD90,CD133, and CK19) outperformed EpCAM alone for the diagnosis of HCC and had an AUC of 0.88 and 0.93 in the training set and the validation set, respectively. CTCs may induce metastases in distal organs and hence may play a significant role in prognosis[35-37]. Qi et al[37]demonstrated that the percentages of CTC and mesenchymal-CTC were significantly associated with early recurrence, multi-intrahepatic recurrence, and lung metastasis.
Exosomes are cell-derived vesicles about 30-100 nm in size. They are intraluminal vesicles formed by the inward budding of the endosomal membrane, and secreted out of the cell by the fusion of MVEs with the cell surface[25]. Exosomes are released into the extracellular space by multiple cell types, and are present in many eukaryotic fluids including blood, urine, cerebrospinal fluid and cultured cell culture media[38,39].Exosomes are currently a good tool for the HCC diagnosis. Several exosomes isolation technologies are used clinically, such as ultracentrifugation, sucrose-gradient centrifugation, and immune-magnetic isolation[40].
mRNA, microRNA, and lncRNA in exosomes are prevented from being degraded by RNases due to the exosome lipid layer. Several studies have recommended serum exosomes and exosomes enclosed RNAs as HCC screening biomarkers[21,40]. Xu et al[41]using 301 patient samples demonstrated that the combination of exosomal ENSG00000258332.1, LINC00635 and serum AFP has an AUC of 0.885-0.894 for HCC diagnosis. The combination of exosomal miR-122, miR-148a, and serum AFP increased the AUC to 0.931 for distinguishing early HCC from liver cirrhosis.Additionally, exosomal miR-122 was the best for differentiating HCC from NC (AUC= 0.990)[42]. Furthermore, serum exosomal heterogeneous nuclear ribonucleoprotein H1, LINC00161, and miRNA224 were also able to distinguish HCC patients from healthy controls (AUC = 0.865; AUC = 0.794; AUC = 0.91, respectively)[43-45]. Studies have also demonstrated that exosome enclosed RNAs had the potential for HCC prognosis[46,47].
Circulating cell-free DNA is defined as extracellular DNA present in plasma or serum samples. ctDNA is specifically released from tumor cells undergoing metabolic secretion, apoptosis or necrosis. ctDNA carries with it tumor-specific genetic or epigenetic changes, such as DNA methylation, point mutations, copy number variation, etc[18]. A recent study using minimally invasive examination of ctDNA harvested from a small amount of patient peripheral blood demonstrated genetic and epigenetic changes associated with specific cancers and their metastatic status. This provides a unique insight for the continuous monitoring of tumor genomes in a noninvasive and precise way.
The methylation pattern of ctDNAs is applicable for the early detection of HCC.Methylation changes occur in many genes that are involved in the initiation and progression of HCC. Several studies have demonstrated changes in DNA methylation in tumor tissues of HCC patients, including the aberrant methylation of the glutathione S-transferase P1 (GSTP1) promoter region[48,49]and the cyclin-dependent kinase inhibitor p15 and p16[50,51]. Hypermethylated GSTP1[52], p15[53]and p16[54]in ctDNA from HCC patients have been observed. The study conducted by Wang et al[52]demonstrated GSTP1 promoter CpG island hypermethylation in 23 of 26 (88.5%)tumor tissues and 18 of 26 (69%) corresponding non-tumor tissues. Huang et al[19]analyzed the methylation status of four genes (APC, GSTP1, RASSF1A and SFRP1) in plasma and demonstrated sufficient diagnostic value of ctDNAs. The combination of these four genes increased the AUC to 0.933 with 92.7% sensitivity and 81.9%specificity for distinguishing HCC patients from normal healthy controls. MS-PCR has been widely used in methylation studies and provides a quick and easy method with high sensitivity and accuracy. The tissue specificity of methylation patterns in ctDNA may help determine original tumor location. The above results indicate that comprehensive assessment of circulating methylated DNA may be a promising tool for HCC diagnosis and management.
In addition to changes in methylation patterns, other genetic or epigenetic changes have been demonstrated in ctDNAs. These include microsatellite changes, point mutations, chromosomal rearrangements, and viral DNA which may contribute to the early diagnosis of HCC. Comparative genomic hybridization technology has enabled scientists to study microsatellite changes in HCC, such as chromosome 8p, 17p and 19p deletions[55]. The loss of 8p has been reported in eight metastatic tumors, but only in three corresponding primary tumors (P = 0.03)[56]. Droplet digital PCR[57]and wholegenome high-throughput sequencing[58]has been used to accurately detect rare and multiple types of mutations in circulating DNA. Point mutations that inactivated tumor suppressor genes or activate proto-oncogenes can be detected in ctDNA using these technologies. Ser249 of TP53 is the most frequently reported mutation hotspot in HCC patients, that results in a loss of specific DNA binding capacity[59]. Recent studies have demonstrated that TP53 Ser249 mutations are highly correlated with cirrhosis and HCC in China and Africa[60]. In addition, high recurring hotspot mutations have been detected for TP53 R249S, CTNNB1 amino acids D32, S33, S37, T41 and S45, and TERT c.-124C> T promoter mutations[61-63]and may be regarded as potential markers for HCC diagnosis. Additionally, genomic sequencing revealed several recurrent chromosomal rearrangements in HCC, including deletions, insertions, amplifications,translocations, and more complex rearrangements. Detection of these chromosomal rearrangements in ctDNA requires highly sensitive PCR. This has only been successfully applied in a small group of patients, mostly with hematological malignancies[18]. However, whole-genome sequencing of ctDNA provides an opportunity to identify changes in chromosomal rearrangements or copy number in HCC patients and will ultimately become a reliable and robust method for HCC detection. Although ctDNA has been associated with disease stages and is easy to measure, it lacks the sensitivity and specificity[64]. Additional case-control and clinical studies should be performed to validate its utility, especially for diagnosis of early stage HCC.
Among the diagnostic biomarkers of HCC, serum RNAs, including mRNAs, circular RNAs, long non-coding RNAs and micro-RNAs have garnered recent attention, while studies using urine RNAs have been seldomly reported. In this section, we will mainly focus on long non-coding RNAs and micro-RNAs.
Micro-RNA are non-coding RNAs consisting of approximately 22 nucleotides.miRNAs are not chemically stable, however circulating miRNAs in plasma makes them ideal biomarkers to be studied for HCC. Using TLDA Chips, Yang et al. reported that eight miRNAs were dysregulated in HCC. The AUC of the eight-miRNAs for HCC diagnosis reached 0.802 for their patient cohort[65]in their phase three study. In addition, four lncRNAs (miR-20a-5p, miR-320a, miR-324-3p and miR-375) have been used as preclinical biomarkers for HCC. Another study demonstrated that plasma miR-224 was significantly expressed in HCC patients. They found that the AUC for diagnosis was 0.888-0.899 for early HCC patients[66]. Similar studies on miR-106b[67],miRNAs 21, 199-a[68]and microRNA-21[69]have also reported their diagnostic value for HCC, especially early HCC, with AUCs of 0.885, 0.865, and 0.943 respectively, with some having prognostic value. However, the diagnostic and prognostic value of these miRNA have not been replicated in other studies. The performance of these miRNAs needs further validation before clinical use.
Long non-coding RNAs were usually more than 200 nucleotides. They function in RNA stability, protein and DNA binding. Several circulating lncRNAs have demonstrated diagnostic values for HCC. For example, Li et al[70]reported that by combining circulating lncRNAs HULC and Linc00152, the AUC for HCC diagnosis was 0.87, while combining them with AFP, the AUC could reach 0.89. Another study combined three plasma lncRNAs (LINC00152, RP11-160H22.5 and XLOC014172) as a model to discriminate healthy/chronic hepatitis patients from HCC patients and had an AUC of 0.985-0.986[71]. Additional lncRNAs, including p34822, have also been reported to have diagnostic value[72].
Metabolomics is an “omics” method for the high-throughput identification, quantification and characterization of small metabolites (metabolites with an atomic mass <1.5 kDa)[73]. Metabolomics provides direct information on metabolites produced by both endogenously and exogenously[74]. Two major analytical platforms have been used in HCC metabolomics studies: Nuclear magnetic resonance spectroscopy and mass spectrometry (MS)[75]. The latter is generally equipped with different separation instruments, that comprise liquid chromatography (LC), gas chromatography (GC),and capillary electrophoresis[63,74]capabilities.
Blood, urine, and feces can be easily obtained from individuals and are generally used in human metabolomic studies. Several metabolomics studies using blood and urine have been reported for HCC diagnosis. Di Poto et al[76]performed GC coupled MS (GC-MS)-based metabolomics on plasma samples from 128 individuals (63 HCC cases and 65 cirrhotic controls). They demonstrated that a panel consisting of 11 metabolites and three clinical factors (AFP, Child–Pugh score, and etiologic factors)had a higher area under the ROC curve (AUC = 0.985) compared to AFP alone. Luo et al[77]performed LC-MS-based metabolomics on serum samples from 1448 individuals(healthy controls and patients with chronic HBV infection, liver cirrhosis, and HCC),and identified a biomarker panel (phenylalanyl-tryptophan and glycocholate) that had a better diagnostic value (AUC = 0.807) compared to AFP (AUC = 0.650) for distinguishing HCC patients from a high-risk cohort of cirrhosis patients. In addition,Lu et al[78]found that serum acetylcarnitine was able to discriminate HCC patients from patients with cirrhosis (AUC = 0.808-0.887). In addition, other biomarker panels have been identified using different metabolites to discriminate HCC patients from non-HCC patients[79-81].
AFP, AFP-L3, DCP, Glypican-3 (GPC3), Osteopontin (OPN), Midkine (MDK), GP73,Annexin A2, squamous cell carcinoma antigen, soluble urokinase plasminogen activator receptor, and thioredoxin have been shown to have value for the early diagnosis of liver cancer. An introduction for each protein and its sensitivity and specificity for the early diagnosis of HCC have been compiled from literature and is shown in Table 1. It is worth mentioning that AFP is regarded as the most useful biomarker for HCC diagnosis, and the only biomarker evaluated in randomized controlled trials[82]. Most of the biomarkers were evaluated in case-control studies for the detection of early stage HCC[83].
Early and accurate diagnosis of HCC patients is critical for patient prognosis.Several studies have successfully identified promising biomarkers for the diagnosis of HCC. However, current studies suggest that a single biomarker alone may not have the best sensitivity and specificity for detecting HCC, especially for detecting early stage HCC. Several studies have reported that combining several biomarkers complements to improve the early diagnosis rate[84]. A study demonstrated that the sensitivity and specificity of serum GP73 for HCC was 74.6% and 97.4% respectively,while for AFP it was 58.2% and 85.3%. Combining both GP73 and AFP increased the sensitivity to 89.2% (95%CI: 86.7-91.5%) and its specificity to 85.2% (95%CI: 83.4%-86.4%)[84]. Adding clinical variables (such as age and gender) into the model based on biomarker combinations can further increase the predictive performance for HCC detection. It is worth noting that GALAD score, which is the combination of clinical factors (gender, age) and biomarkers (AFP, AFP-L3 and Des-carboxyprothrombin),has been validated to improve the performance of discerning between HCC and cirrhosis[85]. More randomized control trials to validate the optimal combination of biomarkers will contribute to better detection of early stage HCC.
About 80 percent of normal human microbes are concentrated in the intestinal tract,with a number of more than 100 trillion[86]. The symbiotic microbial flora in the human body is closely related to the health of the host[87]. including carcinogenesis. Intestinal microorganisms cause tumorigenesis by integrating carcinogenic genes into the host genome, affecting the stability of the host genome, breaking the balance between the host immune systems, and inhibiting the host immune system. Liver inflammation,increase and inhibition of anti-tumor immunity are affected by intestinal leakage,intestinal flora imbalance, microbe-associated molecular patterns and microbial metabolites, which promotes the occurrence of HCC[88-90].
Therefore, intestinal microorganisms are powerful biomarkers for early diagnosis of HCC. Bacteroides, Clostridium cluster XVIII,Prevotella and Oscillibacter were increased in HCC group compared to control group, while Prevotella, Streptococcus and Bifidobacterium were decreased in animal studies as Yamada et al[91], Liet al[92]reported. Escherichia coli was regarded as microbiome factor in clinical trials as Grat et al[93,94]reported, which was enriched in HCC patients. Escherichia-Shigella, Enterococcus,Proteus and Veillonella were increased in HBV-related HCC versus HBV-related HCC.On the other hand, Faecalibacterium, Ruminococcus, Ruminoclostridium,Pseudobutyrivibrio, Lachnoclostridium and Phascolarctobacterium were decreased as Liu et al[95]reported. Loomba et al[96]set a Random Forest model found that Proteobacteria,Escherichia coli and Firmicutes were microbial diagnostic markers. This model had a robust and statistically significant diagnostic accuracy, with AUC 93.60%. Currently,Ren et al[97]reported that faecal microbial diversity was increased from cirrhosis to early HCC. Phylum Actinobacteria was increased in early HCC vs cirrhosis. And 13 genera including Gemmiger and Parabacteroides were enriched in early HCC versus cirrhosis. Butyrate-producing genera were decreased, while genera producinglipopolysaccharide were increased in early HCC versus controls. The AUC of this diagnostic model was 80.64%.
Some biomarkers mentioned above have been validated their ability of HCC diagnosis in different phases of clinical trials. Pepe et al[83]summarized five phases of biomarker development and the only biomarker that has undergone five phases of HCC biomarker development is AFP[98]. Three biomarkers, OPN[99], MDK[100]and GALAD score[85], have passed phase III validation. More biomarkers are under phase II biomarker development, such as AFP-L3[9], Dickkopf-1[101], and Glypican-3[102], etc.The description of the phases (I, II, III) of biomarker development and the corresponding candidate biomarkers are shown in Table 2.
Historically verification has been limited due to the lack of large-scale vertical cohorts and available test samples. Fortunately, serial samples from several longitudinal cohorts in the United States are being collected to allow for extensive validation of these biomarkers[103]. In the next few years, through the improved big data techniques and appropriate validation, the biomarker-based precision screeningmethod may become the main method for early detection of HCC.
Table 1 The sensitivity and specificity of biomarkers in the diagnosis of hepatocellular carcinoma
AFP: α-fetoprotein; AFP-L3: α-fetoprotein-L3 (binding fraction); DCP: Des-γ-carboxyprothrombin; HCC: Hepatocellular carcinoma; GPC3: Glypican-3;GP73: Golgi protein-73; OPN: Osteopontin; MDK: Midkine; SCCA: Squamous cell carcinoma antigen; suPAR: Soluble urokinase plasminogen activator receptor; TRXs: Thioredoxins.
Theoretically, the human body is an exquisitely organized system, while blood is responsible for carrying the substances from tissues to tissues, to maintain the complex functions. During carcinogenesis, the cells, organelles, macromolecules and metabolites are inevitably altered to different degrees. Due to the rapid development of high-throughput technologies, HCC biomarker screening is now feasible and necessary, by detecting these cells, organelles, macromolecules and metabolites.However, the genetic and environmental heterogeneity makes the carcinogenesis of individuals unique. Therefore, the alteration on these biological levels varies, which makes it difficult to develop an almighty biomarker for diagnosis. Technically, due to the limitation of detection methods and the nature of targets, each method has its own drawbacks and advantages. Plus, it is taken for granted that early-staged HCC has less huge impact on the body, which makes the early HCC diagnosis difficult. Thus,HCC diagnosis on different platforms is worth trying.
Protein biomarkers for HCC is mostly widely recognized in clinical practice,However, the diagnostic value of several biomarkers has shown inconsistent results.For example, the sensitivity and specificity of DCP reported by Sawada et al[23]was 77% and 82%, while it was 61% and 70% as reported by Marrero et al[9], which may result from different parameter selection or system errors.
ctDNA analysis is convenient, non-invasive and has a precise diagnostic capability.But serious technical hurdles in the detection and analysis of ctDNA, there has been limited progress in determining its clinical applicability for HCC diagnosis. The low levels of ctDNA, HCC heterogeneity and poorly characterized DNA alterations in HCC are the three main challenges for using ctDNA technology for the early detection of HCC[18].
CTCs are effective tools for diagnosing HCC as they are the “seeds” of tumors. One study[37]detected low levels of CTCs in two of 12 HBV patients, and subsequently detected small HCC tumors during the next 5 mo. This suggested that CTCs may be a valuable tool for HCC early-diagnosis. But the precision of HCC diagnosis using CTCs varies across cohorts and studies[33,34,104]. Exosomal cargo is protected from degradation by its lipid bilayer which makes it a good HCC diagnostic tool. Although several technology platforms have been developed for exosome enrichment, there is still no standard separation technology that is widely used[105]. Plus, the expensive extraction methods hinders the study of exosomes[40]. Cheaper and faster extraction methods are needed to fully explore its utility. Although several studies have used paneled metabolites for HCC screening, there have been no new markers or panels that have been clinically utilized. This is because of the absence of standard sample processing and analytical procedures that has led to metabolomics studies being unable to be replicated[106]. Secondly the metabolomics detection platforms are not mature enough to robustly and accurately detect metabolites.
Table 2 Phases (I, II, III) of biomarker development and corresponding candidate biomarkers
Rapid high-throughput sequencing makes HCC diagnosis, especially early HCC diagnosis more feasible, and studies concerning omics data-based HCC diagnosis is still increasing. The most studies collected samples with a small sample size from single or few cohorts. The sample processing steps are different in cohorts, and the analyzing method even in same platforms still varies. These studies limited the clinical utilization of biomarkers. It is believed that with the development and maturity of techniques, omics data-based biomarker screening will finally become clinically utilized tool for cancers, including but not limited to HCC.
World Journal of Gastroenterology2019年30期