J Cancer 2020; 11(17):4933-4946. doi:10.7150/jca.45378 This issue Cite

Research Paper

The prognostic value of COL3A1/FBN1/COL5A2/SPARC-mir-29a-3p-H19 associated ceRNA network in Gastric Cancer through bioinformatic exploration

Hongyu Shen1 Corresponding address*, Lin Wang1*, Qinnan Chen2*, Juqing Xu1, Jin Zhang3, Leping Fang1, Jun Wang1, Weifei Fan1 Corresponding address

1. Department of Hematology and Oncology, Department of Geriatric Lung Cancer Laboratory, Geriatric Hospital of Nanjing Medical University, Jiangsu Province Geriatric Hospital, Nanjing, Jiangsu, China.
2. Department of Clinical Medicine, Jiangsu Health Vocational College, Nanjing, Jiangsu, China.
3. Department of General Practice, Geriatric Hospital of Nanjing Medical University, Jiangsu Province Geriatric Hospital, Nanjing, Jiangsu, China.
*These authors contributed equally to this work.

Shen H, Wang L, Chen Q, Xu J, Zhang J, Fang L, Wang J, Fan W. The prognostic value of COL3A1/FBN1/COL5A2/SPARC-mir-29a-3p-H19 associated ceRNA network in Gastric Cancer through bioinformatic exploration. J Cancer 2020; 11(17):4933-4946. doi:10.7150/jca.45378. https://www.jcancer.org/v11p4933.htm
Other styles

File import instruction


Increasing studies on malignant tumors have proposed a new competing endogenous RNA (ceRNA) regulatory mechanism that mRNA, miRNA and lncRNA interact with each other. However, the mRNA-miRNA-lncRNA associated ceRNA network in gastric cancer remains unknown. We used online bioinformatic softwares to predict the hub genes and their upstream miRNAs and lncRNAs in gastric cancer, and then performed survival analyses. After collecting gastric cancer tissue samples and performing PCR experiments, the correlations among predicted mRNA, miRNA and lncRNA were further verified. A total of 101 up-regulated significant differentially expressed genes (DEGs) and 219 down-regulated significant DEGs in gastric cancer were confirmed. Functional enrichment analyses of these significant DEGs indicated that they were potentially enriched in some pathways involved in tumor malignant biological processes or metabolism. Then, we identified 20 hub genes in the PPI networks. Combined with expression and survival analyses, 8 up-regulated genes and 1 down-regulated gene were identified as central genes and acted as important prognostic roles in gastric cancer. 17 miRNAs were confirmed that might potentially regulate the expressions of these central genes. But only 8 out of them indicated better outcome in gastric cancer. Further, 79 lncRNAs were predicted that might have the potence to combine with the 8 central miRNAs. The lncRNA H19 was eventually defined as a central lncRNA by survival analyses. Stimultaneously, we found that there were certain interactions among lncRNA, miRNA and mRNAs in 50 gastric cancer tissues by qRT-PCR. Moreover, the high expression of H19 is associated with advanced TNM stage, primary tumor and lymph nodes, indicating a poor prognosis. In summary, we uncovered the prognostic value of COL3A1/FBN1/COL5A2/SPARC-mir-29a-3p-H19 ceRNA network in gastric cancer.

Keywords: gastric cancer, bioinformatic analysis, competing endogenous RNA (ceRNA), prognosis, long noncoding RNA (lncRNA), microRNA (miRNA)


Gastric cancer (GC) remains the fifth most frequently malignancy and the third leading cause of cancer death worldwide especially in East Asia, and ranks as a growing health-threatening disease, causing over 1,000,000 new cases and approximately 783,000 deaths in 2018 worldwide [1]. Due to the lack of typical early symptoms, the majority of GC patients are diagnosed at advanced stages accompanied with malignant proliferation, extensive invasion and distant metastasis, with a median survival of only 3-5 months [2, 3]. Despite great efforts such as surgery and adjuvant chemotherapy have been offered, the prognosis of GC patients has only slightly improved [4]. To date, however, there has been little exploration on promising molecular biomarkers that could explain the risk of cancer recurrence and metastasis. Therefore, a comprehensive understanding of the mechanisms involved in the development of GC is urgently needed for improving the diagnosis, treatment strategy and prognosis of this disease.

Recently, long noncoding RNAs (lncRNAs) have received great attention in elucidating the complex mechanisms in malignant biological processes such as carcinogenesis, recurrence, metastases and drug resistance [5]. Increasing studies proposed a new regulatory mechanism in which lncRNAs may act as competing endogenous RNAs (ceRNAs) and engage in crosstalk with mRNAs by using miRNA response elements (MREs) to competitively sponge their miRNAs [6-8]. Intriguingly, more and more scholars and researchers are beginning to conduct a seris of researches on lncRNA-miRNA-mRNA ceRNA network in their respective fields, including cancer. For instance, FOXP4-AS1 post-transcriptionally regulated FOXP4 by acting as ceRNA to sponge to miR-3184-5p in prostate cancer [9], while FBXL19-AS1 influenced breat cancer cells progression via a ceRNA-mediated regulatory network to bind to miR-718 [10]. Other studies have investigated that ZEB1-AS1 can promote malignant behaviors of cancer cells through miRNA-mediated mechanisms [11-13]. However, our current understanding on lncRNA-miRNA-mRNA is dificient.

In our study, firstly we obtained differentially expressed genes (DEGs) by screening four GEO datasets. Then, pathway enrichment analyses of these differentially expressed genes were performed. Besides, we also conducted protein-protein interaction (PPI) analyses and identified the hub genes. Considering the expression and survival analyses of hub genes in GC, 8 up-regulated genes and 1 down-regulated gene were selected for subsequent study. Immediately after, upstream miRNAs and lncRNAs were predicted based on online tools. In addition, we further assessed the prognostic values of these miRNAs and lncRNAs in GC. The correlation analyses of mRNAs, miRNAs and lncRNAs were also accomplished. Eventually, a split-new ceRNA regulatory network related to the prognosis of patients with GC was successfully inaugurated. Enthusiastically, based on our current analysis, all RNAs in the ceRNA network can be used to indicate the prognostic role of GC. In the future, they may also be promising as novel treatment strategies for GC.

Material and Methods

Data collection

The microarray datasets that compared gene expression between GC tissues and nontumorous/normal tissues were downloaded on the Gene Expression Omnibus database (http://www.ncbi.nlm.nih.gov/geo/). Only datasets containing more than 10 cancer/normal samples were included. We finally chose four datasets (GSE13911, GSE79973, GSE19826 and GSE54129) through screening the titles and abstracts of these datasets. The datasets only provided gene expression data instead of other non-coding RNA expression data. Based on Affymetrix Human Genome U133 Plus 2.0 Array (GPL570) platform, GSE13911 dataset was submitted from Italy group and other three datasets were all from China groups.

Differential gene expression analysis

We employed the differential gene expression analyses on an online analytic tool GEO2R (https://www.ncbi.nlm.nih.gov/geo/geo2r) from the GEO database to retrieve DEGs among the four datasets. DEGs were considered as significantly different if they meet the conditions of |log2FC|>1 and p-value<0.05. In addition, VENNY 2.1.0 (http://bioinfogp.cnb.csic.es/tools/venny/index.html) was performed to draw the Venn diagrams. The DEGs that were concentrated in all four datasets were considered as the significant DEGs, including up-regulated significant DEGs and down-regulated significant DEGs.

Gene ontology and KEGG pathway enrichment analysis

Gene Ontology (GO) functional analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis were submitted to ClueGO+CluePedia in Cytoscape 3.6.1. The GO classification includes biological process (GO-BP), cell component (GO-CC), and molecular function (GO-MF). We chose the BP for further exploration. For GO-BP analysis, the number of enriched genes was required to be greater than or equal to 10 and the proportion of genes was greater than 4%. For both GO and KEGG pathway, p-value < 0.05 was considered as statistically significant.

Establishment of PPI networks and identification of hub genes

Based on the DEGs identified, the Search Tool for the Retrieval of Interacting Genes (STRING) database (http://stringdb.org/) was adopted to construct the PPI networks [14]. Interactors with combined confidence score ≥ 0.4 were selected and high-resolution bitmaps were downloaded from the webpage. According to the degree of connectivity, the CytoHubba app was used to screen modules of hub genes in the PPI networks [15]. Based on node degree, the top 30 hub genes were visualized using the Cytoscape 3.6.1 software.

Gene expression analysis

In The Cancer Genome Atlas Program (TCGA) project, there are only 36 normal gastric samples, and the sample size is insufficient to compare the gastric cancer with normal controls. Gene Expression Profiling Interactive Analysis (GEPIA) (http://gepia.cancer-pku.cn/about.html) is a newly developed interactive web server for analyzing the RNA sequencing expression data of 9736 tumors and 8587 normal samples from the TCGA and the GTEx projects, using a standard processing pipeline [16]. GEPIA2 is an updated version [17]. In our study, the expression levels of central genes and lncRNAs in GC was analyzed by GEPIA2 database, which contains 408 gastric cancer samples and 211 normal controls. Besides, the |log2FC| threshold was set to be 1 and p-value was less than 0.05.

Survival analysis

Prognostic values of genes in GC were analyzed using Kaplan-Meier plotter database, which is capable to assess the effect of 54k genes on survival in 21 cancer types [18]. The mRNA RNA-seq data of GC were obtained from “Pan-cancer” item in Kaplan-Meier plotter database. After entering these genes into the database, the hazard ratio (HR) with 95% confidence interval (CI) and logrank p-value were automatically calculated. Survival curves were also directly shown on the webpage. Logrank p-value < 0.01 was used to narrow the range of DEGs. The miRNA subsystems include 11k samples from 20 different cancer types and miRNA survival data were acquired using the similar method. Logrank p-value < 0.05 was recognized as statistically significant. Prognostic values of lncRNAs in GC were performed through UALCAN (http://ualcan.path.uab.edu/index.html), which is a comprehensive, user-friendly, and interactive web resource for analyzing cancer OMICS data [19]. UALCAN is designed to provide patient survival information based on gene expression including lncRNAs and p-value < 0.05 was regarded as statistically significant.

Prediction of miRNA

In our study, miRTarbase (http://mirtarbase.mbc.nctu.edu.tw/php/search.php), an experimentally validated microRNA-target gene interactions database, was used to predict the upstream miRNAs of hub genes [20]. As a database, miRTarBase has collected more than three hundred and sixty thousand miRNA-target interactions (MTIs). Generally, the accumulated MTIs were validated experimentally by reporter assay, western blot, microarray and next-generation sequencing experiments. In order to obtain more precise results, only MTIs that were validated by reporter assay included. Prognostic values of these predicted miRNAs were further evaluated by Kaplan-Meier plotter database as described above.

Prediction of lncRNA

Upstream lncRNAs of miRNAs were predicted using miRNet database (https://www.mirnet.ca/), which is a user-friendly, integrated tool suite designed for comprehensive analysis and functional interpretation of miRNAs and xeno-miRNAs [21, 22]. 'Organism-H.sapiens (human)', 'ID type-miRBase ID', 'Tissue-Gastric' and 'Target type-lncRNAs' were set as the selection criteria.

Correlation analysis

StarBase database (http://starbase.sysu.edu.cn/index.php) is an open-source platform for studying the miRNA-ncRNA, miRNA-mRNA, ncRNA-RNA, RNA-RNA, RBP-ncRNA and RBP-mRNA interactions from CLIP-seq, degradome-seq and RNA-RNA interactome data [23, 24]. On the basis of this platform, the correlation analyses in our study were conducted and p-value < 0.05 was considered as statistically significant.

Tissue samples and clinical data collection

A total of 50 tumor tissues and adjacent normal tissues were collected from GC patients who underwent surgery at Geriatric Hospital of Nanjing Medical University. According to the 8th edition of the American Joint Committee on Cancer (AJCC) TNM Staging System for GC and histopathological evaluation, all patients were confirmed. Clinicopathological characteristics of GC patients are shown in Table 3. No neoadjuvant therapy was performed on these patients before surgery. Prior to real-time quantitative PCR (RT-qPCR) analysis, all tissues were stored at -80°C. All human tissue samples were obtained with written informed consent from all subjects, and this project was approved by the Research Ethics Committee of Geriatric Hospital of Nanjing Medical University.

RT-qPCR Assay

Total RNA was isolated using TRIzol reagent (Invitrogen) from tissues. Total RNA (1 μg) was reverse transcribed by using BuSuperScript RT Kit (Biouniquer Technology, Nanjing, China) following the manufacturer's instruction. For detection of lncRNA, miRNA and mRNA expression levels, RT-qPCR was then performed on LightCycler®480 (Roche, Switzerland) with SYBR Green PCR Master Mix (Roche, Australia). Before calculation using the ΔΔCt method, the levels of GAPDH were used to normalize the relative expression levels of lncRNA and mRNA, and the levels of small nuclear U6 were used to normalize the miRNA expression levels. The primers are provided in Table S5.

Statistical analysis

Statistical analysis was performed using SPSS 21.0 statistical package. All experiments were carried out at least three times independently. The relationships between H19 expression and clinicopathologic parameters were analyzed by chi-square test and the Mann-Whitney U test. The Kaplan-Meier method was employed to demonstrate the relationship between H19 expression and the prognosis of GC patients. Statistical comparisons among expressions of lncRNA, miRNA and mRNAs were undertaken using the Spearman correlation. P < 0.05 was considered statistically significant. Graphs were generated on GraphPad.


Screening for significant DEGs in gastric cancer

Firstly, through screening gene expression microarrays related to GC from the GEO database, four datasets (GSE13911, GSE79973, GSE19826 and GSE54129) were ultimately selected. Secondly, differential gene expression analysis was performed by GEO2R, with the criteria of |log2FC| > 1 and p-value < 0.05. The DEGs in each dataset were separately displayed in Figure 1A, Figure 1B, Figure 1C and Figure 1D. Thirdly, we further searched some significant DEGs which were simultaneously appeared in the four datasets. As shown in Figure 1E and Figure 1F, a total of 101 up-regulated significant DEGs and 219 down-regulated significant DEGs in GC were confirmed. These up-regulated and down-regulated significant DEGs were completely listed in Table S1 and Table S2, respectively. Subsequently, these significant DEGs were subject to in-depth analysis.

Function analysis of the significant DEGs

GO and KEGG pathway analyses were performed to further reveal the biological processes and relevant pathways of these significant DEGs (Table S6). ClueGO, a plugin of the Cytoscape software, was involved in subsequent analyses.

For up-regulated significant DEGs, the enriched GO-BP functions included extracellular structure organization, extracellular matrix organization, collagen fibril organization, cartilage development, connective tissue development, regulation of cell-substrate adhesion and bone development (Figure 2A). Additionally, Figure 2C indicated that these up-regulated significant DEGs were significantly enriched in some cancer-associated pathways, such as protein digestion and absorption, pathways in cancer, PI3K-Akt signaling pathway, focal adhesion and ECM-receptor interaction.

As shown in Figure 2B, the enriched GO-BP functions for down-regulated significant DEGs included primary alcohol metabolic process, xenobiotic metabolic process, long-chain fatty acid metabolic process, oxidoreductase activity (acting on CH-OH group of donors), cellular response to xenobiotic stimulus, potassium ion transmembrane transport, antibiotic metabolic process, cellular lipid catabolic process, monooxygenase activity and potassium ion transmembrane transporter activity. Similarly, some enriched KEGG pathways were also discovered, among which gastric acid secretion, chemical carcinogenesis, drug metabolism and metabolism of xenobiotics by cytochrome P450 were the most highly enriched pathways (Figure 2D).

Construction and deep analysis of PPI network

Based on STRING database analysis, PPI networks with the up-regulated significant DEGs and down-regulated significant DEGs were established, as shown in Figure 3A and Figure 3C, respectively. By calculating the node degree, some hub genes among these significant DEGs were determined. For a better visual experience, the interactions of top 30 up-regulated (Figure 3B) and down-regulated (Figure 3D) hub genes were reconstructed through Cytoscape software. Besides, the top 30 hub genes with relevant node degrees were listed in Table 1. The top 10 up-regulated hub genes were FN1, COL1A1, COL3A1, COL1A2, FBN1, BGN, COL5A2, THBS2, COL5A1 and SPARC, and top 10 down-regulated hub genes were TP53, TLR3, MYD88, SMAD3, ISG15, HSP90AA1, TRAF6, IRF7, ATP4A and TBK1. The 20 hub genes were selected for following analyses.

Ascertainment of central genes in gastric cancer

To further ascertain central genes in GC, we used the GEPIA and Kaplan-Meier plotter databases to determine the expression and prognosis of the top 10 up- and down-regulated central genes, respectively. Taken the results of expression analysis and survival analysis into consideration, 8 up-regulated hub genes (FN1, COL3A1, FBN1, BGN, COL5A2, THBS2, COL5A1 and SPARC) were discovered not only significantly up-regulated in gastric cancer groups but also predicted poor prognosis in patients with gastric cancer (Figure 4A and Figure 4C-J). Only 1 down-regulated hub gene (ATP4A) had a low expression and a good prognosis (Figure 4B and Figure 4K). In the following analyses, we are impassioned to further explore the 9 central genes, including 8 up-regulated hub genes and 1 down-regulated hub gene.

 Table 1 

The top 30 hub genes in PPI networks

Up-regulated genesDown-regulated genes

Prediction and identification of upstream central miRNAs of central genes

Subsequently, upstream miRNAs of the 9 central genes were predicted relying on miRTarBase. As noted above, we only included microRNA-target gene interactions verified by reporter assay in our study. Eventually, a total of 17 miRNAs were confirmed that might potentially regulate the expression of 8 central genes (FN1, COL3A1, FBN1, COL5A2, THBS2, COL5A1, SPARC and ATP4A), as presented in Figure 5A and Table S3. No potential upstream miRNAs were observed for the BGN gene. Focusing on the recognized opposite relationship between target genes and miRNAs, we hypothesized that the upstream miRNAs of the 7 up-regulated central genes should theoretically show good prognosis, and the down-regulated gene ATP4A should act as an unfavorable prognostic role. Thence, we further explored the prognostic values of the 17 predicted miRNAs by Kaplan-Meier plotter database. Survival analysis displayed that 8 (miR-200c-3p, miR-200b-3p, let-7g-5p, miR-140-3p, miR-29a-3p, let-7b-5p, miR-27b-3p and miR-135b-5p) out of 17 miRNAs were considered as better outcome biomarkers in GC patients as shown in Figure 5B-I. We sensed the 8 miRNAs as the central miRNAs.

 Table 2 

The correlations of mRNA-miRNA, mRNA-lncRNA and miRNA-lncRNA pairs in the established network by starBase database

 Table 3 

Correlation between expression of lncRNA H19 and clinicopathological characteristics of patients with gastric cancer

CharacteristicsCases (n)H19p value
High (n)Low (n)
Age (years)
Smoking history
Drinking history
TNM stage
Primary tumor
Lymph nodes

Prediction and identification of upstream central lncRNAs of central miRNAs

It is well known that lncRNA can interact with miRNA as a competitive endogenous RNA (ceRNA) to regulate the expression of target genes, which play an important role in the occurrence and development of tumors [25, 26]. In light of this theory, the lncRNAs that might have the potence to combine with the 8 central miRNAs (miR-200c-3p, miR-200b-3p, let-7g-5p, miR-140-3p, miR-29a-3p, let-7b-5p, miR-27b-3p and miR-135b-5p) were predicted by an online miRNet database. A total of 79 lncRNAs were confirmed as displayed in Table S4. Based on the classical ceRNA hypothesis, there is an inverse correlation between lncRNA and miRNA. Therefore, we used the GEPIA database to analyze the expression of these lncRNAs in GC. Only 6 (H19, HCP5, LINC00511, MALAT1, OIP5-AS1 and RP11-618G20.1) out of 79 lncRNAs were significantly up-regulated in gastric cancer tissues compared to normal controls. Subsequently, survival analysis on an online UALCAN database of the 6 up-regulated lncRNAs indicated that patients with high expression of H19 had an unfavorable prognosis. After comprehensive consideration, the lncRNA H19 was defined as a central lncRNA (Figure 6A-B).

Establishment of mRNA-miRNA-lncRNA network in gastric cancer

Through a series of online analyses of bioinformatics, we established an essential mRNA-miRNA-lncRNA competitive endogenous RNA network associated with prognosis in GC. The network contained a total of 9 pairs, including 4 mRNA-miRNA pairs (COL3A1-mir-29a-3p, FBN1-mir-29a-3p, COL5A2-mir-29a-3p and SPARC-mir-29a-3p), 4 mRNA-lncRNA pairs (COL3A1-H19, FBN1-H19, COL5A2-H19 and SPARC-H19) and only one miRNA-lncRNA pair (mir-29a-3p-H19). This network was drawn in Figure 6C.

 Figure 1 

Screening for significant DEGs in gastric cancer. (A-D) Volcano plots from four datasets (GSE13911, GSE79973, GSE19826 and GSE54129). X axis represents the p-value of the logarithmic transformation, and Y axis represents the average gene expression differences between gastric cancer samples and normal samples. The four volcano plots showed all DEGs; red dots and green dots represent the up-regulated and down-regulated genes in gastric cancer samples, respectively; black dots represent genes that are not differentially expressed between gastric cancer samples and controls. |log2FC| >1 and p-value < 0.05 were set as the cut-off criteria. (E-F) Venn diagrams of up-regulated and down-regulated DEGs from four GSE datasets. The genes taking the intersection are considered to be significant DEGs.

J Cancer Image
 Figure 2 

Function analysis of the significant DEGs. (A) The enriched GO biological processes (BP) of the up-regulated significant DEGs. (B) The enriched GOBP of the down-regulated significant DEGs. (C) The enriched KEGG pathways of the up-regulated significant DEGs. (D) The enriched KEGG pathways of the down-regulated significant DEGs.

J Cancer Image

As we all known, lncRNA can attenuate the inhibition of mRNA expression by competitively binding to miRNA. In view of this hypothesis, there are negative correlations between miRNAs and lncRNAs or mRNAs and positive relationships between mRNAs and lncRNAs. Furthermore, the gastric cancer associated TCGA data from StarBase database were introduced to profoundly determine the correlations of mRNA-miRNA, mRNA-lncRNA and miRNA-lncRNA pairs in the established network. As shown in Table 2, all pairs as described above were consistent with the ceRNA mechanism (p<0.05). Taken together, we established a brand new mRNA-miRNA-lncRNA network, COL3A1/FBN1/COL5A2/SPARC-mir-29a-3p-H19, which is correlated to the prognostic value of patients with GC. This network is also promising as a novel biomarker for diagnosis or a therapeutic strategy for GC.

Preliminary validation of the mRNA-miRNA-lncRNA network in gastric cancer tissue samples

Quantitative PCR experiments showed that lncRNA H19 expression was increased in 50 GC tissue samples compared with paired adjacent normal tissues (Figure 7A). To assess the significance of H19 overexpression in GC, we evaluated the correlation between H19 expression and patient clinicopathological characteristics. 50 GC patients were classified into two groups according to the median ratio (4.24) of relative H19 expression in tumor tissues: high H19 group (n=25, H19 expression ratio≥median ratio) and low H19 group (n=25, H19 expression ratio≤median ratio). As shown in Table 3, higher levels of H19 were significantly associated with advanced TNM stage (p=0.012), primary tumor (p=0.015), and lymph nodes (p=0.018). However, there was no obvious relationship between H19 expression and other clinical parameters such as age (p=0.569), gender (p=0.564), smoking history (p=0.771), drinking history (p=0.777) or differentiation (p=0.395). Kaplan-Meier survival analysis was used to examine the association between H19 expression and GC patient prognosis. Notably, patients with higher H19 expression levels had significantly shorter overall survival time than those expressing lower levels of H19 (p=0.018) (Figure 7B). We also find that patients with TNM stage III-IV, primary tumor T3-T4 and lymph nodes N2-N3 showed higher H19 expression than patients with TNM stage I-II, primary tumor T1-T2 and lymph nodes N0-N1 (Figure 7C). These results suggested that H19 might play an essential biological role in GC tumorigenesis and progression. Spearman correlation analysis showed that the expression between H19 and miR-29a-3p and the expression between miR-29a-3p and mRNAs (COL3A1/FBN1/COL5A2/SPARC) were inversely proportional, while the expression between H19 and mRNAs (COL3A1/FBN1/COL5A2/SPARC) was positively correlated. The p-value were all statistically significant (Figure 7D).

 Figure 3 

The top 30 hub genes determined in PPI networks. (A) The PPI networks of the up-regulated significant DEGs. (B) The top 30 hub genes of the up-regulated significant DEGs. (C) The PPI networks of down-regulated significant DEGs. (D) The top 30 hub genes of the down-regulated significant DEGs.

J Cancer Image
 Figure 4 

Ascertainment of central genes in gastric cancer. (A-B) Ascertainment of central genes among the top 10 hub genes of the significant up-regulated and down-regulated DEGs by integrating expression and prognosis analyses base on GEPIA and Kaplan Meier-plotter databases, respectively. (C-K) Expressions and prognostic values of the central genes (FN1, COL3A1, FBN1, BGN, COL5A2, THBS2, COL5A1, SPARC and ATP4A) in gastric cancer.

J Cancer Image


The incidence and mortality of gastric cancer have reduced substantially since great progress in surgery, chemotherapy and targeted therapy over the past 50 years. However, there are still a large number of gastric cancer patients suffered recurrence and metastasis after surgical resection, particularly those with diffuse-type in Asia. It's a great challenge for us to search for efficient biomarkers of prognosis [27]. The biological function of miRNAs have been studied extensively in a variety of cancers over the past few decades, including GC [28, 29]. At present, as important members in cell processes and human diseases, the newly discovered lncRNAs have attracted more and more researchers' attention [30]. After the ceRNA hypothesis first proposed by Salmena et al. [6], more and more experiments have been conducted regarding ceRNAs in human cancers. For instance, lncRNA CASC11 was overexpressed in hepatocellular carcinoma and promotes cell proliferation by suppressing miRNA-188-5p [31]; Bai et al. found that lncRNA LOXL1-AS1/miR-let-7a-5p/EGFR axis obviously changed proliferation, migration and apoptosis of drug-resistant DU-145 Cells, providing a promising treatment approach for drug-resistant prostate cancer patients [32]; Similar studies have been reported in gastric cancer. Lu et al. indicated that lncRNA BC032469 acted as a novel ceRNA to modulate hTERT expression by sponging miR-1207-5p and promoted proliferation in gastric cancer [33]; Li et al. found that MIAT competitively binded to miR-29a-3p and consequently upregulated the expression of HDAC4, highlighting the involvement of the MIAT/miR-29a-3p/HDAC4 axis in the cell biological behaviors of GC [34]; LincHOTAIR epigenetically repressed miR34a by binding to PRC2 and contributed to the epithelial-to-mesenchymal transition (EMT), finally accelerating metastasis in human gastric cancer [35]. However, there's a shortage in comprehensive analyses of ceRNAs and mRNAs in gastric cancer. Herein, it aims to study a specific prognosis-related ceRNA network in gastric cancer via the “mRNA-miRNA-lncRNA” model rather than the “lncRNA miRNA-mRNA” model. Encouragingly, we have established a new mRNA-miRNA-lncRNA network in which each RNA play a significant prognostic role in gastric cancer.

 Figure 5 

Identification of upstream central miRNAs of central genes. (A) Establishment of miRNA-gene network based on Cytoscape software. (B-I) Prognostic values of central miRNAs (miR-200c-3p, miR-200b-3p, let-7g-5p, miR-140-3p, miR-29a-3p, let-7b-5p, miR-27b-3p and miR-135b-5p) in gastric cancer.

J Cancer Image
 Figure 6 

Identification of upstream central lncRNAs of central miRNAs. (A) Identification of central lncRNAs among the predicted lncRNAs by integrating expression and prognosis analyses using GEPIA and UALCAN databases, respectively. (B) Expression and prognostic value of H19 in gastric cancer. (C) The map of the novel mRNA-miRNA-lncRNA ceRNA network associated with prognosis of gastric cancer.

J Cancer Image

In our current study, we have totally selected 320 significant DEGs including 101 up-regulated and 219 down-regulated DEGs through extraction from four GEO datasets. GO and KEGG are widely used as functional enrichment analysis for numerous genes [36, 37]. The results of these significant DEGs indicated that they were potentially enriched in some pathways involved in tumor malignant biological processes or metabolism, including cell-substrate adhesion, PI3K-Akt signaling pathway, focal adhesion and ECM-receptor interaction, antibiotic metabolic process, chemical carcinogenesis, drug metabolism and so on. Plenty studies have shown that focal adhesion and cell adhesion own in dispensable positions in cancer invasion and metastasis, consequently leading to cancer progression [38, 39]. In addition, the PI3K-Akt pathway has been considered as one of the essential pathways for tumor growth, invasion and even drug resistance [40]. Thus, these significant DEGs may participate in the regulation of malignant biological behaviors of gastric cancer.

In order to systematically analyze the interactions and functions of these significant DEGs in gastric cancer, we obtained PPI networks on STRING database. As we all know, the more node degrees present in the PPI network, the greater the role that genes will play. Thus, we identified 20 hub genes in the two PPI networks according to the degree of node. To further determine central genes in gastric cancer, the top ten up- and down-regulated central genes were conducted by expression and survival analyses. The results showed that 8 up-regulated genes (FN1, COL3A1, FBN1, BGN, COL5A2, THBS2, COL5A1 and SPARC) and 1 down-regulated gene (ATP4A) may be the central genes in gastric cancer. Further searching for literatures, most of these central genes have been well investigated in gastric cancer. Before, it has been verified low expression of gastric cancer-associated fibroblasts-derived SPARC could lead to cancer stem cell transformation and 5-FU resistance [41]; FN1 was significantly up-regulated in GC tissues and down-regulation of FN1 could inhibit the proliferation, migration and invasion of GC cells [42]; Patients with low expression of THBS2 had a positive prognosis [43]. These findings may support the accuracy of our bioinformatics analysis to a certain extent.

 Figure 7 

Preliminary validation of the mRNA-miRNA-lncRNA network in gastric cancer tissue samples. (A) H19 expression in GC tissues (n = 50) compared with normal tissues (n = 50) was examined by qRT-PCR and normalized to GAPDH expression. (B) Kaplan-Meier overall survival curves according to H19 expression levels. (C) The relationships between H19 expression and clinicopathological characteristics (TNM stage, primary tumor and lymph nodes). (D) Spearman correlation analysis among the expression of H19, miR-29a-3p and mRNAs (COL3A1/FBN1/COL5A2/SPARC).

J Cancer Image

As mentioned above, miRNAs and lncRNAs can participate in the regulation of gene expression through the ceRNA mechanism. The upstream miRNAs of the central genes were first predicted. Survival analysis showed that patients with higher expression of 8 miRNAs (miR-200c-3p, miR-200b-3p, let-7g-5p, miR-140-3p, miR-29a-3p, let-7b-5p, miR-27b-3p and miR-135b-5p) had better overall survival in gastric cancer. Some of the 8 miRNAs have been reported for their tumor suppressive effects in gastric cancer. For example, miR-29a-3p showed a significant down-regulated expression in GC tumor tissues and high expression of miR-29a-3p inhibited cell proliferation and metastasis by targeting CDK2, CDK4, and CDK6 [44]; Tao et al. indicated that miR-27b-3p exerts tumor-suppressive effects in GC via the inhibition of oncogene ROR1 expression [45]. Next, we further predicted 79 upstream lncRNAs of these central miRNAs. Based on TCGA data, integrating expression analysis and survival analysis of these lncRNAs, only 1 lncRNA (H19) was considered as the central lncRNA. Moreover, overexpression of lncRNA H19 has been determined to enhance the proliferation, invasion of gastric cancer and contributed to poor prognosis in GC patients in multiple studies [46-49]. Naturally, a prognosis-related mRNA-miRNA-lncRNA ceRNA network in gastric cancer was successfully constructed. Correlation analysis of the RNA pairs in the established network further demonstrated that this network was absolutely compliant with the ceRNA hypothesis. Without any doubt, we have gained exciting discoveries through a series of bioinformatic analyses. Supplementally, we proved the ceRNA network through preliminary RT-qPCR validation in gastric cancer tissue samples; however, more experimental and clinical studies are needed in the future to fully elucidate this network, which will help to develop better strategies for the diagnosis and therapy of GC.


In conclusion, we have established a novel mRNA-miRNA-lncRNA ceRNA network through integrating bioinformatic analysis and experiments on clinical samples. This network is significantly associated with the prognosis of patients with GC. In addition, it may also provide some new directions for studying the molecular mechanism of GC. However, further researches are still needed to validate this network.


ceRNA: competing endogenous RNA; lncRNA: long noncoding RNA; miRNA: microRNA; DEGs: differentially expressed genes; PPI: Protein-protein interaction; GC: Gastric cancer; MREs: miRNA response elements; GO: Gene Ontology; KEGG: Kyoto Encyclopedia of Genes and Genomes; BP: biological process; CC: cell component; MF: molecular function; STRING: the Search Tool for the Retrieval of Interacting Genes; GEPIA: Gene Expression Profiling Interactive Analysis; HR: hazard ratio; MTIs: miRNA-target interactions; EMT: epithelial-to-mesenchymal transition; TCGA: The Cancer Genome Atlas Program; AJCC: American Joint Committee on Cancer.

Supplementary Material

Supplementary tables.



This work was supported by the National Natural Science Foundation for Youth of China (No. 81802902), and the grant from the Six Talent Peaks Project in Jiangsu Province (2018-WSW-040).

Competing Interests

The authors have declared that no competing interest exists.


1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68:394-424

2. Katsuragi S, Tanaka H, Hasegawa J, Kanayama N, Nakata M, Murakoshi T. et al. Analysis of preventability of malignancy-related maternal death from the nationwide registration system of maternal deaths in Japan. J Matern Fetal Neonatal Med. 2019:1-7

3. Kang W, Zheng Q, Lei J, Chen C, Yu C. Prognostic Value of Long Noncoding RNAs in Patients with Gastrointestinal Cancer: A Systematic Review and Meta-Analysis. Dis Markers. 2018;2018:5340894

4. Japanese Gastric Cancer A. Japanese gastric cancer treatment guidelines 2014 (ver. 4). Gastric Cancer. 2017;20:1-19

5. Yue B, Sun B, Liu C, Zhao S, Zhang D, Yu F. et al. Long non-coding RNA Fer-1-like protein 4 suppresses oncogenesis and exhibits prognostic value by associating with miR-106a-5p in colon cancer. Cancer Sci. 2015;106:1323-32

6. Salmena L, Poliseno L, Tay Y, Kats L, Pandolfi PP. A ceRNA hypothesis: the Rosetta Stone of a hidden RNA language?. Cell. 2011;146:353-8

7. Kotake Y, Nakagawa T, Kitagawa K, Suzuki S, Liu N, Kitagawa M. et al. Long non-coding RNA ANRIL is required for the PRC2 recruitment to and silencing of p15(INK4B) tumor suppressor gene. Oncogene. 2011;30:1956-62

8. Song X, Cao G, Jing L, Lin S, Wang X, Zhang J. et al. Analysing the relationship between lncRNA and protein-coding gene and the role of lncRNA as ceRNA in pulmonary fibrosis. J Cell Mol Med. 2014;18:991-1003

9. Wu X, Xiao Y, Zhou Y, Zhou Z, Yan W. LncRNA FOXP4-AS1 is activated by PAX5 and promotes the growth of prostate cancer by sequestering miR-3184-5p to upregulate FOXP4. Cell Death Dis. 2019;10:472

10. Ding Z, Ye P, Yang X, Cai H. LncRNA FBXL19-AS1 promotes breast cancer cells proliferation and invasion via acting as a molecular sponge to miR-718. Biosci Rep. 2019 39

11. Li M, Guan H, Liu Y, Gan X. LncRNA ZEB1-AS1 reduces liver cancer cell proliferation by targeting miR-365a-3p. Exp Ther Med. 2019;17:3539-47

12. Qu R, Chen X, Zhang C. LncRNA ZEB1-AS1/miR-409-3p/ZEB1 feedback loop is involved in the progression of non-small cell lung cancer. Biochemical and biophysical research communications. 2018;507:450-6

13. Gao R, Zhang N, Yang J, Zhu Y, Zhang Z, Wang J. et al. Long non-coding RNA ZEB1-AS1 regulates miR-200b/FSCN1 signaling and enhances migration and invasion induced by TGF-beta1 in bladder cancer cells. J Exp Clin Cancer Res. 2019;38:111

14. Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J. et al. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019;47:D607-D13

15. Lou W, Liu J, Ding B, Xu L, Fan W. Identification of chemoresistance-associated miRNAs in breast cancer. Cancer Manag Res. 2018;10:4747-57

16. Tang Z, Li C, Kang B, Gao G, Li C, Zhang Z. GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses. Nucleic Acids Res. 2017;45:W98-W102

17. Tang Z, Kang B, Li C, Chen T, Zhang Z. GEPIA2: an enhanced web server for large-scale expression profiling and interactive analysis. Nucleic Acids Res. 2019;47:W556-W60

18. Nagy A, Lanczky A, Menyhart O, Gyorffy B. Validation of miRNA prognostic power in hepatocellular carcinoma using expression data of independent datasets. Scientific reports. 2018;8:9227

19. Chandrashekar DS, Bashel B, Balasubramanya SAH, Creighton CJ, Ponce-Rodriguez I, Chakravarthi B. et al. UALCAN: A Portal for Facilitating Tumor Subgroup Gene Expression and Survival Analyses. Neoplasia. 2017;19:649-58

20. Chou CH, Shrestha S, Yang CD, Chang NW, Lin YL, Liao KW. et al. miRTarBase update 2018: a resource for experimentally validated microRNA-target interactions. Nucleic Acids Res. 2018;46:D296-D302

21. Fan Y, Xia J. miRNet-Functional Analysis and Visual Exploration of miRNA-Target Interactions in a Network Context. Methods Mol Biol. 2018;1819:215-33

22. Fan Y, Siklenka K, Arora SK, Ribeiro P, Kimmins S, Xia J. miRNet - dissecting miRNA-target interactions and functional associations through network-based visual analysis. Nucleic Acids Res. 2016;44:W135-41

23. Li JH, Liu S, Zhou H, Qu LH, Yang JH. starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data. Nucleic Acids Res. 2014;42:D92-7

24. Yang JH, Li JH, Shao P, Zhou H, Chen YQ, Qu LH. starBase: a database for exploring microRNA-mRNA interaction maps from Argonaute CLIP-Seq and Degradome-Seq data. Nucleic Acids Res. 2011;39:D202-9

25. Lei F, Zhang H, Xie X. Comprehensive analysis of an lncRNA-miRNA-mRNA competing endogenous RNA network in pulpitis. PeerJ. 2019;7:e7135

26. Tuersong T, Li L, Abulaiti Z, Feng S. Comprehensive analysis of the aberrantly expressed lncRNAassociated ceRNA network in breast cancer. Molecular medicine reports. 2019;19:4697-710

27. Henson DE, Dittus C, Younes M, Nguyen H, Albores-Saavedra J. Differential trends in the intestinal and diffuse types of gastric carcinoma in the United States, 1973-2000: increase in the signet ring cell type. Arch Pathol Lab Med. 2004;128:765-70

28. Lu M, Huang Y, Sun W, Li P, Li L, Li L. miR-135b-5p promotes gastric cancer progression by targeting CMTM3. International journal of oncology. 2018;52:589-98

29. Dong Y, Fu C, Guan H, Zhang Z, Zhou T, Li B. Prognostic significance of miR-126 in various cancers: a meta-analysis. Onco Targets Ther. 2016;9:2547-55

30. Pan X, Zheng G, Gao C. LncRNA PVT1: a Novel Therapeutic Target for Cancers. Clin Lab. 2018;64:655-62

31. Cheng N, Wu J, Yin M, Xu J, Wang Y, Chen X. et al. LncRNA CASC11 promotes cancer cell proliferation in hepatocellular carcinoma by inhibiting miRNA-188-5p. Biosci Rep. 2019 39

32. Bai T, Liu Y, Li B. LncRNA LOXL1-AS1/miR-let-7a-5p/EGFR-related pathway regulates the doxorubicin resistance of prostate cancer DU-145 cells. IUBMB Life. 2019;71:1537-51

33. Lu MH, Tang B, Zeng S, Hu CJ, Xie R, Wu YY. et al. Long noncoding RNA BC032469, a novel competing endogenous RNA, upregulates hTERT expression by sponging miR-1207-5p and promotes proliferation in gastric cancer. Oncogene. 2016;35:3524-34

34. Li Y, Wang K, Wei Y, Yao Q, Zhang Q, Qu H. et al. lncRNA-MIAT regulates cell biological behaviors in gastric cancer through a mechanism involving the miR-29a-3p/HDAC4 axis. Oncol Rep. 2017;38:3465-72

35. Liu YW, Sun M, Xia R, Zhang EB, Liu XH, Zhang ZH. et al. LincHOTAIR epigenetically silences miR34a by binding to PRC2 to promote the epithelial-to-mesenchymal transition in human gastric cancer. Cell Death Dis. 2015;6:e1802

36. Zhao Z, Bai J, Wu A, Wang Y, Zhang J, Wang Z. et al. Co-LncRNA: investigating the lncRNA combinatorial effects in GO annotations and KEGG pathways based on human RNA-Seq data. Database (Oxford). 2015. 2015

37. Zhang T, Jiang M, Chen L, Niu B, Cai Y. Prediction of gene phenotypes based on GO and KEGG pathway enrichment scores. BioMed research international. 2013;2013:870795

38. Ntantie E, Allen MJ, Fletcher J, Nkembo AT, Lamango NS, Ikpatt OF. Suppression of focal adhesion formation may account for the suppression of cell migration, invasion and growth of non-small cell lung cancer cells following treatment with polyisoprenylated cysteinyl amide inhibitors. Oncotarget. 2018;9:25781-95

39. Wu JD, Hong CQ, Huang WH, Wei XL, Zhang F, Zhuang YX. et al. L1 Cell Adhesion Molecule and Its Soluble Form sL1 Exhibit Poor Prognosis in Primary Breast Cancer Patients. Clin Breast Cancer. 2018;18:e851-e61

40. Shen H, Wang D, Li L, Yang S, Chen X, Zhou S. et al. MiR-222 promotes drug-resistance of breast cancer cells to adriamycin via modulation of PTEN/Akt/FOXO1 pathway. Gene. 2017;596:110-8

41. Ma Y, Zhu J, Chen S, Ma J, Zhang X, Huang S. et al. Low expression of SPARC in gastric cancer-associated fibroblasts leads to stemness transformation and 5-fluorouracil resistance in gastric cancer. Cancer Cell Int. 2019;19:137

42. Zhang H, Sun Z, Li Y, Fan D, Jiang H. MicroRNA-200c binding to FN1 suppresses the proliferation, migration and invasion of gastric cancer cells. Biomed Pharmacother. 2017;88:285-92

43. Zhuo C, Li X, Zhuang H, Tian S, Cui H, Jiang R. et al. Elevated THBS2, COL1A2, and SPP1 Expression Levels as Predictors of Gastric Cancer Prognosis. Cellular physiology and biochemistry: international journal of experimental cellular physiology, biochemistry, and pharmacology. 2016;40:1316-24

44. Zhao Z, Wang L, Song W, Cui H, Chen G, Qiao F. et al. Reduced miR-29a-3p expression is linked to the cell proliferation and cell migration in gastric cancer. World J Surg Oncol. 2015;13:101

45. Tao J, Zhi X, Zhang X, Fu M, Huang H, Fan Y. et al. miR-27b-3p suppresses cell proliferation through targeting receptor tyrosine kinase like orphan receptor 1 in gastric cancer. J Exp Clin Cancer Res. 2015;34:139

46. Chen JS, Wang YF, Zhang XQ, Lv JM, Li Y, Liu XX. et al. H19 serves as a diagnostic biomarker and up-regulation of H19 expression contributes to poor prognosis in patients with gastric cancer. Neoplasma. 2016;63:223-30

47. Li H, Yu B, Li J, Su L, Yan M, Zhu Z. et al. Overexpression of lncRNA H19 enhances carcinogenesis and metastasis of gastric cancer. Oncotarget. 2014;5:2318-29

48. Hashad D, Elbanna A, Ibrahim A, Khedr G. Evaluation of the Role of Circulating Long Non-Coding RNA H19 as a Promising Novel Biomarker in Plasma of Patients with Gastric Cancer. Journal of clinical laboratory analysis. 2016;30:1100-5

49. Mohamed WA, Schaalan MF, Ramadan B. The expression profiling of circulating miR-204, miR-182, and lncRNA H19 as novel potential biomarkers for the progression of peptic ulcer to gastric cancer. Journal of cellular biochemistry. 2019;120:13464-77

Author contact

Corresponding address Corresponding author: E-mail addresses: njmushycom (Hongyu Shen), fwfei1974edu.cn (Weifei Fan).

Received 2020-2-27
Accepted 2020-5-25
Published 2020-6-16

Citation styles

Shen, H., Wang, L., Chen, Q., Xu, J., Zhang, J., Fang, L., Wang, J., Fan, W. (2020). The prognostic value of COL3A1/FBN1/COL5A2/SPARC-mir-29a-3p-H19 associated ceRNA network in Gastric Cancer through bioinformatic exploration. Journal of Cancer, 11(17), 4933-4946. https://doi.org/10.7150/jca.45378.

Shen, H.; Wang, L.; Chen, Q.; Xu, J.; Zhang, J.; Fang, L.; Wang, J.; Fan, W. The prognostic value of COL3A1/FBN1/COL5A2/SPARC-mir-29a-3p-H19 associated ceRNA network in Gastric Cancer through bioinformatic exploration. J. Cancer 2020, 11 (17), 4933-4946. DOI: 10.7150/jca.45378.

Shen H, Wang L, Chen Q, Xu J, Zhang J, Fang L, Wang J, Fan W. The prognostic value of COL3A1/FBN1/COL5A2/SPARC-mir-29a-3p-H19 associated ceRNA network in Gastric Cancer through bioinformatic exploration. J Cancer 2020; 11(17):4933-4946. doi:10.7150/jca.45378. https://www.jcancer.org/v11p4933.htm

Shen H, Wang L, Chen Q, Xu J, Zhang J, Fang L, Wang J, Fan W. 2020. The prognostic value of COL3A1/FBN1/COL5A2/SPARC-mir-29a-3p-H19 associated ceRNA network in Gastric Cancer through bioinformatic exploration. J Cancer. 11(17):4933-4946.

This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/). See http://ivyspring.com/terms for full terms and conditions.
Popup Image