Open-access A novel lncRNA-mRNA-miRNA signature predicts recurrence and disease-free survival in cervical cancer

Abstract

Cervical cancer (CC) patients have a poor prognosis due to the high recurrence rate. However, there are still no effective molecular signatures to predict the recurrence and survival rates for CC patients. Here, we aimed to identify a novel signature based on three types of RNAs [messenger RNA (mRNAs), microRNA (miRNAs), and long non-coding RNAs (lncRNAs)]. A total of 763 differentially expressed mRNAs (DEMs), 46 lncRNAs (DELs), and 22 miRNAs (DEMis) were identified between recurrent and non-recurrent CC patients using the datasets collected from the Gene Expression Omnibus (GSE44001; training) and The Cancer Genome Atlas (RNA- and miRNA-sequencing; testing) databases. A competing endogenous RNA network was constructed based on 23 DELs, 15 DEMis, and 426 DEMs, in which 15 DELs, 13 DEMis, and 390 DEMs were significantly associated with disease-free survival (DFS). A prognostic signature, containing two DELs (CD27-AS1, LINC00683), three DEMis (hsa-miR-146b, hsa-miR-1238, hsa-miR-4648), and seven DEMs (ARMC7, ATRX, FBLN5, GHR, MYLIP, OXCT1, RAB39A), was developed after LASSO analysis. The built risk score could effectively separate the recurrence rate and DFS of patients in the high- and low-risk groups. The accuracy of this risk score model for DFS prediction was better than that of the FIGO (International Federation of Gynecology and Obstetrics) staging (the area under receiver operating characteristic curve: training, 0.954 vs 0.501; testing, 0.882 vs 0.656; and C-index: training, 0.855 vs 0.539; testing, 0.711 vs 0.508). In conclusion, the high predictive accuracy of our signature for DFS indicated its potential clinical application value for CC patients.

Cervical cancer; Recurrence; Competing endogenous RNA; Molecular signature; FIGO staging


Introduction

Cervical cancer (CC) is one of the most common gynecological cancers worldwide. According to statistics using the Global Cancer Observatory database, there were approximately 570,000 cases of CC in 2018 (1). Among all countries, China contributed with the highest incidence burden (106,430 cases) (1). Although great advances have been made in the therapeutic options (such as surgery, radiotherapy, and chemotherapy), a considerable proportion of patients can develop relapse or metastasis, which may be the possible reason associated with a high mortality-to-incidence ratio in CC (about 30-50%) (1,2). Hence, how to early screen out patients at a high risk of poor prognosis may be a significant issue for gynecologists.

Accumulating evidence has demonstrated that the advanced International Federation of Gynecology and Obstetrics (FIGO) stage correlates with a high risk of recurrence and shorter length of 5-year survival (3). Thus, the FIGO staging system has been well recognized as a prognostic biomarker for CC in clinically. However, some studies indicate that the prognostic effectiveness of the FIGO staging system should be improved because survival differences could also be observed in patients within the same stage (4). Thus, identification of a more effective prognostic biomarker has become the main research focus. With the recent developments in sequencing technology and bioinformatics, identification of molecular biomarkers has gained much attention. Some molecular biomarkers were proven to have better predictive abilities than the FIGO staging system for survival in CC patients. For example, Zhao et al. (5) identified a prognostic signature that consisted of five protein-encoding messenger RNAs (mRNAs) (ITM2A, DSG2, SPP1, EFNA1, and MMP1) and found that only the molecular signature (P<0.05), but not the FIGO staging (P>0.05), was an independent indicator for the prediction of survival in CC patients. This conclusion was also seen in a study by Ju et al. (6) in which a five-mRNA signature (GALNTL6, ARSE, DPAGT1, GANAB, and FURIN) was shown to predict disease-free survival (DFS). Both univariate and multivariate Cox regression analyses revealed that a three-microRNA (miRNA) signature (miR-145, miR-200c, and miR-218-1) was significantly associated with overall survival (OS) of CC patients, while the FIGO staging was reported to be a significant prognostic factor only in univariate analysis (7). Mao et al. (8) screened nine long non-coding RNAs (lncRNAs) (ATXN8OS, C5orf60, DIO3OS, EMX2OS, INE1, KCNQ1DN, KCNQ1OT1, LOH12CR2, and RFPL1S) together as a single prognostic signature and demonstrated the predictive accuracy of this lncRNA signature for DFS was higher than that of the FIGO staging (area under the ROC (receiver operating characteristic) curve (AUC): training dataset, 0.793 vs 0.537; validation dataset, 0.780 vs 0.529; testing dataset, 0.742 vs 0.589). Based on the Harrell's Concordance Index (C-index), Cheng et al. (9) also showed the prediction ability of the six-lncRNA signature (LINC00619, FGF13-AS1, EMX2OS, WT1-AS, C9orf147, and LINC00908) for survival was higher than that of the FIGO staging (0.648 vs 0.516). However, the prediction power of the known signature remains relatively low (AUC<0.8 in most of studies) (6,), suggesting other candidate prognostic predictors are necessary for CC.

In recent studies on other cancers, we noticed that some scholars recommended integrating various RNA types to identify prognostic biomarkers. Also, they demonstrated that the multi-RNA-based classifier model was more effective to separate the survival of patients with different risks than lncRNA, miRNA, or mRNA alone (11,12), which has not been reported for CC patients. Therefore, the goal of this study was to develop a novel lncRNA-miRNA-mRNA prognostic signature to predict recurrence and DFS for CC patients.

Material and Methods

Data resource

One microarray dataset was downloaded from the Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/) database under accession number GSE44001 (13) on April 8, 2019 (ID seen in Supplementary Table S1). GSE44001 dataset was run on the platform of Illumina HumanHT-12 WG-DASL V4.0 R2 expression beadchip (GPL14951, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GPL14951). GSE44001 included 300 CC samples with recurrence (n=38, recurrent; n=262, non-recurrent) and DFS information and thus was set as the training dataset. During the corresponding period, matched RNA sequencing (RNA-seq; level 3, fragments per kilobase of exon per million fragments mapped value), miRNA-seq (level 3), and clinical data (recurrence status: n=28, recurrent; n=211, non-recurrent; and DFS time) of CC samples (ID seen in Supplementary Table S1) were also obtained from The Cancer Genome Atlas (TCGA, https://portal.gdc.cancer.gov/) database. TCGA data were obtained on the platform of Illumina HiSeq 2000 RNA Sequencing (https://tcga-xena-hub.s3.us-east-1.amazonaws.com/download/probeMap%2Fhugo_gencode_good_hg19_V24lift37_probemap) and selected as the testing dataset.

Identification of differentially expressed RNAs

Gene symbol annotation was performed using HUGO Gene Nomenclature Committee (HGNC; http://www.genenames.org/) to identify gene classes (mRNAs, lncRNAs, and miRNAs). The RNAs with the median expression level of zero or only annotated in one dataset were deleted. The Linear Models for Microarray Data (LIMMA) package (v3.34.7; https://bioconductor.org/packages/release/bioc/html/limma.html) for R was used for differential analysis. |log2FC(fold change)|>0.5 and false discovery rate (FDR) <0.05 were defined as the threshold value to identify significantly differentially expressed mRNAs (DEMs), lncRNAs (DELs), and miRNAs (DEMis) between recurrent and non-recurrent samples of the GSE44001 dataset. The heatmap of DEMs, DELs, and DEMis was generated using the R packages of “pheatmap” (v1.0.8; https://cran.r-project.org/web/packages/pheatmap).

Construction of a competing endogenous RNA (ceRNA) regulatory network

The ceRNA network was constructed based on the theory that lncRNAs can act as miRNA sponges and compete for miRNA binding to protein-coding mRNAs, influencing the negative regulation of miRNAs on the expression of mRNAs (14). Based on this hypothesis, the lncRNA-miRNA-mRNA ceRNA network was constructed following the steps below: 1) DIANA-LncBase (v2.0; http://carolina.imis.athenainnovation.gr/diana_tools/web/index.php?r=lncbasev2/index-predicted) was used to screen interactions between DELs and DEMis. Only the pairs with the opposite expression in DELs and DEMis were left; 2) starBase (v2.0; http://starbase.sysu.edu.cn/) database was used to retrieve the interactions between DEMis and DEMs. The starBase database included the prediction results from five databases (targetScan, picTar, RNA22, PITA, and miRanda). The interaction pairs predicted by any one database were included. Only the pairs with the opposite expression in DEMis and DEMs were retained; 3) lncRNA-miRNA-mRNA interaction axes were selected according to the intersected miRNAs interacted with DEMs and DELs; and 4) the ceRNA network was visualized using Cytoscape (v3.6.1; www.cytoscape.org/).

Functional enrichment analysis

To understand the underlying functions of DEMs in the ceRNA network, Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway and Gene Ontology (GO) biological process enrichment analyses were performed using the Database for Annotation, Visualization, and Integrated Discovery (DAVID) (v6.8; http://david.abcc.ncifcrf.gov). A P-value <0.05 served as the cut-off point.

Establishment of a prognostic signature based on the ceRNA network genes

Univariate Cox regression analysis was performed to analyze the association between DEMs/DELs/DEMis in the ceRNA network and DFS using “survival” package in R (v2.41-1; http://bioconductor.org/packages/survivalr/). DEMs/DELs/DEMis that were significantly related with DFS (log-rank P-value <0.05) were entered into the multivariate Cox regression analysis to confirm their independence. The selected feature genes by the multivariate analysis (log-rank P-value <0.05) were used to fit a least absolute shrinkage and selection operator (LASSO)-Cox proportional hazards model (Cox-PH) model to further obtain an optimal gene panel using the Penalized package (v0.9-5; http://bioconductor.org/packages/penalized/). The risk score was built according to the expression levels of the RNAs (ExpRNAs) and their corresponding LASSO coefficients (ΣβRNAs): Prognostic risk score=ΣβRNAs × ExpRNAs.

Based on the median risk score, CC patients in the training dataset (GSE44001) were divided into two groups: the “low-risk” group and the “high-risk” group. Kaplan-Meier (K-M) survival curves were conducted using the “survival” package in R to identify the DFS differences between two groups. The ROC curve within 5 years was plotted and the AUC was computed to estimate the predictive ability of the risk score. To demonstrate the predictive accuracy of the risk score better than the FIGO staging for DFS prediction, several analyses were performed, including stratification with K-M curves, estimation of AUC for time-dependent ROC curves, and calculation of C-index for various survival models using survcomp package in R (http://www.bioconductor.org/packages/release/bioc/html/survcomp.html). The robustness of the prognostic signature for predicting DFS in CC patients was subsequently assessed in the testing dataset (TCGA).

Results

Identification of DEMs, DELs, and DEMis

By HGNC annotation, 11,187 mRNAs, 249 lncRNAs, and 229 miRNAs were found in the GSE44001 dataset. According to the cut-off threshold (FDR<0.05 and |log2FC|>0.5), 763 DEMs (409 upregulated and 354 downregulated), 46 DELs (3 upregulated and 43 downregulated), and 22 DEMis (16 upregulated and 6 downregulated) were identified by comparing recurrent and non-recurrent CC samples (Figure 1A and B; Supplementary Table S2). The heat map showed that these DEMs, DELs, and DEMis can provide a clear separation for the recurrent and non-recurrent samples (Figure 1C).

Figure 1
Identification of differentially expressed RNAs between recurrent and non-recurrent cervical cancer samples. A, Volcano plot to show the distribution of differentially expressed RNAs (green dots, downregulated; red dots, upregulated). FC: fold change; FDR: false discovery rate. B, Upregulated and downregulated number of differentially expressed mRNAs, lncRNAs, and miRNAs. C, Heat map of differentially expressed mRNAs, lncRNAs, and miRNAs. Red: high expression; green: low expression.

Construction of a ceRNA network

A total of 63 interaction pairs between 23 DELs and 15 DEMis were predicted by the DIANA-LncBase database (such as CD27-AS1-hsa-miR-1275, LINC00683-hsa-miR-146b, EMX2OS-hsa-miR-1238/4648, LINC00265-has-miR-128-1/604/1304, EMX2OS-hsa-miR-29b-1), while 1,038 paired interactions between 15 DEMis and 426 DEMs were predicted by the starBase database [such as hsa-miR-1275-ANKRD6 (ankyrin repeat domain 6), hsa-miR-595/4648-GHR (growth hormone receptor), hsa-miR-1238-FBLN5 (fibulin 5), hsa-miR-146b-ACAP2 (ArfGAP with coiled-coil, ankyrin repeat and PH domains 2), has-miR-128-1-ATRX (ATRX chromatin remodeler), hsa-miR-29b-1-MYLIP (myosin regulatory light chain interacting protein), hsa-miR-604-ARMC7 (armadillo repeat containing 7)/OXCT1 (3-oxoacid CoA-transferase 1), and hsa-miR-1304-RAB39A (RAB39A, member RAS oncogene family)]. According to the intersected miRNAs, an lncRNA-miRNA-mRNA ceRNA network was constructed (Figure 2).

Figure 2
Construction of a competing endogenous RNA network among differentially expressed lncRNAs, miRNAs, and genes. A, Relationship pairs among upregulated lncRNA, downregulated miRNAs, and upregulated mRNAs; B, Relationship pairs among downregulated lncRNAs, upregulated miRNAs, and downregulated mRNAs. Red: upregulated; green: downregulated.

Functional enrichment analysis

To further understand the functions of DEMs in the ceRNA network, functional enrichment analysis was performed. The results showed that 16 GO biological process terms were enriched, including GO:0033554∼cellular response to stress (ATRX), GO:0009967∼positive regulation of signal transduction (GHR), GO:0006259∼DNA metabolic process (ATRX), GO:0006974∼response to DNA damage stimulus (ATRX), GO:0010647∼positive regulation of cell communication (GHR), GO:0051336∼regulation of hydrolase activity (ACAP2), and GO:0006928∼cell motion (MYLIP) (Figure 3; Table 1). Also, four KEGG pathways were enriched, such as hsa04144:Endocytosis (ACAP2) (Figure 3; Table 1).

Figure 3
Functional enrichment results for genes in the competing endogenous RNA network. GO: Gene Ontology.
Table 1
Functional enrichment for genes in the ceRNA network.

Construction of a prognostic signature

Univariate Cox regression analysis detected that among 464 RNAs in the ceRNA network, 418 were significantly associated with DFS, including 390 DEMs (i.e., ANKRD6, WWP1, ACAP2), 15 DELs (i.e., EMX2OS, LINC00265), and 13 DEMis (i.e., hsa-miR-1275, has-miR-128-1, hsa-miR-29b-1, hsa-miR-604) (Supplementary Table S3). Of them, 29 (22 DEMs, two DELs, and five DEMis) were screened as the independent predictors of DFS by the multivariate Cox regression analysis. LASSO Cox regression model analysis further showed that 12 of these 29 RNAs were the optimal prognostic panel (2 DELs: CD27-AS1, LINC00683; 3 DEMis: hsa-miR-146b, hsa-miR-1238, hsa-miR-4648; 7 DEMs: ARMC7, ATRX, FBLN5, GHR, MYLIP, OXCT1, RAB39A) (Table 2). Hence, the risk score was built based on the expression of these 12 signature genes and their LASSO coefficients: risk score=(-0.2223297) × ExpCD27-AS1 + (0.403561456) × ExpLINC00683 + (1.275309117) × Exphsa-miR-146b + (-2.408082103) × Exphsa-miR-1238 + (-0.574392794) × Exphsa-miR-4648 + (0.848557903) × ExpARMC7 + (0.868981201) ×ExpATRX + (-0.573439111) × ExpFBLN5 + (-0.457116812) × ExpGHR + (-0.56842456) × ExpMYLIP + (0.872944103) × ExpOXCT1 + (0.777716643) × ExpRAB39A.

Table 2
The 12-gene-based signature.

The risk score was calculated for each sample in the training set. According to the median risk score, patients were divided into high- (n=150) and low-risk (n=150) groups (Supplementary Table S1). The recurrence rate of the patients in the high-risk group was significantly higher than that in the low-risk group (34/150, 22.7% vs 4/150, 2.7%; chi-squared=27.029, P<0.001). K-M survival curve analysis showed that the DFS time was significantly shorter in the high-risk group relative to the low-risk group [hazard ratio (HR)=9.356; 95%CI: 3.319-26.37; P=4.239e-08] (Figure 4A). ROC survival curve analysis revealed that this 12-gene signature exhibited a high accuracy to predict the 5-year DFS of CC patients, with the AUC of 0.954 (Figure 4B). Stratification analysis demonstrated that the DFS of patients with the same stage (1 or 2) could be significantly distinguished when they were divided into high- and low-risk groups by the risk score (Figure 5A and B). Time-dependent ROC curve (AUC=0.954 vs 0.501) and C-index (0.855 vs 0.539) analyses further confirmed that the predictive accuracy of the risk score was better than the FIGO staging for DFS prediction (Figure 6A and C). Furthermore, we also evaluated the prognostic robustness of this 12-gene signature by using the TCGA as the testing dataset. In line with the results of the training dataset, K-M survival curve analysis showed the high-risk group was correlated with poorer DFS compared to that of the low-risk group (HR=2.545; 95%CI: 1.172-5.526; P=1.461e-02) (Figure 4C). The AUC was 0.882 in the ROC survival curve analysis (Figure 4D). Stratification analysis showed a statistical difference in DFS between patients at high and low risk, although they all belonged to stage 4 (Figure 5F). The DFS in patients with stage 1 to 3 were not significantly different (Figure 5C-E). Time-dependent ROC curve (AUC=0.882 vs 0.656) and C-index (0.711 vs 0.508) analyses further confirmed the higher predictive accuracy of risk score for DFS compared with the FIGO staging (Figure 6B and C).

Figure 4
Prognostic evaluation of the 12-multi-RNA-based risk score for cervical cancer patients. A, Kaplan-Meier curve of the training dataset (GSE44001). B, Receiver operator characteristic curve of the training dataset (GSE44001). C, Kaplan-Meier curve of the testing dataset (TCGA). D, Receiver operator characteristic curve of the testing dataset (TCGA). AUC: area under the receiver operator characteristic curve; HR: hazard ratio.
Figure 5
Stratification analysis according to the FIGO staging. Kaplan-Meier survival curve for A, patients with stage 1 (GSE44001); B, patients with stage 2 (GSE44001); C, patients with stage 1 (TCGA); D, patients with stage 2 (TCGA); E, patients with stage 3 (TCGA); F, patients with stage 4 (TCGA). HR: hazard ratio.
Figure 6
Comparison of the prognostic power among the risk score model, the FIGO staging, and mRNA, miRNA, lncRNA alone. A, Time-dependent receiver operator characteristic curves (ROC) using the training dataset (GSE44001). B, Time-dependent ROC curves using the testing dataset (TCGA). C, Calculated results for ROC and C-index.

Discussion

Although studies have investigated a molecular signature to predict the recurrence and DFS (or recurrence free survival, RFS) for CC patients (6,8), the known signature was mainly composed of 5 mRNAs (6), 9 lncRNAs (8), or 9 miRNAs (15). No studies focused on identification of a combined-RNA signature. In the present study, we attempted to develop a novel signature to distinguish the CC patients having a high risk to recurrence and poor DFS from those with a low risk based on the lncRNAs, miRNAs, and mRNAs in the ceRNA network. This analysis flow may be beneficial to identify some mechanism-clear, prognostic associative biomarkers. As expected, our identified prognostic signature included two lncRNAs (CD27-AS1, LINC00683), three miRNAs (hsa-miR-146b, hsa-miR-1238, hsa-miR-4648), and seven mRNAs (ARMC7, ATRX, FBLN5, GHR, MYLIP, OXCT1, RAB39A). Almost all of their related ceRNA interactive axis genes (CD27-AS1-hsa-miR-1275-ANKRD6, LINC00683-hsa-miR-146b-ACAP2, EMX2OS-hsa-miR-1238-FBLN5, EMX2OS-hsa-miR-4648-GHR, EMX2OS-hsa-miR-29b-1-MYLIP, LINC00265-has-miR-128-1-ATRX, LINC00265-hsa-miR-604-ARMC7/OXCT1, LINC00265-hsa-miR-1304-RAB39A) were associated with DFS except hsa-miR-1304. The risk score built by these multiple-type RNAs showed good performance to predict DFS, with an AUC of 0.954 and 0.882 in the training dataset and the testing dataset, respectively. The predictive accuracy of our signature was obviously higher than that of previously identified signatures for CC, such as 9-lncRNA (training GSE44001 dataset, AUC=0.793; testing TCGA dataset, AUC=0.742) (8) and 5-mRNA (TCGA dataset, AUC=0.792) (6). The predictive superiority of multiple-type RNA to single RNA type was also confirmed by our analysis (training: AUC=0.954 vs 0.659 for lncRNAs, 0.63 for miRNAs, 0.939 for mRNAs; testing: AUC=0.882 vs 0.602 for lncRNAs, 0.608 for miRNAs, 0.808 for mRNAs; training: C-index=0.855 vs 0.646 for lncRNAs, 0.628 for miRNAs, 0.806 for mRNAs; testing: C-index=0.711 vs 0.469 for lncRNAs, 0.606 for miRNAs, 0.703 for mRNAs). These findings were in agreement with the study of Zhang et al. (11) and Wang et al. (12). More importantly, we also found that the risk score had the ability for predicting the DFS within each FIGO stage and the AUC (training: 0.954 vs 0.501; testing: 0.882 vs 0.656) and C-index (training: 0.855 vs 0.539; testing: 0.711 vs 0.508) of the risk score were higher than those of the FIGO staging. Combination of the FIGO staging with the risk score did not obviously improve the predictive power, it even slightly decreased due to the fact that the FIGO staging was not an independent prognostic factor (data not shown). These conclusions were also observed in the study of Mao et al. (8). These findings indicated our risk score may be a more promising, effective, independent predictor for DFS in CC patients.

All our identified ceRNA axes were not reported previously, indicating they were novel insights to explain the mechanisms of recurrence and unfavorable prognosis in CC. However, the individual studies on these lncRNAs, miRNAs, and mRNAs may indirectly suggest their functions for CC. For example, Roychowdhury et al. (16) identified CD27-AS1 as a downregulated lncRNA in cervical carcinoma by microarray expression profile analysis. MiR-1275 was found to be upregulated in cancer cell lines and tissues of lung adenocarcinoma (17,18). High expression of miR-1275 was associated with shorter OS and RFS in lung cancer patients (17). Overexpression of miR-1275 promoted cancer cell migration, invasion, and proliferation (18), which may be related to its negative regulation on the downstream target genes (such as leucine zipper putative tumor suppressor 3) and then to enhance the stemness of cancer cells (17). Knockdown of ANKRD6 was revealed to increase melanoma cell proliferation and migration (19). Consistent with these studies, we also identified CD27-AS1 and ANKRD6 were downregulated, while miR-1275 was upregulated in recurrent CC tissues compared with non-recurrent controls. The HRs were less than 1 for CD27-AS1 and ANKRD6 and larger than 1 for miR-1275 in the DFS curve analysis (Supplementary Table S3), further indicating their tumor suppressor and oncogenic roles, respectively. Thus, theoretically, their related-ceRNA theory (that is, downregulated CD27-AS1 may be insufficient to sponge hsa-miR-1275 and then lead to the inhibitory effects of hsa-miR-1275 on ANKRD6) may be believable.

By analysis of the TCGA data, Liu et al. (20) showed that LINC00683 was significantly downregulated in prostate cancer samples, and high expression of LINC00683 was correlated with a more favorable OS compared with lower levels. MiR-146b expression was increased in CC tissues compared with paracancerous tissues (21). Down-regulation of miR-146b strongly suppressed proliferation, migration, and anchorage-independent growth of CC cells (21). The study of Sullivan et al. (22) reported that ACAP2 expression was downregulated in esophageal cancer, leukemias, and lymphoma. Knockdown of ACAP2 inhibited cancer cell apoptosis. These results implied that LINC00683 and ACAP2 exert tumor suppressor roles, while miR-146b was pro-oncogenic. This conclusion was also proven in our study, showing LINC00683 and ACAP2 were lower expressed, while miR-146 was higher expressed in recurrent CC tissues than those in the non-recurrent samples. Therefore, LINC00683-miR-146-ACAP2 ceRNA axis may also be a verifiable mechanism for CC.

Using the data from TCGA database, Zheng et al. (23) observed EMX2OS was significantly downregulated in the CC tissues compared with normal controls. K-M survival curve analysis showed CC patients with high expression of EMX2OS had better OS compared with the low-expression group. Univariate Cox analysis further validated that highly expressed EMX2OS was a protective factor for poor OS (HR=0.91). Upregulated expression of miR-1238 was indicated to confer chemoresistance for glioblastoma cells, while loss of miR-1238 sensitized resistant glioblastoma cells to temozolomide (24). The expression of miR-4648 was shown to be higher in the relapse cases of small cell carcinoma of the esophagus compared with non-relapse cases (25). The oncogenic activity of miR-29b-1-5p may result from the activation of epithelial-mesenchymal transition in cancer cells (26). Highly expressed miR-29b-1 was also proven to be closely correlated with shorter OS time in non-small cell lung cancer patients (27). High expression of FBLN5 was significantly correlated with better OS and DFS in hepatocellular carcinoma patients (28). A recent case report showed that GHR was very lowly expressed in patients with adamantinomatous craniopharyngioma and deficiency in growth hormone led to the development of recurrence at 18 months after surgery (29). Inhibition of MYLIP facilitated the growth and metastasis of CC cells (30). In accordance with these reports, we also identified that EMX2OS and FBLN5/GHR/MYLIP were downregulated, while hsa-miR-1238/4648/29b-1 was upregulated in recurrent CC tissues compared to those in the non-recurrent samples. Also, they were all DFS-related biomarkers. Accordingly, their-related ceRNA axes (EMX2OS-hsa-miR-1238-FBLN5, EMX2OS-hsa-miR-4648-GHR, EMX2OS-hsa-miR-29b-1-MYLIP) may be important targets for preventing the occurrence of recurrence in CC patients.

Several studies demonstrated that LINC00265 was significantly highly expressed in cancer patients and associated with poorer prognosis (31). Functional analysis revealed that depletion of LINC00265 impaired cell proliferation and invasion, promoted cell cycle distribution and apoptosis in vitro, and attenuated tumorigenesis in vivo(32). miR128-1 was suggested to function as a tumor suppressor because it was downregulated in glioblastoma multiforme and glioma stem-like cells. Forced expression of miR128-1 inhibited tumor cell proliferation, migration, and invasion in vitro and blocked the growth of transplant tumors in vivo (33). miR-604 also seemed to be a tumor suppressor miRNA due to low expression in cisplatin-resistant gastric cancer cells (34). miR-1304 was an independent risk factor for recurrence of esophageal carcinoma, showing that patients with high expression of miR-1304 had a relatively lower survival rate (35). Patients with a loss of ATRX expression were found to have longer survival times (36). ARMC7 was found to be amplified across several cancer tissues and cell lines (37). OXCT1, which encodes a 3-oxoacid CoA transferase 1 enzyme necessary for ketone catabolism and then involving in the tricarboxylic acid cycle, was also expressed at higher levels in metastatic colorectal cancer cell lines (38). Similarly, RAB39A was highly expressed in sarcomas and in malignancies of lymphoid, adrenal, and testicular tissues. RAB39A may facilitate tumorigenesis by interaction with its downstream RXRB (39). In agreement with these findings, we also identified LINC00265 and ATRX/ARMC7/OXCT1 were upregulated in recurrent CC tissues compared with those of the non-recurrent samples and were risk factors for poor DFS (HR>1), while hsa-miR128-1/604/1304 were downregulated and protective factors for DFS (HR<1). Hence, we predicted that LINC00265 may be responsible for the recurrence and the malignant phenotype of CC patients by functioning as a ceRNA via sponging hsa-miR128-1/604/1304 to elevate the expressions of ATRX/ARMC7-OXCT1/RAB39A.

There were some limitations in this study. First, the prognostic signature was identified and validated based on bioinformatics analyses of retrospective public data. The prognostic value of our molecular model needs to be further validated using larger clinical CC samples prospectively collected from multiple hospitals. Second, the identified ceRNA interactive axes of prognostic genes should be demonstrated by in vitro or in vivo experiments. Third, the roles of other genes that were not significantly associated with RFS but were differentially expressed in our study and previous reports (such as hsa-miR-196a) (40) require further investigation. Fourth, in addition to ceRNA, lncRNAs also could function by directly binding with mRNAs (14). mRNAs did not independently play important key roles in cancer, but by interaction with each other (10). In subsequent studies, the lncRNA-mRNA co-expression network or protein-protein interaction network should be established to find other underlying prognostic signatures for CC.

Conclusion

In this study, we developed a novel risk score model based on two-lncRNA-three-miRNA-seven-mRNA signature for CC patients. This signature not only discriminated patients at a high risk of recurrence from patients at a low risk of recurrence, but also accurately predicted the outcomes of DFS for CC. Its predictive power for DFS was higher than that of the FIGO staging and single RNA-type model. This model was derived from genes in the ceRNA network and, thus, this study sheds new light on the molecular mechanisms of tumorigenesis and provided promising therapeutic targets for CC patients.

Acknowledgments

This work was supported by Shenzhen Health and Family Planning System Research Project (No. SZFZ2018003).

References

  • 1 Arbyn M, Weiderpass E, Bruni L, de Sanjosé S, Saraiya M, Ferlay J, et al. Estimates of incidence and mortality of cervical cancer in 2018: a worldwide analysis. Lancet Glob Health 2020; 8: e191-e203, doi: 10.1016/S2214-109X(19)30482-6.
    » https://doi.org/10.1016/S2214-109X(19)30482-6
  • 2 Siegel RL, Miller KD, Jemal A. Cancer statistics, 2019. CA Cancer J Clin 2019; 69: 7-34, doi: 10.3322/caac.21551.
    » https://doi.org/10.3322/caac.21551
  • 3 Jonska-Gmyrek J, Gmyrek L, Zolciak-Siwinska A, Kowalska M, Kotowicz B. Adenocarcinoma histology is a poor prognostic factor in locally advanced cervical cancer. Curr Med Res Opin 2019; 35: 595-601, doi: 10.1080/03007995.2018.1502166.
    » https://doi.org/10.1080/03007995.2018.1502166
  • 4 Li W, Li L, Wu M, Lang J, Bi Y. The prognosis of stage IA mixed endometrial carcinoma. Am J Clin Pathol 2019; 152: 616-624, doi: 10.1093/ajcp/aqz083.
    » https://doi.org/10.1093/ajcp/aqz083
  • 5 Zhao M, Huang W, Zou S, Shen Q, Zhu X. A five-genes-based prognostic signature for cervical cancer overall survival prediction. Int J Genomics 2020; 2020: 8347639, doi: 10.1155/2020/8347639.
    » https://doi.org/10.1155/2020/8347639
  • 6 Ju M, Qi A, Bi J, Zhao L, Jiang L, Zhang Q, et al. A five-mRNA signature associated with post-translational modifications can better predict recurrence and survival in cervical cancer. J Cell Mol Med 2020; 24: 6283-6297, doi: 10.1111/jcmm.15270.
    » https://doi.org/10.1111/jcmm.15270
  • 7 Liang B, Li Y, Wang T. A three miRNAs signature predicts survival in cervical cancer using bioinformatics analysis. Sci Rep 2017; 7: 5624, doi: 10.1038/s41598-017-06032-2.
    » https://doi.org/10.1038/s41598-017-06032-2
  • 8 Mao Y, Dong L, Zheng Y, Dong J, Li X. Prediction of recurrence in cervical cancer using a nine-lncRNA signature. Front Genet 2019;10: 284, doi: 10.3389/fgene.2019.00284.
    » https://doi.org/10.3389/fgene.2019.00284
  • 9 Cheng Y, Yang S, Shen Y, Ding B, Wu W, Zhang Y, Liang G. The role of high-risk human papillomavirus-related long non-coding RNAs in the prognosis of cervical squamous cell carcinoma. DNA Cell Biol 2020; 39: 645-653, doi: 10.1089/dna.2019.5167.
    » https://doi.org/10.1089/dna.2019.5167
  • 10 Meng H, Liu J, Qiu J, Nie S, Jiang Y, Wan Y, et al. Identification of key genes in association with progression and prognosis in cervical squamous cell carcinoma. DNA Cell Biol 2020; 39: 848-863, doi: 10.1089/dna.2019.5202.
    » https://doi.org/10.1089/dna.2019.5202
  • 11 Zhang Y, Ye Q, He J, Chen P, Wan J, Li J, et al. Recurrence-associated multi-RNA signature to predict disease-free survival for ovarian cancer patients. Biomed Res Int 2020; 2020: 1618527, doi: 10.1155/2020/1618527.
    » https://doi.org/10.1155/2020/1618527
  • 12 Wang P, Zeng Z, Shen X, Tian X, Ye Q. Identification of a multi-RNA-type-based signature for recurrence-free survival prediction in patients with uterine corpus endometrial carcinoma. DNA Cell Biol 2020; 39: 615-630, doi: 10.1089/dna.2019.5148.
    » https://doi.org/10.1089/dna.2019.5148
  • 13 Lee YY, Kim TJ, Kim JY, Choi CH, Do IG, Song SY, et al. Genetic profiling to predict recurrence of early cervical cancer. Gynecol Oncol 2013; 131: 650-654, doi: 10.1016/j.ygyno.2013.10.003.
    » https://doi.org/10.1016/j.ygyno.2013.10.003
  • 14 Zhang XZ, Liu H, Chen SR. Mechanisms of long non-coding RNAs in cancers and their dynamic regulations. Cancers (Basel) 2020; 12: 1245, doi: 10.3390/cancers12051245.
    » https://doi.org/10.3390/cancers12051245
  • 15 How C, Pintilie M, Bruce JP, Hui AB, Clarke BA, Wong P, et al. Developing a prognostic micro-RNA signature for human cervical carcinoma. Plos One 2015; 10: e0123946, doi: 10.1371/journal.pone.0123946.
    » https://doi.org/10.1371/journal.pone.0123946
  • 16 Roychowdhury A, Samadder S, Das P, Mazumder DI, Panda CK. Deregulation of H19 is associated with cervical carcinoma. Genomics 2020; 112: 961-970, doi: 10.1016/j.ygeno.2019.06.012.
    » https://doi.org/10.1016/j.ygeno.2019.06.012
  • 17 Jiang N, Zou C, Zhu Y, Luo Y, Ke Z. HIF-1α-regulated miR-1275 maintains stem cell-like phenotypes and promotes the progression of LUAD by simultaneously activating Wnt/β-catenin and Notch signaling. Theranostics 2020; 10: 2553-2570, doi: 10.7150/thno.41120.
    » https://doi.org/10.7150/thno.41120
  • 18 He J, Yu L, Wang CM, Zhou XF. MiR-1275 promotes non-small cell lung cancer cell proliferation and metastasis by regulating LZTS3 expression. Eur Rev Med Pharmacol Sci 2018; 22: 2680-2687.
  • 19 Prabhakar K, Rodríguez CI, Jayanthy AS, Mikheil DM, Bhasker AI, Perera RJ, et al. Role of miR-214 in regulation of β-catenin and the malignant phenotype of melanoma. Mol Carcinog 2019; 58: 1974-1984, doi: 10.1002/mc.23089.
    » https://doi.org/10.1002/mc.23089
  • 20 Liu Y, Yang B, Su Y, Xiang Q, Li Q. Downregulation of long noncoding RNA LINC00683 associated with unfavorable prognosis in prostate cancer based on TCGA. J Cell Biochem 2019; 120: 14165-14174, doi: 10.1002/jcb.28691.
    » https://doi.org/10.1002/jcb.28691
  • 21 Yao S, Xu J, Zhao K, Song P, Yan Q, Fan W, et al. Down-regulation of HPGD by miR-146b-3p promotes cervical cancer cell proliferation, migration and anchorage-independent growth through activation of STAT3 and AKT pathways. Cell Death Dis 2018; 9: 1055, doi: 10.1038/s41419-018-1059-y.
    » https://doi.org/10.1038/s41419-018-1059-y
  • 22 Sullivan KD, Nakagawa A, Xue D, Espinosa JM. Human ACAP2 is a homolog of C. elegans CNT-1 that promotes apoptosis in cancer cells. Cell Cycle 2015; 14: 1771-1778, doi: 10.1080/15384101.2015.1026518.
    » https://doi.org/10.1080/15384101.2015.1026518
  • 23 Zheng J, Cao B, Zhang X, Niu Z, Tong J. Immune-related four-lncRNA signature for patients with cervical cancer. Biomed Res Int 2020; 2020: 3641231, doi: 10.1155/2020/3641231.
    » https://doi.org/10.1155/2020/3641231
  • 24 Yin J, Zeng A, Zhang Z, Shi Z, Yan W, You Y. Exosomal transfer of miR-1238 contributes to temozolomide-resistance in glioblastoma. Ebiomedicine 2019; 42: 238-251, doi: 10.1016/j.ebiom.2019.03.016.
    » https://doi.org/10.1016/j.ebiom.2019.03.016
  • 25 Okumura T, Shimada Y, Omura T, Hirano K, Tsukada K. MicroRNA profiles to predict postoperative prognosis in patients with small cell carcinoma of the esophagus. Anticancer Res 2015; 35: 719-727.
  • 26 Kurihara-Shimomura M, Sasahira T, Shimomura H, Nakashima C, Kirita T. The oncogenic activity of miR-29b-1-5p induces the epithelial-mesenchymal transition in oral squamous cell carcinoma. J Clin Med 2019; 8: 273, doi: 10.3390/jcm8020273.
    » https://doi.org/10.3390/jcm8020273
  • 27 Chen B, Gao T, Yuan W, Zhao W, Wang TH, Wu J. Prognostic value of survival of microRNAs signatures in non-small cell lung cancer. J Cancer 2019; 10: 5793-5804, doi: 10.7150/jca.30336.
    » https://doi.org/10.7150/jca.30336
  • 28 Tu K, Dou C, Zheng X, Li C, Yang W, Yao Y, et al. Fibulin-5 inhibits hepatocellular carcinoma cell migration and invasion by down-regulating matrix metalloproteinase-7 expression. BMC Cancer 2014;14: 938, doi: 10.1186/1471-2407-14-938.
    » https://doi.org/10.1186/1471-2407-14-938
  • 29 Ogawa Y, Kudo M, Watanabe M, Tominaga T. Heterogeneity of growth hormone receptor expression in craniopharyngioma-implications for surgical strategy. World Neurosurg 2020; 138: 89-92, doi: 10.1016/j.wneu.2020.02.022.
    » https://doi.org/10.1016/j.wneu.2020.02.022
  • 30 Ni M, Yan Q, Xue H, Du Y, Zhao S, Zhao Z. Identification of MYLIP gene and miRNA-802 involved in the growth and metastasis of cervical cancer cells. Cancer Biomark 2021; 30: 287-298, doi: 10.3233/CBM-201523.
    » https://doi.org/10.3233/CBM-201523
  • 31 Zhu Y, Gu L, Lin X, Cui K, Liu C, Lu B, et al. LINC00265 promotes colorectal tumorigenesis via ZMIZ2 and USP7-mediated stabilization of β-catenin. Cell Death Differ 2020; 27: 1316-1327, doi: 10.1038/s41418-019-0417-3.
    » https://doi.org/10.1038/s41418-019-0417-3
  • 32 Ge H, Yan Y, Yue C, Liang C, Wu J. Long noncoding RNA LINC00265 targets EGFR and promotes deterioration of colorectal cancer: a comprehensive study based on data mining and in vitro validation. Onco Targets Ther 2019;12: 10681-10692, doi: 10.2147/OTT.S227482.
    » https://doi.org/10.2147/OTT.S227482
  • 33 Shan ZN, Tian R, Zhang M, Gui ZH, Wu J, Ding M, et al. miR128-1 inhibits the growth of glioblastoma multiforme and glioma stem-like cells via targeting BMI1 and E2F3. Oncotarget 2016; 7: 78813-78826, doi: 10.18632/oncotarget.12385.
    » https://doi.org/10.18632/oncotarget.12385
  • 34 Zhou D, Li X, Zhao H, Sun B, Liu A, Han X, et al. Combining multi-dimensional data to identify a key signature (gene and miRNA) of cisplatin-resistant gastric cancer. J Cell Biochem 2018; 119: 6997-7008, doi: 10.1002/jcb.26908.
    » https://doi.org/10.1002/jcb.26908
  • 35 Luo YG, Duan LW, Ji X, Jia WY, Liu Y, Sun ML, et al. Expression of miR-1304 in patients with esophageal carcinoma and risk factors for recurrence. World J Gastroenterol 2020; 26: 670-685, doi: 10.3748/wjg.v26.i6.670.
    » https://doi.org/10.3748/wjg.v26.i6.670
  • 36 Wang X, Li F, Wang D, Zeng Q. Diffusion kurtosis imaging combined with molecular markers as a comprehensive approach to predict overall survival in patients with gliomas. Eur J Radiol 2020;128: 108985, doi: 10.1016/j.ejrad.2020.108985.
    » https://doi.org/10.1016/j.ejrad.2020.108985
  • 37 Hart T, Chandrashekhar M, Aregger M, Steinhart Z, Brown KR, Angers S, et al. Systematic discovery and classification of human cell line essential genes. Bio Rxiv 2015, doi: 10.1101/015412.
    » https://doi.org/10.1101/015412
  • 38 Lee CL, Huang CJ, Yang SH, Chang CC, Huang CC, Chien CC, et al. Discovery of genes from feces correlated with colorectal cancer progression. Oncol Lett 2016; 12: 3378-3384, doi: 10.3892/ol.2016.5069.
    » https://doi.org/10.3892/ol.2016.5069
  • 39 Chano T, Kita H, Avnet S, Lemma S, Baldini N. Prominent role of RAB39A-RXRB axis in cancer development and stemness. Oncotarget 2018; 9: 9852-9866, doi: 10.18632/oncotarget.23955.
    » https://doi.org/10.18632/oncotarget.23955
  • 40 Tornesello ML, Faraonio R, Buonaguro L, Annunziata C, Starita N, Cerasuolo A, et al. The role of microRNAs, long non-coding RNAs, and circular RNAs in cervical cancer. Front Oncol 2020; 10: 150, doi: 10.3389/fonc.2020.00150.
    » https://doi.org/10.3389/fonc.2020.00150

Supplementary Material

Click to view [zip].

Publication Dates

  • Publication in this collection
    20 Sept 2021
  • Date of issue
    2021

History

  • Received
    24 May 2021
  • Accepted
    17 June 2021
location_on
Associação Brasileira de Divulgação Científica Av. Bandeirantes, 3900, Cep: 14049-900 , Tel: +55 16 3630-2778, +55 16 3315-3173 - Ribeirão Preto - SP - Brazil
E-mail: bjournal@terra.com.br
rss_feed Acompanhe os números deste periódico no seu leitor de RSS
Acessibilidade / Reportar erro