Abstract Purpose Clear cell renal cell carcinoma (ccRCC) is one of the most common cancers with high mortality worldwide. However, biomarkers for predicting prognosis in ccRCC are limited. In this study, we attempted to identify potential prognostic biomarkers of ccRCC. Methods Clinical information and the preprocessed ccRCC mature miRNA expression profiles in The Cancer Genome Atlas database were downloaded from UCSC Xena. The miRNAs differentially expressed between ccRCCs and matched normal tissues were analyzed using the “limma” package. A miRNA-based signature was constructed using the multivariate Cox regression model with prognosis index (PI) formula. Patients with ccRCC were divided into low-risk and high-risk subgroups according to median PI. The survival times were compared between the two groups using Kaplan–Meier analysis with log-rank test. The training set was used to construct a miRNA-based signature for predicting prognosis. The test set was used to verify the signature. Target gene prediction and functional enrichment analysis of the four miRNAs were performed using miRNet. Results We identified four miRNAs, miRNA-21-5p, miRNA-9-5p, miR-149-5p, and miRNA-30b-5p, as independent prognostic indicators. Next, we used these four miRNAs to construct a four-miRNA PI for each patient. Results revealed that patients in the high-risk group (n=119) had significantly shorter survival time than those in the low-risk group (n=118) (high-risk/low-risk group log-rank P=0.000). This four-miRNA signature is an independent prognostic factor compared with routine clinicopathological features in the test set. These miRNAs targeted 1,634 genes, and a miRNA-target gene network was constructed using miRNet. The target genes of these four miRNAs were involved in various pathways related to cancer. Conclusion Our observations suggest that the four-miRNA signature correlated with the survival of patients with ccRCC and can be used as a prognostic biomarker of ccRCC. Keywords: ccRCC, miRNA signature, overall survival, prognostic biomarkers Introduction Renal cell carcinoma (RCC) is a common malignant tumor of the urinary system, accounting for 2%–3% of adult malignancies,[41]^1 and more than 100,000 people die of kidney cancer every year worldwide.[42]^2 The most common subtype of RCC is clear cell renal cell carcinoma (ccRCC), which is associated with high morbidity and poor prognosis.[43]^3 miRNAs were first identified by Lee et al[44]^4 in Caenorhabditis elegans in 1993 as 19- to 24-nucleotide-long ncRNAs. It is estimated that miRNAs regulate the expression of >60% protein-coding genes. miRNAs are involved in various biological processes, such as cell growth, proliferation, differentiation, and apoptosis.[45]^5 Owing to the tissue-specific expression of miRNAs, their expression profile has been associated with various diseases. Currently, abnormally expressed miRNAs have been detected in many human tumors, such as bladder cancer,[46]^6 lung cancer,[47]^7 prostate cancer,[48]^8 pancreatic cancer,[49]^9 gastric cancer,[50]^10 liver cancer,[51]^11 and other malignancies. Several recent studies have suggested that miRNA expression profiling can be used to predict the clinical outcome of patients with malignant tumors.[52]^12^,[53]^13 Specific miRNAs have been used as potential diagnostic tools to distinguish the four subtypes of RCC (clear cell RCC, papillary RCC, chromophobe RCC, and benign oncocytomas).[54]^14 However, studies on the association of miRNAs with ccRCC prognosis are limited. Currently, The Cancer Genome Atlas (TCGA) database ([55]https://cancergenome.nih.gov/) can be used to analyze complicated clinical characteristics and cancer genomics. In this study, we screened the differentially expressed mature miRNAs between ccRCC tissues and matched normal tissues, and determined the association between these miRNAs and overall survival (OS). We constructed a four-miRNA signature that may be used as a potential prognostic biomarker of ccRCC. Materials and methods Data processing The preprocessed ccRCC mature miRNA expression profiles in TCGA database, displayed as log[2] converted reads per million (log[2] (RPM + 1)), and clinical information, were downloaded from the UCSC Xena ([56]https://xenabrowser.net/datapages/, version 09-08-2017). It contains miRNA expression data from two different platforms, including 311 samples (241 ccRCC tissues and 70 matched normal kidney tissues) based on the IlluminaHiSeq_miRNASeq platform (Illumina Inc., San Diego, CA, USA) and 259 ccRCC tissues based on the IlluminaGA_miRNASeq platform. The samples based on the IlluminaHiSeq_miRNASeq platform were used as the training set to identify differentially expressed miRNAs and to construct a miRNA-based signature for predicting prognosis. The samples based on IlluminaGA_miRNASeq platform were used as the test set to verify the signature. The mature miRNA sequencing data were processed using R language. Screening of differentially expressed miRNAs In the training set, miRNAs that were not expressed in >10% samples were removed. The differentially expressed miRNAs between ccRCCs and matched normal tissues were analyzed using the “limma” package[57]^15 in R. The fold changes (FCs) in the expression of individual miRNAs were calculated, and differentially expressed miRNAs with |log2FC|>1.0 and P<0.05 were considered to be significant. We applied bidirectional hierarchical clustering to the differentially expressed miR-NAs based on Euclidean distance and displayed the results as a heat map. Construction and validation of the miRNA-based prognostic signature for ccRCC In the training set, the patients were separated into high- and low-level groups based on the median value of the differential expression of miRNAs, followed by univariate and multivariate Cox proportional hazards analyses. Finally, a miRNA signature-based prognosis index (PI) score was constructed on the basis of a linear combination of the expression level multiplied by a regression coefficient derived from the multivariate Cox regression model (β) using the following formula. [MATH: PI=M1*β< mn>1+M2*β< mn>2+M3*β< mn>3+ :MATH] The “β” value is the estimated regression coefficient of miRNAs and is derived from the multivariate Cox regression analysis, and “M” indicates the expression profiles of the miRNAs. Patients with ccRCC were divided into low- and high-risk groups based on median PI. The survival times were compared between the two groups using Kaplan– Meier analysis with log-rank test at P-value<0.05. The test set was used to confirm the robustness and transferability of the miRNA-based prognostic signature. We conducted the univariate and multivariate Cox proportional hazards analyses in the training and test sets to compare the relative prognostic value of this four-miRNA signature with that of routine clinicopathological features. Target gene prediction and functional enrichment analysis Target gene prediction and functional enrichment analysis of the four miRNAs were performed using miRNet ([58]http://www.mirnet.ca/).[59]^16 miRNet is an easy-to-use web-based tool that offers statistical, visual, and network-based approaches to assist researchers understand miRNA function and regulatory mechanisms and construct a miRNA-target gene network. The Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis was subsequently performed for the target genes. P-value <0.05 was set as the cutoff criteria. Statistical analysis The chi-squared test was used for categorical data, and the unpaired Student’s t-test was used to screen differentially expressed miRNAs. Univariate/multivariate Cox proportional hazards analyses and Kaplan–Meier survival analysis were used to compare the two groups of patients. The chi-squared test and survival analysis were performed using IBM SPSS statistics software program version 22.0 (IBM, Armonk, NY, USA). All tests were two-sided, and P<0.05 was considered statistically significant. Results Differential expression of miRNAs between ccRCC and matched normal kidney tissues The detailed clinical characteristics of patients with ccRCC, including gender, age at diagnosis, histological grade, and TNM stage, are shown in [60]Table 1. The training set contained more patients with metastasis than the test set (16.88% vs 14.67%, chi-squared test, P=0.000). This also partially assisted us to test the prognostic value of this miRNA-based signature in different patients. According to the cutoff criteria (P<0.05 and |log2FC|>1), 138 miRNAs were differentially expressed between ccRCC and matched normal kidney tissues in the training set. These included 54 upregulated miRNAs and 84 downregulated miRNAs in ccRCC tissues. miRNA-21-5p was upregulated, whereas miRNA-9-5p, miR-149-5p, and miRNA-30b-5p were downregulated in ccRCC tissues. The results of the expression analysis are presented as a heat map ([61]Figure 1), and the results of hierarchical clustering showed that the expression patterns of these differentially expressed miRNAs can correctly distinguish ccRCC from normal kidney tissues. Table 1. Summary of patient cohort information Factors Training set __________________________________________________________________ Test set __________________________________________________________________ P N=237 (%) N=259 (%) __________________________________________________________________ Gender Male 158 (66.67) 162 (62.55) 0.338 Female 79 (33.33) 97 (37.45) Age <65 153 (64.56) 162 (62.58) 0.711 ≥65 84 (35.44) 97 (62.59) Histologic G1–2 99 (41.77) 119 (45.95) 0.227 G3–4 136 (57.38) 134 (51.74) Gx 2 (0.84) 6 (2.32) T T1–2 151 (63.71) 156 (60.23) 0.481 T3–4 86 (36.29) 103 (39.77) M M0 166 (70.04) 221 (85.33) 0.000 M1 40 (16.88) 38 (14.67) Mx 31 (13.08) 0 (0.00) N N0 97 (40.93) 123 (47.49) 0.323 N1 8 (3.38) 9 (3.47) Nx 132 (55.70) 127 (49.03) Stage I–II 143 (60.34) 146 (56.37) 0.507 III–IV 92 (38.82) 112 (43.24) Not reported 2 (0.84) 1 (0.39) [62]Open in a new tab Notes: Bold figure indicates statistically significant, P<0.05. Figure 1. [63]Figure 1 [64]Open in a new tab Hierarchical clustering dendrograms of expression patterns of differentially expressed miRNAs that can distinguish between normal kidney tissue and ccRCC tissue. Abbreviation: ccRCC, clear cell renal cell carcinoma. Construction of miRNA-based signature with differentially expressed miRNAs Four samples were removed because of lack of survival record in the training set. For each of the 138 differentially expressed miRNAs, we used the median expression level as a cutoff to stratify the remaining 237 patients into high-level and a low-level groups. The univariate Cox proportional hazards regression analysis revealed that eight miRNAs possessed prognostic value ([65]Table 2). Next, we performed a multivariate Cox proportional hazards regression analysis and identified four miRNAs, namely, miRNA-21-5p ([66]Figure 2A), miRNA-9-5p ([67]Figure 2B), miR-149-5p ([68]Figure 2C), and miRNA-30b-5p ([69]Figure 2D), as independent prognostic indicators. Thus, we used these four miRNAs to construct a four-miRNA PI as follows: [MATH: PI=miRNA215p*0.788+miR 95p*0.536+miR 1495p*0.566+miR 30b5p*0.683. :MATH] Table 2. Univariate and multivariate analyses in ccRCC patients miRNA Univariate analysis __________________________________________________________________ Multivariate analysis __________________________________________________________________ P 95% CI b P 95% CI __________________________________________________________________ miR-21-5p 0.000 1.884–5.149 0.788 0.031 1.073–4.502 miR-9-5p 0.004 1.270–3.465 0.536 0.044 1.014–2.883 miR-149-5p 0.003 1.303–3.473 0.566 0.027 1.066–2.908 miR-204-5p 0.003 0.293–0.775 –0.118 0.692 0.195–1.594 miR-146b-5p 0.001 1.436–3.800 –0.014 0.976 0.388–2.505 miR-223-3p 0.007 1.193–3.086 0.130 0.615 0.685–1.894 miR-30b-5p 0.009 1.173–3.069 0.683 0.006 1.211–3.236 miR-146b-3p 0.004 1.247–3.254 0.143 0.727 0.516–2.580 [70]Open in a new tab Notes: Bold figure indicates statistically significant, P<0.05. Abbreviation: ccRCC, clear cell renal cell carcinoma. Figure 2. [71]Figure 2 [72]Open in a new tab Four miRNAs were associated with overall survival in ccRCC patients using Kaplan–Meier curves and log-rank tests. Note: (A) miRNA-21-5p; (B) miRNA-9-5p; (C) miRNA-149-5p; (D) miRNA-30b-5p. Abbreviation: ccRCC, clear cell renal cell carcinoma. A PI was calculated for each patient in the training set. Then, 237 patients were separated into low- and high-risk groups according to median PI. Survival analysis was performed using the Kaplan–Meier method with log-rank test. Results revealed that patients in the high-risk group (n=119) had significantly shorter survival time than those in the low-risk group (n=118) (high-risk/low-risk group log-rank P=0.000; [73]Figure 3A). This four-miRNA signature is an independent prognostic factor compared with routine clinicopathological features ([74]Table 3). Figure 3. [75]Figure 3 [76]Open in a new tab The Kaplan–Meier curves obtained using the four-miRNA signature to separate patients into high- and low-risk groups. Notes: (A) Kaplan-Meier curve for training set; (B) Kaplan-Meier curve for testing set. Table 3. Univariate and multivariate analyses of routine clinicopathological features and four-miRNA prognostic signature PI in the training set Factors Univariate analysis __________________________________________________________________ Multivariate analysis __________________________________________________________________ P 95% CI P 95% CI __________________________________________________________________ Gender (female/male) 0.862 0.579–1.580 Age (<65 years/≥65 years) 0.149 0.885–2.245 Histologic (G3–4/G1–2) 0.000 1.819–6.294 0.641 0.490–3.189 M (M0/M1) 0.000 2.933–7.587 0.702 0.467–3.103 N (N1/N0) 0.000 2.518–18.038 0.048 1.008–8.724 T (T1–2/T3–4) 0.000 2.018–5.283 0.408 0.146–2.187 Stage (I–II/III–IV) 0.000 2.677–7.515 0.116 0.729–17.584 PI (high/low risk) 0.000 1.692–4.759 0.008 1.363–7.740 [77]Open in a new tab Notes: Bold figure indicates statistically significant, P<0.05. Abbreviation: PI, prognosis index. Verification of the four-miRNA signature in the test set Similar to that observed in the training set, patients in the test set were divided into low- and high-risk groups according to median PI, and Kaplan–Meier analysis was used to compare the patient’s OS. The results of survival analysis revealed that patients in the high-risk group (n=130) had significantly shorter survival time than those in the low-risk group (n=129) (high-risk/low-risk group log-rank P=0.000; [78]Figure 3B). The four-miRNA signature is an independent prognostic factor compared with routine clinicopathological features in the test set ([79]Table 4). Overall, these results are consistent with that in the training set. Table 4. Univariate and multivariate analyses of routine clinicopathological features and four-miRNA prognostic signature PI in the test set Factors Univariate analysis __________________________________________________________________ Multivariate analysis __________________________________________________________________ P 95% CI P 95% CI __________________________________________________________________ Gender (female/male) 0.490 0.767–1.740 Age (<65 years/≥65 years) 0.001 1.360–3.056 0.011 1.193–2.808 Histologic (G3–4/G1–2) 0.000 1.713–4.202 0.407 0.665–2.735 M (M1/M0) 0.000 2.057–4.719 0.108 0.820–7.363 N (N1/N0) 0.026 1.118–5.520 0.412 0.588–3.659 T (T3–4/T1–2) 0.000 2.688–6.388 0.000 2.189–9.396 Stage (I–II/III–IV) 0.000 2.314–5.529 0.349 0.155–1.931 PI (high/low risk) 0.000 2.481–6.265 0.000 1.847–8.377 [80]Open in a new tab Notes: Bold figure indicates statistically significant, P<0.05. Abbreviation: PI, prognosis index. Target gene prediction and functional enrichment analysis To investigate the potential biological functions of these four miRNAs, we predicted their target genes using miRNet. We observed that 1,634 genes were targeted by these four miRNAs and a miRNA-target gene network was constructed using miRNet ([81]Figure 4). KEGG pathway enrichment analyses of the target genes revealed that they were involved in various pathways related to cancer, such as the MAPK, p53, and Wnt signaling pathways, cell cycle, and RNA transport ([82]Figure 5). Figure 4. [83]Figure 4 [84]Open in a new tab The miRNA-target genes network constructed using the miRNet and the corresponding target genes involved in KEGG Pathways. Note: (A) The total network of four miRNAs and their targets; (B) pathways in cancer, (C) p53 signaling pathway; (D) cell cycle pathway; (E) renal cell carcinoma; (F) Wnt signaling pathway. Abbreviation: KEGG, Kyoto Encyclopedia of Genes and Genomes. Figure 5. [85]Figure 5 [86]Open in a new tab Kyoto Encyclopedia of Genes and Genomes pathway analysis of the predicted targets of the four miRNAs. Abbreviation: HTLV-1, human T lymphotropic virus type 1. Discussion ccRCC is one of the most common renal malignancies associated with high mortality and morbidity.[87]^17 However, clinical tools for predicting patient outcome utilize traditional clinical parameters. Therefore, accurate identification of predictive factors from data obtained from analysis of ccRCC specimens is clinically challenging. Identification and validation of novel biomarkers form an important part of practical studies on ccRCC. During tumorigenesis, miRNAs act as oncogenes or tumor suppressors; hence, the biological behavior of tumors can be inhibited by regulating miRNA levels for therapeutic purposes. Identification of RNA profiles and selective targets is the basis for individualized treatment of different tumors. Previous studies have shown that some specific miRNAs were aberrantly expressed in RCC and participate in its development.[88]^18^–[89]^21 However, detailed analyses of the associations between miRNA expression and prognosis of patients with ccRCC remain limited. In this study, we identified 138 miRNAs that were differentially expressed between ccRCC and normal kidney tissues. The univariate Cox proportional hazards regression analysis revealed that eight miRNAs possessed prognostic value. We confirmed that the four-miRNA (miRNA-21-5p, miRNA-9-5p, miRNA-149-5p, and miRNA-30b-5p) signature can be regarded as an independent predictor of prognostic OS after considering the various variables, including gender, age, histology, and stages. Previous studies have also identified these four miRNAs, and several studies have investigated the relationship between miRNA expression patterns and cancer. Kowalczyk et al[90]^22 reported that special AT-rich sequence binding protein 1 (SATB1) may be a potential prognostic marker for ccRCC, as low SATB1 expression in ccRCC may result from overexpression of miR-21-5p. SATB1 downregulation and miR-21-5p upregulation were associated with shorter patient survival. At present, the role of miRNA-9-5p in tumors has not been clarified. Certain studies show that downregulation of miR-9-5p expression can reverse the effect of miRNA-9-5p on proliferation, colony formation, cell cycle arrest, and apoptosis in osteosarcoma cells.[91]^23 Okato et al[92]^24 demonstrated that dual strands of pre-miR-149 (miRNA-149-5p and miRNA-149-3p) acted as antitumor miRNAs by targeting FOXM1, which was shown to be associated with survival of patients with ccRCC. Liu et al[93]^25 suggested that miR-30b-5p acts as a novel tumor suppressor to regulate RCC cell proliferation, metastasis, and epithelial to mesenchymal transition by downregulating GNA13 expression. In other words, miR-30b-5p may be considered a potential biomarker for RCC diagnosis. However, studies demonstrating that the four differentially expressed miRNAs were predictors of ccRCC were lacking. In this study, we constructed a four-miRNA signature, and the PI of this signature was calculated for each patient, which successfully separated patients into low- and high-risk groups. Specifically, patients considered high-risk by our four-miRNA signature had significantly poor prognosis than those in the low-risk group (P<0.001). We confirmed that the four-miRNA signature is an independent predictor of OS in patients with ccRCC. Similar to that observed in the training set, patients in the test set were divided into low- and high-risk groups based on the risk score of individual patients, and Kaplan–Meier analysis was used to compare patient survival differences. Statistically significant differences (P<0.0001) were observed between high- and low-risk groups. This confirmed that our four-miRNA signature is an independent and universal predictor of ccRCC. It is well known that miRNAs modulate gene expression. Hence, we screened the target genes of these four miRNAs and used bioinformatics to predict the pathways and biological functions associated with their targets. The target genes were significantly enriched in multiple cancer-associated pathways, such as MAPK, p53, and Wnt signaling pathways. Abnormal regulation of these signaling pathways is involved in the development of various human cancers, such as breast cancer, hepatocellular carcinoma, hematological cancer, and lung cancer.[94]^26^–[95]^28 Thus, abnormal regulation of signaling pathways may play a crucial role in the pathogenesis and progression of ccRCC. Further molecular investigations may provide new therapeutic targets for ccRCC. Limitations However, our study has few limitations. First, the ccRCC tissues were more than normal kidney tissues. Second, the miRNA expression profiles in the test set were not based on the same platform. Therefore, this four-miRNA signature has to be verified in a larger independent cohort of patients. Conclusion A comprehensive analysis of differentially expressed miRNA profiles and corresponding clinical information suggested that a four-miRNA signature was an independent and universal prognostic factor in patients with ccRCC. These miRNAs modulated genes associated with multiple cancer-associated pathways. However, further studies are required to verify our observations and establish the molecular mechanism underlying the interplay of miRNAs, their target genes, and ccRCC progression. Acknowledgments