Abstract Purpose To construct a competing endogenous RNA (ceRNA) topology network of RNA-seq data and micro RNA-seq (miRNA-seq) data to identify key prognostic long non-coding RNA (lncRNAs) in luminal breast cancer, and validate the results by human luminal breast cancer samples. Materials and Methods The RNA-seq data and miRNA-seq data of luminal A breast cancer in the The Cancer Genome Atlas (TCGA) database were downloaded and compared with those in the miRcode database to obtain lncRNA–miRNA relationship pairs. Final target genes were predicted by all three databases (miRDB, miRTarBase, and TargetScan), thereby obtaining the miRNA-messenger RNA (miRNA-mRNA) relationship pairs and a ceRNA topology network was constructed, then mRNA enrichment analysis, ceRNA topological and stability analysis, univariate and multivariate Cox regression analysis were performed. Overall survival (OS) was evaluated and the key prognostic RNAs were identified. The expression difference between normal and tumor, as well as the correlation of high expression in tumor with pathological parameters (Ki-67, Grade, tumor diameter) were validated by human breast cancer specimens. Results A ceRNA topology network was constructed and six lncRNAs were finally identified (The higher expression of PART1, IGF2.AS, WT1.AS, OIP5.AS1, and SLC25A5.AS1 was associated with poor prognosis while [32]AL035706.1 was adverse) and the poor prognostic ones were higher expressed in tumor tissue and correlated with a higher Ki-67 (>10%), tumor grades (II, III) and tumor diameters (>1.5 cm). Using six lncRNAs, we constructed a prognostic model, which performed well for the classification of prognosis in the module. Conclusion We identified and verified six biomarkers (OS-predicting) in luminal breast cancer, which significantly enriched the prediction and potential targets of this subtype. Keywords: competing endogenous RNA, breast cancer, long non-coding RNA, prognosis, clinical sample Introduction Breast cancer (BC) is a major concern, both morbidity and mortality, for women worldwide.[33]^1 Four subtypes of breast cancer (BC) defined by the estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER-2) exhibit differences in biological behavior. The molecular mechanism behind them remains unclear. The current target-therapies are mostly concentrated on HER-2 overexpressed subtype, such as Trastuzumab and Pertuzumab,[34]^2^,[35]^3 while immunotherapies are aimed at triple-negative breast cancer (TNBC) due to its higher immunogenicity.[36]^4^,[37]^5 Quite a number of prognostic markets and targets were identified.[38]^6^–[39]^8 Unlike the two subtypes, patients with ER- or PR-positive BC, termed luminal subtype, are considered at a lower risk and endocrine therapy has achieved a considerable success. However, the outcome of luminal BC rather varies after treatment[40]^9 and drug resistance is urgent to be responded. mTOR inhibitors and CDK4/6 inhibitors showed potential to solve the problem. Their clinical effective suggests that molecular target therapy could also be a wise strategy for luminal BC. Therefore, the identification of prognostic markers, which can be potential targets, is of importance. Recently, genes or proteins were proved to be an oncogene role or promoting metastasis in the luminal subtype,[41]^10^–[42]^13 while little were verified to be related to overall survival (OS). Moreover, most of them were genomic, along with some microRNAs (miRNA), while Long non-coding RNAs (lncRNA) were rarely mentioned. LncRNA, a type of RNA with limited protein-coding ability, is previously viewed as transcriptional “noise” without biological functions.[43]^14 Recently, lncRNAs are increasingly reported to be closely involved in progression and metastasis of various cancers.[44]^15^–[45]^18 In luminal BC, the oncogenic roles of lncRNAs were also unveiled[46]^19^,[47]^20 but rare, especially specific on luminal BC. And crosstalks between lncRNAs and miRNAs were also identified. For example, lncRNA BLACAT1 was found to promote MCF7 (luminal BC cell) proliferation and metastasis by miR-150-5p/CCR2.[48]^21 In addition, prognostic roles and relationship with tamoxifen resistance were also mentioned.[49]^13^,[50]^22 Recently, competitive endogenous RNA (ceRNA), as a whole view of transcription, has gradually become a useful tool for identifying prognostic lncRNAs. The ceRNA hypothesis suggests that lncRNAs can competitively bind to miRNAs, thereby preventing their binding to mRNA, and consequently regulating mRNA expression.[51]^23 The competitive relationship influences the inhibition of miRNAs to mRNAs, resulting in downstream pathological processes like osteoarthritis, cerebral ischemia, and cancers.[52]^24^–[53]^27 However, the prognosis function of lncRNAs in breast cancer, especially luminal, is less known. In the study, a ceRNA network was constructed by integrating analysis of several databases and six lncRNAs PART1, IGF2.AS, WT1.AS, OIP5.AS1, SLC25A5.AS1, and [54]AL035706 were identified and verified by human breast cancer specimens. Further, a prognostic model consisted of them was constructed. Materials and Methods Download of Data and Pre-Processing RNA-seq data and mRNA-seq data of BC were obtained from The Cancer Genome Atlas (TCGA) database.[55]^28 Samples meet requirements were screened to obtained three matrix files of mRNA, lncRNA, and miRNA expression profiles: 1) female samples; 2) positive for at least one of the Estrogen receptor (ER) or Progesterone receptor (PR) in the cancer samples; 3) both RNA-seq and miRNA-seq data. 4) miRNAs with a ratio of zero values ≤75%. Relationship with Prediction The miRcode[56]^29 is a database based on the human complete transcriptome of GENCODE annotations to predict miRNA targets, through which we obtained the lncRNA–miRNA relationship files. The miRDB[57]^30 is a miRNA target gene prediction database based on high-throughput sequencing experiments. miRTarBase[58]^31 is a database of integrated, experimentally validated miRNA targets. TargetScanS[59]^32 predicts miRNA target genes by searching for conserved 8 mer and 7 mer positions that match miRNA seed sequences point. The three databases are currently used for miRNA target gene predictions. Through this approach, the miRNA–mRNA relationship pair files were screened. Relationship Screening Pearson correlation coefficient was calculated between the lncRNA and miRNA in the lncRNA–miRNA relationships, and the miRNA and mRNA in the miRNA–mRNA relationships. The significant negative correlation (p<0.05) relationship pairs, which served as a pre-selected ceRNA regulatory relationship, were screened. Building a ceRNA Network Files that were only in the lncRNA–miRNA or miRNA–mRNA relationship files were excluded and the remained lncRNA–miRNA and miRNA–mRNA files were merged, and visualized by the Cytoscape software. mRNA Enrichment Analysis in the ceRNA Network KEGG is a group of databases ranging from genomes, biological pathways, and associated diseases, even drugs and chemical materials.[60]^33 Gene ontology analysis (GO) is commonly used for interaction network analysis of genes and their RNA or protein products.[61]^34 By KEGG and GO, mRNA enrichment analysis was performed in the ceRNA network. Topological Analysis and Stability Analysis of the ceRNA Network The topology properties of the network were analyzed using the NetWorkAnalyzer toolKit 10 in Cytoscape9.[62]^35^,[63]^36 The NetWork Analyzer mainly includes network distribution, number of connections, average aggregation coefficient, etc. In the present study, we calculated the number of connections, the length of the path, and the proximity center of the node. Prognostic Analysis of ceRNA Modules The clinical information of the above samples was download from the TCGA database, and then the survival data of each sample were extracted, which combined the previously obtained expression spectrum data to perform a Cox survival analysis on the lncRNA nodes in the network then identify prognostic lncRNAs. Single-Factor Regression, Multivariate Cox Regression Analysis, and Survival Analysis Multivariate Cox regression analysis and survival analysis were performed on prognostic lncRNAs by SPSS 19.0, and then a prognostic model was constructed and survival curves were plotted. Patients and Tissue Samples Paired tumor specimens and their adjacent non-tumor tissues were derived from 20 luminal A patients who underwent surgical resection at Second Affiliated Hospital, Zhejiang University School of Medicine (Hangzhou, China) between Jan 2019 and May 2019. All patients recruited in the study received no pre-operative chemotherapy or radiotherapy. Specimens were frozen in liquid nitrogen immediately and stored at −80°C. This study was approved by the Ethics Committees of Second Affiliated Hospital, Zhejiang University School of Medicine. RNA Extraction and RT-qPCR Analysis Using Trizol Reagent (Invitrogen), total RNA was extracted from tissues. A total of 2 μg RNA was reverse transcribed to cDNA according to the PrimeScript RT reagent Kit (TaKaRa, Dalian, China). Then, qPCR assay was performed by the SYBR Premix Ex Taq™ II kit (TaKaRa), primers, and cDNA templates on the ABI PRISM 7900 Sequence Detection System (Applied Biosystems, Foster City, CA, USA). The 2^−ΔΔCt method was used for data quantification. GAPDH was set as the internal control. The sequences of six lncRNAs were as follows IGF2-AS: Forward primer CCTGCCTAGAGCTCCCTCTT; Reversed Primer CCATCCTCACCCAGGAACAG. WT1-AS: Forward primer CTTTCGACTAGCGCCTCTCC; Reversed primer CTTTCGACTAGCGCCTCTCC. OIP5-AS1: Forward primer CCTCATGCAGTGCCATCTGA; Reverse primer TATCCACCTTGGGTTGCAGG. SLC25A5-AS1: Forward primer AGCGGCATCTGTCATGTTGA; Reverse primer AAATCGGCCATTTGCTTCGC. [64]AL035706.1-201: Forward primer GCCAAAGTGCTGGGATTACA; Reverse primer GCCACGGGAATCAAATAAGA. PART1: Forward primer AAACGCCTGAGGACTGAGAA Reverse primer TCTCCTGCTTGCCAAATCTT Logistic Regression Analysis for Pathological Parameters Using Ki67 (1 ≤ 10%, 0 = ≤10%), Grade (1 = Ⅲ, 0 = Ⅰ or Ⅱ), diameter (1 ≤ 1.5, 0 = ≤1.5), age (1 ≤ 50, 0 = ≤50), pathological types (1 = invasive, 0 = non-invasive), and the surgical strategy (1 = conservation, 0 = radical) as the dependent variables and the expression of six prognostic lncRNAs as the independent variable to perform binary logistic regression analysis. P < 0.05 is considered as a statistically significant difference. Stata 12.0 was used for statistical analysis. Results lncRNA–miRNA Relationship From TCGA, the RNA-seq data contained 113 normal samples and 1102 cancer samples and the miRNA-seq data contained 104 normal samples and 1096 cancer samples were download. After screening, we got 884 samples, including 102 normal samples and 782 cancer samples. Subsequently, we extracted the expression profiles of mRNA and lncRNA from the matrix files obtained from the RNA-seq data. An expression profile of 14,166 lncRNAs was revealed, and by miRcode, we further identified 7098 pairs of lncRNA–miRNA, including 1449 lncRNAs and 35 miRNAs. miRNA–mRNA Relationship An expression profile containing 2253 miRNAs was collected. After excluding miRNAs with a ratio of zero values >75% in all samples, 875 miRNAs remained. Using miRDB, miRTarBase, and TargetScan to predict the target genes of these miRNAs, we obtained 19,616 pairs, including 5322 target genes simultaneously predicted by all three databases and 785 miRNAs. Relationship Filtering By screening, we obtained 295 pairs of lncRNA–miRNA and 3204 pairs of miRNA–mRNA. After a future filtering of the non-compliant relationship, we finally identified 240 pairs of lncRNA–miRNA and 115 pairs of mRNA–miRNA. Several of those were shown in [65]Tables 1 and [66]2. Table 1. Several of the lncRNA–miRNA Pairs Obtained from This Analysis lncRNA miRNA IGF2-AS hsa-miR-193a-3p FAM182A hsa-miR-363-3p FAM182A hsa-miR-425-5p TTTY6 hsa-miR-429 LINC00525 hsa-miR-301b-3p PART1 hsa-miR-429 PART1 hsa-miR-22-3p PART1 hsa-miR-449c-5p CCDC13-AS1 hsa-miR-22-3p C11orf44 hsa-miR-193a-3p [67]Open in a new tab Table 2. Several of the miRNA–mRNA Pairs Obtained from This Analysis miRNA mRNA hsa-miR-24-3p BCL7A hsa-miR-140-5p PDGFRA hsa-miR-363-3p BCL11B hsa-miR-425-5p OCRL hsa-miR-33a-3p RARB hsa-miR-140-5p RALA hsa-miR-363-3p EXOC5 hsa-miR-24-3p VGLL3 hsa-miR-301b-3p PTPN4 hsa-miR-22-3p EDC3 [68]Open in a new tab Construction and Visualization of the ceRNA Network After combining the aforementioned 240 pairs of lncRNA–miRNA and 115 pairs of mRNA–miRNA, we used the Cytoscape software for network visualization to obtain a ceRNA network map. The network map contained 356 edges and 288 nodes, including 160 lncRNAs, 15 miRNAs, and 113 mRNAs ([69]Figure 1). Figure 1. [70]Figure 1 [71]Open in a new tab Diagram of the ceRNA network. Red point: miRNA; blue point: lncRNA; yellow point: mRNA. The size of the node point represents the degree of the node. Higher degrees indicate larger points. Topological Analysis and Stability of the ceRNA Network Topological analysis showed the distribution of the degree of nodes ([72]Figure 2A). The degree of most nodes was low. Only a small number showed a high degree, suggesting that these nodes acted as a hub for the entire network. Figure 2. [73]Figure 2 [74]Open in a new tab Topological analysis and stability analysis of the ceRNA network. (A) Node degree distribution map. (B) Closeness Centrality distribution map. (C) Shortest path distribution map. (D) The density map of node degree distribution. The Closeness centrality (CC) of the node could be used to calculate the number of connection steps between nodes. The more concentrated nodes yield higher scores; thus, the CC indicated the shortest path. [75]Figure 2B illustrated that nodes with a relatively low number of connections were relatively concentrated, whereas those with higher connectivity were scattered. The path reflected the combination of all nodes in the network. [76]Figure 2C showed the shortest path length distribution, suggesting that the path length distribution was concentrated, as the extreme values at both ends were lower. The upper and lower limit was 8 and 1, respectively, indicating that most nodes in the network could be connected through a shorter path. The density map of node degree distribution is shown in [77]Figure 2D. We found that node density decreased sharply with an increase in node degree, suggesting that most nodes in the network were isolated. In the process of disease, only a few key nodes changed and interacted with adjacent nodes. Subsequently, co-expression occurred, which in turn triggered the downstream biological processes. Enrichment Analysis In order to observe the function of the constructed ceRNA network, we selected all the mRNAs included in the network to perform functional enrichment analysis.[78]^37 The results of the Gene Ontology (GO) analysis of the ceRNA network showed that most of the enriched biological processes were related to blood agglutination and hormones, suggesting that hormone regulation may be the main processes, by which key RNAs involved in the development of BC ([79]Figure 3A). Figure 3. [80]Figure 3 [81]Open in a new tab Results of the enrichment analysis. (A) GO and (B) KEGG. The Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis revealed that choline metabolism, T-cell receptor signaling, the tumor necrosis factor signaling pathway, and the mitogen-activated protein kinase signaling pathway were related to the key nodes of the ceRNA ([82]Figure 3B). lncRNA Node Survival Analysis The 160 lncRNAs included in the network were subjected to Cox survival analysis, and six lncRNAs significantly correlated with OS were identified (P<0.05); PART1, IGF2.AS, WT1.AS, OIP5.AS1, and SLC25A5.AS1 were associated with an adverse OS (HR>1) while [83]AL035706.1 with a good one (HR<1) ([84]Table 3). Table 3. Prognosis-Related lncRNAs in the ceRNA Network Sig Name P value HR Low 95% CI High 95% CI PART1 0.042076 1.218052 1.00709 1.473207 IGF2.AS 0.008407 1.180625 1.043468 1.335809 WT1.AS 0.003515 1.26976 1.081608 1.490642 OIP5.AS1 0.008513 1.30421 1.070089 1.589554 [85]AL035706.1 0.007941 0.427966 0.228699 0.800855 SLC25A5.AS1 0.020755 1.230025 1.032058 1.465966 [86]Open in a new tab Prognosis-Related lncRNA Molecules as Potential Markers We performed single-factor regression on the six identified lncRNAs, and the forest map was shown in [87]Figure 4. The high expression of five of those was associated with poor prognosis, whereas the low expression level of the remaining one lncRNA was associated with poor prognosis. Figure 4. [88]Figure 4 [89]Open in a new tab Forest map of the six identified prognosis-related lncRNAs. Different Expression of Six lncRNAs Between Normal and Tumor Tissue of Breast Cancer Patients We next verified six lncRNAs expression by clinical samples. Ten pairs of normal and tumor tissue from fresh human breast cancer specimens (ER+) were collected and examined the expression of six lncRNAs. The results showed that among five adverse-OS ones, PART1, IGF2.AS, WT1.AS, and SLC25A5.AS1 significantly higher expressed in tumor than normal, especially for PART1. While OIP5.AS1 expressed no difference between normal and tumor. [90]AL035706.1 was higher expressed in normal, which was consistent with the former results ([91]Figure 5). Figure 5. [92]Figure 5 [93]Open in a new tab The expression of the six identified prognosis-related lncRNAs between normal and tumor tissue of luminal breast cancer specimens collected from our center. N=20; *P<0.05, **P<0.01, ***P<0.001. The Expression of Six lncRNAs Correlates with Higher Pathological Parameters To evaluate whether six lncRNAs’ expression correlated with pathological parameters or not, we chose the Ki-67, tumor grades and tumor diameters of patients to help perform the logistic regression analysis. We found that the expression of PART1 was significantly in accordance with all three parameters, and [94]AL035706.1 was adverse with three parameters but had not reached a significant difference. The higher expression of WT1.AS and SLC25A5.AS1 was associated with an advanced grade and the higher expression of OIP5.AS1 with a higher Ki-67. However, IGF2.AS did not show a relation with the parameters in our samples. To exclude the other clinical parameters’ influence, we also evaluated the age, pathological types, and surgical strategies, and the results were negative ([95]Table 4). Table 4. The Correlation Between Prognosis-Related lncRNAs and Pathological Parameters in Clinical Samples Gene Ki-67 Tumor Grade Tumor Diameter OR (95% CI) P-value OR (95% CI) P-value OR (95% CI) P-value PART1 1.623 (1.090–2.415) 0.017 1.703 (0.961–3.019) 0.028 1.276 (0.942–1.727) 0.016 IGF2.AS 1.226 (0.978–1.535) 0.077 1.102 (0.866–1.402) 0.428 1.127 (0.880–1.442) 0.343 WT1.AS 1.388 (0.988–1.949) 0.058 1.459 (1.005–2.118) 0.047 1.397 (0.931–2.097) 0.106 OIP5.AS1 1.227 (0.929–1.621) 0.043 1.124 (0.823–1.535) 0.462 1.383 (0.913–2.096) 0.026 [96]AL035706.1 0.178 (0.004–7.998) 0.374 0.049 (0.000–20.373) 0.328 0.557 (0.117–2.651) 0.462 SLC25A5.AS1 1.266 (0.928–1.726) 0.136 1.574 (1.007–2.459) 0.046 1.306 (0.871–1.959) 0.197 Gene Age Pathological Type Surgery OR(95% CI) P-value OR(95% CI) P-value OR (95% CI) P-value PART1 1.099 (0.079–1.433) 0.213 1.367 (0.356–2.923) 0.167 0.857 (0.422–1.397) 0.603 IGF2.AS 0.846 (0.205–1.249) 0.570 1.145 (0.571–1.899) 0.314 1.323 (0.560–1.827) 0.054 WT1.AS 1.233 (0.767–1.621) 0.182 0.799 (0.095–1.830) 0.108 1.518 (0.242–1.939) 0.091 OIP5.AS1 1.331 (0.826–1.832) 0.074 1.827 (0.983–3.002) 0.066 0.865 (0.322–1.926) 0.173 [97]AL035706.1 1.081 (0.479–1.759) 0.245 1.994 (0.725–3.621) 0.129 1.727 (1.202–2.475) 0.288 SLC25A5.AS1 0.768 (0.278–1.343) 0.096 1.577 (0.807–1.933) 0.097 1.150 (0.592–1.847) 0.551 [98]Open in a new tab Prognostic Model of lncRNA Further, we used the expression profiles of these six lncRNAs in a multivariate Cox regression analysis to construct a prognostic model: the Risk Score = (0.0906×PART1) + (0.1506×IGF2.AS) + (0.2048×WT1.AS) + (0.2087×OIP5.AS1) − (0.8068×[99]AL035706.1) + (0.2072×SLC25A5.AS1). Subsequently, we calculated the risk coefficient by taking the expression profile of each lncRNA into the model. The samples were classified according to the median risk coefficient. As shown in [100]Figure 6, the high-risk group had significantly more deaths than the low-risk group. Moreover, a higher risk coefficient was linked to a higher expression level for lncRNAsPART1, IGF2.AS, WT1.AS, OIP5.AS1, and SLC25A5.AS1. The opposite was observed for [101]AL035706.1. Figure 6. [102]Figure 6 [103]Open in a new tab The relationship between the expression of the six identified lncRNAs and risk scores. The horizontal axis represents the risk coefficient score of the sample, increasing from left to right. As shown in [104]Figure 7A, we performed a survival analysis and plotted Kaplan–Meier curves to determine prognostic differences between the high- and low-risk samples of the model. A significant difference in prognosis was observed between the two groups. In order to verify the heterogeneity and stability of the model, we analyzed the Receiver operating characteristic (ROC) curve of the model, as shown in [105]Figure 7B. The prognostic model exhibited a markedly greater area under the curve, and prognosis was classified as high and low risk according to these six lncRNAs. These findings suggested that the six identified lncRNAs may serve as meaningful prognostic markers. Figure 7. [106]Figure 7 [107]Open in a new tab (A) Kaplan–Meier curves from the multivariate regression analysis for the prognostic classification of the six lncRNAs. (B) ROC curve from the multivariate regression analysis for the six lncRNAs. Discussion Luminal BC is the most common type of whole BC.[108]^9 Owing to advanced screening and endocrine therapy, some success in the treatment of luminal BC has recently been achieved. However, the mortality rate has not declined, partly because of a lack of efficient biomarkers. Considering the existed heterogeneity of luminal BC reflected by its inconsistent clinical outcomes, it is necessary to discover effective biomarkers, especially for luminal BC. Previous researches have shown several dysregulated genes that involved in the progression of BC and possessed great potential to be used as biomarkers. However, these markers focused on TNBC or distant metastasis.[109]^6^–[110]^8^,[111]^10^–[112]^13 Recent studies have revealed that lncRNAs played vital roles in the development of cancers.[113]^21^,[114]^22 Further, germline mutations can also predict tumor recurrence in breast cancer patients.[115]^38 In the present study, we used ceRNA to detect and validate potential prognostic lncRNAs in the luminal BC. Firstly, using TCGA database, we obtained the RNA-seq data and miRNA-seq data of BC. These data were compared with the data in the miRcode[116]^29 database to get 7089 pair files of the lncRNA–miRNA relationship. Subsequently, we defined the miRNA-targeted genes that were predicted by all three databases (miRDB, miRTarBase and TargetScan) as the final target genes, thereby obtaining 19,616 pair files of the miRNA–mRNA relationship. After screening, 240 pairs of lncRNA–miRNA and 115 pairs of mRNA–miRNA remained. Thus, a ceRNA topology network with 356 edges and 288 nodes, including 160 lncRNAs, 15 miRNAs, and 113 mRNAs was constructed. Through topological analysis, we observed that only a small number exhibited a high degree, suggesting that these act as hubs for the entire network. Most nodes in the network can be connected through shorter path correlation and showed isolation. This result indicates that only a few co-expressed key nodes change and interact with adjacent nodes in the process of BC, which in turn triggered the downstream biological processes. Subsequently, we used the GO and KEGG databases to perform enrichment analyses to elucidate involved biological process and pathways.[117]^37 The analyses suggested that hormone regulation, choline metabolism, T-cell receptor signaling, the tumor necrosis factor (TNF), and the mitogen-activated protein kinase (MAPK) signaling pathway were involved in the development of BC. After Cox survival analysis, six lncRNAs, namely PART1, IGF2.AS, WT1.AS, OIP5.AS1, [118]AL035706.1, and SLC25A5.AS1, significantly correlated with the prognosis were finally identified. According to single-factor regression and forest map analysis, the high expression level of PART1, IGF2.AS, WT1.AS, OIP5.AS1, and SLC25A5.AS1 was associated with poor prognosis, while the low expression of [119]AL035706.1 was linked to poor prognosis. Furthermore, we used the expression profiles of these six lncRNAs in a multivariate Cox regression analysis to construct a prognostic model. The survival analysis and Kaplan–Meier curves showed a significant difference in prognosis between the high- and low-risk groups classified by the model, and a markedly greater area under the curve verified the heterogeneity and stability of the model. More importantly, 20 normal and tumor paired luminal breast cancer specimens were collected to examined six prognostic lncRNAs’ expression and their correlation with clinical pathological parameters. We verified that five poor-prognostic lncRNAs were higher expressed in tumor than normal, and their expression, except for IGF2.AS, was associated with, to a degree, adverse parameters, such as Ki-67, grade, and tumor diameter. The results supported that these lncRNAs had a specification in tumor so that could be candidate prognostic markers. As for the positive-prognostic one, [120]AL035706.1 was higher expressed in the normal tissue and showed a tendency, but not a significance, to oppose adverse parameters. For this, we supposed a larger number of specimens could give a definite conclusion. We found six potential lncRNAs, namely PART1, IGF2.AS, WT1.AS, OIP5.AS1, [121]AL035706.1, and SLC25A5.AS1. Only the high expression of [122]AL035706.1 predicted a better prognosis, others were all with worse. Consistent with our findings, it was previously reported that PART1 could predict early recurrence in patients with hepatocellular carcinoma after curative resection, and modulate the toll-like receptor (TLR) pathway to influence cell proliferation and apoptosis in prostate cancer cells.[123]^39^–[124]^41 The research showed that IGF2.AS was related to metabolism, and inhibition of IGF2.AS promoted angiogenesis in patients with type 2 diabetes.[125]^42 While the other four lncRNAs have not been well investigated thus far and all of their prediction in OS were not mentioned. We proposed that they may be promising new targets for further researches. Conclusion In the present study, we comprehensively analyzed several databases to construct a ceRNA network in luminal BC and identify six OS-related lncRNAs, by which a prognosis prediction module was constructed. The six lncRNAs’ higher expression and pathological features were verified by our paired breast cancer specimen. Therefore, we proposed that these six lncRNAs can be useful biomarkers for predicting the prognosis of patients with luminal BC. Acknowledgments