Abstract Background: Gliomas, the most prevalent type of primary brain tumor, stand out as one of the most aggressive and lethal types of human cancer. Methods & Results: To uncover potential prognostic markers, we employed the weighted correlation network analysis (WGCNA) on the Chinese Glioma Genome Atlas (CGGA) 693 dataset to reveal four modules significantly associated with glioma clinical traits, primarily involved in immune function, cell cycle regulation, and ribosome biogenesis. Using the least absolute shrinkage and selection operator (LASSO) regression algorithm, we identified 11 key genes and developed a prognostic risk score model, which exhibits precise prognostic prediction in the CGGA 325 dataset. More importantly, we also validated the model in 12 glioma patients with overall survival (OS) ranging from 4 to 132 months using mRNA sequencing and immunohistochemical analysis. The analysis of immune infiltration revealed that patients with high-risk scores exhibit a heightened immune infiltration, particularly immune suppression cells, along with increased expression of immune checkpoints. Furthermore, we explored potentially effective drugs targeting 11 key genes for gliomas using the library of integrated network-based cellular signatures (LINCS) L1000 database, identifying that in vitro, both torin-1 and clofarabine exhibit promising anti-glioma activity and inhibitory effect on the cell cycle, a significant pathway enriched in the identified glioma modules. Conclusions: In conclusion, our study provides valuable insights into molecular mechanisms and identifying potential therapeutic targets for gliomas. Keywords: glioma, transcriptome, WGCNA, LASSO, prognostic signature, tumor microenvironment, drug screening 1. Introduction Glioma is a prevalent malignant brain tumor that originates from neuroglial cells within the skull [[42]1]. According to the report summarized in the study of Lin et al. [[43]2], the overall annual incidence of glioma ranges from 2.82 to 7.70 per 100,000 individuals. Specifically, the incidence of non-glioblastoma (non-GBM) varies from 4.80 to 7.70 per 100,000 person-years, while the incidence of glioblastoma multiforme (GBM) ranges from 2.82 to 5.10 per 100,000 person-years. Pathologically, gliomas are typically classified into various types, including astrocytoma, oligodendroglioma, and GBM, with GBM accounting for about 50% of all glioma cases [[44]3]. The World Health Organization (WHO) classifies gliomas into four grades: WHO grade I, II, III, and IV. Grades I and II are considered low-grade gliomas (LGG), while grades III and IV are high-grade gliomas (HGG), with GBM being classified as WHO grade IV [[45]4]. The prognosis and survival outcomes of LGG are significantly better than those of HGG. Histological classification helps us understand the behavior of gliomas, but molecular classification based on molecular features can more accurately distinguish glioma groups. The 2016 World Health Organization Classification of Tumors of the Central Nervous System (2016 CNS WHO) [[46]5], for the first time, uses molecular features in addition to histology to perform the classification of tumors of the central nervous system, including gliomas. The CNS tumor diagnoses should consist of a histopathological name followed by the molecular features, separated by a comma. For example, glioblastoma, IDH-wildtype; astrocytoma, IDH-mutant; Oligodendroglioma, IDH-mutant and 1p/19q-codeleted; diffuse midline glioma, H3 K27M–mutant; and so on [[47]5]. Additionally, key genetic alterations have been reported in gliomas, such as O6-methylguanine-DNA Methyltransferase (MGMT) promoter methylation, mutations in phosphatase and tensin homolog (PTEN) and tumor protein p53 (TP53), amplification of epidermal growth factor receptor (EGFR), and fusion of the MET tyrosine kinase (MET) gene [[48]6,[49]7]. The identified molecular features provide an effective basis for predicting the prognosis of gliomas. Clinically, the treatment of glioma faces many challenges, including complex tumor locations, high recurrence rates, difficulty in complete resection, and resistance to radiotherapy and chemotherapy. Therefore, finding more effective treatment strategies and developing personalized treatment plans have become important directions in current glioma research. By deepening our understanding of the molecular mechanisms, gene expression characteristics, and the impact of the tumor microenvironment on gliomas, more targeted approaches can be provided for glioma treatment, thereby improving patient prognosis and quality of life. Several risk factors have been associated with clinical outcomes in glioma patients, including age, tumor grade, Karnofsky performance status (KPS), MGMT status, isocitrate dehydrogenase (IDH) mutation status, 1p/19q codeletion, extent of surgical resection, tumor location or multifocality, and treatment with radiotherapy and chemotherapy [[50]8,[51]9,[52]10]. However, predicting the survival rate of individual patients remains challenging. To more accurately assess the prognosis of glioma patients, researchers have developed hundreds of prognostic models based on prognostic marker genes. However, there are currently no widely accepted and applied prognostic models [[53]11]. In this study, we aimed to construct a free-scale gene co-expression network related to the clinical traits of glioma patients using a dataset from the Chinese Glioma Genome Atlas (CGGA) database. We developed a prognostic model based on 11 genes associated with the cell cycle and immunity, utilizing the CGGA 693 dataset, a transcriptome dataset containing 693 samples. Subsequently, we validated this model using two datasets: the CGGA 325 dataset and another dataset comprising 12 glioma patients with varying overall survival (OS) durations. The CGGA 325 dataset is another transcriptome dataset containing 325 samples from the CGGA database. We explored the close relationships between the risk score model, clinical traits, the tumor microenvironment, immune checkpoints, and the cell cycle. The prognostic model we constructed establishes an independent prognostic model for glioma and provides new insights into the molecular mechanisms of glioma. 2. Results 2.1. Identification of Four Modules Significantly Associated with Clinical Traits in Glioma We acquired the raw expression matrix of 693 glioma samples from the Chinese Glioma Genome Atlas (CGGA) database. After the trimmed mean of M-values (TMM) normalization, sample clustering was performed using t-distributed stochastic neighbor embedding (tSNE) and uniform manifold approximation and projection (UMAP) with default parameters, respectively. Consistent clusters were obtained by the two methods, and the results are depicted in [54]Figure S1a–c. The individual tumor sample was color-coded based on their respective clinical attributes, such as glioma grade, Primary-Recurrent-Secondary (PRS) types, and histology types. This analysis revealed two clusters: a major group comprising 626 samples and a small group consisting of 67 samples. While the samples in the smaller group were more concentrated in WHO grade IV glioma and primary types, there was no significant difference observed in the Kaplan–Meier survival analysis between the two groups ([55]Figure S1d). For more precise data, the samples in the major group were selected for further analysis. [56]Figure 1a illustrates the clustering of samples within the major group using tSNE and UMAP. Both methods demonstrate highly consistent results, revealing that different grades of glioma samples have a gradually separated trend as the grade increases from WHO II to WHO IV ([57]Figure 1a). Additionally, Kaplan-Meier survival analysis indicated that samples with isocitrate dehydrogenase (IDH) mutation or 1p/19q codeletion exhibited significantly more favorable prognostic outcomes compared to samples with wild-type variants in the CGGA 693 dataset ([58]Figure 1b). Samples categorized as WHO grade IV, commonly known as glioblastoma multiforme (GBM), exhibit a poorer prognosis in contrast to samples classified as WHO grades II and III (referred to as non-GBM) in CGGA 693 dataset ([59]Figure S1e). These results are in alignment with the current understanding of glioma [[60]12,[61]13]. Figure 1. [62]Figure 1 [63]Open in a new tab Identification and analysis of glioma-related modules by the weighted correlation network analysis (WGCNA) analysis based on the CGGA 693 dataset. (a) t-distributed stochastic neighbor embedding (tSNE) (left) and uniform manifold approximation and projection (UMAP) (right) clustering of 626 glioma samples from the CGGA 693 dataset based on their cancer grades by using the trimmed mean of M-values (TMM) normalized expression profiles; (b) Kaplan–Meier survival analysis for 626 of glioma patients based on IDH status (Wild or Mutant) (left), and 1p/19q status (Non-codel or Codel) (right); (c) Pearson correlation between gene modules identified by WGCNA and clinical traits in glioma patients. Each cell contains the corresponding correlation and p-value; (d) analysis of Pearson correlation among modules identified by WGCNA; (e) pathway enrichment analysis of reactome for core genes identified by the MCODE, a plugin in Cytoscape software, in the yellow and pink modules, and (f) that of the green module. A total of 5476 uniquely glioma-associated genes were derived from the DisGeNET ([64]https://www.disgenet.org/ (accessed on 17 November 2023)) and GeneCards ([65]https://www.genecards.org/ (accessed on 17 November 2023)) database ([66]Table S1). These genes were utilized in constructing a co-expression network using the weighted correlation network analysis (WGCNA) algorithm with a soft thresholding power of β = 10 across 413 high-quality samples, which were obtained by removing those with missing clinical data from the major group of 626 glioma samples ([67]Figure S2a). This network consisted of 9 modules or communities of proteins interconnected by their co-expression patterns ([68]Figure 1c and [69]Figure S2b). After combining clinical traits, four significantly correlated modules were pinpointed. Among these, modules colored in pink, yellow, and green displayed negative correlations with IDH (Pearson p-value = 4 × 10^−10, 3 × 10^−12, and 8 × 10^−6, respectively) or 1p/19q status (Pearson p-value = 0.001, 4 × 10^−10 and 2 × 10^−5, respectively), while the blue module exhibited a positive correlation (Pearson p-value = 0.01 and 3 × 10^−5, respectively). Coincidentally, these identified modules also exhibited reasonable Pearson correlations with other clinical traits such as glioma grade, overall survival (OS), and censor (alive = 0; dead = 1) ([70]Figure 1c). Analysis of Pearson’s correlation between the modules indicated that the identified modules are relatively independent of each other, except for a notably strong correlation between the pink and yellow modules ([71]Figure 1d). After applying filtering criteria with a threshold of module membership (MM) > 0.8 and gene significance (GS) > 0.2 for IDH status, a total of 96 candidate genes were identified, including 11 genes in the pink module, 31 in the yellow module, 38 in the green module, and 16 in the blue module ([72]Figure S2c). Subsequently, a protein–protein interaction network was constructed with the STRING database ([73]https://string-db.org/ (accessed on 4 March 2024)), and gene clusters were identified by the plugin molecular complex detection (MCODE) in Cytoscape software (v3.1.0) [[74]14]. Notably, the top three clusters identified by MCODE scores corresponded to the green, blue, and yellow modules in WGCNA, respectively. Additionally, genes in the cluster with the fourth highest score aligned with two modules, the yellow and pink modules, potentially due to the strong correlation between these two modules ([75]Figure S2d, Table S2), demonstrating the robustness of the protein modules identified by the WGCNA algorithm. By analysis of pathway enrichment for MCODE clusters, it was observed that genes from pink and yellow modules were enriched in the immune system, innate immune system, extracellular matrix organization, and neutrophil degranulation ([76]Figure 1e), while genes from green modules correlated more strongly with cell cycle, including cell cycle-mitotic, cell cycle checkpoints, etc. ([77]Figure 1f). The MCODE cluster corresponding to the blue module mainly correlated with ribosome biogenesis. For clarity, we defined the pink and yellow modules as the unified Immune module, the green module as the Cell cycle module, and the blue module as the Ribosome module in the subsequent content. These results suggest that the biological functions or processes represented by these co-expression modules may potentially play significant causal roles in the initiation or progression of glioma. Kaplan–Meier survival analysis showed that the majority of genes in the Immune and Cell cycle modules, which are expressed at lower levels in glioma, are associated with a more favorable prognosis compared to genes expressed at higher levels, while the opposite trend was observed in the Ribosome module ([78]Table S3). This finding was further supported by the expression of the module genes in patients, where reduced expression was noted in the Immune and Cell cycle modules in patients with IDH or 1p/19q mutations, typically associated with WHO II or WHO III glioma grades and a more favorable prognosis ([79]Figure 2). Figure 2. [80]Figure 2 [81]Open in a new tab Heatmap depicting the expression of 96 candidate genes in four modules significantly associated with glioma and their relationship with clinical traits including grade (including WHO II, WHO III, and WHO IV), IDH status (Wild or Mutant type), and 1p/19q status (Non-codel or Codel type) in 413 high-quality glioma samples, which were obtained by removing those with missing clinical data from the major group of 626 glioma samples; the four modules were represented by their primary biology function. Conversely, a contrasting pattern was observed in the Ribosome module ([82]Figure 2), where most genes displayed higher expression levels in patients with a favorable prognosis ([83]Table S3). Additionally, a significant positive correlation was identified between the gene expression within the module and the 1p/19q status in glioma patients in the CGGA 693 dataset, indicating that individuals with a 1p/19q codeletion are more likely to display elevated gene expression levels ([84]Figure 2 and [85]Figure S3a). Meanwhile, the correlation with 1p/19q status was also confirmed in the CGGA 325 dataset, where 7 out of 16 candidate genes were found to be highly expressed in patients with a 1p/19q codeletion ([86]Figure S3b). 2.2. Construction and Evaluation of Risk Score with the CGGA 693 Dataset Given the consistent correlations observed between IDH and 1p/19q status and the four significantly identified modules, we applied a filtering process to select module genes with a threshold of MM > 0.8 and GS > 0.2 for both IDH and 1p/19q status ([87]Figure S2c). A total of 69 genes were identified and considered as candidate genes, of which 53 genes belong to the Immune and Cell cycle modules. Subsequently, a least absolute shrinkage and selection operator (LASSO) Cox regression analysis was implemented based on the 53 genes to establish the prognostic signature for glioma patients, identifying 11 key genes ([88]Figure 3a): PLOD1, CCR5, CTSZ, ITGB2, TLR2, ASPM, GINS4, KIF14, KIF2C, KPNA2, and POLD3. Gene ontology (GO) enrichment analysis in the biological process (BP) revealed that many of these genes are linked to glioma-related pathways, particularly enriched in processes such as microglial cell activation, cell division, DNA-dependent DNA replication, inflammatory response, etc. ([89]Figure 3b). The enriched reactome pathways are associated with the immune system, neutrophil degranulation, and cell cycle ([90]Figure 3b). Subsequently, we established a comprehensive risk score consisting of 11 key genes as the glioma-related prognostic signature. The risk score was calculated as follows: expression of PLOD1 * 0.0035 + expression of CCR5 * (−0.0030) + expression of CTSZ * (−0.0003) + expression of ITGB2 * (−0.0010) + expression of TLR2 * 0.0148 + expression of ASPM * 0.0052 + expression of GINS4 * 0.0261 + expression of KIF14 * 0.0041 + expression of KIF2C * 0.0025 + expression of KPNA2 * 0.0067 + expression of POLD3 * 0.0121. For each patient in the CGGA 693 dataset, we computed the corresponding risk score and correlated it with clinical traits based on the Spearman test ([91]Figure 3c). The findings indicate positive correlations between risk score and WHO grade, along with negative correlations with overall survival, IDH status, and 1p/19q status. Specifically, patients with high-risk scores tend to have higher WHO grades and are more likely to exhibit IDH wild type or 1p/19q non-codeletion, which are typically associated with a poor prognosis ([92]Figure 3c,d). By utilizing the median risk score to classify glioma samples into high and low-risk groups, Kaplan–Meier survival curves revealed that glioma patients in the high-risk score group are significantly associated with poor prognosis across all groups in the CGGA 693 dataset, including all glioma patients, the non-GBM group (WHO II and WHO III) and the GBM group (WHO IV) ([93]Figure 3e). Collectively, these results indicated that the calculated risk scores are closely associated with clinical traits and exhibit robust predictive potential for glioma patients. Figure 3. [94]Figure 3 [95]Open in a new tab Identification of 11 key prognostic genes by the least absolute shrinkage and selection operator (LASSO) analysis and risk score calculation based on the 413 high-quality glioma samples. (a) Coefficient profiles (left, different colors represent different genes.) and 10-fold cross-validation results (right) of LASSO; (b) gene ontology (GO) enrichment analysis (top), and reactome pathway enrichment analysis (bottom) for 11 genes determined by LASSO analysis; (c) the Spearman correlation coefficient between risk score and clinical traits in glioma patients. Value in each cell represents a correlation between traits, and the red color indicates negative correlations, with blue indicating positive correlations; (d) survival time for glioma patients with high risk (red dots) and low risk (black dots); (e) Kaplan–Meier survival curves of different groups in 413 glioma samples based on the risk scores-all samples group (left), 257 non-GBM samples group (including WHO II and WHO III for grade, middle) and 156 GBM samples group(including WHO IV for grade, right); The patients with risk score greater than the median of risk scores of all patients were considered as high risk; otherwise, they are considered low-risk patients. 2.3. Analysis of Risk Score as an Independent Prognostic Signature Univariate and multivariate Cox regression analyses of risk scores for glioma were performed in the CGGA 693 dataset. The univariate Cox regression analysis revealed a statistically significant correlation between the risk score and overall survival (OS) (HR = 4.2, p-value = 1 × 10^−26) ([96]Figure 4a). Following adjustment for confounding factors in the multivariate Cox regression analysis, the risk score continued to demonstrate its potential as an independent prognostic indicator for OS in glioma patients (HR = 2.19, p-value < 0.001) ([97]Figure 4b). These findings suggest that the risk score serves as an independent prognostic factor for overall survival in patients with glioma. Figure 4. [98]Figure 4 [99]Open in a new tab Analysis and validation of prognostic signature in the CGGA 693 dataset. (a) Univariate and (b) multivariate Cox regression analyses of the association between clinic traits and overall survival (OS) of patients; (c) construction of a nomogram model for survival prediction; (d) ROC curves with confidence bands (±1 standard deviation, SD) showing the predictive value of the nomogram model for 1-, 3-, and 5-year survival rates; (e) calibration plots of the nomogram for 1-year, 3-year and 5-year survival. The X-axis represents the nomogram-predicted probability, and the Y-axis shows the observed probability. Subsequently, we developed a nomogram prediction model integrating the risk score and four clinical traits (Age, Grade, IDH status, and 1p/19q status) to forecast clinical outcomes ([100]Figure 4c). To evaluate the predictive efficacy of the nomogram, we performed a receiver operating characteristic (ROC) analysis ([101]Figure 4d) and generated calibration curves ([102]Figure 4e). The calibration curves demonstrated excellent concordance between the predictions and observations at 1-year, 3-year, and 5-year intervals, with the area under the ROC curve reaching 0.74 (±1 standard deviation, SD: 0.64–0.84) at 1-year, 0.83 (±1 SD: 0.76–0.89) at 3-year, and 0.82 (±1 SD: 0.78–0.86) at 5-year, respectively. These results signify a highly predictive performance in the model. 2.4. Validation of Prognostic Signature in Glioma Patients with Varying Severity Our prognostic signature was validated in two additional datasets: the CGGA 325 dataset and a cohort comprising 12 glioma patients with diverse prognoses and tumor types. Initially, we assessed the correlations between the risk scores of individual patients in the CGGA 325 dataset and their respective clinical traits, including age (categorized as young or old based on median age), tumor type, grade, IDH mutant status, and 1p/19q codeletion status ([103]Figure 5a). The results revealed significant positive correlations between the risk scores of glioma patients and their age and grade, as well as significant negative correlations with IDH and 1p/19q status. While no statistical differences were observed between primary and recurrent tumors or between recurrent and secondary tumors, it was observed that the risk scores of patients increased with the severity of tumor recurrence. These findings suggest strong correlations between the risk score and the clinical traits of patients within the CGGA 325 dataset. Kaplan–Meier survival also revealed that glioma patients in the high-risk score group are significantly associated with poor prognosis across all sample groups and non-GBM group (WHO II and WHO III) in this dataset ([104]Figure S4a,b), which is consistent with that of in CGGA 693 dataset. However, there was no significant difference in prognosis between high-risk and low-risk groups for the GBM group ([105]Figure S4c), potentially due to the smaller sample size as the number of follow-up days increased (with only four samples remaining after 3000 days of follow-up). Furthermore, based on the LGG and GBM datasets from the Cancer Genome Atlas (TCGA), consistent results were obtained ([106]Figure S4d,e). It was noteworthy that the expression levels of the 11 prognostic genes exhibited statistically significant positive correlations with glioma grade, with levels increasing as the tumor grade advanced ([107]Figure 5b). This finding further corroborates previous results, indicating that patients with lower expression levels of these genes trend to have a more favorable prognosis. For the purpose of organization, we categorized the 12 glioma patients into three distinct groups labeled A, B, and C. Group A comprised four patients diagnosed with LGG who demonstrated a longer OS ranging from 72 to 132 months across all groups. Groups B and C each consisted of 4 patients with GBM. In comparison to group B, where the OS ranged from 40 to 76 months, patients in group C displayed a poorer prognosis, with an OS ranging from 4 to 8 months. Figure 5. [108]Figure 5 [109]Open in a new tab Validation of risk score and 11 prognostic genes in CGGA 325 dataset and a cohort of 12 glioma patients. (a) The distributions of the risk score in the different groups of CGGA 325 patients in terms of each clinical trait, such as age, Primary-Recurrent-Secondary (RPS) type, grade, IDH status and 1p/19q status; (b) the distributions of expression of 11 identified prognostic genes among different levels of grades patients in the CGGA 325 dataset; (c) heatmap of expression for 96 candidate genes in three identified biology modules (Immune, Cell cycle and Ribosome modules, respectively); (d) the distribution of risk score in different groups for 12 glioma patients. A group represents low-grade glioma (LGG) patients, and both B and C groups represent glioblastoma (GBM) patients. Among the three groups, the patients in the A group have the longest overall survival (OS), followed by the B group, and group C has the shortest OS. Each group includes four patients. Statistical significances among different groups were determined using the Wilcox test, and a p-value < 0.05 was considered significant. The candidate genes identified within the Immune and Cell cycle modules exhibited increased expression levels in groups B and C compared to group A. In contrast, the blue module exhibited a reverse pattern, showing decreasing expression levels in groups B and C relative to group A ([110]Figure 5c), which is consistent with findings from the CGGA 693 dataset. Furthermore, as the OS of patients decreased, there was a gradual rise in risk scores, indicating a statistically significant negative correlation (Spearman R = −0.71, p-value = 0.01) between risk scores and OS across all patients ([111]Figure 5d). We also selected 5 out of the 11 key genes for immunohistochemical validation in these cohorts ([112]Figure S5). Overall, these findings confirm the reliability of the prognostic signature in glioma patients. 2.5. Correlation of Risk Score with Immunological Function Analysis in Glioma Patients To investigate the distinctions between patients classified as high risk and low risk, we conducted a cluster analysis on the gene expression of 413 glioma samples from the CGGA 693 dataset previously selected for WGCNA analysis. This analysis was carried out by using tSNE with default parameters, except for the perplexity set to 10, and the UMAP method with default parameters. The results revealed a clear differentiation between patients with high-risk scores and those with low-risk scores ([113]Figure 6a). Differential expression analysis comparing high-risk and low-risk score patients showed significant upregulation of the 11 identified prognostic genes in patients with high-risk scores ([114]Figure 6b). Interestingly, genes upregulated in patients with high-risk scores were significantly enriched for adaptive immune response in the gene set enrichment analysis (GSEA) analysis (NES = 2.75, FDR < 1 × 10^−10) ([115]Figure 6c), which is consistent with pathway enrichments observed for the 11 prognostic genes ([116]Figure 3b). These findings suggest a correlation between the risk score for glioma patients and immune function. Figure 6. [117]Figure 6 [118]Open in a new tab Analysis of the relationship between risk score and tumor microenvironment in the 413 glioma samples from the CGGA 693 dataset. (a) tSNE (left) and UMAP (right) plot of 413 samples based on their gene expression, colored by risk group of samples (high risk is defined as a patient with a risk score greater than the median risk score, and low risk is the opposite); (b) volcano plot of high-risk versus low-risk group (The red dots represent up-regulated genes, and the blue ones down-regulated genes), in which 11 prognostic genes were marked; (c) GSEA plot of high-risk patients compared to low-risk patients in term of adaptive immune response (NES = 2.75, FDR < 1 × 10^−10); (d) heatmap showing the relative abundance of 28 infiltrating immune cell types; (e) Violin plots showing the distribution of stroma score, immune score or tumor purity in the low risk and high-risk patients (Wilcox test; ***, p-value < 0.01; (f) the Spearman’s rank correlation with 95% confidence intervals (CIs) between immune infiltrating cells by CIBERSORTx and risk scores (***, p-value < 0.01; **, p-value < 0.05; NS., no significance), calculated using the JASP tool. To investigate the relationship between risk score and tumor immunity, we assessed immune heterogeneity between patients with high and low-risk scores using a single-sample gene set enrichment analysis (ssGSEA). The resulting heatmap displayed the relative abundance of 28 infiltrating immune cells in each glioma patient ([119]Figure 6d). Overall, patients with high-risk scores exhibited a higher degree of immune cell infiltration, indicating an immune-hot phenotype, while those with low-risk scores displayed lower levels of immune cell infiltration, suggesting an immune-cold phenotype, consistent with previous study findings [[120]13,[121]15]. In addition, we also calculated the stromal and immune scores for each glioma patient. The results indicated positive correlations between both stromal and immune scores with the risk score, while tumor purity showed a negative correlation ([122]Figure 6e). These results suggest that the infiltrating immune cells in gliomas increase with rising risk scores, and patients with high-risk scores exhibit an immune-hot phenotype. It is widely recognized that a high level of immune infiltration tends to be associated with a favorable prognosis [[123]16,[124]17]. However, in this study, there appears to be a contradiction between a high-risk score and an immune-hot phenotype. To address this, we further assessed the proportions of 22 immune cells in each glioma patient using the online software CIBERSORTx ([125]https://cibersortx.stanford.edu/ (accessed on 2 April 2024)), which estimates the abundances of different cell types within a mixed cell population based on gene expression data. Upon conducting a Spearman correlation analysis between risk scores and the proportions of the 22 infiltrating immune cells, we observed that the infiltrating cells in patients with high-risk scores were predominantly associated with immune suppression, such as macrophage, regulatory T cell, and neutrophil, etc. ([126]Figure 6f). Moreover, the risk score exhibited significant positive correlations with well-known checkpoints and their corresponding ligands ([127]Figure 7a,b, Spearman test). It is well-documented that overexpression of immune checkpoints in tumors is linked to a poor prognosis [[128]18,[129]19,[130]20]. In summary, the high proportion of immune suppression cells and high expression of checkpoints may contribute to the poor prognosis of patients with high-risk scores. Figure 7. [131]Figure 7 [132]Open in a new tab The Spearman correlation between risk score and immune checkpoint proteins. The upper panel (a) shows immune checkpoint receptors, including PD-1, CTLA-4, TIM-3, and LAG3, and the lower panel (b) shows corresponding ligands for each receptor. 2.6. Screening for Potentially Effective Molecules Targeting Prognostic Genes of Glioma Drug repurposing was conducted based on the 11 prognostic genes for glioma and the LINCS L1000 platform using the methodology reported in the study [[133]21]. Briefly, the method could screen the drugs that have the potential to reverse cancer-related gene expression and calculate a reversal score for each drug. Sorted by reversal scores for all cases of drugs in the L1000 dataset (a total of 60,510 cases), the drugs with high reversal scores have high potency to reverse the expression of 11 genes and are considered to have high efficacy in the treatment of glioma. Two drugs, torin-1 and clofarabine, were selected for testing their anti-GBM activity due to their high ranks in the list of drugs (20/60,510, 24/60,510, respectively) according to their reversal scores. Subsequently, the dose-dependent effects of torin-1 and clofarabine on the viability of glioma cells were assessed in the LN229 and U251 cell lines, respectively. In LN229 cells, we observed that torin-1 and clofarabine have a significant inhibitory effect on the cells, and the half-maximal inhibitory concentration (IC[50]) values were 21.6 nM and 12.96 nM, respectively ([134]Figure 8a,b, top). In U251 cells, the consistent inhibition of cell viability was observed, and the IC[50] values for torin-1 and clofarabine were 5.189 nM and 12.22 nM, respectively ([135]Figure 8a,b, bottom). Figure 8. [136]Figure 8 [137]Open in a new tab Effect of torin-1 and clofarabine on cell viability and cell cycle in glioblastoma (GBM) cells. (a) The images of LN229 (top) and U251 cells (bottom) treated with torin-1 and clofarabine; (b) the curves of inhibition and IC[50] values of torin-1 and clofarabine for LN229 (top) and U251 (bottom) cells. The cells were determined by CCK8 assay after treatment of the drugs for 48 h; (c) flow cytometry analysis for cell cycle distribution of LN229, and (d) that of U251 cells across different groups, including untreated, control, and perturbation with torin-1 and clofarabine. Cells were treated for 24 h with 3 μM of clofarabine and torin-1, respectively; proportions of cells in different cell cycle phases with 95% confidence intervals (CIs) across different groups for (e) LN229 and (f) U251 cells. The whole cycle for each group was assumed as 100%, with experiments conducted in triplicate. Given that the 11 prognostic genes displayed significant correlations with cell cycle pathways ([138]Figure 3b), we conducted a flow cytometry analysis to investigate whether the two drugs influenced cell viability by modulating cell cycle progression. The results for LN229 and U251 cells are presented in [139]Figure 8c,d. A chi-square test was performed to compare the proportions of cell populations in each cell cycle phase—G0/G1, S, and G2/M—between the control and drug-treated groups for both cell lines. The results indicate significant differences in the distribution of cell populations for cell cycle phases across all comparisons ([140]Table 1). Furthermore, torin-1 led to an increase in S phase cells in U251 and prolonged the G0/G1 phase in LN229, while clofarabine induced obvious cell cycle arrest in LN229 and U251 cells, characterized by an increase in G0/G1 phase cells ([141]Figure 8e,f). These results indicate that torin-1 and clofarabine may exert inhibitory effects by preventing cells from progressing into the G2/M phase and inhibiting cell proliferation. Table 1. Chi-square test results for the proportion of cell populations in each cell cycle phase (G0/G1, S and G2/M) between the control and drug-treated groups for the LN229 and U251 cell lines. LN229 Cells U251 Cells Control and Torin-1 Control and Clofarabine Control and Torin-1 Control and Clofarabine Chi-square statistic 9.110 6.042 8.342 12.849 Degree of freedom 2 2 2 2 p-value 0.011 0.049 0.015 0.002 [142]Open in a new tab 3. Discussion With rapid advancement in the basic and clinical research of gliomas, there has been considerable progress in the treatment of brain gliomas. However, improvement of survival time for glioma patients is still an extremely serious challenge due to the obscure pathogenesis, lack of precise diagnostic methods, and effective medicines. Since the combination of histopathological and molecular features was required for the classification and diagnosis of glioma in the 2016 World Health Organization Classification of Tumors of the Central Nervous System (2016 CNS WHO) [[143]5], molecular features in the diagnosis and prognosis for glioma patients have been widely accepted. Undoubtedly, the identification of key genes and the building of prognostic models based on the large public database for glioma, such as the Chinese Glioma Genome Atlas (CGGA) and the Cancer Genome Atlas (TCGA) database, has become increasingly appealing. Recently, to more accurately assess the prognosis of glioma patients, many prognostic models based on the identification of marker genes have been developed. However, there is currently no widely accepted and applied prognostic model [[144]11]. In this study, we have conducted WGCNA analysis based on the CGGA 693 dataset, a transcriptome dataset containing 693 samples, and identified four modules significantly associated with clinical traits for glioma. These modules are significantly enriched in the immune system, cell cycle, and ribosome biogenesis. Upon LASSO regression analysis, 11 key genes were identified. A risk score model was then constructed based on the 11 genes, which can effectively predict risk scores for individual patient with glioma and assess the prognosis. The risk score has been demonstrated to be an independent prognostic factor through both univariate and multivariate regression analysis. Using data from the CGGA 325 dataset and a separate group of 12 glioma patients with varying overall survival (OS) durations, the risk score proved to be an effective tool for assessing patient prognosis. The analysis procedure for this whole study was summarized in [145]Figure 9. Figure 9. [146]Figure 9 [147]Open in a new tab The flowchart of the whole study. The main procedure sequentially comprises six components: (1) Data Screening and Modules Identification, (2) Construction of Risk Score, (3) Evaluation of risk score, (4) Validation of risk score, (5) Immunological analysis, (6) Drug screening and Validation. Through WGCNA, LASSO, and multivariate Cox regression analysis, we selected 11 genes from the pink, yellow, and green modules as risk prediction indicators. KIF14, ASPM, CCR5, GINS4, POLD3, TLR2, KIF2C, KPNA2, ITGB2, PLOD1, and CTSZ were highly correlated with the overall survival of glioma patients. The total survival period of glioma patients, LGG, or GBM patients with high-risk scores was significantly lower than that of low-risk score individuals. Even after adjusting for confounding factors such as clinical traits, gene mutations, and 1/3/5-year survival periods, the risk score remained an independent prognostic factor for glioma. We conducted GO functional analysis and pathway enrichment analysis on these key genes and found that they mainly cluster in functions related to the immune system processes, DNA replication, cell cycle, and division, indicating their potential roles in enhancing immune evasion and cell proliferation in gliomas. The kinesin superfamily genes (KIFs) play crucial roles in the cell cycle and have been shown to be involved in chromosome and spindle movements during mitosis and meiosis. KIF14 is highly expressed in gliomas and is associated with higher mitotic and Ki67 indices, as well as lower patient survival rates [[148]22]. Knocking down KIF14 can reduce cell proliferation and invasion capabilities, induce apoptosis, and inhibit glioma growth [[149]23], suggesting that KIF14 may be a potential target for HGBT treatment [[150]24]. KIF2C is involved in regulating cell mitosis and the repair of double-strand DNA breaks in cancer cells. Research by Tu et al. found that KIF2C is highly expressed in primary and recurrent gliomas and is associated with shorter patient survival, potentially serving as a prognostic biomarker for GBM [[151]25]. Studies have shown that ASPM expression is associated with the WHO grade of gliomas, with higher expression in GBM and recurrent tumors. ASPM is involved in the cell cycle and microtubule stability in gliomas [[152]26,[153]27], suggesting that ASPM depletion may mimic microtubule destabilizers; however, specific ASPM inhibitors have not been discovered yet. CCR5 is highly expressed in glioblastomas and can trigger the PI3K/AKT pathway to promote proliferation, induce polarization of TAMs and immune suppression and facilitate the maintenance of glioma stem-like features [[154]28]. The GINS (Go-Ichi-Ni-San) complex plays a crucial role in DNA replication and the cell cycle. The GINS complex consists of four subunits encoded by the GINS1, GINS2, GINS3, and GINS4 genes. While there is abundant research reporting high expression of GINS2 in various tumors, including gliomas, promoting tumor cell proliferation and migration, inhibiting apoptosis, and regulating the cell cycle, it can serve as a new diagnostic marker and therapeutic biomarker for tumors [[155]29]. Although there are no reports on the expression and function of GINS4 in gliomas, as a member of the GINS complex, GINS4 plays an important role in cell cycle regulation. DNA polymerase delta (POLD) is a heterotetramer composed of a catalytic subunit and three accessory subunits. Abnormal expression and gene mutations of POLD can lead to tumorigenesis. Mutations in the POLD1 and POLD3 genes increase the risk of CRC and other malignant tumors [[156]30]. Dysregulation of POLD3 expression can affect genomic stability, promote cell differentiation and proliferation, and serve as a prognostic factor predicting the survival rate of LGG patients [[157]31,[158]32]. Recent studies have reported that POLD/POLE mutations are associated with the immune response to CRC [[159]33]. Toll-like receptors (TLRs) play a crucial role in innate immunity, and tumor-associated macrophages play a significant role in the tumor microenvironment. Activation of TLR2 can promote M2 polarization of TAMs to enhance tumor growth. Additionally, TLR2 activation can upregulate MMP1 expression in microglia and gliomas, promoting tumor growth and migration [[160]34]. KPNA2 is highly expressed in various malignant tumors. In gliomas, KPNA2 can promote tumor growth and metastasis by inhibiting the activation of the HIPPO signaling pathway and mediating TP53 nuclear translocation to promote EMT [[161]35,[162]36]. ITGB2 is a common and important gene in glioma immunity and stromal infiltration. Tumor infiltration analysis indicates that ITGB2 is associated with immune cells such as dendritic cells, macrophages, monocytes, neutrophils, and B cells. ITGB2 is considered a potential target for primary GBM [[163]37]. Cathepsin Z (CTSZ) is a protease that not only has protease activity but also interacts with integrins. Although there is limited research on the role of CTSZ in gliomas, studies in pancreatic neuroendocrine tumors have shown that intracellular CTSZ in tumor cells can promote proliferation, while both intracellular CTSZ in tumor cells and CTSZ secreted by TAMs can promote tumor invasion [[164]38]. CTSZ may have similar functions in gliomas and the microenvironment. Overall, the functions of these genes in tumors, most of which have been well established in gliomas, suggest that KIF14, ASPM, CCR5, GINS4, POLD3, TLR2, KIF2C, KPNA2, ITGB2, PLOD1, and CTSZ can be considered key prognostic factors for the overall survival of glioma patients. In addition, we established a prognostic scoring model based on 11 hub genes to predict risk in both the CGGA dataset containing 325 patients and 12 clinical samples of gliomas. We found that the expression of these 11 genes in WHO IV is significantly higher than in WHO II and WHO III, and the risk score is closely associated with age, grading, IDH mutation, and 1p/19q loss. However, this predictive model cannot effectively distinguish between GBM patients with long or short progression-free survival. Considering the high malignancy and rapid progression of GBM, as well as its significant tumor heterogeneity and short overall survival, existing predictive models struggle to provide accurate predictions for GBM. Future research focusing on prognostic models based on different pathological subtypes of GBM will be beneficial for survival and prognosis predictions in GBM. The tumor microenvironment has a complex relationship with the onset, progression, and metastasis of gliomas. Unlike other tumors, gliomas have a unique immune microenvironment. Due to the presence of the blood–brain barrier, tumor-infiltrating lymphocytes (TILs) are scarce in gliomas, with the microenvironment mainly infiltrated by microglia and tumor-associated macrophages (primarily M2-type macrophages). M2-type macrophages can produce growth factors, activate tissue repair and angiogenesis, suppress adaptive immunity, enhance immune escape, and promote tumorigenesis. TILs primarily consist of anti-tumor cells, including CD8+ and CD4+ T cells (helper T cells), with CD8+ T cells associated with a good prognosis. The TME also contains Treg cells, which participate in tumor immune suppression processes and are associated with poor prognosis [[165]39]. Previous studies have reported that CCR5, TLR2, ITGB2, and CTSZ, among the 11 hub genes, are involved in TAM polarization and immune suppression in gliomas. In this study, we found that the risk score significantly influences the adaptive immunity in gliomas and is strongly correlated with TAMs. Patients with high-risk scores have anti-tumor-related TILs in the TME predominantly in a dormant state, while patients with low-risk scores have higher levels of anti-tumor-related TILs in the TME. Furthermore, the risk score is positively correlated with stromal and immune scores and negatively correlated with tumor purity. These results suggest that our constructed risk prediction model may predict the immune-suppressive environment in glioma patients. Immune checkpoint inhibitors are widely used in cancer treatment and have shown good efficacy in various cancers; however, they have shown poor efficacy in glioma treatment [[166]40]. Our risk score indicates a significant positive correlation between immune checkpoint-related molecules, such as PD-1/PD-L1, CTLA-4/CD80, TIM-3/LGALS9, LAG3/HLA-DRA, and the risk score, indicating that higher risk scores are associated with higher immune checkpoint expression, stronger immune evasion capabilities, and potentially poorer response to the immune checkpoint blockade (ICB) therapy. Therefore, our established risk score may be useful for evaluating the prognosis of glioma patients after receiving ICB treatment. The conventional treatment methods for gliomas include surgery, chemotherapy, and radiotherapy; however, the prognosis for malignant glioma patients remains discouraging, with conventional therapies showing only limited improvements for glioma patients [[167]41]. Malignant gliomas exhibit genetic heterogeneity, and a single therapy is not sufficient to address clinical issues. The 11 hub genes included in our prognostic model are significantly associated with the clinical traits of gliomas. We conducted small molecule inhibitor screening for these 11 hub genes and found that clofarabine and torin-1 exhibit high inhibitory activity against two glioma cell lines, LN229 and U251, blocking cells in the G0/G1 phase, inhibiting the proportion of cells in the G2 phase, and affecting the cell cycle process, thereby inhibiting cell proliferation. This indicates that these hub genes can serve not only as prognostic indicators but also as targets for glioma treatment. 4. Materials and Methods 4.1. Data Acquisition and Reprocessing Two datasets from two distinct Chinese glioma cohorts (the 693 and 325 cohorts) were downloaded from the Chinese Glioma Genome Atlas database (CGGA, [168]http://www.cgga.org.cn (accessed on 4 July 2023)) [[169]12], named CGGA 693 and CGGA 325 datasets, respectively, which included the raw expression matrix and corresponding clinical information. In our study, the CGGA 693 dataset was used as the primary dataset for analysis, while the CGGA 325 dataset was used as one of the validation datasets. For the CGGA 693 dataset, we first performed cluster analyses using t-distributed stochastic neighbor embedding (tSNE) and uniform manifold approximation and projection (UMAP) methods and achieved two clustering groups. We selected the primary group of 626 samples as the object for the study. After removing missing clinical data, the primary group included a total of 413 glioma samples. The statistics of clinical characteristics for the datasets with 413 samples and the CGGA 325 dataset are shown in [170]Table S4. The expression matrix for low-grade glioma (LGG) and glioblastoma multiforme (GBM), comprising raw counts and fragments per kilobase per million mapped fragments (FPKM), along with the associated clinical information, were obtained from the Cancer Genome Atlas (TCGA, [171]https://www.cancer.gov/tcga (accessed on 4 July 2023)). The raw counts in all datasets were normalized using the Trimmed Mean of M-values (TMM) algorithm from the edgeR package (v3.36.0) in R (v4.1.3) [[172]42]. 4.2. Collection of Genes Associated with Glioma To obtain relatively reliable glioma-related genes, the DisGeNET database ([173]https://www.disgenet.org/ (accessed on 17 November 2023)) was initially consulted to compile genes linked to Glioma (C0017638) and malignant glioma (C0555198), respectively [[174]43]. After eliminating duplicates, a total of 3202 genes were obtained. Subsequently, the term “Glioma” was used to search the Genecards database ([175]https://www.genecards.org/ (accessed on 17 November 2023)), yielding 9446 glioma-related genes, in which filtering is carried out based on a criterion where only genes with relevance scores greater than the median score (0.75) were retained, resulting in 4621 genes. By integrating genes from both databases, a total of 5476 unique genes with entry IDs were compiled ([176]Table S1). These genes will be utilized for subsequent analyses. 4.3. Weighted Correlation Network Analysis (WGCNA) and Identification of Modules To identify the modules of co-expressed genes associated with clinical traits for glioma, we employed the WGCNA method as described by Langfelder and Horvath [[177]44], which was performed using the WGCNA package (v1.72.1) in R. After checking excessive missing values and identification of outliers, a soft thresholding power of β = 10 was selected for the dataset (a matrix of 5476 (genes) × 413 (samples)) by the function pickSoftThreshold() of WGCNA package to ensure scale-free topology, R^2 > 0.9, based on the criterion of approximate scale-free topology ([178]Figure S2a). The WGCNA::blockwiseModules() function was used with the following settings for co-expression network construction and module detection in a one-step way: soft threshold power (β) = 10, minimum module size of 30, merge cut height of 0.5. Briefly, a co-expression similarity matrix S[ij] was constructed by calculating Pearson’s correlation coefficients between gene pairs of the collected glioma-related genes (5476) across the 413 glioma samples. The co-expression similarity matrix S[ij] was defined as the absolute value of the correlation coefficient between the expression profiles of gene i (x[i]) and gene j (x[j]). The formula of S[ij] is as follows: [MATH: Sij=|cor(< msub>xi,xj)| :MATH] Subsequently, the similarity matrix was transformed into a weight adjacency matrix (a[ij]) using the selected soft thresholding power (β), and the formula was shown as follows: [MATH: aij=sijβ :MATH] The adjacency matrix was then converted into a topological overlap matrix (TOM) with TOM similarity and its dissimilarity (dissTOM). Hierarchical clustering on the dissTOM was performed to group genes into modules. A dynamic tree cut algorithm was conducted on the dendrogram for identifying co-expression gene modules, with a minimum size requirement of 30 and merge cut height of 0.5. The first principal component of the expression of each module was summarized as module eigengenes (MEs). Pearson’s correlations between MEs and clinical traits of gliomas were computed, and modules exhibiting significant correlations (p-value < 0.05) with clinical traits such as IDH and 1p/19q status were singled out for further analysis. The candidate genes in the selected modules are defined as having high significance with the traits (gene significance, GS) as well as high intramodular connectivity (module membership, MM), representing the correlation between gene expression and MEs. The candidate genes in the module were screened out by setting GS > 0.2 and MM > 0.8 as a threshold. 4.4. Gene Function Enrichment Analysis Gene ontology (GO) and reactome pathway analysis were performed by online DAVID ([179]https://david.ncifcrf.gov/home.jsp (accessed on 5 March 2024)) [[180]45] with default parameters (Count: 2 and EASE Score: 0.1), and terms with p-value < 0.05 were considered to be statistically significant. Gene set enrichment analysis (GSEA) was performed using the GSEA Preranked method in GSEA software (v4.1.0) with default parameters employed as follows: Number of permutations: 1000, Collaps/Remap to gene symbols: No_Collapse, Min size: 15, Max size: 500. The significance of the enrichment was assessed at p-value < 0.05 and FDR < 0.25 [[181]46,[182]47]. 4.5. Construction of Glioma-Related Prognostic Signature A total of 53 genes exhibiting a significant correlation with IDH and 1p/19q status within the Immune and Cell cycle modules were utilized in the least absolute shrinkage and selection operator (LASSO) Cox regression analysis to determine specific coefficients for each association. LASSO Cox regression is a method for variable selection and shrinkage in the Cox proportional hazards model proposed by Tibshirani et al. [[183]48]. LASSO Cox regression analysis was carried out using the glmnet package (v4.1.6) in R. Briefly, we generated a Surv object using the survival package (v3.55) in R based on the survival time and censoring information for 413 patients, which was used as the response variable in the function glmnet() of glmnet package. The expression matrix for 53 screened genes across 413 patients was used as predictor variables. Next, we ran the function glmnet() with the parameters family = “Cox” and alpha = 1 to fit the LASSO regression model. To determine what value to use for lambda, we performed 10-fold cross-validation using the function cv.glmnet() with parameter k = 10 and identified the lambda value that produces the lowest test mean squared error (MSE). The lambda value that minimized the test MSE turned out to be 0.04067 in our study. Lastly, we analyzed the final model produced by the optimal lambda value to obtain the coefficient estimates for this model. For the genes with no coefficients in predictor variables, this means it was completely dropped from the model because it was not influential enough. Then, 11 prognostic genes and corresponding coefficients were achieved to develop a glioma-related prognostic signature. The risk score for each patient was computed using the following formula: [MATH: RiskScore=i=1nexprgene i×coef fgene(i) :MATH] In this formula, n represents the number of genes selected by the LASSO method, expr[gene][(i)] is the expression value of gene(i), and coeff[gene][(i)] is the Cox coefficient of gene(i). 4.6. Protein-Protein Interaction (PPI) Analysis The key genes meeting the criteria of gene significance (GS) for IDH status > 0.2 and module membership (MM) > 0.8 within the four significantly identified modules were used as input for the STRING database ([184]https://cn.string-db.org/ (accessed on 4 March 2024)), a resource for predicting functional associations among proteins, to perform protein–protein interaction network analysis. The MCODE plugin (v2.0.0) in Cytoscape (v3.1.0) was used to detect densely connected regions within the resulting extensive protein–protein interaction networks. All network visualizations were performed by using Cytoscape. 4.7. Prognostic Model Based on Clinical Traits and Risk Scores Univariate survival analyses were performed on the risk scores and clinical traits of glioma patients using univariate Cox proportional hazard models. Subsequently, a multivariate Cox model was executed utilizing the same features to determine if each feature could serve as an independent prognostic variable for glioma patients. A nomogram model was constructed using the ‘rms’ package (v6.8.0) in R software to predict clinical outcomes for glioma patients by integrating risk scores with four clinical traits(Age, Grade, IDH status, 1p/19q status). To evaluate the effectiveness of this nomogram model in predicting the 1-year, 3-year, and 5-year survival rates of patients, a series of calibration plots and a receiver operating characteristic (ROC) curve were generated. 4.8. Glioma Tissues and RNA Sequencing A total of 12 patients diagnosed with high or low-grade glioma (4 LGG and 8 GBM) were selected from the archives of Tianjin Medical University Cancer Institute and Hospital (TMUCIH). Frozen tumor samples were collected from these patients for mRNA next-generation sequencing using the Illumina-HiSeq2000/2500 platform. All patients were newly diagnosed and had not undergone chemo-or radiotherapy before surgery. Patients received concurrent radiotherapy, temozolomide (TMZ) chemotherapy, and TMZ sequential chemotherapy after surgery until tumor recurrence. The patients were monitored for a period ranging from 4 to 132 months, and unfortunately, all patients had passed away by the end of the study. This study was thoroughly reviewed and approved by the Ethics Committee of TMUCIH. 4.9. Immunohistochemical Staining The tumor samples of patients were processed into 5 μm tissue sections after formalin-fixation and paraffin embedding. The sections underwent deparaffinization in xylene and rehydration using a descending series of ethanol. Subsequently, tissue antigens were repaired by microwave heating, followed by incubation with 10% normal goat serum to block nonspecific reactions at room temperature for 10 min. Primary antibodies against KPNA2 (1:200; Proteintech, Wuhan, China), ASPM (1:200; Proteintech, China), CCR5 (1:100; Proteintech, China), KIF2C and KIF14 (1:100; Proteintech, China) were applied separately and incubated overnight at 4 °C, and the biotin-labeled goat anti-mouse/rabbit IgG and streptavidin-peroxidase (UltraSensitiveTM SP IHC Kit; Fuzhou Maxim Biotech, Fuzhou, China) were subsequently used. After washing, the sections were then developed using a diaminobenzidine substrate. 4.10. Immune Cell Infiltration Analysis To quantify the infiltration of 28 types of immune cells in each glioma sample, ssGSEA analysis was performed using the GSVA packages (v1.42.0) in R [[185]49]. The ssGSEA enrichment score was utilized as the measure of immune cell infiltration in each sample. Gene signatures for each immune cell type were adopted from a previous study. The CIBERSORTx ([186]http://cibersort.stanford.edu (accessed on 2 April 2024)) method was performed to characterize cell composition based on gene expression profiles in previous studies. 4.11. Estimation of Stromal and Immune Cells The Estimation of Stromal and Immune cells in MAlignant Tumours using the Expression data (ESTIMATE) tool (1.0.13) was employed to calculate the immune score and stromal score [[187]50,[188]51], representing the degree of infiltration of immune cells and stromal cells, respectively. The ESTIMATE score was then produced by integrating the stromal scores and the immune scores. The formula for tumor purity is as follows [[189]51]: [MATH: Tumor purity=cos(0.6049872018+0.0001467884×ESTIMATEscore) :MATH] 4.12. Drug Screening Based on Prognostic Genes Expression regulatory profiles (upregulation or downregulation) of 11 glioma prognostic genes were analyzed to identify potential therapeutic drugs capable of reversing the expression of these genes by the method reported in the study [[190]21], using the LINCS L1000 database [[191]52]. The LINCS L1000 database offers gene expression profiles induced by over 10,000 compounds, shRNAs, and kinase inhibitors using the L1000 platform. Briefly, the method could screen the drugs that have the potential to reverse cancer-related gene expression and calculate a reversal score for each drug. Sorted by reversal scores for all cases of drugs in the L1000 database, the drugs with high reversal scores have high potency to reverse the expression of 11 genes and were considered to have high efficacy in the treatment of glioma. 4.13. Cell Culture and Drug Perturbation Human glioma cell lines LN229 were sourced from the American Type Culture Collection (ATCC), while the U251 cell line was obtained from Sigma. All cell lines were cultured in Dulbecco’s modified Eagle’s medium (DMEM) (Gibco, Carlsbad, CA, USA) supplemented with 10% fetal bovine serum (Sigma, St. Louis, MO, USA) and 1% penicillin/streptomycin (Gibco, Carlsbad, CA, USA) and maintained at 37 °C in a humidified atmosphere with 5% CO[2]. To assess the cytotoxic effect of compounds, LN229 and U251 cell lines were subjected to a CCK8 assay. Cells were seeded into 96-well plates at a density of 2–5 × 10^3 cells/well and allowed to adhere and grow overnight. After treatment with different concentrations of clofarabine and torin-1 (MedChemExpress, Shanghai, China) for 48 h, cell viability was detected by adding 10 µL CCK8 reagent (GLPBIO, Montclair, CA, USA) to each well and another 1 h normal culture and the optical density (OD) was measured at 450 nm using a Microplate Reader (Bio-Rad, Hercules, CA, USA). Cell proliferation inhibition rates were calculated to assess the impact of different treatments on cell viability, using the formula: cell proliferation inhibition rate = 100% × [mean OD value of control group − mean OD value of treatment group]/mean OD value of control group. 4.14. Experiment and Analysis of Cell Cycle The cell cycle was assessed using the Cell Cycle Staining Kit (MULTI SCIENCES). LN229 cells and U251 cells were seeded into 6cm dishes with 2–5 × 10^6 cells/well and allowed to adhere and grow overnight. Subsequently, the cells were treated with 3 μM of the respective drugs. After a 24-hour incubation period with the drugs, the cells were collected and centrifuged to remove the supernatant, and the cell pellet was resuspended and washed with PBS (Gibco). Single-cell suspensions were then stained with 1 mL of propidium iodide (PI) staining solution and 10 μL of permeabilization solution at 4 °C for 30 min in a final volume of 200 μL. The labeled cells were analyzed using flow cytometry, and the results were presented as the percentage of cells in each phase of the cell cycle. The distributions of cell populations in cell cycle phases between the control and drug-treated groups were assessed using the Chi-square test via the online tool Epitools ([192]https://epitools.ausvet.com.au/chisq (accessed on 27 August 2024)). 4.15. Statistical Analysis All statistical analyses and figures were performed using R language (version 4.1.3). Student’s t-test or Wilcox test was used to determine the significance of differences between groups. Kaplan−Meier survival analysis was used to assess the statistical significance between two groups using R packages (survival and ggplot2). Pearson or Spearman correlation was used to measure the correlation between two variables, which were calculated by function cor.test() in R or JASP tool [[193]53]. The details of the source and version of the algorithms and software involved in this study are listed in [194]Table S5. 5. Conclusions In conclusion, we constructed a gene co-expression network for gliomas and identified risk assessment characteristics of 11 hub genes. This prognostic model demonstrates good efficiency in predicting the prognosis of glioma patients. The high-risk scores of these characteristics predict immune suppression and evasion capabilities in gliomas, potentially serving as prognostic indicators for immune checkpoint blockade therapy. Interestingly, we identified two small molecule inhibitors targeting the risk feature genes, demonstrating effective inhibition of glioma cell activity and cell cycle progression, providing new insights for glioma treatment. Acknowledgments