Abstract Spinal cord injury (SCI) is a severe neurological disorder, with glucocorticoids like methylprednisolone commonly used for treatment. However, their efficacy and risks remain controversial. Programmed cell death (PCD) mechanisms have been increasingly implicated in SCI pathology. This study aimed to identify differentially expressed genes (DEGs) related to glucocorticoid receptors and PCD and to construct a diagnostic model to guide glucocorticoid use in SCI treatment. SCI datasets ([38]GSE5296, [39]GSE47681, [40]GSE151371, and [41]GSE45550) were analyzed using protein-protein interaction networks, consensus clustering, GSVA for PCD pathway enrichment, and WGCNA. A total of 113 diagnostic models were developed through 12 machine learning algorithms, with the optimal model, “Lasso + Stepglm[both],” featuring six genes: Abca1, Cdh1, Glipr1, Glt8d2, Il10ra, and Pde5a. Validation through qRT-PCR confirmed the differential expression of four genes (Abca1, Glipr1, Il10ra, and Cdh1), which demonstrated strong predictive performance. Pathway enrichment of GRRDEGs was analyzed using GO, KEGG, and Bayesian network methods, and immune cell infiltration was assessed via CIBERSORT. In this study, we identified GR- and PCD-related DEGs in SCI and constructed a diagnostic model that may improve understanding of SCI molecular mechanisms and inform future investigations of glucocorticoid use. Supplementary Information The online version contains supplementary material available at 10.1038/s41598-025-08060-9. Subject terms: Computational biology and bioinformatics, Neuroscience, Biomarkers Introduction Spinal cord injury (SCI) is a critical neurological condition that frequently results in motor and sensory impairments and, in severe cases, paralysis, profoundly affecting quality of life and societal productivity^[42]1. Despite advances in clinical interventions such as surgical decompression, pharmacotherapy, and rehabilitation, achieving meaningful neural functional recovery remains a substantial challenge^[43]2. To date, no universally effective therapeutic strategy has been developed to reverse or fully repair SCI-induced functional deficits, highlighting the urgent need to explore novel treatment approaches. Glucocorticoids, particularly methylprednisolone (MP), are among the most commonly used pharmacological agents for SCI treatment. Administered intravenously within 8 h post-injury, MP is believed to exert therapeutic effects by improving microcirculation, controlling inflammatory responses, inhibiting neuronal necrosis, and enhancing neuronal activity^[44]3. These effects can contribute to the recovery of neurological function in SCI patients. However, the clinical application of MP remains controversial due to variability in patient response and potential adverse effects, such as gastrointestinal bleeding, sepsis, pulmonary infections, steroid-induced myopathy, and delayed wound healing^[45]4. As a result, MP therapy is not universally recommended as a standard treatment for SCI, and many clinicians advocate its selective use based on individualized assessment^[46]5. Recent studies have highlighted the significant role of programmed cell death (PCD) mechanisms-such as necrosis, apoptosis, and autophagic cell death-in SCI pathology. Necrosis is often associated with exacerbation of inflammation, further aggravating spinal tissue damage^[47]6, while apoptosis involves activation of signaling pathways that lead to the programmed death of neurons and glial cells^[48]7. Emerging evidence also points to the role of autophagic cell death in neuronal injury^[49]8. These complex and interconnected cell death mechanisms present a challenge for understanding the pathological processes underlying SCI. While glucocorticoids have shown therapeutic potential, their precise mechanisms of action remain unclear, particularly concerning their impact on PCD pathways. In recent years, machine learning technology has shown significant potential in SCI classification and mechanism analysis. Many studies have constructed diagnostic models by integrating multimodal data. For instance, a study utilized^[50]9 the random forest algorithm to conduct a joint analysis of transcriptome data and immune infiltration characteristics, dividing the SCI mouse model into inflammatory and neurodegenerative subgroups, revealing the dynamic changes in the M1/M2 macrophage ratio. However, it did not delve deeply into the interaction between glucocorticoid receptors and the programmed cell death pathway. Another study^[51]10 screened out the hub genes related to the pathological process of SCI by combining the support vector machine with the WGCNA co-expression module. However, its model construction relied on single-modal data and lacked cross-species validation. In addition, some studies^[52]11 have identified molecular subtypes closely related to nerve injury by integrating pyroptosy-related genes and immune infiltration analysis, but the verification of the biological functions of key genes is insufficient. In response to these challenges, this study innovatively combines the following strategies: First, multi-omics data integration: Jointly analyze cross-species GEO datasets (mice, rats, and humans), remove batch effects through sva packages, and use WGCNA to identify co-expression modules strongly correlated with SCI phenotypes, overcoming the limitations of a single data type. Second, machine learning algorithm optimization: A combination of 12 algorithms was adopted to construct 113 models and conduct ten-fold cross-validation. Eventually, the optimal model was selected, significantly enhancing the classification robustness. Thirdly, The expression differences of key genes were verified by qRT-PCR, and the regulatory relationship between them and the immune microenvironment was analyzed by CIBERSORT to ensure the biological reliability of the results. Building on this framework, we systematically applied bioinformatic analyses-including protein-protein interaction networks, consensus clustering, gene set variation analysis (GSVA), and WGCNA—to identify glucocorticoid receptor- and PCD-related differentially expressed genes (GRRDEGs) in SCI. To further validate these findings, a mouse model of spinal cord injury was established, enabling functional assessment of candidate biomarkers in vivo. Materials and methods Data acquisition The SCI datasets [53]GSE5296, [54]GSE47681, [55]GSE151371, and [56]GSE45550 were downloaded from the Gene Expression Omnibus (GEO) database^[57]12 ([58]https://www.ncbi.nlm.nih.gov/geo/) via the R package GEOquery^[59]13. Detailed information is provided in Table [60]1. Table 1. GEO microarray chip Information. [61]GSE5296 [62]GSE47681 [63]GSE45550 [64]GSE151371 Platform [65]GPL1261 [66]GPL1261 [67]GPL1355 [68]GPL20301 Species Mus musculus Mus musculus Rattus norvegicus Homo sapiens Tissue Spinal cord Spinal cord Spinal cord Peripheral white blood cells Samples in SCI group 54 25 24 38 Samples in control group 42 9 6 10 Reference PMID: 35706450 PMID: 35706450 PMID: 26380273 PMID: 33512429 [69]Open in a new tab GEO, Gene Expression Omnibus; SCI, Spinal Cord Injury. Glucocorticoid receptor-related genes (GRRGs) were retrieved from the GeneCards database^[70]14 ([71]https://www.genecards.org/) using “glucocorticoid receptor” as the search term, filtered by “protein coding” and a relevance score > 2, resulting in 352 GRRGs. These genes were converted to their mouse counterparts using the R package homologene, yielding 335 mouse GRRGs. An additional 492 mouse GRRGs were sourced from the UniProt database^[72]15 ([73]https://www.uniprot.org/). After merging the datasets and removing duplicates, a final set of 761 GRRGs was compiled, as summarized in Table S1. Key regulatory genes for 13 PCD pathways were collected from a published study^[74]16, resulting in a total of 884 PCD-related genes. The final list of genes is available in Table S2. To eliminate batch effects, the [75]GSE5296 and [76]GSE47681 datasets were combined using the R package sva^[77]17. The resulting integrated dataset consisted of 79 SCI samples and 51 control samples. Standardization was performed with the R package limma^[78]18, followed by probe annotation and normalization. Principal component analysis (PCA)^[79]19 was conducted to validate the effectiveness of batch effect removal, as shown in Fig. S1. Glucocorticoid receptor (GR)-related differentially expressed genes (GRRDEGs) in SCI Samples from the integrated GEO dataset were categorized into SCI and control groups. Differential expression analysis was conducted using the R package limma, with thresholds set at |logFC| > 0.5 and p-value < 0.05 for identifying differentially expressed genes (DEGs). The Benjamini‒Hochberg (BH) method was used for p value correction. Volcano plots depicting differential expression results were generated via the R package ggplot2. Glucocorticoid receptor-related differentially expressed genes (GRRDEGs) were identified by intersecting the DEGs with GRRGs. A Venn diagram was created to illustrate the overlap, and the expression of GRRDEGs was visualized through heatmaps generated using the R package pheatmap. Functional analysis of GRRDEGs in SCI A genomic map featuring 24 chromosomes was created using the RCircos package. Functional roles of GRRDEGs were evaluated through Kyoto Encyclopedia of Genes and Genomes (KEGG)^[80]20–[81]22 and Gene Ontology (GO) analyses, focusing on biological processes (BPs), cellular components (CCs), and molecular functions (MFs). Key genes were further analyzed using the Metascape platform. A protein-protein interaction (PPI) network for the key genes was constructed via the STRING database^[82]23 ([83]https://string-db.org/) with an interaction score > 0.400. The network was visualized in Cytoscape (version 3.9.0), and potential hub genes were identified using five CytoHubba algorithms^[84]24,[85]25: maximal clique centrality (MCC), degree, stress, edge percolated component (EPC), and closeness^[86]26. Based on their scores, the top 20 GRRDEGs were selected, and a Venn diagram was generated to highlight overlapping genes for further analysis. Immune cell infiltration analysis CIBERSORT^[87]27, a method based on linear support vector regression, was applied to deconvolute the transcriptome expression matrix and estimate the composition and abundance of immune cells in mixed cell populations. Using the CIBERSORT algorithm with a mouse-specific gene signature matrix, immune cell infiltration scores were calculated. The proportions of immune cells were visualized in a bar plot. The differences in the proportions of infiltrating immune cells between the SCI and control samples were visualized via the R package ggplot2, and only those results with p values < 0.05 were retained. Immune cells with significant differences in proportions were selected for subsequent analysis. Spearman’s correlation analysis was performed to assess the relationships between immune cells proportions, and a heatmap of these correlations was created via the R package pheatmap. The correlations between the expression levels of hub genes and the proportions of infiltrating immune cells were similarly calculated and visualized. Unsupervised clustering of SCI samples Unsupervised clustering of the 79 mouse samples was performed using the expression profiles of GRRDEGs and the R package ConsensusClusterPlus^[88]28, applying the partitioning around medoids (PAM) algorithm. The optimal number of clusters was determined based on a consistency score (> 0.9), the cumulative distribution function (CDF) curve, and consensus matrix assessment, with a maximum of six subtypes considered. GSVA of 13 cell death pathways GSVA^[89]29 of the expression matrix was conducted via the R package GSVA. With this analysis, enrichment scores were calculated for each gene set, reflecting the activity of different cell death pathways across samples. Default parameters, such as Gaussian kernel density estimation, were used. A GSVA score matrix for all samples was obtained, with rows representing the 13 cell death pathways and columns representing the samples. WGCNA The WGCNA algorithm^[90]30 was employed to identify potential key genes across the results. Suitable thresholds were determined to construct weighted adjacency and topological overlap matrices (TOMs). Gene modules were analysed on the basis of their correlation with target phenotypes. Module membership (MM) represented the correlation between genes and module eigengenes, whereas gene significance (GS) reflected the correlation between genes and target phenotypes. The WGCNA results were intersected to identify key genes within coexpression modules for subsequent validation and analysis. Bayesian pathway enrichment analysis To infer the pathway interaction network, the CBNplot package^[91]31 was used. Sequence data from the 79 SCI mice in the combined dataset were used as a background to calculate interaction directions. The bngeneplot function was utilized to generate graphical representations, with a filtering standard set to 0.90. ML model construction To construct diagnostic models, 12 machine learning (ML) algorithms were employed: Lasso, Ridge, StepGLM, XGBoost, random forest (RF), elastic net (Enet), partial least squares regression for generalized linear models (plsRglm), gradient boosting machine (GBM), naive Bayes, linear discriminant analysis (LDA), glmBoost, and support vector machine (SVM). A total of 113 combinations of these algorithms were tested for variable selection and model development using the training dataset (integrated GEO dataset) with tenfold cross-validation. To mitigate overfitting risks, Lasso regularization was applied during feature selection to reduce model complexity by penalizing non-informative variables. The meta-cohort ([92]GSE151371 and [93]GSE45550) served as an external test set to validate model performance. The model achieving the highest average area under the receiver operating characteristic (ROC) curve (AUC) across both the training and test sets was selected as the best-performing model^[94]32. Prediction model construction A nomogram model was constructed with the R package rms^[95]33 using four key predictive genes to evaluate SCI occurrence. The “Total Points” represent the aggregate of the individual points assigned to each predictive variable. Decision curve analysis (DCA) and calibration plots were used to assess the prediction performance of the model. Group comparison plots based on hub gene expression levels were created to further explore the differences between SCI and control samples in the integrated GEO dataset. Animal study Healthy female Kunming mice, 7–8 weeks old and weighing 30–35 g, were housed under standard conditions and weres provided by Slike Jingda Laboratory Animal Co. Ltd. All animal procedures were conducted in compliance with the guidelines of the International Association for the Study of Pain (IASP) and were approved by the Ethics Committee of Gannan Medical University (Approval No. 2024 − 228). The SCI group and control group each consisted of 6 mice. Establishment of the SCI mouse model Anaesthesia was induced with a 1% sodium pentobarbital solution (40 mg/kg) via intraperitoneal injection. The mice were placed on a heating pad during the procedure. After the skin incision was made, soft tissue at the level of the eighth to twelfth thoracic vertebrae (T8-T10) was separated. A steel clip was inserted under the transverse process, and a laminectomy was performed at T9. The dorsal spinal cord was transected to a depth of approximately 2 mm. Haemostasis was achieved with the use of gelatine sponges. Sham-operated mice underwent the same procedures except for spinal cord transection. Tissue samples for further experiments were collected one week postsurgery from an 8–10 mm segment surrounding the injury site.The BMS locomotor scores and mechanical pain threshold of mice over 7 days are shown in Fig. S2. RNA isolation and quantitative real-time polymerase chain reaction (qRT–PCR) RNA was extracted using TRIzol reagent (TransGen Biotech, Beijing, China). OPN cDNA was synthesized from total RNA using the FastQuant RT Kit (TIANGEN, Shanghai, China). Quantitative real-time PCR (qRT-PCR) was carried out with SYBR Premix ExTaq (TaKaRa) following the manufacturer’s instructions. GAPDH served as the internal control, and the primer sequences are provided in Table S3. Relative gene expression levels were calculated via the 2^−ΔΔCt method. Statistical analysis All data processing and analyses were conducted using R software (version 4.3.0). Unless otherwise specified, the statistical significance of differences between two groups for normally distributed variables was assessed using an independent Student’s t-test. For non-normally distributed variables, the Mann-Whitney U test (Wilcoxon rank-sum test) was applied. Comparisons involving three or more groups were analyzed with the Kruskal-Wallis test. Spearman correlation analysis was performed to calculate correlation coefficients between gene expression levels. Unless otherwise noted, all statistical tests were two-sided, and a p-value < 0.05 was considered statistically significant. Software and package versions. All computations were performed in R (R Foundation for Statistical Computing, v4.3.0, [96]https://www.r-project.org). Gene-expression matrices were retrieved with GEOquery (Gene Expression Omnibus query, v2.69., ([97]https://bioconductor.org/packages/GEOquery) and batch effects were removed using sva (Surrogate Variable Analysis, v3.46.0, [98]https://bioconductor.org/packages/sva). Differential expression analysis employed limma (Linear Models for Microarray & RNA-seq Data, v3.58.1, [99]https://bioconductor.org/packages/limma); pathway activity scores were calculated with GSVA (Gene Set Variation Analysis, v1.50.0, [100]https://bioconductor.org/packages/GSVA); and weighted gene co-expression networks were built with WGCNA (Weighted Gene Co-expression Network Analysis, v1.72-3, [101]https://cran.r-project.org/package=WGCNA). Unsupervised consensus clustering used ConsensusClusterPlus(v1.72.0, [102]https://bioconductor.org/packages/ConsensusClusterPlus). Machine-learning models were trained with randomForest (Breiman & Cutler Random Forests, v4.7-1.1), glmnet (Lasso / Elastic-Net GLMs, v4.1-9), xgboost (Extreme Gradient Boosting, v1.7.6), gbm (Generalized Boosted Regression Models, v2.1-8), mboost (Model-Based Boosting, v2.9-7), plsRglm (Partial Least Squares for GLM, v1.2.4), e1071 (Miscellaneous Functions of TU Wien, v1.7-16) and MASS (Modern Applied Statistics with S, v7.3-60) - all from CRAN. Receiver-operating-characteristic curves were drawn with pROC (v1.18.5), heatmaps with pheatmap (v1.0.12), circular ideograms with RCircos (v1.2.1) and general graphics with ggplot2 (Grammar of Graphics for R, v3.5.0). Bayesian pathway inference relied on CBNplot (v0.99.7, [103]https://bioconductor.org/packages/CBNplot); nomograms were produced with rms (Regression Modeling Strategies, v6.6-1). Protein-protein interaction networks were queried from STRING (v11.5, [104]https://string-db.org) and visualized in Cytoscape (v3.9.0, [105]https://cytoscape.org). Immune-cell infiltration was estimated with CIBERSORTx (v1.0, [106]http://cibersortx.stanford.edu), and functional enrichment of gene lists was performed on Metascape (v3.5, [107]https://metascape.org). Results Technical roadmap The workflow of this study comprises four steps: (1) data acquisition and batch-effect correction; (2) bioinformatic screening of GR/PCD-related differentially expressed genes; (3) construction of a machine-learning model with external validation; and (4) qRT-PCR validation in a mouse SCI model (Fig. [108]1). Fig. 1. [109]Fig. 1 [110]Open in a new tab Technology roadmap. DEGs, Differentially Expressed Genes; GRRGs, Glucocorticoid receptor-Related Genes; GRRDEGs, Glucocorticoid receptor-Related Differentially Expressed Genes; GO, Gene Ontology; GSVA, Gene Set Variation Analysis; KEGG, Kyoto Encyclopedia of Genes and Genomes; ROC, Receiver Operating Characteristic; SCI, spinal cord injury; PPI, Protein-protein Interaction; WGCNA, Weighted Gene Co-expression Network Analysis; GRRDEGs in SCI Differential expression analysis of the integrated GEO dataset identified 457 DEGs, including 257 upregulated and 200 downregulated genes. A volcano plot was generated to display the results (Fig. [111]2A). Further analysis of GRRDEGs expression differences in the integrated dataset was performed. The results were visualized using a volcano plot (Fig. [112]2B) and a heatmap (Fig. [113]2C), both generated with the R package pheatmap. Fig. 2. [114]Fig. 2 [115]Open in a new tab Differential gene expression analysis. (A) Volcano plot illustrating the differential expression analysis of SCI and Control samples in the combined GEO datasets. (B) Volcano plot showing the analysis of GRRDEGs in the integrated GEO dataset. (C) Heat map displaying GRRDEGs in the integrated GEO datasets. PPI network construction and GO/KEGG pathway enrichment analysis A PPI network of 54 GRRDEGs was constructed via the STRING database (Fig. [116]3A). The top 20 GRRDEGs, on the basis of the five algorithm scores, were intersected and analysed via a Venn diagram (Fig. [117]3B). GO and KEGG pathway enrichment analyses were conducted to investigate the BP, CC, MF, and KEGG pathways associated with the 17 GRRDEGs and their relevance to SCI. (Table S4). These genes were associated with functions such as protein kinase activator activity, interleukin-1 receptor binding, kinase regulator activity, and cytokine activity. KEGG pathway analysis indicated that these genes were significantly enriched in the PI3K-Akt, NF-κB, MAPK, IL-17, and TNF signaling pathways. The results of the GO and KEGG enrichment analyses were visualized using bar plots (Fig. [118]3C) and bubble charts (Fig. [119]3D). Fig. 3. [120]Fig. 3 [121]Open in a new tab PPI interaction network and GO/KEGG enrichment analysis. (A) The PPI network of GRRDEGs calculated using the STRING database. (B) Venn diagram of the top 20 GRRDEGs from five algorithms in the CytoHubba plugin. (C–D). Bar plot (C) and bubble plot (D) showing the results of GO and KEGG enrichment analysis of GRRDEGs. Pathway diagram adapted from KEGG (Kyoto Encyclopedia of Genes and Genomes)^[122]20–[123]22. Dysregulation of GRRDEGs and immune cell activation in SCI samples The expression levels of 17 GRRDEGs were compared between control and SCI samples (Fig. [124]4A), revealing significant upregulation of genes such as Ccl5, Ccnd1, Cebpa, Cebpb, Egr1, Fos, Icam1, Igf1, Il1a, Il1b, IL6, Jun, Myc, Nfe2l2, Ptgs2, Tgfb1, and Tnf in SCI samples (p < 0.05). Next, the expression patterns of these genes between the spinal cord injury group and the control group, as well as their locations on the chromosomes(Fig. [125]4C), were visualized using a heat map(Fig. [126]4B). We then constructed a gene coexpression network to explore interactions between these genes (Fig. [127]4D), with the chord diagram highlighting the key connections, especially among Il1b, Tnf, and Jun. Correlation analysis quantified the relationships between these genes (Fig. [128]4E), with Ptgs2 and Cebpb showing the strongest positive correlation, whereas Il6 and Igf1 had the strongest negative correlation. These results suggest significant positive and negative correlations among the genes, supporting their synergistic or antagonistic roles in SCI. Fig. 4. [129]Fig. 4 [130]Open in a new tab Identification of GRRDEGs in spinal cord injury. (A) Box plot showing the expression profiles of 17 GRRDEGs. (B) Heatmap depicting the expression patterns of 17 GRRDEGs between SCI and Control samples. (C) Chromosomal locations of the 17 GRRDEGs. (D) Gene relationship circle diagram for the 17 GRRDEGs, with violet lines indicating positive correlations and green lines indicating negative correlations. (E) Relationships among the 17 GRRDEGs. Significance levels were indicated by *p < 0.05, **p < 0.01, ***p < 0.001. To further investigate immune system changes in SCI, immune cell infiltration analysis was conducted. The relative proportions of immune cells were visualized via a stacked bar plot (Fig. [131]5A). Boxplots revealed significant differences in the proportions of infiltrating immune cells between the control and SCI groups (Fig. [132]5B). Notably, the proportion of M2 macrophages was significantly higher in the SCI group, while M0 macrophages showed a reduced proportion. Correlation analysis of infiltrating immune cell types (Fig. [133]5C) identified significant relationships, such as a positive correlation between M2 macrophages and T cells CD4 memory resting, and a negative correlation between M0 macrophages and M2 macrophages. Further analysis of the relationships between immune cell proportions and GRRDEG expression (Fig. [134]5D) showed that Il1b expression had the strongest positive correlation with activated mast cells (p < 0.0001), while Myc expression was negatively correlated with follicular helper T cells (p < 0.0001). These findings highlight the potential immunoregulatory roles of GRRDEGs in SCI. Fig. 5. [135]Fig. 5 [136]Open in a new tab Combined datasets immune infiltration analysis by CIBERSORT algorithm. (A–B) The proportion of immune cells in theCombined Datasets bar graph (A) and group comparison graph (B). (C) Correlation heatmap showing immune cell infiltration abundance in the combined datasets. (D) Bubble plot illustrating the correlation between GRRDEGs and immune cell infiltration abundance in the combined datasets. * represents p value < 0.05, statistically significant; ** represents p value < 0.01, highly statistically significant; *** represents p value < 0.001 and highly statistically significant. In the proportion bar chart and group comparison chart, violet is SCI samples, and green is Control samples. green is negative correlation, violet is positive correlation, and the depth of color represents the strength of correlation. Characterization of GR-related clusters in SCI Clustering analysis of the 17 GRRDEGs expression profiles revealed the most stable and distinct clustering results when k = 2 (Fig. [137]6A). To determine the optimal k, we analysed changes in the area under the CDF curve (Fig. [138]6B) for k values from 26, revealing relative stability at k = 2. The consensus index distribution (Fig. [139]6C) further supported k = 2, as the consensus scores for each subtype were significantly greater than 0.9 at this value (Fig. [140]6D). PCA was used to divide the 79 SCI samples into two subtypes, Cluster 1 (C1, N = 49) and Cluster 2 (C2, N = 30), with distinct differences in GRRDEG expression levels (Fig. [141]6E). Fig. 6. [142]Fig. 6 [143]Open in a new tab Consensus clustering based on GRRDEGs expression matrices. (A) Consensus clustering matrix for k = 2. (B) Relative change in the area under the CDF delta curves. (C) Cumulative distribution function (CDF) for consensus clustering at k = 2–6. (D) Consensus clustering scores. (E) PCA plot of the two clusters, with each scatterplot point representing a sample based on the two principal components (PC1 and PC2) derived from GRRDEG expression profiles. Immunological and functional pathway differences between clusters To investigate the molecular features of C1 and C2, key gene expression differences were analysed, and significantly different patterns between the clusters were observed (Fig. [144]7A). C1 presented elevated expression levels of Tgfb1, Igf1, Cebpa, and Nfe2L2, whereas C2 presented increased expression levels of IL6, Tnf, IL1b, Lcam1, Jun, Fos, Myc, Ptgs2, Cebpb, and Egr1 (Fig. [145]7B). Immune cell infiltration analysis (Fig. [146]7C) revealed significant differences between the clusters, with a notable increase in the proportion of infiltrating M1 macrophages in C1. Correlation analysis between GRRDEGs expression and immune cell proportions (Fig. [147]7D) showed that Fos expression was strongly positively correlated with activated mast cells (cor = 0.814) and significantly negatively correlated with resting dendritic cells (cor = -0.65). Fig. 7. [148]Fig. 7 [149]Open in a new tab Gene abundances and immune characteristics of GRRDEGs. (A) and (B) display the expression patterns of 17 GRRDEGs across two distinct glucocorticoid receptor-related phenotypes using boxplots and heatmaps, respectively. (C) Boxplot showing the proportions of 22 immune-infiltrated cell types analyzed via the CIBERSORT algorithm. (D) Bubble plot illustrating the correlation between GRRDEGs and immune cell infiltration abundance. Significance levels were indicated by *p < 0.05, **p < 0.01, ***p < 0.001. Bayesian pathway enrichment analysis of 14 GRRDEGs with differential expression between clusters revealed significant enrichment in immune-related pathways, such as the signalling by interleukins, interleukin-4 (IL-4), and interleukin-13 (IL-13) signalling pathways (Fig. [150]8A–C). Fig. 8. [151]Fig. 8 [152]Open in a new tab Bayesian network enrichment analysis. Bar plot (A) and bubble plot (B) showing the results of Bayesian network enrichment analysis of GRRDEGs. (C) Bayesian network inference of the pathway regulatory network of GRRDEGs. Each node represents a pathway; the redder and larger the node, the higher the gene expression value. The thicker the line, the stronger the regulatory effect. PCD pathway activity and key co-expression modules To further investigate the activation status of various cell death pathways in SCI, enrichment scores for 13 PCD-related pathways were calculated using GSVA. The results indicated increased activity in eight PCD pathways, including akaliptosis, apoptosis, disulfidptosis, ferroptosis, lysosome-dependent cell death, necroptosis, oxidative stress-induced cell death, and pyroptosis. Conversely, decreased activity was observed in three pathways: autophagy, cuproptosis, and parthatos (Fig. [153]9A). Seven coexpression modules were identified with the brown and turquoise modules showing a high correlation with the cluster traits (Fig. [154]9B). Fig. 9. [155]Fig. 9 [156]Open in a new tab GSVA analysis of PCD pathways and Identification of the critical gene modules. (A) GSVA analysis of 13 PCD pathways displayed as box plots comparing Control and SCI groups. (B) Module-trait correlations, with columns representing traits and rows representing module eigengenes. (C) The correlation between cluter and module eigengenes. Each column matches a trait, and each row corresponds a module eigengene. Each unit cell includes the corresponding p-value and correlation. SCI, Spinal Cord Injury. With the use of the WGCNA algorithm, a gene coexpression network was constructed to analyse gene expression data from patients with SCI and controls, with a focus on the top 25% most highly variable genes. The soft-threshold power was set to 2, achieving a scale-free network topology with R^2 = 0.9 and high average connectivity. Correlation analysis between modules and clinical traits revealed that the brown module, containing 855 genes, had the highest correlation with SCI (cor = 0.58, p = 6e-13) (Fig. S3). For the GR-related gene clusters, the brown module, containing 996 genes, had the highest correlation with the cluster trait (cor = 0.74, p = 1e-14) (Fig. [157]9C). Identification of core genes and development of diagnostic models The module genes most correlated with SCI and clusters were intersected with the PCD-related brown and turquoise module genes, resulting in 5 and 32 genes, respectively, for a total of 37 genes (Fig. S4). We combined 12 ML algorithms under a tenfold cross-validation framework to develop the most robust diagnostic model on the basis of 37 intersecting genes. The AUC values of different combinations were visualized via a heatmap (Fig. S5), where the best model, Lasso + Stepglm[both]. The volcano plot displayed the included genes, with the six genes from the optimal model specifically annotated. The results showed that Glt8d2 and Pde5a exhibited no differential expression, while Abca1, Cdh1, Glipr1, and Il10ra were upregulated (Fig. [158]10A), We refer to these four genes as glucocorticoid receptor and PCD-related differentially expressed genes(GRPRDEGs). The ROC curve analysis indicated that Glipr1 had the highest diagnostic value, with an AUC of 0.906, Abca1 followed closely, with an AUC value of 0.8 (Fig. [159]10B). Finally, qRT-PCR results showed that, compared to the control group, the mRNA expression levels of Abca1, Glipr1, and Il10ra were significantly increased, while Cdh1 expression was decreased. In contrast, no statistically significant differences were observed for Glt8d2 and Pde5a (Fig. [160]10C). Fig. 10. [161]Fig. 10 [162]Open in a new tab Diagnostic performance of our model. (A) Volcano plot of the included genes. (B) ROC curves for the included genes. (C) qRT-qPCR validation of GRPRDEG mRNA expression. (D) Nomogram predicting the risk of SCI based on GRPRDEGs (Abca1, Glipr1, Il10ra). (E–G) ROC curves evaluating the prediction model in the combined dataset, [163]GSE151371 dataset, and [164]GSE45550 dataset. A nomogram model was developed based on the GRPRDEGs (Abca1, Glipr1, and Il10ra) (Fig. [165]10D). Decision curve analysis (DCA) and calibration curve analysis confirmed the high predictive accuracy of the model (Fig. [166]6). The model’s performance was validated across both the training and external datasets, achieving AUCs of 0.920 in the combined datasets, 0.858 in [167]GSE151371, and 0.772 in [168]GSE45550 (Fig. [169]10E–G). Discussion SCI is a highly complex and severe neurological condition that is often accompanied by long-term functional impairments and a significant reduction in quality of life. Glucocorticoids have become a common choice for treating acute SCI because of their ability to regulate inflammatory responses, inhibit neuronal necrosis, and increase microcirculation^[170]2. However, despite the demonstrated clinical benefits of glucocorticoids, their precise molecular targets in SCI remain unclear. This ambiguity limits their potential in personalized treatment. Binds molecules associated with PCD, identifying molecular targets could lead to the optimization of therapeutic strategies, increase glucocorticoid efficacy, and reduce the risk of unnecessary complications. In future clinical applications, the expression levels of key GRRDEGs such as Abca1, Glipr1 and Il10ra, could serve as biomarkers to predict a patient’s response to glucocorticoid treatment. By measuring these biomarkers in patient blood samples, clinicians may be able to selectively administer glucocorticoids to those most likely to benefit, thereby optimizing treatment and minimizing potential side effects. Case reports and clinical trials would be valuable in further validating these biomarkers and their clinical applicability. Notably, during rigorous model refinement, we excluded Cdh1 from the predictive signature due to discordant expression trends between RNA-seq and qRT-PCR validation. Post-revision analyses demonstrated preserved diagnostic accuracy in the training set (AUC = 0.920 vs. 0.920 pre-exclusion), while external validations showed modest yet statistically significant reductions in [171]GSE45550 (AUC = 0.711 vs. 0.772) and [172]GSE151371 (AUC = 0.834 vs. 0.858). Importantly, this exclusion enhanced the model’s biological plausibility by ensuring complete concordance between omics and experimental data - a critical prerequisite for clinical translation. Our findings indicated that GRRDEGs are significantly upregulated in SCI patients compared with controls, suggesting that these genes may play a role in the pathogenesis of SCI. KEGG pathway analysis revealed significant enrichment of GRRDEGs in the PI3K-Akt, NF-κB, MAPK, IL-17, and TNF signalling pathways. A large number of studies have proved that these pathways play an important role in the occurrence and development of SCI^[173]34–[174]46. We identified two distinct GR-related clusters on the basis of GRRDEGs and investigated differences in immune characteristics and BPs among these clusters. Our analysis identified distinct differences in the immune microenvironment between SCI and control samples. The proportion of infiltrating M2 macrophages was significantly increased in the SCI group, whereas the proportion of infiltrating M1 macrophages was notably reduced in C2 compared with C1. Bayesian pathway enrichment analysis of GRRDEGs between the clusters revealed significant enrichment in signalling pathways such as signalling by interleukins, interleukin-4 (IL-4), and interleukin-13 (IL-13), which are key regulators of immune inflammation. Macrophages derived from bone marrow cells play critical roles in initiating, maintaining, and resolving inflammation^[175]47. M0 macrophages polarize into two phenotypes: M1 (proinflammatory) and M2 (anti-inflammatory). M1 macrophages increase antigen presentation and secrete IL-12 and IL-23^[176]48, whereas M2 macrophages promote cell proliferation, tissue repair, angiogenesis, and phagocytosis^[177]49. Studies have shown that the transition from M1 to M2 macrophages offers clinical benefits in neurological diseases. IL-4 induces M2 polarization and reduces neuronal apoptosis, limiting secondary injury after damage^[178]50,[179]51. Next, we used WGCNA to identify the module genes most closely associated with SCI and clusters, and then intersected these genes with PCD-related module genes, ultimately obtaining the intersecting genes associated with all three factors. To address personal bias and inherent preferences, we combined 12 ML algorithms and evaluated their