Abstract
Glioblastoma multiform (GBM) is a malignant central nervous system
cancer with dismal prognosis despite conventional therapies. Scientists
have great interest in using immunotherapy for treating GBM because it
has shown remarkable potential in many solid tumors, including
melanoma, non-small cell lung cancer, and renal cell carcinoma. The
gene expression patterns, clinical data of GBM individuals from the
Cancer Genome Atlas database (TCGA), and immune-related genes (IRGs)
from ImmPort were used to identify differentially expressed IRGs
through the Wilcoxon rank-sum test. The association between each IRG
and overall survival (OS) of patients was investigated by the
univariate Cox regression analysis. LASSO Cox regression assessment was
conducted to explore the prognostic potential of the IRGs of GBM and
construct a risk score formula. A Kaplan–Meier curve was created to
estimate the prognostic role of IRGs. The efficiency of the model was
examined according to the area under the receiver operating
characteristic (ROC) curve. The TCGA internal dataset and two GEO
external datasets were used for model verification. We evaluated IRG
expression in GBM and generated a risk model to estimate the prognosis
of GBM individuals with seven optimal prognostic expressed IRGs. A
landscape of 22 types of tumor-infiltrating immune cells (TIICs) in
glioblastoma was identified, and we investigated the link between the
seven IRGs and the immune checkpoints. Furthermore, there was a
correlation between the IRGs and the infiltration level in GBM. Our
data suggested that the seven IRGs identified in this study are not
only significant prognostic predictors in GBM patients but can also be
utilized to investigate the developmental mechanisms of GBM and in the
design of personalized treatments for them.
Keywords: glioblastoma, expression profile, immune-related genes,
prognosis prediction, overall survival
Introduction
Glioblastoma constitutes the most recurrent and aggressive primary
malignant tumor of the central nervous system ([33]Yan et al., 2012).
In spite of multimodal conventional treatments consisting of
neurosurgical resection as well as radiotherapy with accompanying
adjuvant alkylating agent temozolomide chemotherapy, the prognosis for
glioblastoma multiform (GBM) individuals remains dismal, with a median
survival time ranging from 9.4 to 19.0 months ([34]Yang et al., 2014).
This poor outcome is due to the highly invasive nature, malignant
progression, drug resistance, and tumor recurrence, which are regulated
by a large number of oncogenes and tumor suppressor genes ([35]Liu et
al., 2015; [36]Cao et al., 2019). Next-generation sequencing
technologies have made great progress recently, enabling scientists to
gain profound insights into the molecular level of GBM pathophysiology
([37]Aldape et al., 2015). As a consequence, many prospective
diagnostic and prognostic biosignatures have been discovered, which
enable a more distinct classification and a more precise outcome
estimation of GBM. Nonetheless, given the dismal prognosis of GBM, a
multiple-gene signature derived model is still urgently required to
estimate the prognosis and treatment response more accurately for GBM
patients.
The immune microenvironment has been chronicled to play a pivotal
function in tumor biology ([38]Hanahan and Weinberg, 2011), and cancer
immunotherapy has been demonstrated to have a significant preclinical
or clinical value to many patients with some sensitive types of cancer
([39]Schumacher and Schreiber, 2015; [40]Steven et al., 2016;
[41]Odunsi, 2017; [42]Morrison et al., 2018; [43]Christofi et al.,
2019). Increasing research evidence supports the idea that although the
brain constitutes an immunologically specific site, the immune
microenvironment provides ample opportunities for immunotherapy of
brain tumors ([44]Lim et al., 2018). Many kinds of immunotherapy,
including GBM vaccines, oncolytic viral therapies, immune-checkpoint
suppressors, and chimeric antigen receptor T cell therapy, have been
tested in clinical trials, but the results are not satisfactory. Tumors
are insensitive to immunotherapy due to the immunosuppressive tumor
microenvironment, defects in tumor antigen presentation, and
characteristics of the physical microenvironment, including hypoxia and
necrosis ([45]Lim et al., 2018; [46]Pombo Antunes et al., 2020). The
precise mechanism of immune escape is unclear. Glioblastoma usually has
a low mutational load and lower T cell invasion relative to other tumor
types ([47]Li et al., 2016). Thus, it is imperative to better
comprehend the progress and mechanisms of the GBM immune
microenvironment. Multiple recent studies have suggested that immune
gene expression profile biosignatures may be used as a prediction for
clinical outcomes in many cancers ([48]Bremnes et al., 2016;
[49]Campbell et al., 2017; [50]Öjlert et al., 2019). [51]Li et al.
(2017) created a personalized immune-related gene prognostic
biosignature to improve the prognosis of individuals with NSCLC.
In a previous study, a prognostic immune-related gene signature with
nine IRGs based on a total of 161 samples from the Cancer Genome Atlas
database (TCGA) was generated ([52]Liang et al., 2020), and the 9-IRG
model was identified as an independent predictor in glioblastoma. These
researchers established a crosstalk network between prognostic
immune-related genes (IRGs) and transcription factors. Correlations
between immune infiltration cells and risk score were also identified.
However, the potential molecular mechanisms were not clarified in their
study. Thus, it is necessary to elucidate the function of these genes
in the risk score and poor survival outcomes.
Here, we generated a seven immune-linked gene biosignature to exhibit
the connection between gene expression and GBM prognosis, and we
verified this biosignature in the TCGA and GEO dataset. These data may
provide a novel reference for the prognostic prediction of GBM. We also
confirmed the relevance of the seven IRGs to immune checkpoints, immune
cell infiltration, oncogenesis pathway, and drug sensitivity. As a
result, we not only generated a predictive model for GBM prognosis but
also indicated the potential function of these IRGs in the occurrence
and development of glioblastoma.
Materials and Methods
Data Sources and Preliminary Processing
The RNA-Seq data of 169 GBM samples and five normal brain samples, as
well as the clinical data of these GBM patients, such as age, gender,
molecular subtype, gene mutation status, survival time, and survival
status, were obtained from the TCGA dataset^[53]1. Additionally, the
GBM patients’ microarray and clinical data were collected from
independent datasets in the GEO database, including [54]GSE74187 (n =
60) and [55]GSE4412 (n = 59). These gene expression data were generated
and annotated on [56]GPL6480 or [57]GPL97 platform. The immune-related
gene set, including 2,498 genes, was downloaded from the ImmPort
database. The RNA-Seq and microarray data were normalized using scale
method, and the data were pre-processed through the following steps:
(1) patients with unavailable clinical and/or survival information were
removed, (2) only the expression profiles of IRGs were preserved, and
(3) genes with exceeding low abundance were filtered out (the
expression value was 0 in more than half of the samples, or the average
expression value was less than 0.3 in the samples). Finally, 1,100
genes were used for univariate Cox regression analysis and LASSO
analysis.
Differential Gene and Functional Enrichment Analysis
The expression analysis of 2,498 immune-linked genes was conducted to
identify the differentially expressed IRGs by the limma R package
[false discovery rate (FDR) < 0.05 and log[2] | fold change| > 1]
([58]Ritchie et al., 2015). We conducted functional enrichment analyses
to identify potential molecular biomechanisms of the differentially
expressed IRGs via GO analysis and KEGG pathways ([59]Yang et al.,
2018). GOplot package was used for illustrating the relationship
between genes and enriched KEGG pathways. Gene Set Enrichment Analysis
(GSEA) ([60]Mootha et al., 2003; [61]Subramanian et al., 2005) was
employed to examine the signaling cascades in which the IRGs were
enriched between the high- and low-risk subgroups.
Establishment of the Immune-Associated Gene Biosignature
The univariate Cox regression analysis was applied to investigate the
association between each IRG and OS of patients based on the TCGA
dataset. To build the immune-related risk model, the genes with p value
< 0.01 were considered as candidate survival-associated IRGs. The LASSO
regression model was used to determine the most significant
survival-correlated IRGs. First, the GBM patients in TCGA dataset were
randomly divided into training and internal validation cohorts at a 4:1
ratio, forming a training cohort (n = 134) and an internal validation
cohort (n = 33). The LASSO regression was employed based on 10-fold
cross-validation to minimize the risk of overfitting. LASSO tends to
“shrink” the regression coefficients to zero as λ increases. The
optimal λ that yielded minimum cross validation error in 10-fold cross
validation was chosen. The risk score was calculated by using the sum
of normalized expression weighted by the LASSO regression coefficients
([62]Zhong et al., 2020):
[MATH:
Risk score=EmRNA1×CmRNA1+EmRNA2×C
mRNA2+EmRNAn×CmRNAn :MATH]
where E designates the expression level of each gene; and C designates
the lasso regression coefficient of each gene.
The patients were separated into low- and high-risk groups according to
the median of the risk score. OS of the patients in the two groups was
analyzed by the log-rank test with “survival” package in R. Receiver
operating characteristic (ROC) curve and the corresponding area under
the ROC curve (AUC) were calculated to evaluate the prognostic value of
the risk score by using “ROC” package.
CIBERSORT and Assessment of Tumor-Infiltrating Immune Cells
CIBERSORT is a computational technique that predicts the cell type
signature in mix tissues through gene expression levels ([63]Newman et
al., 2015). Cell types can be identified using RNA mixtures in nearly
any tissue ([64]Yang et al., 2019). For this study, we employed
CIBERSORT to examine the 22 types of immune cells in tumor tissues and
show the percentages of 22 sets of tumor-infiltrating immune cells
(TIICs) with bar plots and a corheatmap.
Analysis of Immune Infiltration
To analyze the correlation between the risk signature and infiltrating
levels of six immune cells, including B cells, CD4+ T cells, CD8+ T
cells, neutrophils, macrophages, and dendritic cells, Spearman’s
correlation was calculated and the strength of correlation for the
absolute value of r was as follows: r between 0 and 0.3 indicates a
weak correlation; r between 0.3 and 0.7 indicates a moderate
correlation; r between 0.7 and 1.0 indicates a strong correlation
([65]Akoglu, 2018).
Statistical Analysis
Boxplot was generated using the “ggplot2” package in R. Heat map was
generated using the “pheatmap” package in R. A correlation analysis of
the seven immune genes was performed using the R “corrplot” package in
the Pearson’s method. Circular plot was generated using the “circlize”
package in R. Student’s t test was used to compare data from subgroups.
Pearson’s correlation test was used to analyze the correlation between
the IRGs signature and the expression of immune checkpoint genes. K-M
survival curves were compared using log-rank test. All statistical
analyses were conducted on R software (version 3.6.0). A p value of <
0.05 was considered to indicate significance. Other statistical methods
were described throughout the study.
Results
Identification of Differentially Expressed IRGs in GBM
The mRNA levels of 2,498 IRGs in GBM (n = 169) and normal brain tissues
(n = 5) from TCGA were compared via the Wilcoxon rank-sum test. In
total, 595 differentially expressed IRGs comprising 416 upregulated
genes and 179 downregulated genes were identified ([66]Supplementary
Table 1). The volcano plot and heat map of differentially expressed
IRGs are shown in [67]Figures 1A,B.
FIGURE 1.
[68]FIGURE 1
[69]Open in a new tab
Identification of differentially expressed IRGs between GBM and normal
brain tissues. (A) Volcano plots showing the log[2] (fold change) of
mRNA in GBM compared to normal brain tissues, and the
corresponding-log[10] (P value) in TCGA datasets. Genes with adjusted P
value below 0.05 and fold change above one (below –1) were marked with
red (green) dots. (B) Heatmap of the differentially expressed IRGs in
TCGA datasets.
Functional Characterization of DEIRGs
The gene functional enrichment assessment showed that immune responses
were the most common. The most significant biological terms were
“regulation of leukocyte activation,” “plasma membrane protein complex”
and “receptor ligand activity” among biological processes, cellular
components, and molecular functions, respectively ([70]Figure 2A). With
regard to the KEGG cascades, most of signaling cascades were linked to
immune reactions, and cytokine-cytokine receptor crosstalk was the most
significantly enriched term ([71]Figure 2B). For better visualization,
two heatmaps of these values were plotted using the logFC, including
one for GO terms ([72]Figure 2C) and the other for KEGG pathways
([73]Figure 2D). Some GO terms and KEGG cascades were linked to certain
immune processes.
FIGURE 2.
[74]FIGURE 2
[75]Open in a new tab
GO terms and Enrichment of KEGG pathways for differentially expressed
IRGs. (A) GO biological process analysis for the immune-related DEGs.
(B) KEGG pathway enrichment analysis for the immune-related DEGs. (C)
Heatmap of the GO terms by logFC. (D) Heatmap of the KEGG pathways by
logFC.
Identification of Prognostic Genes
The univariate Cox regression model was applied to select IRGs with the
patient OS, and a total of 15 IRGs were discovered to be significantly
associated with OS (p < 0.01). These genes were subjected to the LASSO
regression analysis to calculate the correlation coefficients. The
signature performed best when only seven genes were included
([76]Figures 3A,B). For this analysis, we used LASSO regression to
obtain the following seven optimal IRGs (risk genes) for incorporation
into the prognostic risk model in TCGA training cohort
([77]Supplementary Figure 1): Bone Morphogenetic Protein Receptor Type
1A (BMPR1A), Cathepsin B (CTSB), NFKB Inhibitor Zeta (NFKBIZ), TNF
Superfamily Member 14 (TNFSF14), C-X-C Motif Chemokine Ligand 2
(CXCL2), Semaphorin-4F (SEMA4F), and Oncostatin M Receptor (OSMR).
Among these genes, CTSB, NFKBIZ, TNFSF14, CXCL2, SEMA4F, and OSMR were
characterized as high-risk genes (estimating a poor prognosis), whereas
BMPR1A was identified as low-risk genes (functioning as a protective
factor) with regard to the OS of patients (see detailed information in
[78]Table 1).
FIGURE 3.
[79]FIGURE 3
[80]Open in a new tab
Seven-immune-related gene signature prognostic risk model analysis of
GBM patients in TCGA dataset. (A) LASSO coefficient profiles of the 15
IRGs in TCGA-GBM. (B) A coefficient profile plot was generated against
the log (lambda) sequence. Selection of the optimal parameter (lambda)
in the LASSO model for TCGA. (C) Kaplan–Meier survival curves for
high-risk and low-risk groups. (D) ROC curves to examine the predictive
accuracy of the model for OS at 1-, 2-, and 3- years.
TABLE 1.
Risk genes in the prognostic risk model.
Gene Coef HR Low. 95%CI Upp. 95%CI p-value
BMPR1A −0.194 0.691 0.556 0.859 8.86E-4
CTSB 0.011 1.280 1.104 1.484 1.06E-3
NFKBIZ 0.050 1.442 1.200 1.731 9.10E-5
TNFSF14 0.081 1.320 1.138 1.532 2.58E-4
CXCL2 0.090 1.350 1.152 1.583 2.11E-4
SEMA4F 0.217 1.490 1.199 1.852 3.25E-3
OSMR 0.250 1.475 1.239 1.757 1.30E-5
[81]Open in a new tab
7 prognostic immune-related genes screened out by the univariate Cox
regression and LASSO Cox proportional hazards regression.
Construction of a Seven-Gene Prognostic Biosignature
The LASSO regression analysis was used to screen the risk genes for
estimating the prognosis of GBM individuals ([82]Friedman et al., 2010;
[83]Simon et al., 2011). We utilized mRNA contents and predicted the
regression coefficients of the risk genes to compute a risk score for
each GBM individual. The prognostic estimation model was created, which
incorporated seven immune-linked genes. The following formula was used
for the calculation:
[MATH: Riskscore
=(-0.194)BMPR1A+0.011CTSB+0.050NFKBIZ<
/mrow>+0.081TNFSF14+ 0.090CXCL2
+0.217SEMA4F
+0.250OSMR :MATH]
According to the formula, we calculated the risk scores of each GBM
individual and clustered them into low-risk and high-risk classes
according to the median risk score. According to the log-rank test, the
Kaplan–Meier curve revealed that the prognosis in the high-risk class
was worse compared to the low-risk class in TCGA training cohort (p =
0.012) ([84]Figure 3C). We employed the time-dependent ROC curves to
explore the estimation accuracy of the model for OS in TCGA training
cohort. The prognostic model area under the ROC values were 0.71 at
1-year, 0.71 at 2-year, and 0.82 at 3-year ([85]Figure 3D). Suggesting
our 7-gene model had a favorable efficiency in predicting prognosis.
Verification of the Immune-Linked Gene Biosignature
The prognostic value of the seven IRGs signature was further evaluated
in three validation sets (TCGA internal validation set, [86]GSE74187,
and [87]GSE4412 datasets). The risk score for each patient was
calculated following the same formula. Patients in three validation
sets were classified into high- and low-risk groups based on the median
of the risk score. Survival analysis in the three validation sets
confirmed a lower survival rate in the high-risk group ([88]Figures
4A–C). The AUC of ROC curves for 1-, 2-, and 3-year survival rate in
the validation dataset were 0.79, 0.91, and 0.93 (TCGA internal
validation cohort) 0.64, 0.67, and 0.6 ([89]GSE74187); 0.58, 0.77, and
0.99 ([90]GSE4412) ([91]Figures 4D–F). In summary, the prognosis model
created according to the expression patterns of these seven
prognosis-distinct immune-linked genes had high estimation accuracy and
stability in identifying immune features. These data demonstrated that
our prognostic risk model precisely estimates the prognosis of GBM
individuals.
FIGURE 4.
[92]FIGURE 4
[93]Open in a new tab
Validation of seven-immune-related gene signature prognostic risk model
of GBM patients in validation datasets. (A) Kaplan–Meier survival
curves for high-risk and low-risk groups in TCGA internal validation
dataset (p < 0.001). (B) Kaplan–Meier survival curves for high-risk and
low-risk groups in [94]GSE74187 dataset (p = 0.048). (C) Kaplan–Meier
survival curves for high-risk and low-risk groups in [95]GSE4412
dataset (p = 0.07). (D–F) ROC curves to examine the predictive accuracy
of the model for OS at 1-, 2-, and 3- years in validation cohorts.
Relationship Between the Risk Score and Clinical Factors
The relationship between the seven IRGs signature and clinical factors,
including age, gender, IDH1 mutation, 1p/19q mutation, and subtype was
further investigated using data from the TCGA dataset. The results
showed that a higher risk score was always associated with IDH1
mutation, 19q mutation, and subtype. No differences were observed
between the risk score and age, gender, or 1p mutation
([96]Supplementary Figure 2).
Functional Annotations and Signaling Pathway Enrichment in High- and Low-Risk
Score Groups
Because the monitoring of disease outcome is imperative for clinical
management, we aimed to identify molecular biosignatures that could be
utilized as viable prognostic indicators. Functional gene annotation
and KEGG enrichment analyses focused on the above mentioned seven
prognosis-distinct immune-linked genes were conducted ([97]Yu et al.,
2012). We demonstrated that these survival-linked IRGs were most
abundant in gene ontology (GO) terms linked to “cell adhesion mediated
by integrin,” “granulocyte migration,” “platelet degranulation,”
“regulation of leukocyte adhesion to vascular endothelial cell,” “rna
capping” and “transcription preinitiation complex assembly” ([98]Figure
5A). Gene set enrichment analysis (GSEA) was performed to identify the
prospective cascades that differentiated the high- or low-risk groups.
The following cascades were significantly enriched: “complement and
coagulation cascades,” “cytokine cytokine receptor interaction,”
“hematopoietic cell lineage,” “leukocyte transendothelial migration,”
“rna polymerase,” and “spliceosome” ([99]Figure 5B). These results
suggested that the prognosis-specific immune-related gene risk score
using the seven IRGs may affect these cascades and estimate the
survival of GBM patients.
FIGURE 5.
[100]FIGURE 5
[101]Open in a new tab
Functional gene annotations and KEGG enrichment analysis between high
and low risk groups. (A) ClusterProfiler was selected for functional
gene annotations. (B) GSEA analysis was performed to identify the
potential pathways differentiate the high and low risk groups.
Correlation Between the Risk Score and Immune Response
To better comprehend the connection between the risk score and immune
response, we calculated the association between the risk score and the
expression levels of core immune checkpoints in GBM, such as CD28,
TIM-3, B7-H3, PD-1, B7-H4, CD40, LAG3, and PD-L1. Interestingly, the
Circos plot ([102]Gu et al., 2014) showed that the risk score was
strongly linked to expression levels of B7-H3, CD40, and PD-L1 in TCGA
cohorts ([103]Figure 6A).
FIGURE 6.
[104]FIGURE 6
[105]Open in a new tab
Correlation between the risk score and immune response and the
distribution of immune infiltration in GBM. (A) Circos plot shows the
relationship between the risk score and the expression levels of some
important immune checkpoints in GBM. (B) The proportions of immune
cells in each GBM sample are indicated with different colors, and the
lengths of the bars in the bar chart indicate the levels of the immune
cell populations. (C) Correlation matrix for all 22 immune cell
proportions. Some immune cells were negatively related, represented in
blue, and others were positively related, represented in red. The
darker the color, the higher the correlation.
Distribution of Immune Invasion in Glioblastoma
We first assessed immune invasion in glioma tissue in 22 subpopulations
of immune cells by employing the CIBERSORT algorithm. In [106]Figure
6B, the percentage of immune cells in each GBM sample is shown in
different colors, and the lengths of the bars indicate the immune cell
population levels. We then speculated that the divergence in TIIC
proportions may function as a critical feature of individual
differences and possess prognostic significance. Based on the chart, we
established that glioma tissues had comparatively high proportions of
M1, M0, and M2 macrophages as well as monocytes, which were responsible
for approximately 70% of the 22 subpopulations of immune cells. In
contrast, B cell and neutrophil proportions were comparatively low, and
they were responsible for approximately 10% of the immune cell
subpopulations ([107]Figure 6B). Proportions of different types of
immune cells subsets were weakly and then moderately correlated
([108]Figure 6C). Populations with a negative correlation consisted of
monocytes/M2 macrophages (Pearson’s correlation = −0.41) and resting NK
cells/activated NK cells (Pearson’s correlation = −0.43). Given the
important role of these hub immune genes, the genetic variations of
five of them with a mutation rate ≥ 5% were further explored
([109]Supplementary Figure 3).
Prognostic Model Associates With Immune Invasion in GBM
Clinical studies on immunotherapy have verified that tumor-invading
lymphocytes in the tumor microenvironment possess an estimation
significance for prognosis and treatment using immunotherapy in some
solid tumors ([110]Bremnes et al., 2016; [111]Lee et al., 2016;
[112]Badalamenti et al., 2019). Given that our risk score was centered
on seven immune-linked genes, we investigated whether it was linked to
the invading levels of six immune cell types in the TCGA GBM cohort
acquired from TIMER. We examined the link between the expression levels
of seven immune-linked genes and the invading contents of six immune
cell types. The findings demonstrated that the expression of these
seven genes exhibited remarkably positive correlation with immune cell
invasion. The expressions of CTSB, NFKBIZ, CXCL2, and OSMR were all
correlated with the invading levels of dendritic cells
([113]Supplementary Figure 4). To better understand the impact of the
seven IRGs signature on the infiltration of immune cells, the relevance
of the risk score and six immune cells was investigated. Results
indicated that the risk score was positively related to neutrophil
cells (r = 0.188), dendritic cells (r = 0.404), and CD4+ T cells (r =
0.169) ([114]Supplementary Figure 5). Collectively, these data
indicated that our model system is partially linked to the invading
level of immune cells in the tumor microenvironment of GBM.
Particularly, BMPR1A was significantly correlated with the infiltrating
levels of CD4+ T cells, macrophages, and dendritic cells. TNFSF14 and
OSMR were significantly correlated with the invading levels of CD4+ T
cells and dendritic cells. CXCL2 was significantly associated with the
invading levels of dendritic cells ([115]Figure 7).
FIGURE 7.
[116]FIGURE 7
[117]Open in a new tab
Correlations of seven immune-related gene copy member with immune
infiltration level in GBM. These seven-immune-related gene CNV affects
the infiltrating levels of different immune cells in GBM. *p < 0.05,
**p < 0.01, ***p < 0.001.
Effects of Prognosis-Specific Immune-Related Genes on Oncogenic Pathways
To further elucidate the molecular mechanisms for prognosis-specific
IRGs participating in tumorigenesis, we explored the link between the
expression of individual genes and activation or repression of 10 core
signaling cascades based on a pathway score computed from the sum of
the relative protein contents for all positive modulatory constituents
less that of all negative modulatory constituents ([118]Akbani et al.,
2014). Our data demonstrated that seven genes were highly correlated to
the activation or suppression of numerous oncogenic cascades
([119]Supplementary Figure 6). For example, CTSB was highly correlated
with the repression of DNA damage response and AR hormone, as well as
the activation of apoptosis and EMT signaling pathways. CXCL2 was
associated with the inhibition of cell cycle, DNA damage response, and
AR hormone, as well as activation of apoptosis, EMT, and RAS/MAPK
signaling pathways. These results suggested that prognosis-specific
IRGs are linked to alterations of diverse oncogenic cascades.
Hub Gene Drug Sensitivity
GSCALite constitutes a web-based analysis portal for gene set cancer
analysis ([120]Liu C. J. et al., 2018), based on which the drug
sensitivity of the hub genes was analyzed to provide support on
drug-targeted therapy ([121]Supplementary Figure 7). Low NFKBIZ level
is resistant to 11 drugs or small molecules, low BMPR1A level is
resistant to seven drugs or small molecules, low SEMA4F level is
resistant to 16 drugs or small molecules, and low levels of OSMR,
CXCL2, and CTSB are resistant to more than 32 drugs or small molecules.
Discussion
Glioblastoma is a fatal human cancer. Despite of the years of research
focused on GBM biology and the numerous clinical trials to evaluate new
treatments, the prognosis of individuals with glioblastoma remains
dismal ([122]Thakkar et al., 2014). Patients diagnosed with GBM undergo
treatments, including neurosurgery, radiotherapy, and chemotherapy,
with unsatisfactory survival.
There has been great advancement in the comprehension of the genetic
and molecular underpinnings of glioblastoma with the emergence and
progression of microarray technology and sequencing technology. The
IDH1 mutant was found in an integrated genomic analysis in 2008
([123]Parsons et al., 2008). Many studies have been performed in recent
years and suggest that mutated IDH1 participates in the pathogenesis of
glioma. According to the WHO categorization of central nervous system
tumors, glioblastoma is divided into IDH-mutant and IDH-wildtype
subtypes ([124]Louis et al., 2016). This categorization is based
entirely on histological features. There are many specific genetic
changes in glioblastoma cases, and the most frequently mutated or
deregulated gene is epidermal growth factor receptor (EGFR), which is
amplified in approximately 60% of glioblastomas ([125]Huang et al.,
2007). Many deregulations with certain pathways, such as PI3K, P53, and
RB, have also been identified. Overall, these studies show the prospect
of the gene signature in tumor diagnosis and prognosis, and they
provide new evidence for tumor biology. With the progression of
bioinformatics and open access of high-throughput data, researchers
have studied multiple gene prognostic signatures for GBM, which result
in more accuracy than single gene prognostic signatures ([126]Colman et
al., 2010; [127]Yin et al., 2019).
The CNS has been considered as an immune-favored system based on the
initial experimental data documented more than 50 years ago
([128]Medawar, 1948; [129]Billingham et al., 1954), but many findings
have suggested that the immune microenvironment provides sufficient
opportunities to treat brain tumors with immunotherapy even though the
brain is an immunologically distinct region ([130]Schiffer et al.,
2017). Scientists have great interest in utilizing immunotherapy to
treat glioblastoma because it has shown considerable improvements in
the management of numerous solid tumors, including melanoma, renal cell
carcinoma, and NSCLC. There are many ongoing clinical trials for
immunotherapy, but the results are not satisfactory. Thus, we need more
knowledge about the GBM immune microenvironment.
Herein, we constructed a robust seven immune-linked gene biosignature
for risk stratification in glioblastoma patients. In contrast to a
previous studies ([131]Liang et al., 2020), we used univariate Cox
regression analysis and LASSO regression assessment to classify genes
as independent prognostic indicators. Among them, CTSB, NFKBIZ,
TNFSF14, CXCL2, SEMA4F, and OSMR were characterized as high-risk genes,
whereas BMPR1A was identified as low-risk gene.
The protease cathepsin B (CTSB) has been identified to highly express
in cancer ([132]Mijanovic et al., 2019), and associate with poor
prognosis of a variety of cancers, including breast cancer, pancreatic
cancer, and lung squamous cell carcinoma, which could be used as an
independent predictor of these tumors ([133]Gong et al., 2013;
[134]Zhang et al., 2014). It was previously found that the absence of
CTSB delays the growth and invasion of pancreatic neuroendocrine tumors
([135]Gocheva et al., 2006). Here, we identified CTSB as a risk pattern
based on our risk model, which is in consistent with previous studies.
NFKBIZ mutation is associated with ulcerative colitis, and the repeated
inflammation and repair are closely related to the occurrence of
colorectal cancer ([136]Kakiuchi et al., 2020). Thus, chronic
inflammation might be related to GBM. TNFSF14 is also known as LIGHT,
which has been studied at preclinical level for more than 10 years and
has shown the prospect of strengthening cancer immunotherapy
([137]Skeate et al., 2020). CXCL2 can promote the recruitment of MDSC
and is associated with the prognosis of bladder cancer ([138]Zhang et
al., 2017). SEMA4F is expressed in adults and related to the neural
guidance of embryos. It can induce neurogenesis in prostate cancer,
thus promoting cancer growth and migration ([139]Ding et al., 2013).
The cytokine receptor for oncostatin M (OSMR) regulates self-renewing
brain tumor stem cells and promotes the resistance of GBM to ionizing
radiation ([140]Sharanek et al., 2020). In breast cancer,
BMPR1-knockdown can inhibit RANKL production through p38 pathway,
thereby inhibiting breast cancer-induced osteolysis ([141]Liu Y. et
al., 2018). Above all, the above mentioned seven genes play important
roles in the occurrence and development of tumors.
We next created a landscape of 22 subtypes of immune cells and acquired
the status of immune infiltration in the GBM microenvironment. Our
results were similar to those of previous studies ([142]Lu et al.,
2019; [143]Liang et al., 2020). Furthermore, we analyzed the
relationship between the expression levels of seven immune-linked genes
and the invading levels of six immune cells. The data demonstrated that
the expression of these seven genes exhibited positive correlation with
immune cell invasion ([144]Supplementary Figure 4). All these findings
indicated that our prognostic model may aid in understanding the immune
status of glioblastoma patients. We also generated a circo plot to show
the relationship between the risk score and expression levels of core
immune checkpoints in GBM. This study may provide new targets or
effective biomarkers for glioblastoma immunotherapy.
In summary, the immunotherapy of GBM patients should be individualized
to obtain a better curative effect. Our study provides a prognosis
prediction based on IRGs, which may reflect the immune status of GBM
patients. However, our study had limitations as our study was based on
databases and bioinformatics analyses. Immunohistochemistry, flow
cytometry, and RT-PCR should be used to verify our research results.
Conclusion
In our study, IRGs were identified to generate a prediction model of
glioblastoma patient prognosis. We also explored the connection between
these genes and the immune cells and immune checkpoints. Further
research on these genes may provide new insights in GBM biology and
promote immunotherapy.
Data Availability Statement
The original contributions presented in the study are included in the
article/[145]Supplementary Material, further inquiries can be directed
to the corresponding author/s.
Author Contributions
LH and ZH performed all experiments, prepared figures, and drafted the
manuscript. LH, ZH, XC, SW, and YF participated in data analysis and
interpretation of results. LH and ZL designed the study and
participated in data analysis. All authors have read and approved the
manuscript.
Conflict of Interest
The authors declare that the research was conducted in the absence of
any commercial or financial relationships that could be construed as a
potential conflict of interest.
Acknowledgments