Abstract
Major depressive disorder (MDD) is a prevalent, devastating and
recurrent mental disease. Hippocampus (HIP)-prefrontal cortex (PFC)
neural circuit abnormalities have been confirmed to exist in MDD;
however, the gene-related molecular features of this circuit in the
context of depression remain unclear. To clarify this issue, we
performed gene set enrichment analysis (GSEA) to comprehensively
analyze the genetic characteristics of the two brain regions and used
weighted gene correlation network analysis (WGCNA) to determine the
main depression-related gene modules in the HIP-PFC network. To clarify
the regional differences and consistency for MDD, we also compared the
expression patterns and molecular functions of the key modules from the
two brain regions. The results showed that candidate modules related to
clinical MDD of HIP and PFC, which contained with 363 genes and 225
genes, respectively. Ninety-five differentially expressed genes (DEGs)
were identified in the HIP candidate module, and 51 DEGs were
identified in the PFC candidate module, with only 11 overlapping DEGs
in these two regional modules. Combined with the enrichment results,
although there is heterogeneity in the molecular functions in the
HIP-PFC network of depression, the regulation of the MAPK cascade, Ras
protein signal transduction and Ephrin signaling were significantly
enriched in both brain regions, indicating that these biological
pathways play important roles in MDD pathogenesis. Additionally, the
high coefficient protein–protein interaction (PPI) network was
constructed via STRING, and the top-10 coefficient genes were
identified as hub genes via the cytoHubba algorithm. In summary, the
present study reveals the gene expression characteristics of MDD and
identifies common and unique molecular features and patterns in the
HIP-PFC network. Our results may provide novel clues from the gene
function perspective to explain the pathogenic mechanism of depression
and to aid drug development. Further research is needed to confirm
these findings and to investigate the genetic regulation mechanisms of
different neural networks in depression.
Keywords: depression, hippocampus-prefrontal circuit, integrative
approaches, gene expression, clinical state
Introduction
Major depressive disorder (MDD) is a highly prevalent and debilitating
psychiatric illness that can severely impair quality of life
([37]Belzung et al., 2015). Estimates indicate that depression-related
suicide is responsible for the loss of 1 million lives per year
([38]Perlis et al., 2006; [39]Kuo et al., 2015). The physical symptoms
of MDD, which are chronic and interfere with daily tasks, behavior, and
quality of life, are considered to result from detrimental emotional
and cognitive processes.
Since the development of next-generation sequencing, numerous studies
have performed transcriptome profiling to explore the mechanisms
underlying neural plasticity and brain pathology ([40]Galfalvy et al.,
2013; [41]Riglin et al., 2019). Chronic stress can cause wide-ranging
changes in gene expression in the human brain, some of which contribute
to functional deficits in brain cells ([42]Kajiyama et al., 2010). Many
previous expression profiling studies on human samples have identified
individual genes as candidate contributors to the depression phenotype,
but a comprehensive systematic analysis of the molecular and cellular
changes in MDD is still lacking ([43]Iwamoto et al., 2004a; [44]Kroes
et al., 2006; [45]Zhang G. et al., 2020). Further analysis of only the
representative molecules with the highest differential expression
levels will cause the underlying connections among genes to be ignored.
Fortunately, a very well-known method, gene set enrichment analysis
(GSEA), can be applied to analyze overall gene expression data among
different groups ([46]Subramanian et al., 2005; [47]Reimand et al.,
2019). GSEA can be used to identify functional categories or pathways
in which genes exhibit coordinated alterations in expression under
different kinds of conditions and is not limited to analyses of sets of
differentially expressed genes (DEGs). One advantage of GSEA is its
ability to highlight and analyze genes with significant phenotypes but
relatively minimal changes in expression that may be difficult to
detect with classical univariate statistics. Additionally, weighted
gene correlation network analysis (WGCNA) is an efficient method that
can be used determine the interactions between genes and
disease-related phenotypes ([48]Langfelder and Horvath, 2008). This
approach involves the analysis and calculation of connectivity weights
and topological overlap. The correlation matrix of co-expressed genes
and the adjacency function formed by the gene network are defined, and
the coefficient of dissimilarity of different nodes is calculated.
Then, genes with similar expression profiles are clustered in gene
modules. If certain genes always exhibit similar expression changes in
the context of a specific physiological process or tissue, the genes
are likely functionally related and can be defined as a module. Unlike
the clustering criteria of conventional clustering methods (such as
geometric distances), the clustering criteria of WGCNA have important
biological meaning; thus, the results obtained by WGCNA have high
reliability and can be used for numerous further analyses
([49]Rangaraju et al., 2018; [50]Zhang et al., 2019).
Previous autopsy and imaging studies on patients with MDD have
indicated the existence of abnormalities in several brain regions,
including the prefrontal cortex (PFC), cingulated cortex, hippocampus
(HIP), and other brain areas ([51]Li et al., 2015; [52]Mamdani et al.,
2015). The HIP-PFC circuit, the critical neural circuit in MDD
research, has important functions in cognitive and emotional regulation
([53]Carreno et al., 2016); however, little is known about the
gene-related signatures and their correlations in the HIP-PFC neural
circuit.
Therefore, we used WGCNA and GSEA with clinical information to explore
the phenotype-centric gene networks of patients with MDD. This study
used public data sets to ensure a comprehensive analysis, and the
included data sets met the following fundamental criteria: (1) all
patients had confirmed MDD; (2) complete public transcription data were
available; and (3) annotation platform files were available. The
available transcriptome datasets on CNS diseases, especially on mental
disorders, are limited, and most prior studies focused on a single
brain area. Previously, [54]Lanz et al. (2019) collected multiple brain
regions from patients with different psychiatric disorders, and
systematically evaluated the genes shared between mental diseases. In
this analysis, different brain region tissues were collected from the
same patient, and all samples are based on the same platform for
annotation. Therefore, to explore the transcriptional insights into HPC
and PFC in major depression, we performed WGCNA and GSEA of the
clinical information in [55]GSE53987, which is the main dataset
analyzed in the current study, and other single brain area datasets
were incorporated for verification.
Materials and Methods
Data Collection and Processing
Gene microarray datasets ([56]GSE53987, [57]GSE42546, and [58]GSE12654)
were retrieved from the Gene Expression Omnibus (GEO) database, and the
basic information is listed in [59]Table 1. [60]GSE53987 contains the
gene profiles of both the HIP (CTRL/MDD: 18/17) and PFC (CTRL/MDD:
19/17), and no demographic differences were found across the groups
([61]Supplementary Table S1) ([62]Lanz et al., 2019). Under the
corresponding platform annotation, 20174 probes can be converted into
genes. Then in the verification section, the [63]GSE42546 is a
high-throughput sequencing dataset for the HIP, which contains 29
healthy controls and 17 MDD patients with a total of 6297 genes
([64]Kohen et al., 2014). [65]GSE12654 is a gene set for PFC, which
includes 15 healthy controls and 11 MDD, and a total of 8622 genes were
annotated ([66]Iwamoto et al., 2004b). We used R programming language
software to analyze the datasets after converting the corresponding IDs
of the probes to the official gene symbols. The subsequent analyses
were based on the official gene symbols. Before further analysis, the
raw profile data of the CEL files were processed for background
correction, and the robust multiarray average (RMA) algorithm of the
affy Bioconductor/R oligo package was used for noise reduction and
normalized by applying quantile normalization ([67]Gautier et al.,
2004). The expression data of each dataset before and after adjustment
are displayed in the [68]Supplementary Figures S1, [69]S2. The
differential expression analysis of [70]GSE53987 was performed using
the R package limma, and the adjusted P-value was calculated using
Benjamini and Hochberg (BH) correction. Genes with an adjusted P-value
of <0.05 were statistically significant, log (fold change) > 0 was
considered to be upregulated, and <0 was considered to be downregulated
compared with the control. For the RNA-seq data ([71]GSE42546), the
calcNormFactors function in the edgeR R package was used for correction
and normalization ([72]Ritchie et al., 2015). The flow chart of the
overall study is shown in [73]Figure 1 (created with BioRender).
TABLE 1.
Fundamental information of included gene chips.
Data sets Tissues Samples (CTRL/MDD) Platform Type Purposes
[74]GSE53987 Hippocampus 18/17 [75]GPL570 Affymetrix U133_Plus2 chips
Analysis
Prefrontal cortex 19/17
[76]GSE42546 Hippocampus 29/17 [77]GPL13393 RNA-seq Validate
[78]GSE12654 Prefrontal cortex 15/11 [79]GPL8300 Affymetrix U95 Version
2 Validate
[80]Open in a new tab
FIGURE 1.
[81]FIGURE 1
[82]Open in a new tab
The flow chart of overall study.
Gene Set Enrichment Analysis
GSEA version 3.0 software was used to perform the overall gene analysis
on two groups of different samples (control versus depression). To
examine whether significant regulatory differences were caused by
depression at the functional and pathway levels, background data from
the Molecular Signatures Database (MsigDB^[83]1) were analyzed. The
normalized enrichment score (NES) and normalized P-value were used to
determine statistical significance, as previously described
([84]Subramanian et al., 2005). By calculating the NES, GSEA can
resolve differences in gene set size and correlations between gene sets
and the expression dataset. The NES value calculation is performed as
follow:
[MATH:
NES=act
ualES<
/mi>mea
n(ESsag<
/mi>ain
mo>stal<
/mi>lpe<
/mi>rmu
mo>tationsofth<
/mi>eda<
/mi>tas
mo>et) :MATH]
According to the NES calculation method in GSEA, NES can be a positive
or negative value. Positive and negative NES values indicate enrichment
genes mainly at the top and bottom of the list, respectively. According
to GSEA’s instruction and recommendations^[85]2, the nominal P-value
estimates the statistical significance of the enrichment scores, thus a
conventional 1000 permutation was used in our study to obtain a more
accurate P-value.
WGCNA Analysis
To determine the interaction between genes and the disease-related
phenotype, an expression matrix was calibrated though the system
biology method using the R package WGCNA ([86]Langfelder and Horvath,
2008). This method used gene expression data from two sample groups
(the control and MDD groups in this study) to build co-expression
pairwise correlation matrices. Because low-expressing or unvarying
genes usually means noise, the expression matrix was pre-processed by
calculating the variance and screening the genes in the top 50% of the
variance. The algorithm assumes that the network is an adjacency matrix
a[ij], and the co-expression similarity S[ij] is defined as the
absolute value of the correlation coefficient between the profiles of
nodes i and j based on the default method, S[ij] = | cor(x[i], x[j])|.
Hereafter, the pickSoftThreshold function was used to calculate the
soft threshold β when constructing each module. Then, by defining the
gene co-expression correlation matrices, the algorithm constructs a
hierarchical clustering tree and different gene expression modules. For
each module, the quantification of module membership (MM) is defined as
the correlation between the module eigengene (ME) and the gene
expression profile. The modules are defined as having different degrees
of clinical relevance based on the calculation of the association
between ME and external features. The gene significance (GS) is
represented as the correlation (absolute value) between the gene and
the clinical traits. For the intramodular analysis of each module, a
significant correlation between GS and MM indicates that the genes
within that module have consistency with clinical traits. A module was
considered important and analyzed further if it correlated with
clinical traits and its internal genes significantly correlated with
clinical properties. The important modules related to MDD were screened
for further analysis.
Enrichment Analyses for Key Modules
Gene Ontology (GO) enrichment analysis was carried out to determine the
key gene modules in the biological process (BP) category. Pathway
enrichment analysis was carried out for Kyoto Encyclopedia of Genes and
Genomes (KEGG) pathways and Reactome pathways. All genes in the genome
were used as the enrichment background. The enrichment analysis was
based on the Metascape platform ([87]Saldanha, 2004; [88]Zhou et al.,
2019). The P-value was calculated for each enriched term ([89]Hochberg
and Benjamini, 1990). Terms with P-values < 0.05, and enrichment
factors (ratios between the observed counts and the counts expected by
chance) > 1.0 were clustered based on their membership similarities.
More specifically, the P-values were calculated based on the cumulative
hypergeometric distribution. Kappa scores were used as the similarity
metric when performing hierarchical clustering on the enriched terms,
and sub-trees with a similarity of >0.3 were considered to form a
cluster. The most statistically significant term within a cluster was
chosen to represent the cluster.
Identification and Validation of Hub Genes
The hub genes were identified based on protein interaction evidence
from the STRING database ([90]Altaf-Ul-Amin et al., 2006). Evidence for
protein interaction of significantly dysregulated genes in key modules
were retrieved from the STRING database, which required an interaction
score with high confidence (0.7). The Cytoscape plugin cytoHubba was
used to rank nodes based on the Maxial clique centrality (MCC)
topological method, which predicts performance ([91]Chin et al., 2014),
and the top-10 genes were selected as hub genes for verification. MCC
assumes that the node network is an undirected network; given a node v,
S(v) is the set of the maximal cliques containing v, and (| C| −1)! is
the product of all positive integers less than | C |. The calculation
is as follows:
[MATH:
MCC(V)=∑C∈S<
/mo>(v)(|C|-1)! :MATH]
The separate datasets [92]GSE42546 (HIP) and [93]GSE12654 (PFC) were
analyzed to verify gene expression with unpaired two-samples Wilcoxon
tests.
Results
GSEA for Different Brain Regions in MDD
GSEA was utilized to explore the functions and signaling pathways of
gene sets that differed in the MDD groups compared with controls in
different brain regions and thus to determine the potential biological
significance of these genes sets in MDD. As listed in [94]Supplementary
Table S2, the HIP genes in the MDD group were primarily enriched for
the GO processes of “dorsal/ventral neural tube patterning,” “negative
regulation of protein exit from endoplasmic reticulum” and “regulation
of GTP binding,” and the suppressed HIP genes were enriched for the
“neuron remodeling,” “serotonin secretion,” and “synapse pruning”
processes. The GSEA-based pathway results for the HIP indicated that
several critical upstream pathways were suppressed in MDD, such as the
“NF-kappa B signaling pathway” and the “TGF-beta signaling pathway”;
downstream neuron-related pathways were also inactivated, such as the
“GABAergic synapse” and “neuroactive ligand-receptor interaction”
pathways ([95]Supplementary Table S2).
For the PFC region, the GSEA results based on GO terms suggested that
“stress-induced intrinsic apoptotic signaling pathway,” “negative
regulation of fibrinolysis,” and “regulation of neuron projection
arborization” were significantly activated in MDD ([96]Supplementary
Table S2). Moreover, pathways and GO terms related to synaptic
functions and metabolic processes, such as “negative regulation of
long-term synaptic potentiation,” “synapse pruning,” and the
glutathione and glycerolipid metabolism pathways, were the most
inhibited pathways/processes in MDD.
Key Gene Module Identification
The input for WGCNA contained all the gene profiles in the [97]GSE53987
dataset; the corresponding clinical characteristics for different brain
regions were analyzed separately. The R package WGCNA was applied to
classify genes with similar expression patterns into different modules.
We first selected a suitable soft threshold β value that met the
scale-free conditions (R^2 = 0.9; HIP: 14; PFC: 12; [98]Figures 2A,B).
As shown in [99]Figures 2C,D, we used a tree-cutting algorithm to
calculate the average linkage clustering, obtain gene co-expression
modules, merge similar modules, and gradually build a co-expression
network. Ultimately, the gene modules were identified in the dataset
for each brain area (HIP: 10; PFC: 12).
FIGURE 2.
[100]FIGURE 2
[101]Open in a new tab
Selecting the appropriate soft threshold and network construction in
WGCNA. (A) Soft threshold filtering for HIP. (B) Soft threshold
filtering for PFC. The horizontal axis is Soft threshold (power), and
the vertical axis is the evaluation parameter of the scale-free
network. The red horizontal line can clearly select the soft threshold
when R^2 = 0.8. (C) Cluster dendrogram for HIP. (D) Cluster dendrogram
for PFC. Hierarchical clustering tree showing each module, different
colors represent different gene modules.
Then, we performed a relevance test to relate each module to clinical
diagnostic parameters. The heatmap shown in [102]Figure 3 clearly
indicates the clinical relevance of each module with an MDD diagnostic
status. The clinical correlation analysis results suggested that the
yellow module in the HIP ranked higher than any other module
([103]Figure 3A) and was significantly related to clinical MDD (r =
0.37, P = 0.03). In the PFC ([104]Figure 3B), the red module (r = 0.32,
P = 0.07), magenta module (r = 0.30, P = 0.09), and turquoise module (r
= 0.33, P = 0.06) ranked higher than other modules correlated with MDD.
We then calculated the relationships between the nodes and modules
([105]Figure 4). The correlation results indicated that the genes
within the yellow module in the HIP were significantly correlated (cor
= 0.44, P = 1.3e–18), and the genes within the red module in the PFC
were significantly correlated (cor = 0.15, P = 0.024). These modules
were consistent with the module analysis results, so they were
identified as key modules for subsequent analyses ([106]Figure 4).
Additionally, the different modules contained different numbers of
genes: the blue module in the HIP contained 363 genes, and the red
module in the PFC contained 225 genes. Other genes were not classified
into these modules because they lacked clinical relevance or
co-expression characteristics; therefore, the modules containing these
genes were not further analyzed.
FIGURE 3.
[107]FIGURE 3
[108]Open in a new tab
(A,B) Module-trait relationships heatmaps. The heatmaps show the
clinical relevance of different modules when the diagnosis status is
MDD. The intensity of the color represents the strength of the
correlation of modules. Red represents positive correlation and blue
represents negative correlation. The specific correlation value and
P-value are displayed in the corresponding module.
FIGURE 4.
[109]FIGURE 4
[110]Open in a new tab
Scatter plots for intramodular analysis. Each dot represents a gene,
and different colors represent its module. (A) the intramodular
analysis for the yellow model in hippocampus, (B–D) the intramodular
analysis for the modules in prefrontal cortex.
Regional Comparison of Eigengene Expression via MDD Modules
To analyze the expression patterns of the MDD co-expression modules of
the two brain regions, heatmaps were generated ([111]Figures 5A,C). In
the yellow module of HIP, the MDD co-expression genes have a different
expression pattern than the normal control, with most showing
upregulated gene expression; however, in the red module of PFC, the
expression patterns of the module genes are not as clear as in HIP, and
a wide range of differential expression is shown compared with the
control group. To clearly determine the significance compared with the
control, we combined the results of the differential analysis of the
expression profiles (adjusted P-value < 0.05, [112]Figures 5B,D and
[113]Supplementary Table S2). Among the 343 genes in the yellow module
of HIP, 45 genes were significantly upregulated and 50 genes were
significantly downregulated ([114]Figure 5B). Compared with HIP,
relatively few DEGs were identified in the PFC red module. Of the 225
genes in the PFC module, 22 genes were significantly upregulated and 29
were significantly downregulated ([115]Figure 5D).
FIGURE 5.
[116]FIGURE 5
[117]Open in a new tab
(A,C) Heatmaps for key modules. The heatmaps of the modular genes
expression levels between groups used the data normalization method of
Z-score standardization. (B,D) Volcano plots for key modules. Gray dots
mean genes with no significant difference, red color present
up-regulated genes, and blue color dots indicate down-regulated genes
(E) Venn diagram of gene intersection between two modules, *means the
overlapping significant differentially expressed genes.
Interestingly, for these two gene sets based on clinical co-expression,
few overlapping genes were found (32, [118]Figure 5E). Eleven genes
were differentially expressed in both HIP and PFC with similar
expression trends. Among these intersecting dysregulated genes, 10
genes were significantly downregulated in HPC and PFC compared to the
control group, and BMP6 was significantly upregulated in both brain
regions ([119]Supplementary Table S3).
Regional Comparison of Functions via MDD Modules
To analyze the functional differences between the two key co-expression
modules, we identified the BP and pathway characteristics with the
Metascape database, which can provide GO clusters to more clearly
define the enrichment results. The top significant BP enrichment
clusters for each brain region are shown in [120]Figure 6A and
[121]Table 2. The findings indicate significant differences between the
HIP and PFC regions; the depression-related genes of the HIP are most
significantly involved in processes associated with neuronal
interaction and synaptic structure, such as signal release and cerebral
cortex GABAergic interneuron migration, while the depression-related
genes of the PFC are significantly involved in processes related to
cellular component organization and development, such as telencephalon
development, membrane depolarization and MAPKKK activity. The enriched
pathways of HIP and PFC have distinguishable attributes ([122]Table 3
and [123]Figure 6B); the HIP yellow module dysregulated genes were
significantly enriched in signaling pathways associated with signal
transmission between neurons, and synaptic transmission. The top
enriched pathways included “Amine ligand-binding receptors,” the
downstream signaling pathways of neuronal inflammation and immunity,
and “ADORA2B mediated anti-inflammatory cytokines production.” The
dysregulated genes in the red module of the PFC participate in the
“MAPK signaling pathway” and downstream pathways of molecular
metabolism.
FIGURE 6.
[124]FIGURE 6
[125]Open in a new tab
(A) GO enrichment heatmap for regional specific DEGs (HIP, PFC) and
overlapping DEGs. (B) KEGG enrichment heatmap for regional specific
DEGs (HIP, PFC) and overlapping DEGs. Gray indicates terms have not
been enriched, and the shade of orange represents the degree of
enrichment. (C) Network of enriched terms represented as pie charts,
where pies are color-coded based on the identities of the gene lists.
(D) Network of enriched terms for HIP and PFC, colored by cluster name,
where nodes that share the same cluster ID are typically close to each
other.
TABLE 2.
GO enrichment for key modules.
GO_ID Description P-value
__________________________________________________________________
Co-terms (HIP) (PFC) (Overlapping)
GO:0002380 Immunoglobulin secretion involved in immune response 0.0005
GO:0032346 Positive regulation of aldosterone metabolic process 0.0009
GO:0097037 Heme export 0.0009
GO:0004222 Metalloendopeptidase activity 0.0009
GO:0001602 Pancreatic polypeptide receptor activity 0.0018
GO:0043409 Negative regulation of MAPK cascade 0.0031
GO:0036312 Phosphatidylinositol 3-kinase regulatory subunit binding
0.01791 0.0045
GO:0018108 Peptidyl-tyrosine phosphorylation 3.3E-05 0.0116
GO:0046580 Negative regulation of Ras protein signal transduction
0.01252 0.0228
GO:0048168 Regulation of neuronal synaptic plasticity 0.00399 0.0232
GO:0045936 Negative regulation of phosphate metabolic process 0.00067
0.0285
GO:2000474 Regulation of opioid receptor signaling pathway 0.00664
0.00361
GO:0048167 Regulation of synaptic plasticity 0.00366 0.00482
GO:0055074 Calcium ion homeostasis 0.00551 0.00174
GO:0098662 Inorganic cation transmembrane transport 0.001 0.00243
GO:0021895 Cerebral cortex neuron differentiation 0.00312 0.0442
GO:0009410 Response to xenobiotic stimulus 0.01562 0.01514
GO:0031749 D2 dopamine receptor binding 0.01651 0.009
GO:1905702 Regulation of inhibitory synapse assembly 0.01323 0.0072
GO:0019900 Kinase binding 0.03684 0.00204
Hippocampus
GO:0021853 Cerebral cortex GABAergic interneuron migration 0.00016
GO:0023061 Signal release 0.00016
GO:0071398 Cellular response to fatty acid 0.00091
GO:0008422 Beta-glucosidase activity 0.00016
GO:0098793 Presynapse 6E-05
GO:0034765 Regulation of ion transmembrane transport 4.6E-06
GO:0061098 Positive regulation of protein tyrosine kinase activity
3.6E-05
GO:1903305 Regulation of regulated secretory pathway 1.7E-05
GO:0019905 Syntaxin binding 0.00209
GO:0034373 Intermediate-density lipoprotein particle remodeling 0.00332
Prefrontal cortex
GO:0007612 Learning 0.00013
GO:0021537 Telencephalon development 0.0001
GO:0021549 Cerebellum development 0.00091
GO:0051899 Membrane depolarization 0.0006
GO:0030029 Actin filament-based process 7.1E-05
GO:0008349 MAP kinase kinase kinase kinase activity 1.9E-05
GO:0086047 Membrane depolarization during Purkinje myocyte cell action
potential 9.5E-06
GO:0004597 Peptide-aspartate beta-dioxygenase activity 0.00181
GO:0052593 Tryptamine:oxygen oxidoreductase (deaminating) activity
0.00361
GO:0005668 RNA polymerase transcription factor SL1 complex 0.00361
[126]Open in a new tab
TABLE 3.
Pathway enrichment for key modules.
Pathway_ID Description P-value
__________________________________________________________________
Co-terms (HIP) (PFC) (Overlapping)
R-HSA-1257604 PIP3 activates AKT signaling 0.00603
R-HSA-3928664 Ephrin signaling 0.00855
R-HSA-983695 Antigen activates B Cell Receptor (BCR) leading to
generation of second messengers 0.01435
R-HSA-5083635 Defective B3GALTL causes Peters-plus syndrome (PpS)
0.01658
hsa03022 Basal transcription factors 0.02013
R-HSA-6807505 RNA polymerase II transcribes snRNA genes 0.03291
R-HSA-211945 Phase I – Functionalization of compounds 0.04864 0.01575
Hippocampus
R-HSA-375280 Amine ligand-binding receptors 0.00037
R-HSA-1296071 Potassium Channels 0.00039
R-HSA-9660821 ADORA2B mediated anti-inflammatory cytokines production
0.00105
R-HSA-422356 Regulation of insulin secretion 0.00225
R-HSA-975576 N-glycan antennae elongation in the medial/trans-Golgi
0.00337
R-HSA-451326 Activation of kainate receptors upon glutamate binding
0.00507
R-HSA-373760 L1CAM interactions 0.00735
R-HSA-190374 FGFR1c and Klotho ligand binding and activation 0.00994
R-HSA-1299503 TWIK related potassium channel (TREK) 0.00994
R-HSA-8873719 RAB geranylgeranylation 0.01983
R-HSA-391908 Prostanoid ligand receptors 0.02953
R-HSA-8984722 Interleukin-35 Signaling 0.03918
R-HSA-375281 Hormone ligand-binding receptors 0.03918
R-HSA-1170546 Prolactin receptor signaling 0.04873
Prefrontal cortex
hsa04010 MAPK signaling pathway 0.00114
R-HSA-5619104 Defective SLC12A1 causes Bartter syndrome 1 (BS1) 0.00181
R-HSA-8862803 Deregulated CDK5 triggers multiple neurodegenerative
pathways in Alzheimer’s disease models 0.00072
R-HSA-141405 Inhibition of the proteolytic activity of APC/C required
for the onset of anaphase by mitotic spindl 0.03552
R-HSA-5620916 VxPx cargo-targeting to cilium 0.03726
R-HSA-450341 Activation of the AP-1 family of transcription factors
0.01791
R-HSA-196836 Vitamin C (ascorbate) metabolism 0.01436
R-HSA-1475029 Reversible hydration of carbon dioxide 0.02146
R-HSA-9008059 Interleukin-37 signaling 0.03726
R-HSA-2672351 Stimuli-sensing channels 0.0166
R-HSA-210993 Tie2 Signaling 0.03202
R-HSA-388844 Receptor-type tyrosine-protein phosphatases 0.03552
hsa00360 Phenylalanine metabolism 0.03027
[127]Open in a new tab
The enriched results revealed identical BPs and pathways both in
region-specific DEGs and overlapping DEGs ([128]Tables 2–[129]4 and
[130]Figure 6), which indicate common biological changes in the HIP and
PFC with regard to coping with MDD, including PIP3-activated AKT
signaling, Ephrin signaling and transcription-related downstream
signaling pathways. These pathways are involved in the critical BPs of
the immune response, and participate in the regulation of the MAPK
cascade, neuronal differentiation and synaptic plasticity.
TABLE 4.
Overlapping DEGs of key modules.
PFC
__________________________________________________________________
HIP
__________________________________________________________________
Gene Fold Change adj.P.Val Fold Change adj.P.Val
ADAM28 −1.22718 0.0012 −1.4067 1.44E-05
ADAMTS19 −1.24018 0.0005 −1.1426 0.0129
EPHB2 −1.10922 0.0349 −1.2243 0.0011
FLVCR2 −1.18999 0.0084 −1.1668 0.0120
GTF2I −1.41651 0.0003 −1.3687 0.0008
MECOM −1.42665 3.96E-05 −1.4524 0.0039
NPY6R −1.1682 0.0088 −1.1726 0.0002
PIK3AP1 −1.22638 0.0226 −1.2861 0.0172
POU2F2 −1.19051 0.0007 −1.2575 0.0050
SH2D1A −1.14261 0.0069 −1.3322 2.80E-08
BMP6 1.31198 2.73E-05 1.27219 0.0024
[131]Open in a new tab
adj.P.Val, adjust P-value.
Hub Gene Selection and Verification
In total, the PPI network annotated by STRING includes 31 points, and
the top-10 hub genes were identified though the cytoHubba algorithm,
among which seven genes in the HIP, two genes in the PFC and EPHB2 in
both regions were considered to have strong connections and more
primary biological functions than the other genes in the network
([132]Figure 7A and [133]Table 5). The heatmaps were constructed for
the related biological functions and signaling pathways for the hub
genes ([134]Figure 7B and [135]Table 6). Neuroactive ligand-receptor
interaction is the most significantly enriched pathway for these hub
genes (ADRA1D, DRD1, PTGDR), and the process of “long-term synaptic
potentiation” was regulated by the most hub genes (DRD1, EPHB2, VAMP2,
GNB5). In addition, GNRH2, DRD1, ADRA1D, VAMP2, and SYCE1 were
significantly upregulated and GNB5 and PTGDR were downregulated in the
HIP of MDD patients compared with healthy controls ([136]Figure 7C).
BUB1B was significantly downregulated and LMNB1 was significantly
upregulated in the PFC of MDD patients. EPHB2, which is involved in the
regulation of synaptic enhancement, cation channel activity, neuron
projection retraction, central nervous system neuron development and
MAPKK activity, was downregulated in both the HIP (P = 0.0009) and PFC
(P = 0.0048) of MDD patients ([137]Figures 7C,D).
FIGURE 7.
[138]FIGURE 7
[139]Open in a new tab
(A) PPI network and hub components identified in the HIP and PFC
region. Colored by Counts (full connection which according to STRING’s
minimum confidence of 0.15); nodes with 0.7 STRING high-confidence
nodes only keep in the red circle that shows the high-confidence PPI
network and the hub genes identified by cytoHubba. The larger the nodes
of genes with high coefficients; the red dots represent the common
genes of the two brain regions, blue nodes mean HIP genes, and green
nodes indicate PFC genes. (B) Heatmaps of hub gene enrichment results
(pathway and GO biological process). (C) Boxplots of hub genes
expression in HIP of the analysis set ([140]GSE53987). (D) Boxplots of
hub genes expression in PFC of the analysis set ([141]GSE53987).
TABLE 5.
Hub node’s information.
Gene Score Region-specific Dysregulated
GNRH2 12 HIP Up
DRD1 6 HIP Up
GNB5 6 HIP Down
ADRA1D 6 HIP Up
PTGDR 6 HIP Down
VAMP2 3 HIP Up
BUB1B 2 PFC Down
EPHB2 2 both Down
LMNB1 2 PFC Up
SYCE1 2 HIP Up
[142]Open in a new tab
TABLE 6.
Enrichment results for hub genes.
Term Description P-value Symbols
Pathway
hsa04080 Neuroactive ligand-receptor interaction 0.000164 ADRA1D, DRD1,
PTGDR
hsa04970 Salivary secretion 0.000596 ADRA1D, VAMP2
hsa05032 Morphine addiction 0.000609 DRD1, GNB5
hsa04727 GABAergic synapse 0.03554 GNB5
hsa04912 GnRH signaling pathway 0.037128 GNRH2
hsa04110 Cell cycle 0.049748 BUB1B
Biological progress
GO:0060291 Long-term synaptic potentiation 4.83E-06 DRD1, EPHB2, VAMP2,
GNB5
GO:0019933 cAMP-mediated signaling 5.05E-05 ADRA1D, DRD1, PTGDR
GO:0007212 Dopamine receptor signaling pathway 0.000162 DRD1, GNB5,
VAMP2
GO:2001258 Negative regulation of cation channel activity 0.000248
EPHB2, GNB5
GO:0106028 Neuron projection retraction 0.00041 EPHB2, DRD1
GO:0021954 Central nervous system neuron development 0.000495 DRD1,
EPHB2
GO:0071798 Response to prostaglandin D 0.00205 PTGDR, VAMP2
GO:0035722 Interleukin-12-mediated signaling pathway 0.019125 LMNB1
GO:0051301 Cell division 0.024569 BUB1B, SYCE1
GO:0051129 Negative regulation of cellular component organization
0.037388 BUB1B, EPHB2
GO:0051389 Inactivation of MAPKK activity 0.000821 EPHB2
[143]Open in a new tab
To verify our findings, we used different gene chips to analyze the
expression of the hub genes. For the HIP ([144]Figure 8A), 4 hub genes
could be annotated in the [145]GSE42546 gene set. The expression of
GNB5 (P = 0.037), VAMP2 (P = 0.0025), and SYCE1 (P = 0.031) were
significantly upregulated in the HIPs of MDD patients, which is
consistent with the results of the analysis set. EPHB2 showed a
non-significant decreased expression trend compared to healthy controls
in the validation set (P = 0.42). Additionally, three hub genes (BUB1B,
EPHB2, LMNB1) in the PFC could be annotated in the [146]GSE12654 gene
set ([147]Figure 8B). BUB1B was significantly downregulated (P =
0.0037), and LMNB1 was significantly upregulated in MDD (P = 0.011),
but EPHB2 expression did not differ compared with the control (P =
0.087).
FIGURE 8.
[148]FIGURE 8
[149]Open in a new tab
Verify the expression of the hub genes. (A) Hub genes expression in the
validation set [150]GSE42546 (HIP) (B) hub gene expression in the
validation set [151]GSE12654 (PFC).
Discussion
Building on increasing evidence of the effects of depression on the
brain, a previous study revealed that depression is associated with
widespread network dysconnectivity rather than aberrant responses of
individual brain regions ([152]Li et al., 2018). Both structural and
functional abnormalities in different areas have been found in MDD
([153]Park and Friston, 2013; [154]Dai et al., 2019). For example,
abnormal functional neural circuitry under chronic stress conditions
and decreased volumes of many brain regions have been noted in patients
with depression ([155]Korgaonkar et al., 2014). Furthermore, prior
studies have revealed the importance of communication from the PFC to
the HIP; neural circuitry disinhibition is the main cause of cognitive
deficits in depression. Thus, the current study investigated gene
signatures in distinct brain areas differentially affected by MDD given
that genetic factors have been confirmed to play roles in MDD
development ([156]Bast et al., 2017).
First, we used a GSEA method to analyze overall gene data of the HIP
and PFC in a holistic manner. The GSEA revealed that the involved genes
in the PFC are mainly related to synapse generation and molecular
metabolism. For example, the glutathione metabolism pathway is
associated with genes that are downregulated in MDD samples. Previous
studies reported that glutathione is essential for cellular functions
and plays a key role in redox balance in vivo mainly by eliminating
free radicals to protect cells from oxidative stress ([157]Bi et al.,
2020). The current GSEA results for the PFC are consistent with
previous clinical data showing that cortical glutathione is
dysregulated in MDD. In addition, mounting evidence indicates that
glutathione deficits are associated with increased inflammation and
oxidative stress in MDD ([158]Lapidus et al., 2014; [159]Lindqvist et
al., 2017). The GSEA results for the HIP showed that genes related to
neuronal plasticity and serotonin secretion were downregulated and that
genes related to the NF-κB signaling pathway and the TGF-β signaling
pathway were also dysregulated in MDD samples. Several lines of
evidence indicate that TGF-β pathway signaling is necessary for
dopaminergic neuron development and function ([160]Tesseur et al.,
2017), and TGF-β1 is correlated with pathology in late-onset
Alzheimer’s disease and with MDD susceptibility ([161]Zhang K. et al.,
2020). The overall results of GSEA indicate that in the context of MDD,
the transcriptomic changes in the PFC lead to the abnormal regulation
of ion metabolism, while the transcriptomic changes in the HIP lead to
abnormalities in the immune and neurotransmitter systems.
To identify specific genes closely related to the progression of
depression, we performed WGCNA of these brain areas. Among the genes in
the co-expression module of HIP and PFC under major depression
conditions, more differences and less gene expression pattern overlap
were found, and the HIP yellow module showed more DEGs than the PFC red
module. Few overlapping genes were found between regions, indicating
decreased expression consistency of HIP and PFC in MDD. With regard to
the connections of functions, the two regional co-expression modules
also showed heterogeneity. The results point to specific signaling
pathways and BP clusters in the PFC and/or HIP. Both brain regions are
enriched in the BP of the regulation of neuronal synaptic plasticity;
however, the genes in the HIP are still more enriched in processes
related to ion transport and signal release, while the
depression-related genes in the PFC are significantly enriched in
processes related to membrane depolarization and brain tissue
development. The enhancement of synaptic potency and the plasticity of
neural circuits are considered the primary cellular mechanism of
learning and memory. Notably, the PFC has been related to cognitive and
executive functions, and PFC dysfunction is indeed a pathological
mechanism of depression. In addition, in primates, the HIP is located
near the center of the brain and is one of the main limbic brain
regions that stores memories and regulates cortisol production;
however, its most important function is in adult neurogenesis.
Different BPs and signaling pathways are affected in these brain
regions in the context of depression due to the physiological
distinction between the HIP and PFC. Previous constructive research on
schizophrenia suggested that there are few overlapping genes and BPs in
the HIP and PFC, which means that the regional consistency in
schizophrenia is reduced ([162]Collado-Torres et al., 2019). Our
results are consistent with previous studies and provide further
molecular biological evidence of the decreased regional connectivity in
the depressive state.
Notably, MDD-related genes in both regions were found to be involved in
Ephrin signaling, possibly due to the structural changes in neuronal
connections in neurological diseases, from synaptic changes to the loss
or reconnection of entire axon bundles. Ephrin ligands and their Eph
receptors are membrane-bound molecules that mediate the axon guidance
through cell-cell contacts, and play a vital role in the guidance of
neuronal growth cones, synaptic plasticity, and neuron migration
([163]Torii et al., 2009; [164]Klein and Kania, 2014). Previous studies
demonstrated that Ephrin type-B receptor 2 (EPHB2) in the hypothalamus
was significantly increased and interacted with the accumulated NMDAR
subunit GluN2A in LPS-induced depressive phenotype mice ([165]Wu et
al., 2019), and the ratios of p-EphA4/EphA4 and p-ephexin1/ephexin1 in
the PFC and HIP were increased in the social frustration depression
mouse model ([166]Zhang et al., 2017). In addition, the “MAPK signaling
pathway” was significantly enriched in PFC, and “negative regulation of
MAPK cascade” and “negative regulation of Ras protein signal
transduction” were significantly enriched in the HIP-PFC network. These
pathways and biological progresses are involved in the classic
brain-derived neurotrophic factor (BDNF)-MAPK pathway. Stress factors
alter BDNF activity and influence the BDNF-Ras-MAPK pathway, impairing
neuronal cell survival and neuroplasticity, thereby resulting in
depression symptoms ([167]Easton et al., 2006). Previous studies have
demonstrated that the Ras/MAPK pathway can regulate synaptic plasticity
and affect the expression of hippocampal neuronal plasticity-related
proteins in mice with chronic depression-like symptoms induced by
morphine withdrawal ([168]Zhu et al., 2002; [169]Jia et al., 2013).
Overactivation of the Ras/MAPK pathway impacts memory and learning
behaviors in rats. Although preceding studies have reported the
potential roles of MAPK signaling in depression ([170]Aguilar-Valles et
al., 2018), the downstream cascade targets of the MAPK pathway and the
roles of their interactions in functional neural circuits in the
regulation of depression are still unclear.
The top-10 genes were selected as hub genes from among the DEGs based
on the STRING annotation and the cytoHubba plugin. The selected genes
are involved in BPs such as long-term synaptic potentiation (LTP) and
neuron projection retraction, which are representative BPs in the
mechanism of depression. LTP and long-term inhibition (LTD) are two
opposite forms of synaptic plasticity that contribute to fine-tuning
neural connections to store information in the brain. Neural circuit
information is disrupted during depression, resulting in learning and
memory deficits associated with impaired synaptic plasticity
([171]Adhikari et al., 2010; [172]Padilla-Coreano et al., 2016;
[173]Bast et al., 2017). In one study, [174]Zheng and Zhang (2015)
clarified the potential causes of synaptic plasticity deficits in the
HIP-PFC network during depression and recorded local field potentials
(LFPs) in two brain regions of rats under normal and chronic
unpredictable stress (CUS). The authors found that impaired synaptic
plasticity in the ventral CA1 (vCA1)–medial PFC (mPFC) pathway was
reflected in weakened theta coupling and theta-gamma cross-frequency
coupling of LFPs in the depressed state. EPHB2 was identified as a hub
gene that was significantly downregulated in both the HIP and PFC of
depressed patients (analysis set). Even though it did not show
significant differential expression in the validation set, many
previous studies reported that the receptor tyrosine kinase EphB2 is
inactivated in neuropsychiatric disorders including depression and
memory disorders. Moreover, EPHB2-knockdown mice exhibit
depression-like behaviors and spatial memory deficiency compared with
wild-type mice ([175]Zhen et al., 2018). In mice affected by chronic
social stress, the expression level of EphB2 and its downstream
molecules in mPFC are reduced, and the specific cleavage of the EphB2
receptor can increase sensitivity to stress and induce depression-like
behavior ([176]Bouzioukh et al., 2007). The synapses in the HIP of
EphB2 mutant mice had normal morphology, but synaptic plasticity was
defective and the LTP was attenuated in EphB2(−/−) hippocampal slices
([177]Grunwald et al., 2001).
According to previous reports, HIP-PFC functional circuit interactions
mainly occur in the theta frequency range and reflect the processes of
working memory. In addition, the active processes in the two brain
regions are associated with enhanced oscillatory activities
([178]Spellman et al., 2015); thus, impaired functional neural circuits
affect cognitive and memory processes in depression. In addition to
discovering only a few overlapping molecular pathways in the HIP-PFC
circuit, indicating weak regional coherence, we found that the
overlapping pathways may be regularized responses between the HIP-PFC
neural circuits that underlie the critical mechanism involved in the
pathogenesis of MDD. Therefore, based on the identified gene
characteristics of the two brain regions, further research could be
performed on the molecular mechanism of this abnormal neural circuit in
depression, particularly with regard to the overlapping pathways and
several BPs that are closely associated with the HIP-PFC circuit.
The current study has certain limitations. First, most of the current
evidence suggests that several brain regions are involved in MDD,
including the PFC, HIP, amygdala, thalamus, and STR, and both
structural and functional abnormalities in these areas have been found
to occur in depression. There are also abnormalities in other neural
circuits, such as the basal ganglia-thalamus-cortex circuit, that could
be involved in the development of depression; however, we only analyzed
the HIP-PFC neural network in this paper, given the few human disease
samples and difficulty in analyzing each neural network or neural
circuit extensively. WGCNA analysis associates clustered genes with a
variety of external clinical information, and the features of clinical
depression include the occurrence of anxiety, cognitive symptoms, and
sleep phase disorder. This study was conducted based on the current
limited clinical information, which may lead to the potential external
clinical features of other gene modules not being discovered. This
study analyzed the two brain regions of MDD based on the gene
expression level, therefore, we verified the DEGs from key modules,
however, the analysis and verification of the connection between gene
expression and function requires further exploration. Last, we used
other gene datasets for verification, but several of the hub genes were
not expressed or had no expression differences between the groups. The
different annotation platforms and sample sizes might have caused this
discrepancy.
Above all, the present study shows the gene expression characteristics
of MDD and reveals common and unique molecular features and patterns in
the HIP-PFC network. Our results may provide novel clues from the gene
function perspective to explain the pathogenic mechanism of MDD and to
aid in the development of drug interventions for depression. Further
research is needed to confirm our findings and to investigate the
mechanisms of gene regulation of different neural networks in
depression.
Data Availability Statement
The original contributions presented in the study are included in the
article/[179]Supplementary Material, further inquiries can be directed
to the corresponding author/s.
Author Contributions
NY, QM, and JC conceived of and designed the study. NY, KT, LH, XL, XD,
and HG collected and processed the data. NY and KT wrote the
manuscript. All the authors contributed to manuscript revision and read
and approved the final submitted version. QM and JC take primary
responsibility for communication with the journal and editorial office
during the submission process, throughout peer review and during
publication.
Conflict of Interest
The authors declare that the research was conducted in the absence of
any commercial or financial relationships that could be construed as a
potential conflict of interest.
Acknowledgments