Abstract

Purpose

   Despite advances in characterizing the neurobiology of emotional
   disorders, there is still a significant lack of scientific
   understanding of the pathophysiological mechanisms governing major
   depressive disorder (MDD). This study attempted to elucidate the
   molecular circuitry of MDD and to identify more potential genes
   associated with the pathogenesis of the disease.

Patients and Methods

   Microarray data from the [32]GSE98793 dataset were downloaded from the
   NCBI Gene Expression Omnibus (GEO) database, including 128 patients
   with MDD and 64 healthy controls. Weighted gene coexpression network
   analysis (WGCNA) was performed to find modules of differentially
   expressed genes (DEGs) with high correlations followed by Gene Ontology
   (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway
   enrichment analyses to obtain further biological insight into the top
   three key modules. The protein-protein interaction (PPI) network, the
   modules from the PPI network, and the gene annotation enrichment of
   modules were analyzed, as well.

Results

   We filtered 3276 genes that were considered significant DEGs for
   further WGCNA analysis. By performing WGCNA, we found that the
   turquoise, blue and brown functional modules were all strongly
   correlated with MDD development, including immune response, neutrophil
   degranulation, ribosome biogenesis, T cell activation,
   glycosaminoglycan biosynthetic process, and protein serine/threonine
   kinase activator activity. Hub genes were identified in the key
   functional modules that might have a role in the progression of MDD.
   Functional annotation showed that these modules primarily enriched such
   KEGG pathways as the TNF signaling pathway, T cell receptor signaling
   pathway, primary immunodeficiency, Th1, Th2 and Th17 cell
   differentiation, autophagy and RNA degradation and oxidative
   phosphorylation. These results suggest that these genes are closely
   related to autophagy and cellular immune function.

Conclusion

   The results of this study may help to elucidate the pathophysiology of
   MDD development at the molecular level and explore the potential
   molecular mechanisms for new interventional strategies.

   Keywords: major depressive disorder, bioinformatic analysis,
   differentially expressed genes, immune response, autophagy

Introduction

   Major depressive disorder (MDD) is one of the most common forms of
   mental disease and includes a wide range of symptoms, such as prolonged
   episodes of low mood, lack of motivation, retraction from social
   interaction, altered patterns of sleep and appetite, cognitive
   difficulties and, at its worst, suicidal tendency.[33]^1^,[34]^2
   Therefore, depression is associated with serious consequences,
   including excessive mortality, disability, secondary morbidity, and
   contributors to the global burden of disease.[35]^3 According to the
   World Health Organization (WHO), an estimated 350 million people of all
   ages suffer from depression disorder globally.[36]^4 Despite the
   significant social burden that stems from this disease, there is still
   a significant lack of scientific understanding of the
   pathophysiological mechanisms governing depression. Furthermore, the
   current diagnostic systems do not adequately reflect the relevant
   neurobiological alterations that drive the modified behavioral patterns
   found in patients.[37]^5 To date, the microarray technique has
   increasingly been used for life science research purposes, and
   bioinformatic data‑mining of the gene has widely been used for
   differential expression analysis to identify novel diagnostic and
   therapeutic biomarkers of diseases.[38]^6^,[39]^7

   Extensive research efforts investigating depression have been performed
   in past decades, and potential new therapeutic targets are being
   identified, including oxidative stress, immune system effects[40]^8 and
   neuroinflammation[41]^9 in the central nervous system (CNS), through
   cytokines, which regulate brain activities and emotions. In particular,
   a recognition that immune system actions profoundly impact mood
   regulation[42]^10 has been particularly well established for
   inflammatory signals from the innate immune system linked to depression
   in human and rodent studies. As such, it was important to identify the
   critical genes and key-related pathways that may be related to innate
   immunity and neuroinflammation and thus can further understand the
   pathophysiological mechanisms of depression.[43]^11^,[44]^12 Recent
   genome-wide association studies (GWAS) and many studies on common
   variants increase our understanding of MDD; however, the underlying
   genetic basis remains largely unknown.[45]^13^–[46]^15 The weighted
   gene coexpression network analysis (WGCNA) algorithm can group genes
   into modules based on the gene coexpression similarities across the
   samples, resulting in a cluster of genes with similar functions, for
   relating modules to one another and external sample traits, and then
   the correlation networks can be used to identify tissue-specific
   biomarkers and pathophysiological-related pathways.[47]^16^,[48]^17

   To this end, within the current study, we constructed correlation
   networks using gene expression data from publicly accessible resources.
   Based on a microarray dataset [49]GSE98793 from Gene Expression Omnibus
   (GEO), the study was designed to construct a gene coexpression network
   to predict clusters of candidate genes involved in the pathogenesis of
   MDD. The DEGs were screened, and the WGCNA algorithm was employed to
   construct a coexpression network. Then, the gene modules related to MDD
   were identified. We conducted Gene Ontology (GO) and Kyoto Encyclopedia
   of Genes and Genomes (KEGG) pathway enrichment analyses to obtain
   further biological insight into the highly correlated module, and the
   modules of the protein-protein interaction (PPI) network in MDD were
   screened, as well. In our research, all the possible genes were
   analyzed, and our results would be more complete and more reliable. The
   results of this study may help to elucidate the pathophysiology of MDD
   development at the molecular level and explore the potential molecular
   targets for new interventional strategies.

Materials and Methods

Microarray Data

   The mRNA microarray expression profile dataset [50]GSE98793 from whole
   blood samples for 128 patients with MDD and 64 healthy controls
   provided by Leday et al[51]^18
   ([52]https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE98793) was
   downloaded from the Gene Expression Omnibus database, which is a public
   functional genomics data repository of high-throughput gene expression
   data, chips, and microarrays. The [53]GSE98793 dataset was based on the
   [54]GPL570 [HG-U133_Plus_2] platform using the Affymetrix Human Genome
   U133 Plus 2.0 Array (Affymetrix, Inc., Santa Clara, CA). We downloaded
   the raw CEL file and the probe annotation file. The probes were
   converted into the corresponding gene symbol according to the
   annotation information on the platform. All of the data were freely
   available online, and this study did not involve any experiments on
   humans or animals performed by any of the authors.

Data Preprocessing and DEG Screening

   After the raw data in CEL format were downloaded, the Affymetrix
   package[55]^19
   ([56]http://www.bioconductor.org/packages/release/bioc/html/affy.html)
   in R software (version 3.5.2, [57]https://www.r-project.org/) was used
   for data preprocessing and then obtained after removing the batch
   differences and performing data background correction, normalization
   and summarization to create a robust multiarray average (RMA).

   The series matrix files of the dataset were normalized log-expression
   values available for further analysis. To characterize DEGs, the
   control group and the MDD group were analyzed using the LIMMA (linear
   models for microarray data) package[58]^20 in the R/Bioconductor
   platform. Benjamini–Hochberg’s method was used to control the false
   discovery rate, and P-values < 0.05 were defined as the DEGs, which
   were used to screen gene modules with similar expression profiles for
   the subsequent coexpression network construction.

Construction of Weighted Gene Coexpression Networks

   As a systems biology method, the construction of gene coexpression
   networks and the identification of gene clusters or modules is
   especially useful in identifying transcriptional alterations in
   multigene diseases, where the phenotypic state emerges from the
   convergence of numerous small changes, rather than from isolated
   single-gene effects.[59]^21^,[60]^22 In the present study, the Weighted
   Gene Co-expression Network Analysis (WGCNA) package (Version 1.68,
   [61]https://cran.rstudio.com/web/packages/WGCNA/index.html) within R
   software was used to construct groups of strongly coexpressed genes
   into coexpression networks according to the DEG expression matrix,
   which included 3276 selected genes. The gene network was visualized
   with selected DEGs, and visualization in the gene network of module
   eigengenes was also undertaken.

   The selection of the soft-thresholding power is an important step when
   constructing a WGCNA.[62]^16 In this study, the soft thresholding power
   was screened out by analysis of network topology when the degree of
   scale independence was set as β=5 (scale-free R^2=0.88). When the
   degree of scale independence was set as β=5, the summary connectivity
   measures decline steeply with increasing soft-thresholding power, it is
   advantageous to choose the lowest power that satisfies the approximate
   scale-free topology criterion. Then, gene coexpression module
   similarity and adjacency matrices were successively constructed using
   the power values obtained by the WGCNA algorithm. During module
   selection by cluster analysis, the adjacency matrix was used to
   calculate the topological overlap measure (TOM) and measure the
   connection strength between all gene pairs.[63]^23^,[64]^24 Modules of
   coexpressed genes were generated by hierarchical clustering dendrograms
   with different colors, and the module structure was visualized by
   topological overlap matrix plots. Finally, the minimum number of genes
   was set at 100 to ensure the reliability of the result in each module,
   after which we performed further analysis and validation as the
   candidates on gene modules to characterize modules related to MDD.

Functional Enrichment Analysis of Genes in Three Key Modules

   The genes inside coexpression modules have high connectivity, and the
   genes within the same module may play similar roles. The top 3 modules
   with more than 100 genes that are significantly associated with MDD
   were selected. To further analyze the cellular component (CC),
   molecular function (MF), and potential biological process (BP),
   functional enrichment analysis was performed to identify and interpret
   complex biological functions based on the GO terms and KEGG pathway
   annotation in the coexpression modules. The genes of each selected
   module were submitted to the online Database of Enrichr
   ([65]http://amp.pharm.mssm.edu/Enrichr/) to conduct functional and
   pathway enrichment analysis in this study. Enrichr is a useful online
   tool for annotating genes,[66]^25^–[67]^27 which provides the
   functionality to perform simultaneous GO and KEGG pathway analysis.
   P-values < 0.01 were considered to indicate significant differences.

Integration of Genetics and Highly Connected Hubs in Modules

   The top-ranked genes in the modules are thought to be hub genes. To
   systematically analyze the hub genes from each module and the module
   eigengene, the genes obtained from each module previously were mapped
   into the online search tool STRING database[68]^28 (STRING, V11.0;
   [69]https://string-db.org/), which could play a critical role in the
   protein-protein network (PPI). A combined score ≥ 0.4 of PPI pairs was
   considered significant. Then, the CytoHubba plugin based on Cytoscape
   software[70]^29 ([71]http://www.cytoscape.org/, version 3.7.1;
   Institute for Systems Biology, Seattle, WA, USA) was used to construct
   and visualize the transcriptional regulatory network of each module,
   and the highest degree connectivity was identified as the hub genes.

Results

Identification of DEGs Associated with Normal and MDD Samples

   A total of 192 tissue sample raw files (. CEL format) were downloaded
   from the Gene Expression Omnibus (GEO) database. There were a total of
   54,675 probes on the integrated dataset [72]GSE98793 after batch
   normalization by the sva and limma packages in R, and 22,484 gene
   expressions were extracted. Furthermore, P-values < 0.05 with 3276 gene
   expressions were considered to be significant as the DEGs for further
   WGCNA analysis.

Weighted Coexpression Network Construction and Key Module Identification

   To explore the potential functions and mechanisms of these RNAs in the
   progression of major depression in patients, the WGCNA package in R
   software was used to construct a gene coexpression network to identify
   biologically meaningful gene modules. After screening the 3276 DEGs, we
   analyzed the data with WGCNA to identify the modules containing highly
   correlated genes, and the soft‐threshold power 5 was optimized based on
   the criterion of approximate scale-free topology ([73]Figure 1A and
   [74]B). We set MEDissThres as 0.25 to merge similar modules, and 9
   modules were generated ([75]Figure 2A). There were 491 genes in the
   turquoise module, 348 genes in the blue module, 144 genes in the brown
   module, 50 genes in the black module, 64 genes in the green module, 34
   genes in the magenta module, 41 genes in the pink module, 63 genes in
   the red module and 71 genes in the yellow module. The 296 genes that
   could not be included in any modules were put into the gray module,
   which was reserved for genes identified as not coexpressed.

Figure 1.

   [76]Figure 1
   [77]Open in a new tab

   Clustering of samples and determination of soft-thresholding power.

   Notes: (A) Analysis of the scale-free fit index for various
   soft-thresholding powers (β). The red line represents the merging
   threshold. (B) Analysis of the mean connectivity for various
   soft-thresholding powers. In all, 5 was the most fit power value.

Figure 2.

   [78]Figure 2
   [79]Open in a new tab

   Construction of coexpression modules by the WGCNA package in r.

   Notes: (A) The cluster dendrogram of genes in [80]GSE98793. Branches of
   the cluster dendrogram of the most connected genes gave rise to nine
   gene coexpression modules. Genes that could not be clustered into one
   of these modules were assigned to the gray module. Every gene
   represents a line in the hierarchical cluster. The distance between two
   genes is shown as the height on the y-axis. (B) Interaction
   relationship analysis of coexpressed genes. Different colors of the
   horizontal axis and vertical axis represent different modules. The
   brightness of yellow in the middle represents the degree of
   connectivity of different modules. There was no significant difference
   in interactions among different modules, indicating a high degree of
   independence among these modules. (C) Hierarchical clustering of module
   hub genes that summarize the modules yielded in the clustering
   analysis. (D) Heatmap plot of the adjacencies in the hub gene network.

   Abbreviation: WGCNA, weighted gene coexpression network analysis.

Correlation Between Modules and Identification of Key Modules

   Interactions between the nine coexpression modules were analyzed, and
   all genes were identified for the network heatmap ([81]Figure 2B).
   Interestingly, the results revealed that some of these gene modules
   were independently validated, especially the turquoise, blue, brown and
   magenta modules, which demonstrated a high level of independence among
   the modules and relative independence of genes expressed in each
   module. To determine the connections and interactions among these nine
   coexpressed modules, we analyzed the connectivity of eigengenes.
   Combined with [82]Figure 2C, we observed that these nine modules were
   classified into two main clusters: one included five modules
   (turquoise, black, brown, magenta and green modules), while the other
   included four modules (red, pink, yellow and blue modules).
   Furthermore, the results were demonstrated by the heatmap plotted
   according to the connectivity of eigengenes ([83]Figure 2D). We found
   that the three pairs of modules had higher adjacencies, and they were
   the yellow and blue, brown and green, and black and magenta modules,
   respectively.

Functional Enrichment Analysis of Genes in Modules of Interest

   Because the turquoise, blue and brown module modules were all highly
   correlated with MDD development, GO and KEGG enrichment analyses were
   performed to obtain further biological insight for the genes in the
   three constructed modules. As shown in [84]Figure 3A, the genes in the
   turquoise module were mainly enriched in neutrophil activation involved
   in the immune response, neutrophil degranulation, neutrophil-mediated
   immunity and cAMP-dependent protein kinase activity (cAPK). Genes in
   the blue module ([85]Figure 3B) were enriched in ribosome biogenesis,
   ncRNA and rRNA metabolic processes, T cell activation and DNA helicase
   activity. Genes in the brown module ([86]Figure 3C) were primarily
   involved in mRNA 5ʹ-splice site recognition, glycosaminoglycan
   biosynthetic process, protein serine/threonine kinase activator
   activity and cellular response to caffeine. KEGG pathway analysis
   ([87]Figure 3D) showed that the top enriched terms were the T cell
   receptor signaling pathway, primary immunodeficiency, the TNF signaling
   pathway, T-helper type 1 (Th1), T-helper type 2 (Th2) and T-helper type
   17 (Th17) cell differentiation, autophagy and RNA degradation and
   oxidative phosphorylation. These results suggest that these genes are
   closely related to autophagy and cellular immune function.

Figure 3.

   [88]Figure 3
   [89]Open in a new tab

   Top 5 GO analysis and KEGG pathway enrichment results for genes in
   three modules.

   Notes: (A) turquoise module, (B) blue module, and (C) brown module. The
   size of the bubble shows the enrichment significance, while colors
   indicate the enrichment score. Bubble nodes represent BP, triangular
   nodes represent CC, and square nodes represent MF. (D) KEGG pathway
   enrichment results. Bubble nodes represent the blue module, triangular
   nodes represent the brown module, and square nodes represent the
   turquoise module.

   Abbreviations: GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and
   Genomes; BP, biological process; MF, molecular function; CC, cellular
   component.

PPI Network Construction and Analysis of Selected Modules

   The protein-protein interaction (PPI) network of top-ranked genes for
   three selected modules was constructed based on the STRING database,
   including the turquoise, blue, and brown modules. All the modules were
   visualized by Cytoscape software as shown in [90]Figure 4. The high
   degree of connectivity was calculated as the hub genes of the selected
   module by the cytoHubba plugin. Mitogen-activated protein kinase
   3(MAPK3), resistin (RETN), matrix metallopeptidase 9 (MMP9), cathepsin
   D (CTSD) and cyclic adenosine monophosphate (cAMP) were identified in
   the turquoise module ([91]Figure 4A and [92]Table 1), which was
   enriched in neutrophil degranulation and activation involved in an
   immune response. Fibrillarin (FBL), ribosomal protein L11 (RPL11),
   ribosomal protein S18 (RPS18), NOP56 ribonucleoprotein (NOP56),
   ribosomal protein S5 (RPS5), ribosomal protein S14 (RPS14), MRT4
   homolog (MRTO4), ribosomal protein L18a (RPL18A), ribosomal protein L35
   (RPL35) and RNA polymerase I and III subunit C (POLR1C) were identified
   in the blue modules ([93]Figure 4B and [94]Table 2), which were
   enriched in ribosome biogenesis. Solute Carrier organic anion
   transporter family member 4C1 (SLCO4C1), vascular endothelial growth
   factor A (VEGFA) and serine and arginine rich splicing factor 1 (SRSF1)
   were identified in the brown module ([95]Figure 4C and [96]Table 3),
   which was enriched in mRNA 5ʹ-splice site recognition and
   glycosaminoglycan biosynthetic process.

Figure 4.

   [97]Figure 4
   [98]Open in a new tab

   PPI network of hub genes in three modules.

   Notes: (A) turquoise module, with 65 nodes and 248 edges, (B) blue
   module, with 55 nodes and 387 edges, and (C) brown module, with 29
   nodes and 42 edges. Triangular nodes represent hub genes; up nodes
   represent upregulated genes; down nodes represent downregulated genes.

   Abbreviation: PPI, Protein-protein interaction.

Table 1.

   Most Significant GO Terms of the Turquoise Module
   GO ID P-value No. of Genes Description Hub Gene
   GO:0043312 1.93E-11 40 Neutrophil degranulation RETN, CAMP, CTSD, MMP9
   GO:0002283 2.49E-11 40 Neutrophil activation involved in immune
   response RETN, CAMP, CTSD, MMP9
   GO:0002446 3.21E-11 40 Neutrophil mediated immunity CAMP, CTSD, MMP9
   GO:0035580 3.61E-09 13 Specific granule lumen CAMP, CTSD
   GO:0034774 6.63E-09 28 secretory granule lumen CAMP, CTSD
   [99]Open in a new tab

Table 2.

   Most Significant GO Terms of the Blue Module
   GO ID P-value No. of Genes Description Hub Gene
   GO:0003723 9.81E-08 52 RNA binding FBL, RPL11, RPS18, NOP56, RPS5,
   RPS14, MRTO4, RPL18A, RPL35
   GO:0042254 4.63E-07 17 Ribosome biogenesis FBL, RPL11, RPS18, NOP56,
   RPS5, RPS14, MRTO4, RPL18A, RPL35
   GO:0016072 2.30E-06 15 rRNA metabolic process FBL, RPL11, RPS18, NOP56,
   RPS5, RPS14, MRTO4, RPL18A, RPL35
   GO:0034470 2.37E-06 16 ncRNA processing FBL, RPL11, RPS18, NOP56, RPS5,
   RPS14, MRTO4, RPL18A, RPL35
   GO:0006364 2.60E-06 15 rRNA processing FBL, RPL11, RPS18, NOP56, RPS5,
   RPS14, MRTO4, RPL18A, RPL35
   [100]Open in a new tab

Table 3.

   Most Significant GO Terms of the Brown Module
   GO ID P-value No. of Genes Description Hub Gene
   GO:0070034 5.09E-04 3 Telomerase RNA binding N/A
   GO:0035579 5.37E-04 5 Specific granule membrane SLCO4C1
   GO:0002151 7.58E-04 2 G-quadruplex RNA binding N/A
   GO:0006024 7.85E-04 5 Glycosaminoglycan biosynthetic process N/A
   GO:0050650 8.42E-04 3 Chondroitin sulfate proteoglycan biosynthetic
   process N/A
   [101]Open in a new tab

Discussion

   Depression is a major human disease that has become a pervasive public
   health problem.[102]^30^,[103]^31 Despite advances in understanding the
   neurobiology of MDD, further research is warranted to determine the
   distinctive pathophysiology of MDD in contrast to many other brain
   disorders. Microarray technology has become a popular technology in
   recent years and is commonly used to obtain data regarding genetic
   alterations during the pathological process of disease.[104]^32 The
   [105]GSE98793 dataset may be critical in the identification of
   pathophysiology and biomarkers in MDD. Several researchers have
   previously analyzed the dataset. Leday et al, who provided the original
   data for [106]GSE98793, demonstrated that MDD was significantly
   enriched for an immune response to infection coupled with relative
   inactivation of the adaptive immune system, indicating the potential of
   transcriptional biomarkers for immunological stratification of patients
   with depression. Wang et al divided randomly into two nonoverlapping
   groups for cross-validation and found that MDD is mainly enriched in
   such pathways as ribonucleoprotein complex biogenesis, the Toll-like
   receptor signaling pathway, the apoptosis pathway, and the structural
   constituent of ribosomes.[107]^33 Unlike the previous studies mentioned
   in this report, the present study applied a new approach, based on
   WGCNA, to investigate the molecular mechanisms underlying MDD samples
   compared with control samples. A total of 3276 DEGs were used to build
   coexpression networks and identify groups of highly coexpressed genes.
   Moreover, we identified nine distinct coexpression modules based on the
   DEGs, and three highly correlated modules (turquoise, blue and brown
   modules) were obtained after the functions of the module’s analysis
   based on the coexpression network.

   Over the last two decades, the hypothesis emerged that inflammatory
   processes and neural-immune interactions are involved in the
   pathogenesis of major depression.[108]^34^–[109]^36 Many studies have
   shown that patients with MDD exhibit higher neutrophil and monocyte
   counts and an increased neutrophil to lymphocyte ratio (NLR) than
   controls. From our research, the turquoise module identified by WGCNA
   was the most significantly related module to MDD status. Genes in the
   turquoise module were mainly enriched in neutrophil degranulation,
   neutrophil-mediated immunity and neutrophil activation involved in the
   immune response, suggesting that genes in the turquoise module might
   play important roles in immune functions during MDD. According to the
   PPI network analysis from the turquoise module, RETN, cAMP, CTSD and
   MMP9 were identified as the high degree genes. Mutations in CTSD play a
   causal role in neuronal ceroid lipofuscinosis-10 and may be involved in
   the pathogenesis of several devastating neurodegenerative diseases.
   CTSD-deficient mice manifest depressive-like behavior, including
   anhedonia, behavioral despair, and enhanced learned helplessness. RETN
   also plays an important role in the pathophysiology of MDD, and
   resistin serum levels were lower in MDD individuals than in healthy
   controls.[110]^37 cAMP is a multifaceted modulator of immune synapse
   assembly and the inflammatory response. It has been reported that
   5-HT7R not only stimulates cAMP production[111]^38 but also forms
   heterooligomers with 5-HT1AR,[112]^39 which are important for the
   pathogenesis of MDD. MMP-9 (also known as C1562T) has been confirmed to
   participate in the development of depression.[113]^40 Changes in MMP
   expression may be a common element in, or perhaps even a marker for,
   recurrent depressive disorders.[114]^41

   Pathway enrichment analysis indicated that genes in the turquoise
   module were enriched in lysosomes and autophagy. Autophagy has been
   recognized as a pivotal process to ensure homeostasis of cells through
   lysosomal degradation of damaged macromolecules and organelles, which
   is linked to several diseases.[115]^42 Recently, autophagy has also
   been linked to depression, mainly through its involvement in the action
   of antidepressants. There are several publications that report that
   antidepressants impact autophagy, as has been reported very
   recently.[116]^43^,[117]^44 One finding supporting the role of
   antidepressants in autophagy was that fluoxetine (10 μM), a selective
   serotonin reuptake inhibitor (SSRI), could promote unblocked autophagic
   flux by enhancing the fusion of autophagosomes with lysosomes in
   primary astrocytes.[118]^45 The tricyclic antidepressant amitriptyline
   was found to increase autophagy in primary neurons and astrocytes,
   similar to the selective serotonin reuptake inhibitor fluoxetine. A
   further expanded study found that amitriptyline and fluoxetine lead to
   the gradual accumulation of sphingomyelin in lysosomes, which
   stimulates autophagy via protein phosphatase 2A, ULK, Beclin, and
   LC3B.[119]^46^,[120]^47 Although the molecular mechanism of the
   autophagy-modulating function of antidepressants is described in detail
   in several excellent reviews, it remains a major challenge that
   scientists have largely not explored.

   The blue module was enriched in RNA binding, ncRNA processing, rRNA
   metabolic process and ribosome biogenesis. According to the network
   analysis of the blue module, FBL, RPL11, RPS18, NOP56, RPS5, RPS14,
   MRTO4, RPL18A and RPL35 were identified as high degree hub genes. The
   gene FBL is a component of a nucleolar small nuclear ribonucleoprotein
   (snRNP) particle thought to participate in the first step in processing
   pre-ribosomal RNA.[121]^48 NOP56 is required for assembly of the 60S
   ribosomal subunit and is involved in pre-rRNA processing.[122]^49 MRTO4
   appears to be involved in mRNA turnover and ribosome assembly.[123]^50
   The RPL11, RPS18, RPS5, RPS14, RPL18A and RPL35 genes encode ribosomal
   proteins. Each of the proteins belongs to the family of ribosomal
   proteins. All of the proteins are located in the cytoplasm.

   Many cytokines with vital roles in the regulation of the immune system
   have been investigated in the pathology of MDD. Pathway enrichment
   analysis indicated that genes in the blue module were enriched in
   primary immunodeficiency, the T cell receptor signaling pathway, and
   Th1, Th2 and Th17 cell differentiation. Several studies have
   established that Th17 cells may contribute to the development of
   depression-like behavior.[124]^10^,[125]^51^,[126]^52 Th17 cells are a
   new T cell lineage CD4+ T helper cell. In addition to the traditional
   Th1 and Th2 subsets, Th1 cells may be derived from Th17
   cells.[127]^53^,[128]^54 Th17 cells are critical mediators of the
   cellular immune system that are characterized by the production of the
   inflammatory cytokines interleukin (IL)-17, IL-21 and IL-22. Th1 and
   Th2 cells may also play a regulatory role, as Th1 increased in mouse
   brains after depression-inducing stimuli. One study observed Th1 and
   Th2 cytokine imbalance in the subpopulation of depressed patients. Th2
   cells can stimulate their own growth by producing IL-4,[129]^55 which
   is a cytokine. As the major Th2-type cytokine, IL-4 might be protective
   against depression due to its ability to counter regulate inflammation
   and to inhibit serotonin transporter activity. A recent study
   discovered that IL-4 was a critical participant in the regulation of
   depressive-like behavior in an untreated baseline condition,[130]^56
   and the increase in depressive-like behavior during inflammation in
   wild-type mice might be mediated to some extent by a reduction in IL-4
   signaling.

   We acknowledge there were some limitations and shortcomings in this
   study. First, this study was mainly focused on data mining and data
   analysis, which were based on methodology and the results were not
   validated by experiments. Second, the samples from peripheral blood
   cells of patients, so the associated analysis of mRNAs in the brain
   regions with depression-related dysfunction may validate the data and
   strengthen the conclusion. Third, the results of this study should be
   interpreted within the context of important limitations.

Conclusion

   Our study adopted a systems biology-based WGCNA method and identified
   numerous useful molecular targets for future investigation of the
   mechanisms and selection of biomarkers for MDD. Some important
   biological processes and pathways, including neutrophil activation
   involved in the immune response, T cell receptor signaling pathway,
   Th1, Th2 and Th17 cell differentiation and the hub genes functioning in
   these processes, may help to elucidate the development and progression
   of MDD. Furthermore, the potential molecular mechanisms that have been
   identified simultaneously include ribosome biogenesis, lysosomes, and
   autophagy. Our findings may help to elucidate the pathophysiology and
   progression of MDD. In addition, further molecular biological
   experiments will be performed by our team to confirm the function of
   the identified genes in MDD.

Acknowledgments