Abstract

Objective

   The pathogenesis of sepsis is still unclear due to its complexity,
   especially in children. This study aimed to analyse the immune
   microenvironment and regulatory networks related to sepsis in children
   at the molecular level and to identify key immune-related genes to
   provide a new basis for the early diagnosis of sepsis.

Methods

   The [40]GSE145227 and [41]GSE26440 datasets were downloaded from the
   Gene Expression Omnibus. The analyses included differentially expressed
   genes (DEGs), functional enrichment, immune cell infiltration, the
   competing endogenous RNA (ceRNA) interaction network, weighted gene
   coexpression network analysis (WGCNA), protein–protein interaction
   (PPI) network, key gene screening, correlation of sepsis molecular
   subtypes/immune infiltration with key gene expression, the diagnostic
   capabilities of key genes, and networks describing the interaction of
   key genes with transcription factors and small-molecule compounds.
   Finally, real-time quantitative PCR (RT–qPCR) was performed to verify
   the expression of key genes.

Results

   A total of 236 immune-related DEGs, most of which were enriched in
   immune-related biological functions, were found. Further analysis of
   immune cell infiltration showed that M0 macrophages and neutrophils
   infiltrated more in the sepsis group, while fewer activated memory
   CD4^+ T cells, resting memory CD4^+ T cells, and CD8^+ T cells did. The
   interaction network of ceRNA was successfully constructed. Six key
   genes (FYN, FBL, ATM, WDR75, FOXO1 and ITK) were identified by WGCNA
   and PPI analysis. We found strong associations between key genes and
   constructed septic molecular subtypes or immune cell infiltration.
   Receiver operating characteristic analysis showed that the area under
   the curve values of the key genes for diagnosis were all greater than
   0.84. Subsequently, we successfully constructed an interaction network
   of key genes and transcription factors/small-molecule compounds.
   Finally, the key genes in the samples were verified by RT–qPCR.

Conclusion

   Our results offer new insights into the pathogenesis of sepsis in
   children and provide new potential diagnostic biomarkers for the
   disease.

   Keywords: sepsis in children, key immune-related genes, transcriptomic
   analysis, Gene Expression Omnibus, diagnostic biomarkers

Introduction

   Sepsis is a common critical condition in paediatric emergency
   departments and inpatient wards. This condition is caused by an
   imbalance in the body’s response to infection and can lead to
   microcirculation disorders, immune dysfunction, and tissue and organ
   dysfunction, seriously threatening the life and health of
   children.[42]^1 In the treatment of sepsis in children, in addition to
   early fluid resuscitation, anti-infection efforts, symptomatic
   treatment, and prevention and treatment of complications,[43]^2^,[44]^3
   some new practices have been used to improve the prognosis of the
   disease, including continuous blood purification[45]^4 and
   extracorporeal membrane oxygenation.[46]^5 Nevertheless, the mortality
   rate is still very high in children with sepsis. Of the 7.6 million
   children worldwide under 5 years of age who died each year, 64% died of
   sepsis or septic shock caused by severe infection.[47]^6^,[48]^7 This
   percentage is mainly due to the lack of effective treatment targets and
   effective diagnosis and treatment strategies that make it possible to
   interrupt the occurrence of the disease and prevent deterioration due
   to the disease in a timely manner. Therefore, early identification and
   timely treatment of sepsis in children are critical in improving the
   outcome of this disease.

   The immune inflammatory response plays an important role in the
   development of sepsis. In the early stage of the disease, a variety of
   immune cells (such as neutrophils and macrophages) and the inflammatory
   mediators and cytokines that they release can induce excessive
   inflammatory responses.[49]^8 In contrast, in the late stage of the
   disease, the body is dominated by vigorous and long-lasting
   immunosuppression. This immune-mediated damage, which is thought to be
   an important cause of functional damage to organs, is regulated by a
   complex molecular network throughout the body.[50]^9^,[51]^10 Exploring
   the immune-response link that plays an important role in the
   development and progression of sepsis in children, elucidating the
   relevant functions and mechanisms of the cells and molecules involved,
   identifying the key nodes and the methods that can be used to intervene
   in their function, and improving the success rate of sepsis treatment
   by intervening at these nodes will be of great significance in clinical
   practice.

   As science and technology advance and the methods used to obtain big
   data mature, relevant analysis based on bioinformatic technology has
   given us a certain understanding of the mechanism of the occurrence and
   development of sepsis. The identification of the molecular
   characteristics and biochemical pathways operating in sepsis patients
   has stimulated researchers’ interest in new biomarkers.[52]^11 To the
   best of our knowledge, no previous study has reported the
   immune-related molecular characteristics of sepsis in children. In this
   study, the Gene Expression Omnibus (GEO) was used to analyse and
   explore immune-related key genes, their interactions, and their
   regulatory networks in childhood sepsis. Our findings provide new ideas
   that support in-depth analyses of the mechanisms through which sepsis
   occurs and progresses in children and that may aid in the
   identification of new diagnostic biomarkers.

Materials and Methods

Downloading and Preprocessing of Data

   The reliable original microarray results (CEL format) of the
   transcriptome datasets [53]GSE145227[54]^12 and [55]GSE26440[56]^13 of
   children with sepsis and normal control blood samples were downloaded
   from the GEO database, and the annotation files of [57]GPL23178
   (OElncRNAs520855F, Affymetrix Human Custom lncRNA Array) and [58]GPL570
   (HG-U133_Plus_2, Affymetrix Human Genome U133 Plus 2.0 Array) were
   downloaded. The [59]GSE145227 dataset contains the transcriptome data
   (mRNA + lncRNA) of 10 septic children and 12 normal controls. All the
   samples in [60]GSE145227 came from the same hospital. The [61]GSE26440
   dataset contains the transcriptome data (mRNA) of 98 septic children
   and 32 normal controls. The [62]GSE26440 samples came from 11
   hospitals. The baseline characteristics of the recruited datasets are
   shown in [63]Table 1. The original data in the CEL format were read
   using the oligo package,[64]^14 and background correction and data
   normalization were performed to obtain expression matrices for
   analysis.

Table 1.

   Baseline Characteristics of the Recruited Datasets
          Characteristic             [65]GSE26440        [66]GSE145227
                                   Sepsis    Control    Sepsis    Control
   Number                        98         32         10        12
   Age, median months (range)    26 (0–131) 20 (0–116) 4 (1–97)  13 (6–20)
   Sex, n (%)
    Male                         52 (53.1)  19 (59.4)  8 (80)    11 (91.7)
    Female                       46 (46.9)  13 (40.6)  2 (20)    1 (8.3)
   Infection site, n (%)
    Lung                         –          –          2 (20.0)  –
    Brain                        –          –          4 (20.0)  –
    Other                        –          –          4 (40.0)  –
   Comorbidity, n (%)            41 (41.8)  –          –         –
   Immune suppression, n (%)     23 (23.5)  –          –         –
   Using hydrocortisone, n (%)   35 (35.7)  –          –         –
   Gram-positive bacteria, n (%) 23 (23.5)  –          –         –
   Gram-negative bacteria, n (%) 20 (20.4)  –          –         –
   Negative cultures, n (%)      45 (45.9)  –          –         –
   ICU stay, median days (range) –          –          11 (1–42) –
   ICU mortality, n (%)          18 (18.4)  –          3 (30.0)  –
   [67]Open in a new tab

   Abbreviation: ICU, intensive care unit.

Differential Analysis and Immune-Related Genes

   We used the limma package[68]^15 of the R language to identify
   differentially expressed genes (DEGs) in the sepsis and the control
   groups using a linear model. The selection criteria of DEGs were an
   adjusted P value < 0.05 and |log[2] fold change (FC)| > 1. The volcano
   map and heatmap of DEGs were plotted using the ggplot2 package and the
   pheatmap package.

   The immune-related gene list was obtained from the Immunology Database
   and Analysis Portal (ImmPort) of InnateDB website
   ([69]https://www.innatedb.com/)[70]^16 and contained approximately 4815
   immune-related genes. The immune genes among the DEGs of the two
   datasets were extracted by referencing this list, and the DEGs the two
   datasets shared in common were displayed in a Venn diagram.

Functional Enrichment Analysis

   Gene Ontology (GO) functional annotation is a common method for
   large-scale gene functional enrichment studies. The three top-level
   categories are biological process, molecular function, and cellular
   component. The Kyoto Encyclopedia of Genes and Genomes (KEGG) is a
   widely used database that stores information about genomes, biological
   pathways, diseases, and drugs. The clusterProfiler package[71]^17 was
   used to perform GO and KEGG enrichment analysis on
   DEGs.[72]^18^,[73]^19 An adjusted P < 0.05 was considered statistically
   significant. Gene set enrichment analysis (GSEA) is an analytical
   method applied to microarray data from whole-genome expression
   profiling and compares expressed genes with predefined gene sets. GSEA
   was performed on the gene expression matrix using the clusterProfiler
   package, “c2.cp.kegg.v7.4.symbols.gmt” was selected as the reference
   gene set, and P < 0.05 indicated significant enrichment. Gene set
   variation analysis (GSVA) is a nonparametric, unsupervised algorithm
   that can calculate the enrichment score of a specific gene set in each
   sample. The GSVA package[74]^20 was used to detail the DEG matrix,
   “c2.cp.kegg.v7.4.symbols.gmt” was selected as the reference gene set,
   and the kcdf parameter was set to a Gaussian distribution. Finally, the
   limma package was used for differential pathway screening, in which P <
   0.05 was set as the screening threshold.

Immune Cell Infiltration Analysis

   We calculated the immune cell infiltration of the samples based on the
   gene expression matrix through the CIBERSORTx
   ([75]https://cibersortx.stanford.edu/) online analysis tool[76]^21 and
   filtered the samples for P < 0.05. Then, the degree of immune cell
   infiltration was compared between the septic children and the normal
   controls. For immune cells with significant differences between groups,
   we used Pearson correlation analysis to evaluate the correlation
   between the degrees of infiltration of different immune cells.

Analysis of Differentially Expressed lncRNAs (DElncRNAs) and Construction of
the ceRNA Interaction Network

   We used the limma package of the R language to identify DElncRNAs
   characterizing the sepsis and the control groups of the [77]GSE145227
   dataset. The DElncRNA screening criteria were an adjusted P < 0.05 and
   |log[2] FC| > 1. The miRNAs that bound to these DElncRNAs according to
   the miRNA binding profile database (miRcode)
   ([78]http://www.mircode.org/) were extracted and stored. The screened
   miRNAs were also compared against the miRNA target gene databases
   TargetScan, miRDB, and miRTarBase
   ([79]http://www.targetscan.org/vert_72/; [80]http://mirdb.org/;
   [81]https://mirtarbase.cuhk.edu.cn/~miRTarBase/miRTarBase_2019/php/inde
   x.php) to extract the target genes predicted by all three databases.
   The target genes of the three databases were saved. Finally, we
   compared the obtained target genes with the DEGs obtained from the two
   GEO datasets and obtained the intersection of the lncRNA–miRNA–mRNA
   interactions, which were imported into the regulatory network
   visualization software CytoScape to draw a ceRNA network related to
   sepsis in children.[82]^22

Weighted Gene Co-Expression Network Analysis (WGCNA) and Immune-Related Gene
Screening

   WGCNA was performed on the DEGs selected from the two GEO datasets
   using the WGCNA package.[83]^23 Pearson’s correlation coefficients
   between genes were calculated, and an appropriately soft threshold β
   was selected to make the constructed network more in line with the
   standard of a scale-free network. The gene network was constructed
   using the one-step method, and the adjacency matrix was converted to a
   topological overlap matrix. Hierarchical clustering was used to
   generate a hierarchical clustering tree of genes, and alpha-beta
   pruning was used to divide the gene modules. Finally, the correlation
   between the obtained modules and the differentially infiltrated immune
   cells was analysed, the modules with significant correlation with the
   immune phenotype were selected, and the gene set in the module was
   obtained for subsequent analysis.

Protein–Protein Interaction (PPI) Network Analysis and Immune-Related Key
Genes Screening

   We used the Search Tool for the Retrieval of Interacting Genes (STRING,
   [84]http://www.string-db.org/, Version: 11.0) online tool[85]^24 and
   Cytoscape software to construct the PPI network of DEGs from the
   immune-related genes obtained by WGCNA. The cytoHubba plug-in was used
   to screen the hub genes. In each dataset, the genes that scored in the
   top 20 by the maximum correlation criteria algorithm were that
   dataset’s hub genes. The intersection of the two datasets made up the
   key genes for sepsis.

Construction of Sepsis Molecular Subtypes and Correlation Analysis of
Molecular Subtypes/Immune Cell Infiltration and Key Genes

   Consensus clustering is an algorithm that can be used to identify
   cluster members (such as microarray gene expression data) and their
   numbers in a dataset and is usually used to determine the optimal
   number of clusters k. Consensus clustering validates the rationality of
   clustering through resampling-based methods. Its main purpose is to
   assess the stability of clustering. Consensus clustering and molecular
   subtype screening of [86]GSE26640 were performed using the
   ConsensusClusterPlus package.[87]^25 In short, hierarchical clustering
   is used, with 1000 iterations (80% of the samples are used each time).
   The optimal number of clusters was determined by the clustering score
   of the cumulative distribution function (CDF) curve, and the relative
   change in the area under the CDF curve was evaluated. The reliability
   of the consensus clusters was then verified using principal component
   analysis (PCA) clustering graphs. Pearson correlation analysis was used
   to evaluate the correlations between key genes and molecular
   subtypes/immune cell infiltration.

Analysis of Key Gene Diagnostic Capability

   The diagnostic ability of key genes for sepsis was evaluated. We first
   compared the expression levels of key genes in the disease group and
   the normal control group and then used receiver operating
   characteristic (ROC) curves to analyse the ability of key genes to
   distinguish between the sepsis and normal control groups.

Analysis of Key Gene–Transcription Factor Interaction Network, and Key
Gene–Small-Molecule Compound Interaction Network

   We further analysed the transcription factor network and small-molecule
   compound action network with the key genes. Six key genes was uploaded
   to NetworkAnalyst ([88]https://www.networkanalyst.ca/)[89]^26 and was
   analysed using the transcription factor–gene interactions and
   protein-chemical interactions modules. The reference databases were the
   transcription factor database ENCODE ([90]http://cistrome.org/BETA/)
   and Protein-chemical Interactions ([91]http://ctdbase.org/). The
   analysed network was visualized in Cytoscape.

Analysis of Key Genes Real-Time Quantitative PCR (RT-qPCR)

   Samples were collected from 12 healthy controls and 12 septic patients
   who were hospitalized in the paediatric department of the Seventh
   Affiliated Hospital of Sun Yat-Sen University from October 1, 2021 to
   November 1, 2021. This study was approved by the Ethics Committee of
   the Seventh Affiliated Hospital of Sun Yat-Sen University. Consent was
   obtained by the guardians prior to study commencement. Inclusion
   criteria: 1 month to 14 years of age, meeting the diagnostic criteria
   for sepsis, and hospitalization longer than 24 hours. Exclusion
   criteria: malignant tumours, autoimmune diseases, and use of
   immunosuppressive agents in the past 2 weeks.

   Peripheral-blood samples were collected from all patients within 24
   hours of admission, and peripheral-blood mononuclear cells (PBMCs) were
   immediately isolated using human lymphocyte separation medium (Dayou,
   7111011). Total RNA was extracted from PBMCs using TRIzol reagent
   (Invitrogen, 15596026) according to the manufacturer’s instructions.
   Reverse transcription was performed using Evo M-MLV reverse
   transcription reagent master mix (Accurate Biology Co., Ltd., AG11706).
   RT-qPCR was performed using the SYBR Green Pro Taq HS premixed qPCR
   reagent kit (Accurate Biology Co., Ltd., AG11701). GAPDH was the
   reference gene. The relative expression levels were calculated using
   the 2^-ΔΔCT method. [92]Supplementary table 1 lists the primer pairs
   used in the experiment.

Statistical Analysis

   All data calculations and statistical analyses were performed using R
   programming software ([93]https://www.r-project.org/, version 4.1.0).
   The Benjamini–Hochberg method was used for multiple test correction,
   and false discovery rate correction for multiple tests was performed to
   reduce this rate. Continuous variables were compared between groups by
   Student’s t-test when normally distributed and the Mann–Whitney U-test
   when skewed. The pROC package was used to draw the ROC curve for the
   sepsis diagnostic ability of the key genes, and the area under the
   curve (AUC) was calculated to evaluate the discriminant ability of the
   model. All P values were two-sided tests, and P < 0.05 was considered
   statistically significant.

Results

Data Preprocessing and DEGs Screening

   The data processing of this study is depicted in [94]Figure 1. We
   normalized the original gene expression microarray files of the two
   datasets [95]GSE145227 and [96]GSE26440 to obtain the gene expression
   matrix for analysis. The average gene expression box plot and PCA plot
   of the dataset processing results are shown in [97]Supplementary figure
   1. After preprocessing and quality control of the data, we used the
   limma package of the R language to select a total of 2339 DEGs from the
   [98]GSE145227 mRNA expression matrix and a total of 1146 DEGs from the
   [99]GSE26440 mRNA expression matrix. All these DEGs were displayed in
   the form of volcano plot and heatmaps ([100]Figure 2A–D). The total
   number of DEGs shared in the two datasets was 585, which were used as
   the gene set for subsequent analysis ([101]Figure 2E). Among the DEGs,
   there were 236 immune-related genes shared by both datasets
   ([102]Figure 2F).

Figure 1.

   [103]Figure 1
   [104]Open in a new tab

   Flow chart.

   Abbreviations: CeRNA, competing endogenous RNA; DEG, differentially
   expressed genes; DElncRNA, differentially expressed lncRNA; PPI,
   protein-protein interaction; WGCNA, weighted gene co-expression network
   analysis.

Figure 2.

   [105]Figure 2
   [106]Open in a new tab

   Differentially expressed genes (DEGs) analysis and immune-related
   genes. (A and B) The volcano plot of [107]GSE145227 and [108]GSE26440
   expression matrix. (C and D) The heatmap of [109]GSE145227 and
   [110]GSE26440 expression matrix. (E) Venn diagram of DEGs in
   [111]GSE145227 and [112]GSE26440. (F) Venn diagram of immune-related
   genes in DEGs.

Functional Enrichment Analysis of DEGs

   In the enrichment results, we found many immune-related biological
   functions. KEGG analysis showed that the pathways enriched by DEGs
   mainly included haematopoietic cell lineage, Th1 and Th2 cell
   differentiation, PD-L1 expression and the PD-1 checkpoint pathway in
   cancer, Th17 cell differentiation, and osteoclast differentiation
   ([113]Figure 3, [114]Supplementary table 2). GO analysis showed that
   DEGs were mainly associated with neutrophil degranulation, neutrophil
   activation involved in the immune response, specific granules, tertiary
   granules, immune receptor activity, and non-membrane spanning protein
   tyrosine kinase activity ([115]Figure 4, [116]Supplementary table 3).
   GSEA enriched pathways mainly involve T cell receptor signalling
   pathways and natural killer cell-mediated cytotoxicity-related pathways
   ([117]Supplementary figure 2, [118]Supplementary table 4). The results
   of GSVA analysis are shown in [119]Supplementary figure 3. The DEGs of
   the two datasets were mainly enriched in pathways involving galactose
   metabolism, circadian rhythm mammals, one-carbon pool by folate,
   citrate cycle, and selenoamino acid metabolism.

Figure 3.

   [120]Figure 3
   [121]Open in a new tab

   Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment
   analysis of differentially expressed genes (DEGs). (A) The KEGG
   analysis of DEGs is displayed in the form of bubble graphs. The closer
   the colour is to red, the smaller the P is. The larger the bubble is,
   the more enriched DEGs are in the pathway. (B) The KEGG analysis of
   DEGs is displayed in the form of bar graphs. The closer the colour is
   to red, the smaller the P is. The horizontal axis indicates the number
   of genes enriched in the pathway. (C) Network diagram of the KEGG
   results.

Figure 4.

   [122]Figure 4
   [123]Open in a new tab

   Gene Ontology (GO) enrichment analysis of differentially expressed
   genes (DEGs). (A) The enrichment results of biological process,
   cellular component, and molecular function in the GO analysis are shown
   in the form of bubble graphs. The closer the colour is to red, the
   smaller the P is. The larger the bubble is, the more enriched DEGs are
   in the pathway. (B) The biological process, cellular component, and
   molecular function enrichment results of the GO analysis are shown as a
   bar graph. The closer the colour is to red, the smaller the P is. The
   horizontal axis represents the number of enriched genes in the pathway.

Immune Infiltration Analysis

   The infiltration of 22 types of immune cells in different groups is
   shown in [124]Figure 5A–B. A total of five types of immune cells showed
   significant differences between the two groups. M0 macrophages and
   neutrophils infiltrated more in the sepsis group, while activated
   memory CD4^+ T cells, resting memory CD4^+ T cells, and CD8^+ T cells
   infiltrated less. We calculated the correlations between immune cell
   types with significant differences in the degree of infiltration
   between the two groups, and the correlation matrix is shown in
   [125]Figure 5C–D. The correlation between neutrophils and resting
   memory CD4^+ T cells and the correlation between neutrophils and CD8^+
   T cells were positive and high. The correlation between neutrophils and
   M0 macrophages was negative, and the absolute value of the coefficient
   was large.

Figure 5.

   [126]Figure 5
   [127]Open in a new tab

   Immune cell infiltration analysis. (A and B) Comparison of the degree
   of immune cell infiltration in the [128]GSE145227 and [129]GSE26440
   datasets between the sepsis group and the normal control group. (C and
   D) The correlation matrix of immune cells with significant differences.
   Red indicates positive correlation, blue indicates negative
   correlation, and a darker colour indicates a stronger correlation. The
   nonsignificant ones are indicated by a black “X”. *P < 0.05, **P <
   0.01, ***P < 0.001, ****P < 0.0001.

CeRNA Interaction Network Analysis

   We screened 2219 DElncRNAs from the lncRNA data of the [130]GSE145227
   dataset. miRNAs targeting the DElncRNAs were searched through the
   miRcode database, and then miRNA target mRNAs were searched in the
   TargetScan, miRDB, and miRTarBase databases. A total of 242 mRNAs
   shared by the three databases were found. We cross-checked these 242
   mRNAs with the 585 selected DEGs and found 10 mRNAs. Finally, we
   obtained the lncRNA–miRNA–mRNA interaction relationship and imported it
   into the network visualization software CytoScape to obtain the
   sepsis-related ceRNA network ([131]Figure 6A and [132]B).

Figure 6.

   [133]Figure 6
   [134]Open in a new tab

   Construction of competing endogenous RNA (ceRNA) network and Weighted
   gene co-expression network analysis (WGCNA). (A) ceRNA network. Orange
   is the miRNA, grey is the lncRNA, red is the upregulated mRNA, and blue
   is the downregulated mRNA. (B) Venn diagram of the comparison between
   the mRNA predicted by the lncRNA and the differentially expressed genes
   (DEGs). A total of 10 mRNAs that competed with the lncRNA for binding
   to the miRNA were found. (C and E) WGCNA results of the DEGs of
   [135]GSE145227 and [136]GSE26440. The cluster analysis of genes is
   represented by a tree diagram, and the modules of gene classification
   are represented by different colours. (D and F) Correlation matrix
   between the module scores obtained by the WGCNA of [137]GSE145227 and
   [138]GSE26440 and the degree of differential immune cell infiltration.
   Red indicates a positive correlation, and green indicates a negative
   correlation. The darker the colour, the higher the degree of
   correlation. The correlation coefficient and P value are displayed in
   the grid in the matrix.

WGCNA

   We performed WGCNA on the DEGs selected from the two datasets
   ([139]Figure 6C–F). Three modules were calculated in [140]GSE145227,
   and 10 modules were calculated in [141]GSE26440. Among them, two
   modules of [142]GSE145227 had a strong correlation with immune cell
   infiltration, and six modules of [143]GSE26440 had a strong correlation
   with immune cell infiltration. We selected the two immune-related
   modules with the highest proportions of genes and took these two gene
   sets as the immune-related DEGs in the subsequent analysis. There were
   411 genes in [144]GSE145227 and 197 genes in [145]GSE26440.

PPI Analysis and Screening for Key Genes

   We performed PPI analysis of the immune-related genes obtained by
   WGCNA. The PPI network of [146]GSE145227 has 291 nodes and 593 edges.
   The PPI network of [147]GSE26440 has 136 nodes and 593 edges
   ([148]Figure 7). The node sizes of the PPI networks correlate with the
   numbers of edges that the networks contain. The MAPL14, FYN, HCK, UBC,
   LCK, and FYN genes had high weights in the PPI networks of both
   datasets and showed strong correlations with other nodes (genes). Hub
   gene analysis can identify genes that occupy core regulatory positions
   in PPI networks. We took the top 20 hub genes in the PPI networks of
   [149]GSE145227 and [150]GSE26440. The hub genes that appear in both
   datasets are FYN, FBL, ATM, WDR75, FOXO1, and ITK; these are the key
   immune-related genes identified in this study.

Figure 7.

   [151]Figure 7
   [152]Open in a new tab

   Results of protein-protein interaction (PPI) analysis and screening of
   immune-related key genes. (A and B) The PPI network diagram of
   [153]GSE145227 and [154]GSE26440. The larger the node, the larger the
   clustering coefficient, indicating that the gene occupies a more
   important spot in the network. (C and D) Diagrams of hub genes obtained
   from [155]GSE145227 and [156]GSE26440. The hub genes were the top 20
   differentially expressed genes by the Matthews correlation coefficient
   score of the maximum correlation algorithm. The darker the colour, the
   greater the proportion of interactions this gene has in the network.
   (E) The comparison of the hub genes obtained from the two datasets
   yielded six key genes.

Analysis of Sepsis Molecular Subtypes and the Correlation Between Molecular
Subtypes and Key Genes

   The molecular subtypes of sepsis were determined by unsupervised
   clustering of 98 sepsis samples in the [157]GSE26440 dataset. The
   optimal number of clusters was determined by the k value. The
   ConsensusClusterPlus of R package was used to divide all sepsis samples
   into k subtypes (k = 2–6). After evaluating the relative change in the
   area under the CDF curve and the consensus heatmap, we chose a
   four-cluster solution (k = 5), which did not increase the area under
   the CDF curve significantly ([158]Supplementary figure 4A–F). The PCA
   cluster map showed that sepsis samples could be divided into five
   subtypes (C1, C2, C3, C4, and C5) ([159]Supplementary figure 4G). When
   we analysed the correlation between different molecular subtypes and
   key genes, we found that there were strong correlations between certain
   key genes and different molecular subtypes ([160]Supplementary figure
   4H).

Association Between Key Genes and Immune Cell Infiltration

   We analysed the correlation between key genes and immune cell
   infiltration in the two datasets. As shown in [161]Supplementary figure
   5, there was a strong correlation between key genes and infiltrating
   immune cells. Among them, in the [162]GSE145227 dataset, the key genes
   had the strongest correlation with resting memory CD4^+ T cells, and in
   the [163]GSE26440 dataset, the key genes had the strongest correlation
   with neutrophils.

Diagnostic Capabilities of Key Genes for Sepsis

   We compared the expression levels of key genes in the sepsis and the
   normal control groups, and the results showed that the expression
   levels of these six key genes in the sepsis group were lower than those
   in the control group (P < 0.05). This indicates that the inactivation
   of these genes is of great significance to the occurrence and
   progression of sepsis. The AUCs of the diagnostic ROC curves of the key
   genes in the two datasets were all above 0.84, indicating that the key
   genes had a good ability to distinguish between children with and
   without sepsis ([164]Figure 8A–D).

Figure 8.

   [165]Figure 8
   [166]Open in a new tab

   The ability of key genes to diagnose sepsis and the interactions
   between transcription factors and small-molecule compounds. (A and C)
   Differences in the expression levels of key genes in the [167]GSE145227
   and [168]GSE26440 datasets between the sepsis and normal control
   groups. (B and D) Receiver operating characteristic curves of the
   ability of the key genes in the [169]GSE145227 and [170]GSE26440
   datasets to differentiate between the sepsis and the normal control
   group. (E) Network diagram of the interaction between key genes and
   transcription factors. (F) The network diagram of the interaction
   between key genes and small-molecule compounds. **P < 0.01, ***P <
   0.001, ****P < 0.0001.

Analysis of Transcription Factors and Small-Molecule Compounds

   Key gene–transcription factor interaction network analysis showed that
   ATM, FBL, FOXO1, and WDR75 were associated with 25, 51, 23, and 16
   transcription factors, respectively ([171]Figure 8E). Key
   gene–small-molecule compound interaction network analysis showed that
   ATM was correlated with cyclosporine and benzo(a)pyrene, FOXO1 was
   correlated with Antirheumatic agents, benzo(a)pyrene and cyclosporine,
   FYN was correlated with benzo(a)pyrene, cyclosporine and (+)-JQ1
   compound, FBL was correlated with cyclosporine and sodium selenite,
   WDR75 was correlated with cyclosporine and (+)-JQ1 compound, and ITK
   was correlated with antirheumatic agents, benzo(a)pyrene, (+)-JQ1
   compound, and sodium selenite ([172]Figure 8F).

Verification of Selected Key Genes at the Transcriptional Level

   RT-qPCR was used to compare the expression of the six key genes between
   children with sepsis (n=12) and the control group (n=12). The
   expression of all six was consistent with the trend observed in the
   microarray analysis, and the differences between groups were all
   significant ([173]Figure 9).

Figure 9.

   [174]Figure 9
   [175]Open in a new tab

   Expression of six key genes were compared between controls and septic
   children by real-time quantitative polymerase chain reaction.
   Differences between two groups were analyzed using the t-test or
   Mann–Whitney U-test. *P < 0.05, **P < 0.01, ***P < 0.001.

Discussion

   At present, the pathogenesis of sepsis in children is not clear, and
   methods for the timely diagnosis of sepsis are lacking. Microarray
   analysis is an ideal method for use in the comprehensive study of
   sepsis in children. To explore the molecular mechanism of sepsis in
   this study, we performed transcriptome analysis of two datasets related
   to sepsis in children. From the enrichment results, we found many
   immune-related biological functions. Immune cell infiltration analysis
   showed that immunosuppression occurred during the course of the
   disease. Six key genes related to sepsis in children (FYN, FBL, ATM,
   WDR75, FOXO1, and ITK) were identified through the WGCNA and PPI
   networks. The networks demonstrate that these genes are associated with
   immune cell infiltration and with molecular subtypes of sepsis and that
   they have good specificity for the diagnosis of sepsis in children.
   These findings will be helpful for the diagnosis of sepsis in children.

   Because of the immaturity of the immune system in children, when
   confronted with pathogen invasion, children’s bodies generate immune
   responses that are not exactly the same as those in adults. During the
   acute inflammatory response caused by a severe infection, the human
   immune system undergoes various changes that trigger a variety of
   immune responses, resulting in direct effects on the production,
   function, and survival of immune cells. When the disease progresses to
   a certain extent, the body develops an immune imbalance that may result
   in persistent and recurrent infections, which increases the mortality
   rate of patients with sepsis.[176]^27 Alterations in the immune system
   at the genetic level affect the occurrence and development of sepsis,
   and they are characterized by rapid response and high sensitivity for
   the diagnosis of sepsis; these features can reduce the time required
   for diagnosis. Early diagnosis of sepsis can help physicians provide
   early treatment, thereby reducing patient mortality.[177]^28–30

   In our functional enrichment of DEGs, a variety of common immune
   responses participated in the development and progression of sepsis in
   children. Interestingly, in the analysis of immune cell infiltration,
   innate immune cells, such as macrophages and neutrophils, infiltrated
   more in the sepsis group, while acquired immune cells, such as
   activated memory CD4^+ T cells, resting memory CD4^+ T cells, and CD8^+
   T cells, infiltrated them less. This is consistent with a new viewpoint
   about sepsis. Immune activation and immunosuppression occur throughout
   the course of the disease. Dysfunction of the body’s innate immune
   system and immune suppression of the acquired immune system cause an
   imbalance and persistence of the inflammatory response and
   anti-inflammatory response.[178]^31 It has been well documented that
   multiple immune cells undergo numerical and functional abnormalities
   during sepsis. When infection initially occurs, neutrophils and
   monocyte-macrophages undergo proliferation and are recruited to the
   site of infection to exert immune effects upon stimulation by multiple
   factors. However, the increased number of neutrophils and
   monocyte-macrophages does not represent an enhanced immune function.
   With the progress of the disease, the percentage of immature
   neutrophils increases, and the phagocytic capacity of neutrophils, as
   well as the production of myeloperoxidase, lactoferrin, and reactive
   oxygen species, decline significantly.[179]^32^,[180]^33 This
   illustrates that the functional status of neutrophils can be switched
   from an initial immune activity enhancement to immunosuppression.

   On the other hand, the decrease in the number of T cells confirms the
   immunosuppressive state of the body in sepsis. If the acquired immune
   system is severely inhibited, it will lead to recurrent and persistent
   infections that are difficult to control, leading to multiple-organ
   failure. Zhao et al found that memory CD4^+ T cells played an
   immune-protective role in mice infected with severe acute respiratory
   syndrome (SARS) coronavirus or middle East Respiratory Syndrome (MERS)
   coronavirus and activated the innate immune response and CD8^+ T cell
   immune response by secreting γ-interferon.[181]^34 Elimination of
   memory CD4^+ T cells significantly reduced the survival rate of
   infected mice.

   We found six key genes in the development and progression of sepsis in
   children, namely, FYN, FBL, ATM, WDR75, FOXO1, and ITK. All six key
   genes were downregulated in children with sepsis by both microarray
   analysis and our RT-qPCR experiment. All six key genes were also
   associated with immune cell infiltration and molecular subtypes of
   sepsis. The six key genes could also be used to distinguish between
   sepsis and healthy children.

   FOXO1 and ITK have been associated with sepsis in previous studies.
   FOXO1, one of the key downstream targets of the AKT pathway, is an
   important nuclear transcription factor that regulates various
   physiological and pathological processes such as cell differentiation,
   metabolism, and inflammation.[182]^35 Crossland et al[183]^36 and
   Castillero et al[184]^37 both demonstrated that the FOXO1 signaling
   pathway was associated with damage to skeletal muscle during sepsis.
   Wang et al used an enzyme-linked immunosorbent assay to measure the
   levels of various proteins in the serum of patients with sepsis and
   found that the expression level of FOXO1 successively increased in the
   control, sepsis, severe sepsis, and septic shock groups (P <
   0.01).[185]^38 With the 28-day mortality rate as the study endpoint,
   FOXO1 expression was also significantly higher in survivors than in
   nonsurvivors (P < 0.001).

   ITK belongs to the Tec tyrosine kinase family and is involved in the
   regulation of a variety of immune cells, such as the activation of T
   cells and the differentiation of T helper cells.[186]^39 Consistent
   with the results of our study, ITK has been found in other
   bioinformatics analyses to act as a key gene in sepsis.[187]^40 In a
   lipopolysaccharide-induced sepsis mouse model, Itk−/− mice exhibited a
   more severe hypothermic response and more tumour necrosis factor α
   secretion than control mice. These effects are achieved through
   regulation of the nuclear factor kappa B, PI3K/Akt, and p38 signaling
   pathways in mastocytes by Itk.[188]^41

   In this study, we also found some genes whose functions in sepsis have
   not been determined, suggesting that they may become biomarkers of
   sepsis. FYN belongs to the Src family of protein tyrosine kinases and
   can activate a variety of downstream signaling pathways, such as
   oxidative stress, endoplasmic reticulum stress, and autophagy
   dysfunction.[189]^42 In immune cells, FYN is involved in the
   development and activation of T lymphocytes and NK
   cells.[190]^43^,[191]^44 FYN is also essential for the production and
   activation of platelets.[192]^45 However, the role of FYN in sepsis has
   not been fully elucidated. In this study, the expression of FYN was
   decreased in children with sepsis, which may explain the functional
   inhibition of T cells, NK cells, and platelets in sepsis.

   ATM is a major regulator of double-strand DNA break signalling and the
   stress response. ATM loss can result in mitochondrial
   dysfunction,[193]^46 higher reactive oxygen species levels,[194]^47 and
   DNA oxidative damage.[195]^48 ATM expression is also downregulated in
   osteoarthritis and rheumatoid arthritis.[196]^49^,[197]^50 Figueiredo
   et al found that the protective effect of anthracyclines on septic mice
   depended on ATM and found that ATM was a negative regulator of
   interleukin-1β.[198]^51 These data indicate that ATM is an important
   regulator of the immune response and may provide protection for
   inflammatory diseases, including sepsis.

   Both FBL and WDR75 are nucleolar proteins, and both are involved in the
   processing of pre-rRNA and the regulation of rRNA
   transcription.[199]^52^,[200]^53 FBL is highly conserved in its amino
   acid sequence. It can catalyse the 2′-O-methylation of rRNA and is one
   of the key components of ribosome formation, thus affecting the
   translation efficiency of mRNA. In the past, FBL has been widely
   studied in cancer. Overexpression of FBL can promote tumour progression
   and is associated with the low survival rate of breast cancer and
   hepatocellular carcinoma patients.[201]^52^,[202]^54^,[203]^55
   Recently, FBL has been found to be downregulated in some inflammatory
   diseases. In a meta-analysis of the gene expression profiles of
   peripheral blood cells in patients with systemic lupus erythematosus,
   the gene with the greatest downregulation was FBL (ES = −1.8543,
   P-value = 8.56E-06).[204]^56 Tiku et al found that HeLa cells and
   macrophages showed downregulation of FBL 24 hours after bacterial
   infection.[205]^57 Knockdown of FBL increased the clearance rate of
   intracellular bacteria, inhibited the secretion of inflammatory
   cytokines, increased the secretion of anti-inflammatory cytokines, and
   improved the survival rate of cells. At the same time, the binding of
   bacteria to intracellular lysosomes increased, which may explain the
   increase in the intracellular bacterial clearance rate and the increase
   in the cell survival rate after FBL knockdown.

   The understanding of WDR75 is still very limited, and studies on its
   association with disease are limited to a small number of cancers.
   Uhlen et al analysed the pathological profile of the human cancer
   transcriptome and found that abnormally high expression of WDR75 mRNA
   was an unfavourable prognostic factor for renal carcinoma and hepatic
   carcinoma.[206]^58 In a recent study, researchers knocked down WDR75 in
   a variety of cultured human cell models. In both tumour cell models and
   nontumour cell models, this process activated the RPL5/RPL11-dependent
   stable p53 checkpoint, eventually leading to impaired cell
   proliferation and senescence.[207]^53 In summary, given our
   bioinformatic analysis, we cannot ignore the association between genes
   related to the occurrence of ribosomal abnormalities, ie, FBL and
   WDR75, and the occurrence and progression of sepsis in children.

   We constructed a ceRNA interaction network and a key gene-transcription
   factor interaction network to better understand the complex molecular
   regulation of sepsis in children. To search for effective treatments of
   sepsis, we used the protein–chemical interactions module of
   NetworkAnalyst to screen drugs or small-molecule compounds that may
   target the key genes of sepsis in children. The role of these drugs or
   small-molecule compounds in sepsis in children needs to be further
   explored.

   Our study is the first to comprehensively analyse the molecular
   biological characteristics of septic immunity in children and provides
   new insights for further understanding the immune mechanism of sepsis.
   One limitation of our study is that the data were obtained from public
   databases without validated in our unit, which may have led to a biased
   interpretation of the results. Second, the validation experiments used
   a small number of clinical samples. In the future, a regression model
   can be established using larger numbers of samples and used to further
   analyse the relationship between key genes and disease diagnosis and
   prognosis. Third, little is known about the mechanisms of action of the
   key genes identified in this study, and exploring this will be one of
   our future research directions. Fourth, due to the lack of clinical
   information on individual patients in the datasets, it was not possible
   to fully investigate the clinical characteristics represented by the
   different subtypes.

Conclusion

   We investigated the immune microenvironment and regulatory networks
   related to sepsis in children at the molecular level using
   transcriptomic profiling and screened for and validated key immune
   genes related to sepsis. Our results suggest that FYN, FBL, ATM, WDR75,
   FOXO1, and ITK act as key immune genes in children with sepsis and that
   these genes may be directly involved in the pathophysiology of this
   condition. These findings provide new directions for the investigation
   of potential diagnostic markers and molecular mechanisms of sepsis in
   children. Our results, especially the identification of FBL and WDR75
   in the context of ribosome biogenesis, may stimulate further studies in
   this important but still relatively undeveloped field of biomedical
   research and provide a basis for the use of precision medicine in
   children with sepsis.

Acknowledgments