Abstract Objective: To explore immune-related feature genes in patients with dilated cardiomyopathy (DCM). Methods: Expression profiles from three datasets ([42]GSE1145, [43]GSE21610 and [44]GSE21819) of human cardiac tissues of DCM and healthy controls were downloaded from the GEO database. After data preprocessing, differentially expressed genes (DEGs) were identified by the ‘limma’ package in R software. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were then performed to identify biological functions of the DEGs. The compositional patterns of stromal and immune cells were estimated using xCell. Hub genes and functional modules were identified based on protein-protein interaction (PPI) network analysis by STRING webtool and Cytoscape application. Correlation analysis was performed between immune cell subtypes and hub genes. Hub genes with |correlation coefficient| > 0.5 and p value <0.05 were selected as feature biomarkers. A logistic regression model was constructed based on the selected biomarkers and validated in datasets [45]GSE5406 and [46]GSE57338. Results: A total of 1,005 DEGs were identified. Functional enrichment analyses indicated that extracellular matrix remodeling and immune and inflammation disorder played important roles in the pathogenesis of DCM. Immune cells, including CD8^+ T-cells, macrophages M1 and Th1 cells, were proved to be significantly changed in DCM patients by immune cell infiltration analysis. In the PPI network analysis, STAT3, IL6, CCL2, PIK3R1, ESR1, CCL5, IL17A, TLR2, BUB1B and MYC were identified as hub genes, among which CCL2, CCL5 and TLR2 were further screened as feature biomarkers by using hub genes and immune cells correlation analysis. A diagnosis model was successfully constructed by using the three biomarkers with area under the curve (AUC) scores 0.981, 0.867 and 0.946 in merged dataset, [47]GSE5406 and [48]GSE57338, respectively. Conclusion: The present study identified three immune-related genes as diagnostic biomarkers for DCM, providing a novel perspective of immune and inflammatory response for the exploration of DCM molecular mechanisms. Keywords: dilated cardiomyopathy, immune infiltration, diagnosis model, biomarkers, logistic regression Introduction Dilated cardiomyopathy (DCM) is defined as left or both ventricles enlargement and contraction impairment in the absence of abnormal loading conditions or coronary disease. The estimated prevalence of DCM was >1:250 of the population ([49]Hershberger et al., 2013). It accounts for a considerable portion of heart failure (HF) ([50]Reichart et al., 2019) and is the leading cause of heart transplantation ([51]Weintraub et al., 2017). DCM results from a diverse range of etiologies including genetic alteration, viral infection, drug and alcohol with a heterogeneous pathophysiological mechanism.([52]Felker et al., 2000). Immune and inflammatory response plays an important role in cardiovascular disease, such as myocardial infarction ([53]Kologrivova et al., 2021), atrial fibrillation ([54]Li et al., 2021). As for DCM, myocardial damage, whether from a genetic or environmental etiology, triggers inflammation and recruits immune cells to the heart. Regional inflammation causes tissue fibrosis, which stiffens the heart and promotes the progression to dilation and HF ([55]Schultheiss et al., 2019). Myocardial inflammation was related to poor long-term outcome for DCM ([56]Nakayama et al., 2017). Immune cells, especially T lymphocytes and macrophages, promote myocardial inflammation and contribute to ventricular remodeling ([57]Comarmond and Cacoub, 2017; [58]Jain et al., 2021). For example, it is been reported that polarization of macrophages towards M2 was associated with ventricular remodeling and poor long-term prognosis in DCM ([59]Nakayama et al., 2017). In addition, a significant increase of the number of Th1 and Th17 cells was observed while the number of Treg cells decreased in DCM patients ([60]Wei et al., 2017; [61]Liu et al., 2021). Infiltrated immune cells release cytokines and chemokines, such as TGF-β1, IL-1β, and TNF, promoting collagen deposition, fibrosis and cardiac remodeling ([62]Schultheiss et al., 2019). Microarray profiling research has also shown that the expression of some immune-related genes in left ventricle of DCM was dysregulated, such as IL-6, CXCL10, TLR3 ([63]Qiao et al., 2017). Based on the potential relationship between inflammation and DCM, some immunological therapies have been reported, including immunosuppressants, ([64]Parrillo et al., 1989; [65]Frustaci et al., 2009), immunoadsorption, ([66]Bian et al., 2021), IL-1 inhibitors, ([67]Van Tassell et al., 2017; [68]De Luca et al., 2018). However, these immune-based therapies are either unsatisfactory or not fully confirmed by large randomized, multi-center research. It is necessary to better understand DCM pathogenetic mechanism. As shown in [69]Figure 1, we integrated several GEO datasets. Through systematically bioinformatics analyses, we identified differentially expressed genes (DEGs) between DCM and healthy cardiac samples and explored the potential pathological mechanism of DCM by functional enrichment analysis, immune cell infiltration analysis and protein-protein network analysis. Moreover, we constructed a three-gene diagnostic model via logistic regression analysis. Finally, we confirmed the validity of the diagnostic model in another two datasets [70]GSE406 and [71]GSE57338. This article is the first to explore the pathogenesis of DCM from the perspective of immunity and inflammation with bioinformatics, and we hope our analyses will provide potential targets for future in-depth research. FIGURE 1. [72]FIGURE 1 [73]Open in a new tab Flowchart of the study. Materials and Methods GEO Datasets The DCM RNA expression datasets were collected from the online GEO database ([74]http://www.ncbi.nlm.nih.gov/geo/). The keywords “dilated cardiomyopathy”, “Homo sapiens”, and “expression profiling by array” were used on the initial search, and 184 DCM related studies were found. Then the following criteria were used to further screen datasets: 1) The study includes DCM case vs healthy control; 2) tissue samples obtained from left ventricle; 3) sample size was bigger than 10. Three datasets qualified for the above criteria and performed on the same platform were combined for analysis. Another two datasets derived from other platforms were used as validating datasets. The processed data of [75]GSE1145 (platform: [76]GPL570, including 11samples of control and 12 samples of DCM), [77]GSE21610 ([78]Schwientek et al., 2010) (platform: [79]GPL570, including eight samples of control and 21 samples of DCM), [80]GSE29819 ([81]Gaertner et al., 2012) (platform: [82]GPL570, including six samples of control and seven samples of DCM), [83]GSE5406 ([84]Hannenhalli et al., 2006) (platform: [85]GPL96, including 16 samples of control and 86 samples of DCM) [86]GSE57338 ([87]Liu et al., 2015) (platform: [88]GPL11532, including 132 samples of control and 82 samples of DCM) were downloaded as expression matrix with R package ‘GEOquery’.([89]Davis and Meltzer, 2007). The mRNA expression profiles of controls and targeted patients were extracted and were performed log2 transformation before further analysis (Only if they have not be log2 transformed). Data Preprocessing and DEGs Identification The ‘limma’ ([90]Ritchie et al., 2015) and ‘sva’ ([91]Parker et al., 2014) packages in R software (R version 4.0.3) were used to correct intra- and inter-batch effect. Prcomp function was used to perform principal component analysis (PCA). Next, probes annotations were performed. Probes annotated to >1 gene were removed. For multiple probes annotated to the same gene, the first one appeared retained. Finally, the ‘limma’ package was also used for DEGs identification between different groups with cut-off values of adjusted p-value < 0.05 and |fold change| ≥ 1.5. In addition, DEGs were also identified by applying robust rank aggregation (RRA) algorithm ([92]Kolde et al., 2012) with the same criteria. Venn diagrams ([93]http://bioinformatics.psb.ugent.be/webtools/Venn/) were used to summarize the overlapping DEGs between ‘sva’ and RRA algorithms. Boxplot function, ‘ggplot2’ and ‘pheatmap’ packages were then used to plot gene expression boxplot, volcano plot and heatmap. Functional Enrichment Analysis To further analyze the functions of DEGs, the R package ‘clusterProfiler’ ([94]Yu et al., 2012) was used to perform Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis. The cut-off values for GO and KEGG were set as p < 0.05. The ‘enrichplot’ package was used to draw dot plots for the results of functional enrichment analysis. Immune Cell Infiltration Analysis xCell ([95]Aran et al., 2017) is a new gene signature-based method to estimate the content of immune and stromal cells. It was validated using extensive in-silico simulations and cytometry immunophenotyping. XCell package was applied to normalized merged data to portray the cellular heterogeneity landscape of left ventricular expression profiles. We compared the cell distribution differences between the two groups through t-test, and the cutoff value was set as p < 0.05. The results were significantly different between the two groups, categorized based on their traits into three categories: “lymphoid and myeloid cells”, “stem cells” and “stromal cells and others.” These were visualized by using ‘ggplot2’ and ‘ggpubr’ packages. Correlation analyses of immune cell subtypes and hub genes were performed with ‘psych’ and ‘corrplot’ packages. Pearson correlation coefficient was used to assess the strength of correlation. Hub genes whose absolute correlation coefficient with immune cells >0.5 and p-value < 0.05 are selected to further study. Protein-Protein Interaction Network and Gene Module Identification Search Tool for the Retrieval of Interacting Genes (STRING, [96]https://string-db.org) is a webtool that provide validated and predicted information of PPIs ([97]Szklarczyk et al., 2019). The list of DEGs was uploaded into STRING website to detect significant protein interactions with minimum interaction score >0.7. The network was then exported and visualized by Cytoscape 3.7.1 software ([98]Cline et al., 2007). The CytoHubba plugin ([99]Chin et al., 2014) was used to calculate hub genes with a high degree. The results were directly visualized by Cytoscape. Additionally, the MCODE plugin ([100]Bader and Hogue, 2003) was used to identify highly interconnected clusters with the cutoff parameters set as follows: degree cutoff = 2, node score cutoff = 0.2, k-Core = 2, Max. Depth = 100. The results were further screened with criteria set as MCODE score >4 and nodes number >5. Gene ontology biological process enrichment analysis was performed on the significant modules. Hub Genes Verification and Diagnosis Model Construction Hub genes with correlation coefficient >0.5 and p < 0.05 with immune cells were selected. The expression profiles of the selected biomarkers were visualized in boxplot and validated in another two datasets [101]GSE5406 and [102]GSE57338. Then, a diagnosis model combining the selected biomarkers was constructed in the merged dataset by logistic regression using ‘glm’ function and verified in [103]GSE5406 and [104]GSE57338. Receiver-operating characteristic (ROC) curves were used to assess the discrimination ability of the key genes and the diagnosis model. Results Data Preprocessing and DEGs Identification Expression profiles of healthy controls and DCM patients from [105]GSE1145, [106]GSE21610 and [107]GSE21869 were combined into a merged dataset. The merged dataset contains 25 healthy controls and 40 DCM patients. As shown in [108]Figures 2A,C, the merged dataset had strong batch effect. After normalization, it was effectively removed ([109]Figures 2B,D). Then in the differential expression analysis, 1,005 DEGs were recognized in the merged dataset, which includes 385 down-regulated genes and 620 up-regulated genes by integration method of ‘sva’ ([110]Figure 2E). 179 DEGs including 101 up-regulated genes and 78 down-regulated genes were identified by integration method of RRA. Venn diagrams depict DEGs across two different integration methods ([111]Supplementary Figure S1). DEGs obtained by ‘sva’ method were used in the following analyses. All of the DEGs were displayed in heatmap ([112]Figure 2F). FIGURE 2. [113]FIGURE 2 [114]Open in a new tab The differentially expressed genes in merged dataset (A,B) The boxplot of merged dataset before and after normalization (C,D) Two-dimensional principal component analysis cluster plot before and after normalization (E) Volcano plot of 1,005 DEGs (F) Heatmap of all DEGs. Functional Enrichment Analysis To assess the functions of DEGs, enrichment analyses of GO and KEGG were performed. The categories of GO analysis include biological process (BP), cellular component (CC) and molecular function (MF). The leading 10 enriched terms of each GO categories and KEGG with p value <0.05 were visualized in [115]Figure 3. FIGURE 3. [116]FIGURE 3 [117]Open in a new tab GO and KEGG enrichment analyses of DEGs in merged dataset (A) The dot plots of the top 10 enriched GO BP terms (B) The dot plots of the top 10 enriched GO MF terms (C) The dot plots of the top 10 enriched GO CC terms (D) The dot plots of the top 10 enriched KEGG terms. The mainly enriched BP terms included epithelial cell proliferation and its regulation, extracellular matrix and structure organization, cell chemotaxis, leukocyte migration and regulation, and inflammatory response ([118]Figure 3A). The mainly enriched MF terms were extracellular matrix structural constituent, glycosaminoglycan, collagen and fibronectin binding, G protein-coupled purinergic nucleotide receptor activity ([119]Figure 3B). The mainly enriched CC terms contained collagen-containing extracellular matrix, endoplasmic reticulum lumen, external side of plasma membrane, myofibril, sarcomere, basement membrane, and collagen trimer ([120]Figure 3C). In whole, GO results indicated that the gene function of DEGs mainly associated with both extracellular structure reorganization and fibrosis, immune, and inflammatory abnormalities. In KEGG pathway enrichment analysis, DEGs mainly enriched in PI3K-Akt signaling pathway, Cytokine-cytokine receptor interaction, AGE-RAGE signaling pathway in diabetic complications, Phagosome, viral protein interaction with cytokine and cytokine receptor, and HIF-1 signaling pathway ([121]Figure 3D). Immune Cell Infiltration Analysis xCell was used to estimate the cell composition heterogeneity of left ventricle between DCM and controls. As shown in [122]Figure 4, 18 cell types were significantly changed in DCM cardiac tissue compared to control samples, among which the scores of CD8^+ T-cells, cDC, adipocytes, fibroblasts, and smooth muscle in DCM were significantly increased, while the scores of macrophages M1, Th1 cells were significantly decreased. FIGURE 4. [123]FIGURE 4 [124]Open in a new tab xCell scores of immune and stromal cells between DCM and control heart tissues in merged dataset (A–C) Boxplots of “lymphoid and myeloid cells”, “stem cells”, “stromal cells and others” respectively. *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001. Moreover, the correlation among immune cells were calculated by using the Pearson’s correlation coefficients. As shown in [125]Figure 5, iDC has the highest positive correlation with DC (Pearson’s coefficient = 0.89). The correlation between macrophages and macrophages M1 was the second strongest positive (Pearson’s coefficient = 0.78). Additionally, aDC, DC, CD8^+ Tcm, macrophages, monocytes, and macrophages M1 had strong correlation coefficient with most of the remaining immune cells. FIGURE 5. [126]FIGURE 5 [127]Open in a new tab Correlation matrix of immune cells. Blue squares represent negative correlation, and red squares represent positive correlation. The size of the square in the upper right is positively correlated with the correlation coefficient. The size of the circle in the lower left is positively correlated with the significance of the correlation. *p < 0.05, **p < 0.01, ***p < 0.001. PPI Network and Gene Module Identification As shown in [128]Figure 6A, there were 633 edges among 350 proteins in the PPI network. The top 10 genes with the highest degree in the above network were selected as the hub genes. These are STAT3, IL6, CCL2, PIK3R1, ESR1, CCL5, IL17A, TLR2, BUB1B, and MYC ([129]Figure 6B). The expression of the top 10 hub genes across all datasets was displayed in supplementary material. Then, the correlation coefficients between 10 hub genes and significantly changed immune cells were calculated. As shown in [130]Figure 6C, CD8^+ T-cells and macrophages M1 had significant correlation with most of the hub genes. Moreover, with the screening rules as |Pearson’s coefficient| > 0.5 and p < 0.05, we obtained three immune-related hub genes with potential diagnostic value: TLR2, CCL2 and CCL5. FIGURE 6. [131]FIGURE 6 [132]Open in a new tab PPI network of DEGs in merged dataset (A) PPI network of DEGs. The color depth and shape size of the dots are positively correlated with degree (B) Top 10 hub genes in the network (C) Correlation matrix of xCell scores of significantly changed immune cell subtype with top 10 hub genes. *p < 0.05, **p < 0.01, ***p < 0.001. Additionally, the densely connected modules were identified by using MCODE plug-in. Nine modules were obtained from the PPI network ([133]Supplementary Figure S2). The most enriched GO BP terms of each module were listed in [134]Table 1. Module_1 was enriched in negative regulation of transcription from RNA polymerase II promoter. Module_2 was enriched in sister chromatid cohesion. Enriched BP terms of Module_3 and four were associated with metabolism. Module_5 was enriched in proteolysis. Module_6 and eight were enriched in extracellular matrix organization. Module_9 was enriched in immune response. TABLE 1. Gene ontology biological process enrichment analysis. Module Term Count p Value 1 negative regulation of transcription from RNA polymerase II promoter 6 7.94E-07 2 sister chromatid cohesion 4 3.98E-06 3 metabolic process 2 3.98E-02 4 keratan sulfate catabolic process 5 1.58E-13 5 proteolysis 4 4.00E-04 6 extracellular matrix organization 3 7.94E-04 8 extracellular matrix organization 5 2.00E-08 9 immune response 6 1.00E-08 [135]Open in a new tab Hub Genes Verification and Diagnosis Model Construction As it is shown in [136]Figure 7A, CCL2 and TLR2 were significantly downregulated, while CCL5 was significantly upregulated in the merged dataset and verified in another two datasets [137]GSE5406 and [138]GSE57338 ([139]Figures 7B,C). Next, ROC curve analysis was performed to verify the diagnostic value of the selected biomarkers. In the merged dataset, the ROC curves of the three biomarkers revealed high diagnostic value for DCM ([140]Figure 8A). The AUC (area under the curve) scores of CCL2, CCL5, TLR2 were 0.963, 0.803, 0.766 respectively. In verification dataset [141]GSE5406, the AUC scores of CCL2, CCL5, TLR2 were 0.733, 0.821, 0.794, respectively ([142]Figure 8C). In verification dataset [143]GSE57338, the AUC scores of CCL2, CCL5, TLR2 were 0.738, 0.831, 0.836, respectively ([144]Figure 8E). Finally, we used the three biomarkers to construct a diagnosis model by logistic regression and visualized in ROC curves. The AUC score of the diagnosis model was 0.981, 0.867 and 0.946 in the merged dataset, [145]GSE5406 and [146]GSE57338, respectively ([147]Figures 8B,D,F). FIGURE 7. [148]FIGURE 7 [149]Open in a new tab Gene expression of the three selected hub genes (A) Boxplot of the three selected hub genes in merged dataset (B) Boxplot of the three selected hub genes in [150]GSE5406 (C) Boxplot of the three selected hub genes in [151]GSE57338. *p < 0.05, **p < 0.01, ***p < 0.001 ****p < 0.0001. FIGURE 8. [152]FIGURE 8 [153]Open in a new tab Analysis of the disease predictive abilities of the three selected hub genes (A) ROC curve analysis of three selected hub genes in merged dataset (B) ROC curve analysis of diagnosis model using the three selected hub genes in merged dataset (C) ROC curve analysis of three selected hub genes in [154]GSE5406 (D) ROC curve analysis of diagnosis model using the three selected hub genes in [155]GSE5406 (E) ROC curve analysis of three selected hub genes in [156]GSE57338 (F) ROC curve analysis of diagnosis model using the three selected hub genes in [157]GSE57338. Discussion DCM, a heterogeneous disease, is the major cause of heart failure and heart transplantation worldwide. Both genetic mutation and many different environmental changes can cause cardiomyocyte damage or death and may trigger myocardial inflammation in both directions, further promoting the progression of cardiomyopathy ([158]Whelan et al., 2010; [159]Lynch et al., 2015). Recent experimental and clinical evidence has suggested that abnormal activation of immune system may be involved in the process of cardiac function deterioration ([160]Kawai, 1999; [161]Lynch et al., 2017; [162]Brayson et al., 2019). Exploring the mechanism of key immune cells, pathways, and molecules in the pathophysiological process of cardiomyopathy can help clarify the special role of the immune system in the maintenance and imbalance of cardiac function to some degree, so as to provide potential immunosuppressive targets for future immunotherapy. By bioinformatics analyses, GO annotation and KEGG pathway enrichment analysis of the DEGs in merged dataset revealed that immune and inflammatory response and extracellular matrix remodeling play important roles in the pathogenesis of DCM, such as in myeloid leukocyte migration, cell chemotaxis, mononuclear cell migration, and extracellular matrix organization. Immune cell infiltration analysis indicated that the infiltration degrees of CD8^+ T-cells and cDC were significantly enriched in DCM samples while Th1 cells and macrophages M1 were in healthy control tissues. According to PPI network analysis, ten hub genes were selected as potential biomarkers of DCM. Finally, three feature immune-related hub genes were identified as biomarkers by both correlation and logistic analyses with an excellent AUC score in the merged dataset of 40 DCM patients and 25 healthy controls. Our novel diagnosis model of DCM was constructed based on these three feature molecules and verified in another two datasets [163]GSE5406 and [164]GSE57338. The functions of the three biomarkers were summarized and displayed in [165]Table 2. TABLE 2. summary of three biomarkers. Gene General biological function Function in cardiovascular disease CC2 Chemoattractant for activated T lymphocytes and monocytes Promotes the infiltration of mononuclear cells Primary activator for macrophages Protects cardiomyocytes from death by autocrine and paracrine effects CCL5 Chemoattractant for T lymphocytes, monocytes, eosinophils and NK cells Promotes neutrophil and macrophage activator in myocardial infarction and myocarditis TLR2 Recognizes pathogen-associated molecular patterns (PAMP) and damage-associated molecular patterns (DAMP) Induces expression of inflammatory cytokines and chemokines, resulting in an invasion of inflammatory cells Protects heart in aged animals after transverse aortic constriction surgery [166]Open in a new tab CCL2 (C-C motif chemokine ligand 2, also named as MCP-1) and CCL5 (C-C motif chemokine ligand 5, also named as RANTES) are two kinds of CC chemokines and play dual roles in inflammation and tissue repair. CCL2, the best studied chemokine, can be released by dendritic cells, monocytes, macrophages, smooth muscle cells and cardiomyocytes ([167]Hanna and Frangogiannis, 2020). CCL2 is a potent chemoattractant for activated T lymphocytes and monocytes as well as a primary activator for macrophages ([168]Rollins et al., 1988; [169]Rollins, 1997). Previous studies suggested that CCL2 was upregulated in cardiac injury and was the key culprit for cardiac disease development and progression by promoting the infiltration of mononuclear cells ([170]Hanna and Frangogiannis, 2020). CCL2 could also protect cardiomyocytes from cell death by its autocrine effect on cardiomyocytes and paracrine effect on endothelial cells, which stimulated angiogenesis ([171]Tarzami et al., 2002; [172]Hong et al., 2005; [173]Tarzami et al., 2005). CCL5 is known to be a potent chemoattractant for T lymphocytes, monocytes, eosinophils, and natural killer cells ([174]Jarrah and Tarzami, 2015). It is released by endothelial cells, smooth muscle cells, activated T-cells, macrophages, and so on upon inflammatory stimulus or infection ([175]Jarrah and Tarzami, 2015). Although their function has been well studied on a systemic level, their role in cardiovascular disease, especially DCM, has not been fully elucidated. TLR2 (Toll-like receptor 2) is a pattern-recognition receptor protein critical for the initiation of the innate immune system, which recognizes both pathogen-associated molecular patterns (PAMP) and damage-associated molecular patterns (DAMP) ([176]Bryant et al., 2015). Besides TLR4, TLR2 is the next most abundant toll-like receptor in heart tissue ([177]Mann, 2011). The binding of ligand to TLR2 induces expression of inflammatory cytokines and chemokines (e.g., IL-1β, TNF-α, CCL2), resulting in an invasion of macrophages and other inflammatory cells ([178]Kawasaki and Kawai, 2014). Most of the previous reports suggested that TLR2 played a detrimental role upon pro-inflammation ([179]Yu and Feng, 2018). However, other studies indicated a protective role of TLR2 in aged animals and mice after transverse aortic constriction surgery ([180]Bualeong et al., 2016; [181]Spurthi et al., 2018). In our study, we found that TLR2 was downregulated in DCM, which may lead to more sensitivity of DCM patients to injury stimulus, causing eventual cardiac dysfunction. The novelties of our study were as follows. Firstly, we were the first to use bioinformatics analyses to investigate the molecular mechanism of DCM from the perspective of immunity and inflammation. Secondly, we identified that CCL2, CCL5 and TLR2 could be potential diagnostic biomarkers of DCM. Nonetheless, there are several limitations that should not be ignored. First, it cannot be determined whether there is a cause-and-effect relationship between gene expression differences and the pathophysiological mechanism of DCM or whether it was just compensatory change. Second, the study was a retrospective data analysis; thus, detailed clinical and prognostic profiles, such as the left ventricular eject fraction and the occurrence of adverse cardiovascular events in patients with DCM, were absent. This restricted the further exploration of the key genes about their clinical features and outcomes. Finally, our study was based on bioinformatics analyses of transcriptomic data of public datasets, which may be inconsistent with actual situation. Further clinical trials are needed to test our findings by bioinformatics analyses. Conclusion By bioinformatics analyses of public transcriptional data, CCL2, CCL5, and TLR2 were identified as potential biomarkers of DCM from the perspective of immune cell infiltration combined with logistic regression. More importantly, a diagnostic model of DCM based on these three feature genes was developed, which brings a new aspect to the current understanding of the pathogenesis in DCM and may serve as interesting targets for future in-depth studies. Acknowledgments