Abstract The commonalities and differences in cell-type-specific pathways that lead to Alzheimer disease (AD) and Parkinson disease (PD) remain unknown. Here, we performed a single-nucleus transcriptome comparison of control, AD and PD striata. We describe three astrocyte subpopulations shared across different brain regions and evolutionarily conserved between humans and mice. We reveal common features between AD and PD astrocytes and regional differences that contribute toward amyloid pathology and neurodegeneration. In contrast, we found that transcriptomic changes in microglia are largely unique to each disorder. Our analysis identified a population of activated microglia that shared molecular signatures with murine disease-associated microglia (DAM) as well as disease-associated and regional differences in microglia transcriptomic changes linking microglia to disease-specific amyloid pathology, tauopathy and neuronal death. Finally, we delineate undescribed subpopulations of medium spiny neurons (MSNs) in the striatum and provide neuronal transcriptomic profiles suggesting disease-specific changes and selective neuronal vulnerability. __________________________________________________________________ Neurodegenerative disorders such as AD, PD, Huntington disease and amyotrophic lateral sclerosis are all characterized by the aggregation and deposition of abnormal proteins^[56]1,[57]2. However, the composition of protein aggregates is unique and distinct to each disorder. For example, intracellular neurofibrillary tangles (NFTs) formed by hyperphosphorylated tau proteins and extracellular Aβ plaques generated from amyloid precursor protein (APP) are hallmarks of AD, whereas Lewy bodies resulting from misfolded α-synuclein proteins occur in PD. These observations suggest there are both common and divergent mechanisms of neurodegenerative pathogenesis, but the identity of these pathways remains mysterious. Damage to the basal ganglia occurs in many neurodegenerative diseases, including AD, PD and Huntington disease^[58]3, yet little is known about the underlying molecular mechanisms. The striatum, which comprises of the caudate and putamen, is the main input structure of the basal ganglia, which is crucial for motor learning and a variety of cognitive functions. Striatal Aβ plaques and NFT deposits, and a reduction in striatal volume, are common features of AD and PD. However, cortical amyloid plaques appear early in the disease and are present in many nondemented older adults, whereas striatal plaques usually occur at later stages of AD and largely after dementia onset^[59]4–[60]6. Furthermore, the neuronal heterogeneity of the human striatum has not been characterized in detail, and the vulnerability of neuronal populations remains unclear. To answer these questions, we used single-nucleus RNA sequencing (snRNA-seq) to compare transcriptomes from postmortem striata of well-characterized cognitively normal controls, AD and PD cases and with previously published transcriptomes from postmortem entorhinal cortex (ec)^[61]7, anterior cingulate cortex (acc)^[62]8 and prefrontal cortex (pfc)^[63]9,[64]10. We identified evolutionarily conserved astrocyte and microglia subpopulations shared across different disease conditions and multiple brain regions. We further describe unique human astrocyte and microglia activation states, their regional differences in transcriptomic changes in disease conditions and their contributions to Aβ pathology, tauopathy and neuronal death. Finally, we observed greater striatal MSN heterogeneity than previously shown^[65]11,[66]12 and neuronal transcriptomic profiles that indicate disease-specific changes and selective neuronal vulnerability. Results The single-nucleus transcriptomes of human AD and PD brains We selected the precommissural putamen postmortem brains of four patients with AD, four patients with PD and four control cases matched for sex, age (range 69–85.4 years) and postmortem interval (Methods, [67]Extended Data Fig. 1a,[68]b and [69]Supplementary Table 1). We obtained 30,908 high-quality single-nucleus gene expression profiles after quality filtering and doublets removal with a comparable number of cells, genes and transcripts across diagnostic groups ([70]Extended Data Fig. 1b). We performed unsupervised clustering using Seurat^[71]13 and mapped clusters to six major cell types by comparing conserved marker genes (markers that are conserved among the groups of AD, PD and controls) with the expression patterns of known cell-type-specific markers, including astrocytes (AQP4, SLC1A2), endothelial cells and pericytes (FLT1, RGS5), immune cells (CSF1R, PTPRC, RUNX1), neurons (RBFOX1, RBFOX3, SYT1), oligodendrocytes (MBP, PLP1) and oligodendrocyte precursor cells (PCDH15, MEGF11 and VCAN) ([72]Fig. 1a,[73]b and [74]Extended Data Fig. 1c,[75]d). Clustering was not driven by experimental batch or individual samples, and the percentage of cells from each case that make up each cluster was not statistically different across the groups ([76]Extended Data Fig. 1e–[77]g). Fig. 1 |. Characterization of six major cell types and three distinct astrocyte subpopulations. Fig. 1 | [78]Open in a new tab a, Unsupervised clustering of snRNA-seq data and UMAP (Uniform Manifold Approximation and Projection) plot of all cells from putamen (pu) colored by cluster identity. UMAP plots were generated using default parameters except reduction = ’pca’, dims = 1:20. b, UMAP plot of all cells colored by marker gene expression levels. c–e, UMAP visualization of astrocyte subpopulations colored by cluster identity for putamen (c; total nuclei: control 1,203, AD 1,642, PD 1,433), ec (d; control 1,660, AD 702) and pfc (e; control 6,109, AD 7,144) astrocytes. f–h, UMAP visualization of astrocyte subpopulations colored by conserved marker gene expression levels for putamen (f), ec (g) and pfc astrocytes (h). i,j, Dot plot of conserved marker genes (i) and CD44 and TNC expression levels (j) in Ast-0, Ast-1 and Ast-2 astrocytes from the three brain regions. k, Venn diagram demonstrating overlap of conserved marker genes among the three brain regions for each astrocyte subpopulation. l,m, Violin plot showing the expression of Ast-2 conserved marker genes CD44 (l) and TNC (m) measured by snRNA-seq. n,o, CD44 (n) and TNC (o) expression validated by RNAScope in situ hybridization together with AQP4 immunohistochemistry staining in the putamen of control, AD and PD samples. CD44 and TNC, red; AQP4, tan. For all data, the experiment was performed once. FindConservedMarkers using Wilcoxon rank sum test and metap R package with meta-analysis combined P value < 0.05. Scale bars, 100 μm. CTRL, control; immune, immune cell; Ast, astrocyte; EP, endothelial cell and pericyte; OLIGO, oligodendrocytes; OPC, oligodendrocyte precursor cell. Conserved existence of three distinct astrocyte subtypes We first examined if there were distinct populations of astrocytes in our samples. We selected the parameters resulting in the most stable clustering (dimensionality = 15, resolution = 0.25, adjusted rand index = 0.96; [79]Extended Data Fig. 2a), for all downstream analyses. Subclustering analysis of all astrocyte nuclei revealed three subpopulations: Ast-0 (n = 2,301), Ast-1 (n = 1,338) and Ast-2 (n = 638) ([80]Fig. 1c). Twenty-four genes for Ast-0 (for example GPC5, NRXN1); 261 genes for Ast-1, including AD risk genes APOE, CLU and APOC1 and 160 genes for Ast-2 (for example, DPP10, GFAP) were identified to be conserved marker genes for the given cell types ([81]Fig. 1f,[82]i,[83]k and [84]Supplementary Tables 2–4). CD44 and TNC were uniquely enriched in Ast-2 cells ([85]Fig. 1j,[86]l,[87]m). AQP4 immunohistochemistry staining combined with CD44 or TNC mRNA RNAscope in situ hybridization in the adjacent tissue sections of control, AD and PD samples analyzed by snRNA-seq confirmed the existence of the Ast-2 astrocytes in both the putamen ([88]Fig. 1n,[89]o) and the white matter tissue of the internal capsule in all three groups ([90]Extended Data Fig. 2f,[91]g). The conservation of these marker genes among all groups suggested there are three distinct astrocyte subpopulations regardless of disease status. To determine whether astrocyte subpopulations identified in the striatum existed in other human brain regions, we analyzed data from previously published snRNA-seq studies using the same parameters. Grubman et al.^[92]7 sampled the ec of six patients with AD and six matched controls. Three astrocyte subpopulations were detected ([93]Fig. 1d) with 4/8 (50%), 20/23 (87%, P value = 6.1 × 10^−31), 19/23 (82.6%, P value = 2.2 × 10^−40, hypergeometric test) conserved marker genes overlapped with those identified in the putamen ([94]Fig. 1g,[95]i,[96]k and [97]Supplementary Tables 5–7). Lau et al.^[98]9 sampled pfc tissues from 12 patients with AD and nine matched controls and identified four astrocyte subpopulations ([99]Fig. 1e). The first three most abundant populations had 5/11 (45%), 46/67 (68.7%, P value =1.1 × 10^−77), 20/22 (90.9%, P value = 2.7 × 10^−44, hypergeometric test) conserved marker genes ([100]Supplementary Tables 8–[101]10) that overlapped with those identified in the putamen ([102]Fig. 1h,[103]i,[104]k). We further validated the existence of the three astrocyte subpopulations in the pfc of a much larger AD cohort with 24 patients with AD and 24 matched controls (referred to as ‘AD pfc Mathys’ data)^[105]10 as well as in the acc samples from a Lewy body disease (LBD) cohort of 28 cases, including healthy controls, PD, PD dementia and dementia with Lewy bodies (n = 7 per group, referred to as ‘LBD acc Feleke’ data)^[106]8. Most of the conserved marker genes were shared by the astrocyte subpopulations in these two datasets ([107]Extended Data Fig. 3a–[108]f), although some genes were not detected or did not reach statistical significance in the pfc Mathys data, likely due to the low cell numbers and sequencing depth ([109]Supplementary Table 11). Therefore, the three astrocyte populations identified in the putamen also present in the other three brain regions. Cell clustering was not driven by experimental batch or by individual samples, and the percentage of cells from each sample was not statistically different between AD and controls for any of the datasets ([110]Extended Data Figs. 2b–[111]e and [112]3g,[113]i,[114]k). The three human astrocyte subpopulations resembled the Gfap-low, disease-associated astrocyte (DAA) and Gfap-high astrocytes identified in the mouse hippocampi^[115]14. Gfap-low astrocytes and human Ast-0 share the marker gene features GPC5 and NRXN1^[116]14 ([117]Fig. 1f,[118]i). Analysis of DAA and Gfap-high astrocyte marker gene expression in the putamen astrocytes and the principal-component analysis (PCA) using all distinguishing marker genes ([119]Extended Data Fig. 4a–[120]c) indicated that although human and mouse astrocytes differ substantially, Ast-1 was more similar to DAA, whereas Ast-2 was more similar to the Gfap-high astrocytes. These results demonstrate homology among human Ast-0, Ast-1 and Ast-2 astrocytes with murine Gfap-low, DAA and Gfap-high astrocytes, respectively. Three astrocyte subsets represent distinct activation states The expression levels of reactive astrocyte markers^[121]15 S100B, VIM, MT2A, MT1E, CRYAB and MT1G were significantly higher in putamen Ast-1 than in the other two cell populations in all three diagnostic groups ([122]Extended Data Figs. 4b,[123]d and [124]Supplementary Table 3). Meanwhile, GFAP, CD44, C3, SYNM and MAOB were more highly expressed in Ast-2 compared to the other two populations in all groups ([125]Fig. 1i,[126]l and [127]Extended Data Fig. 4d). These observations suggested that Ast-1 and Ast-2 both represented activated astrocytes with distinct activation states whereas Ast-0 represented homeostatic astrocytes. VIM, MT2A and MT1E were also highly expressed in the pfc Ast-1 in both AD and controls, suggesting shared astrocyte activation features in different brain regions ([128]Extended Data Fig. 4e and [129]Supplementary Table 9). There were no consistent expression differences between cell clusters across diagnostic groups for murine A1-and A2-specific reactive astrocyte marker or signaling pathway genes ([130]Extended Data Fig. 4f)^[131]16,[132]17, suggesting that Ast-1 and Ast-2 represent astrocyte activation states distinct from A1 or A2 states. Regional divergence of the astrocyte transcriptomes The identification of homologous astrocyte subpopulations provides an opportunity to compare gene expression patterns across different brain regions. Some genes had conserved expression patterns between putamen and one brain region but not the other ([133]Fig. 2a,[134]b). Gene Ontology (GO) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway enrichment analysis on the conserved cluster marker genes revealed unique enrichment of GO terms for each putamen astrocyte subpopulation ([135]Extended Data Fig. 4g), suggesting distinct functions of each. Pathways including the regulation of apoptotic signaling and gliogenesis were shared among Ast-1 from all three brain regions ([136]Fig. 2d and [137]Extended Data Fig. 4g), supporting their common functionality. However, the majority of top 10 KEGG terms and disease-related GO terms were highly enriched for the putamen Ast-1 marker genes only ([138]Fig. 2c,[139]d), such as multiple neurodegenerative disease pathways, amyloid fibril formation, tau protein binding, ferroptosis, regulation of inflammatory response and neuron death pathways. These pathways include AD and PD risk genes such as APOE and PARK7 and genes encoding proteins such as metallothionein protein MT3, superoxide dismutase 1 (SOD1) and stress-inducible heat shock protein HSP90AB1. These proteins can be secreted extracellularly by astrocytes to protect neurons from the toxic effect of Aβ, dopamine quinone neurotoxicity or oxidative stress^[140]18–[141]21 suggesting a potential neuroprotective role of Ast-1. In summary, our results suggest that homologous astrocyte subpopulations from different brain regions may share certain functionalities but also have differences that may contribute to regional differences in neuronal vulnerability, amyloid pathology and tauopathy. Fig. 2 |. Transcriptomic comparison of astrocyte subpopulations. Fig. 2 | [142]Open in a new tab a,b, Violin plots showing genes with conserved expression patterns in the putamen and ec (a) or the putamen and pfc (b) in FindConservedMarkers using Wilcoxon rank sum test and metap R package with meta-analysis combined P value < 0.05). c,d, KEGG pathway terms (c) and disease-related GO terms (d) enriched in the subcluster conserved marker genes (false discovery rate (FDR)-adjusted P value < 0.05, hypergeometric test, ≥ 5 query genes). e–j, Volcano plots showing significant DEGs comparing cells from AD (e–g, Ast-0 = 834, Ast-1 = 553, Ast-2 = 255 cells) or PD (h–j, Ast-0 = 784, Ast-1 = 427, Ast-2 = 222 cells) with cells from the controls (CTRL, Ast-0 = 683, Ast-1 = 358, Ast-2 = 161 cells). The x-axis specifies the log fold changes (logFCs), and the y-axis specifies the negative logarithm to the base 10 of the adjusted P values (−log[10](P[adj])). Magenta and cyan dots represent genes upregulated and downregulated in disease brains, respectively (Wilcoxon rank sum test, FDR-adjusted P value < 0.05 and absolute logFC > 0.25 using natural logarithm (ln)). k–m, Violin plots showing the expression level distributions of example DEGs of Ast-0 (k), Ast-1 (l) and Ast-2 (m). n, Violin plots showing F3 gene expression in all major cell types in the putamen. o, Representative images of RNAScope in situ hybridization analysis of F3 transcript expression in the putamen. p, Single-cell F3 in situ hybridization signal from four images each for four subjects from each group were quantified, AD (n = 863 cells), PD (n = 387 cells) and control (CTRL, n = 1,120 cells) using one-way analysis of variance with Tukey’s multiple comparisons test, ***P < 0.001, *P < 0.05, AD versus CTRL P value < 0.001, PD versus CTRL P value < 0.001, PD versus AD P value = 0.016). Data are presented as mean values ± standard deviation (s.d.). Minima = 3.69, maxima = 5.41, mean CTRL = 4.63, AD = 4.38, PD = 4.27. The lower and upper hinges correspond to the 25th and 75th percentiles. The upper/lower whisker extends from the hinge to the largest/smallest value no further than 1.5× interquartile range from the hinge. Down: downregulated; Up: upregulated; NotSig: not statistically significant; log[10](IntDen + 1): logarithm to the base 10 of the integrated density. Shared astrocytic transcriptomic changes between AD and PD To understand the functional significance of astrocyte subpopulations in pathogenesis, we compared astrocyte gene expression in disease samples to that in controls within each population and detected 124 to 668 differentially expressed genes (DEGs) ([143]Figs. 2e–[144]m and [145]3a and [146]Supplementary Tables 12–[147]17). Interestingly, all putamen astrocyte subpopulations had nearly 2/3 or more DEGs upregulated in AD and PD, including CRYAB, HSPB1, multiple metallothionein (MT) family genes, EGFR and TLR4 ([148]Fig. 2e–[149]m), consistent with observed expression increase in patients with AD and PD^[150]22–[151]27. Downregulated DEGs include the AD risk gene APOE ([152]Fig. 2f,[153]h,[154]i,[155]l), consistent with previous snRNA-seq studies^[156]7,[157]10. Tissue factor (F3) expression was significantly downregulated in AD and PD as validated using RNAScope in situ hybridization and quantification ([158]Fig. 2e–[159]j,[160]m–[161]p). Within each disease condition, astrocyte subpopulations shared extensive transcriptomic changes (p < 0.01, hypergeometric test) with 100% concordance in the directions of the gene expression change ([162]Extended Data Fig. 5a). Gene expression fold changes of all genes in the genome were highly correlated in all pair-wise correlation analyses (Pearson’s correlation coefficient r = 0.718 to 0.894, FDR-adjusted P value < 0.01; [163]Fig. 3b). Pathway analysis revealed concordant changes of multiple pathways across all putamen astrocyte subpopulations ([164]Fig. 3c). Together, these results demonstrate shared transcriptomic changes among the three astrocyte subpopulations in each disease condition. Fig. 3 |. Regional differences in astrocytic transcriptomic changes in disease. Fig. 3 | [165]Open in a new tab a, Bar plot showing the number of up- and downregulated differentially expressed genes (DEG) in the three astrocyte subpopulations from the putamen (pu), ec and pfc (Wilcoxon rank sum test, FDR-adjusted P value < 0.05 and logFC > 0.25 using natural logarithm (ln)). Number of subjects: AD (n = 4), PD (n = 4) controls (n = 4). b, Heatmap of Pearson’s correlation coefficient of genome-wide gene expression logFC among the three astrocyte subpopulations from the pu, ec and pfc. The color represents the correlation’s directionality, and the shade of color represents the significant levels. Only significant correlations were plotted (FDR-correlated P value < 0.05). c, Top two biological process pathways enriched in the DEGs. d, Neurodegenerative disease-related KEGG pathways enriched in the DEGs (hypergeometric test, FDR-adjusted P value < 0.05, ≥ 5 query gene). e, Heatmaps showing the logFC of significant DEGs for GWAS AD- and PD-risk genes; GWAS genes differentially expressed in at least two subpopulations were plotted for visualization. Upregulated: upregulated in disease samples. Downregulated: downregulated in disease samples. Our analysis indicated that transcriptomic changes in astrocytes are highly concordant between AD and PD samples. For each astrocyte subpopulation, AD and PD samples had significant overlap and concordant change of DEGs (P < 0.01, hypergeometric test), a significant correlation of genome-wide gene expression level changes (r = 0.542–0.795, P < 0.001), and common pathways dysregulated in disease conditions ([166]Fig. 3b,[167]c and [168]Extended Data Fig. 5b,[169]d). Interestingly, upregulated DEGs in both AD and PD astrocytes were significantly enriched for multiple neurodegenerative disease pathways ([170]Fig. 3d). However, discordant transcriptomic changes between AD and PD were also detected, especially for the downregulation of the amyloid-beta (Aβ) binding pathway in PD astrocytes ([171]Extended Data Fig. 5d). Thus, our results suggest a possible link between astrocytic transcriptomic changes to neurodegenerative diseases and reveal common and unique dysregulated genes in AD and PD, possibly linking to differential Aβ pathology between the two diseases. Regional differences in astrocytic transcriptomic changes Next, we compared astrocytic gene expression changes in disease conditions across different brain regions. We detected 181–263 and 90–123 DEGs for the three ec and pfc astrocyte populations, respectively ([172]Fig. 3a and [173]Supplementary Tables 18–[174]23). Although numbers of downregulated genes in each brain region were comparable, over 90% of DEGs were downregulated for all cortical astrocytes (91.1–97.6%), in contrast to fewer than 1/3 in putamen astrocytes ([175]Fig. 3a). Additionally, DEGs overlapped appreciably among cortical astrocyte subpopulations and between the two brain regions but were largely non-overlapping with, or regulated in opposite directions to putamen astrocyte DEGs ([176]Extended Data Fig. 5c). Furthermore, genome-wide gene expression changes of the cortical astrocytes were significantly correlated (r = 0.51 to 0.68, FDR-adjusted P value < 0.01) but were not with that of the putamen astrocytes (r ≤ 0.14; [177]Fig. 3b). Many pathways were differentially regulated between cortical and putamen astrocytes, including multiple neurodegenerative disease pathways ([178]Fig. 3c,[179]d). We therefore examined AD and PD risk genes^[180]7,[181]28–[182]30 identified by genome-wide association studies (GWASs), and found ~10% of these genes differentially expressed in at least two clusters ([183]Fig. 3e). Expression changes of AD risk genes in the putamen were concordant between AD and PD but largely in opposite directions compared with the cortex, except for APOE and CLU ([184]Fig. 3e). Many PD risk genes were differentially expressed in putamen astrocytes, but few were affected in cortical astrocytes. In summary, our results demonstrate that cortical astrocytes had similar transcriptomic changes in AD but were distinct from putamen astrocytes. Four distinct immune cell populations in the human brain We next investigated immune cell heterogeneity in the putamen and identified four subpopulations ([185]Fig. 4a,[186]b). Conserved marker genes of T cell cluster included T cell-specific markers BCL11B, CD247 and SKAP1^[187]31,[188]32, and were enriched for T cell-specific functions ([189]Fig. 4c,[190]e). Perivascular macrophages (PVMs) and microglia shared macrophage markers CSF1R and C1QB but can be distinguished by the expression of PVM-specific markers such as MRC1, LYVE1, CD163 and F13A1 ([191]Fig. 4c and [192]Extended Data Fig. 6a) and microglia-specific markers such as P2RY12 and CX3CR1^[193]33–[194]35. The two microglia subpopulations, referred to as Micr-0 and Micr-1 herein, can be distinguished by the unique enrichment of AIF1 and APOC1 in Micr-1 cells as validated using immunohistochemistry staining of P2RY12 combined with in situ hybridization for AIF1 or APOC1 ([195]Fig. 4d,[196]f,[197]g and [198]Extended Data Fig. 6d,[199]e). Decreased P2RY12 expression and higher AIF1, CD14, FTL and MHC-II gene expression in Micr-1 suggested that Micr-0 and Micr-1 represented homeostatic and activated microglia, respectively. Consistently, Micr-1 signature genes were enriched for pathways associated with activated microglia^[200]36 ([201]Fig. 4e). Each cluster contained a similar percentage of cells from all cases of all diagnostic groups ([202]Extended Data Fig. 6c). The conservation of marker genes in control, AD and PD brains indicated these distinct immune cell populations existed independent of disease status. Fig. 4 |. Four distinct immune cell populations. Fig. 4 | [203]Open in a new tab a,b, UMAP visualization of subclusters of immune cells (total nuclei: control 558, AD 827, PD 619) colored by cell cluster (a) or disease diagnosis (b). c,d, Violin plots showing the expression level distributions of genes for T cell, microglia and PVM shared markers and PVM unique markers (c); microglia-specific markers and microglia subpopulation markers (d). The color code is the same as in panel a. e, Subcluster signaturegene enriched GO terms in the Biological Process category (hypergeometric test, FDR-adjusted P value < 0.05, ≥ 5 query genes). f,g, Immunohistochemistry staining (brown) of marker protein P2RY12 and RNAscope in situ hybridization analysis (red) of AIF1 (f) and APOC1 (g) transcript expression in the adjacent tissue sections from the putamen tissue of a control, AD or PD brain. Hematoxylin-positive cell nuclei are shown in blue. For all data, the experiment was performed once. UMAP were generated using default parameters except reduction = ’pca’, dims = 1:30. Cell cluster were defined using resolution = 0.2. Conserved marker genes were determined by FindConservedMarkers using Wilcoxon Rank Sum test and metap R package with meta-analysis combined P value < 0.05. Number of subjects: AD (n = 4), PD (n = 4) and the controls (n = 4). Scale bars, 100 μm. To determine whether immune cell subpopulations identified in the striatum existed in other human brain regions, we analyzed the four published snRNA-seq data independently^[204]7–[205]10. All four immune cell subpopulations were detected in the Mathys-pfc^[206]10 and the Feleke-acc^[207]8 data with most of the conserved marker genes shared by the immune cells in these two datasets ([208]Extended Data Fig. 7). However, some genes were not detected or statically insignificant in the Mathys-pfc data likely due to the low cell numbers and the sequencing depth ([209]Supplementary Table 11). Both microglia subpopulations were also present in the ec and Lau-pfc data. FRMD4A and ST6GALNAC3 were identified as the marker genes of Micr-0, whereas activated microglia markers such as APOE, HLA-DRA, HLA-DPB1, FTH1 and FTL were identified as the marker genes of Micr-1, which were shared across all five datasets and among the control, AD, PD, PD dementia and dementia with Lewy bodies samples ([210]Extended Data Figs. 6f–[211]k and [212]7d,[213]h). These results suggest that the two microglia subpopulations identified in the putamen were also present in all the other brain regions, irrespective of disease status. Micr-1 share transcriptomic signatures with murine DAM Interestingly, 32 of the 83 Micr-1 conserved marker genes overlapped with signature genes of murine DAM^[214]37 (P value = 6.46 × 10^−33, hypergeometric test; [215]Fig. 5a and [216]Supplementary Table 24). When we compared Micr-1 with homeostatic Micr-0 microglia from AD samples, we identified more DAM signature genes and a greater significant overlap with murine DAM gene signatures ([217]Fig. 5a; 40 out of 111 genes, P value = 3.03 × 10^−40, hypergeometric test). The upregulation of APOE, B2M and TYROBP in Micr-1 cells compared to homeostatic Micr-0 microglia was observed across all diagnostic groups ([218]Fig. 5b), which was similar to the reported expression changes during murine DAM activation^[219]37. In particular, TYROBP-APOE signaling is implicated in the initiation of DAM phenotypes independent of TREM2^[220]38, whereas TREM2 is critical for the transition of DAM from an intermediate state to a fully activated state^[221]37,[222]39. Consistent with this, TREM2 expression was more enriched in Micr-1 cells in AD and PD brains ([223]Fig. 5b,[224]e), suggesting a possible transition from an intermediate activated state in controls to a fully activated state in the disease conditions^[225]37. Thus, Micr-1 represents human activated microglia and shares similar transcriptomic changes with murine DAM, suggesting a general microglia activation response to central nervous system (CNS) challenge conserved in the murine models^[226]37. Fig. 5 |. Characterization of the human activated microglia. Fig. 5 | [227]Open in a new tab a, Venn diagram demonstrating overlap (hypergeometric test) between murine DAM marker genes and the conserved marker genes of human activated microglia (Micr-1) of AD, PD and controls (left, P value = 6.46 × 10^−33) or marker genes of AD-only human activated microglia (right, P value = 3.03 × 10^−40). b–d, Violin plots showing the expression level distributions of (b) TREM2, APOE, B2M and TYROBP; (c) M1- and M2- microglia markers; (d) M1- and M2- microglia regulatory transcription factors (TF). e, Immunohistochemistry staining (brown) of marker protein P2RY12 and RNAscope in situ hybridization analysis (red) of TREM2 transcript expression in the adjacent tissue sections from the putamen of an AD and a PD case. For all data, the experiment was performed once. Hematoxylin-positive cell nuclei are shown in blue. f, Heatmap of Pearson’s correlation coefficient of M1- and M2- microglia marker gene expression for Micr-0 and Micr-1 microglia respectively. The shade of the color represents the significance levels (FDR-correlated P value < 0.05). The color represents the directionality of correlation. APOE, B2M and TYROBP were determined to be conserved cluster marker for Micr-1 by FindConservedMarkers using Wilcoxon rank sum test and metap R package with meta-analysis combined P value < 0.05 comparing gene expression in Micr-1 cluster with the other cell clusters for AD (n = 4), PD (n = 4), and the controls (n = 4). Scale bars, 100 μm. Undescribed activation states of human microglia To define the activation states of human microglia, we investigated the expression of canonical M1 and M2 marker genes. M1 markers, such as IL-18 and CD86, and M2 markers, such as IL4R and TGFB1^[228]40, were expressed in both microglia subpopulations at similar levels across all diagnostic groups ([229]Fig. 5c). We calculated pairwise expression correlation among all known M1 and M2 marker genes^[230]36,[231]40 expressed in at least 20% of Micr-0 and Micr-1 cells. Most of these genes were expressed in the same cells and were positively correlated irrespective of being an M1 or M2 marker ([232]Fig. 5f). STAT1, PPARG, STAT3 and MEF2C, transcription factors critical for M1 or M2 polarization^[233]41,[234]42, were expressed at similar levels in both populations ([235]Fig. 5d). Therefore, the activation state of human microglia could not be distinguished using known markers, suggesting that human microglia may have an activation state distinct from what had been observed in vitro and in animal models. Co-expression modules shared by control, AD and PD microglia To better understand gene expression dynamics during human microglia activation, we reconstructed microglia activation trajectories ([236]Fig. 6a). We identified 549, 516 and 437 genes whose expression changes were significantly associated with pseudotime progression (pseudotime DEG) for control, AD and PD samples, respectively ([237]Supplementary Tables 25–[238]27). Interestingly, pseudotime DEGs were significantly enriched for AD risk genes^[239]43,[240]44 ([241]Extended Data Fig. 8a; FDR-adjusted P value < 0.05, hypergeometric test), resembling findings in an App knockin mouse model^[242]45, with APOC1, APOE, HLA-DRB1, INPP5D and MEF2C shared by all diagnostic groups. PD risk genes CHCHD2, FBXO7 and PARK7^[243]28,[244]29 also overlapped with pseudotime DEGs. Consensus k-means partitioning identified three co-expression modules for each diagnostic group ([245]Fig. 6d and [246]Supplementary Tables 25–[247]27). Module 1 consisted of genes downregulated during microglia activation. Modules 2 and 3 were comprised of genes upregulated in early and late activated microglia. Many genes were shared among all three conditions with highly concordant expression within each module and between different conditions ([248]Fig. 6g,[249]j and [250]Extended Data Fig. 8b). Top-ranking genes in modules 2 and 3 (APOE, B2M, FTH1, FTL and CD74; [251]Fig. 6j) are involved in microglia transition from a homeostatic to an activated stage associated with neurodegeneration in humans and mice^[252]37,[253]46,[254]47. We defined the set of genes that were present in all three conditions in module 1 (26 genes, downregulated) or modules 2 + 3 (179 genes, upregulated) as the core gene co-expression modules. Upregulated core genes were enriched for hallmark pathways of activated microglia ([255]Fig. 6k), confirming that our pseudotime analysis captured the true transcriptomic dynamics associated with microglia activation. Fig. 6 |. Transcriptome transition during microglia activation. Fig. 6 | [256]Open in a new tab a–c, UMAP visualization of pseudotemporal trajectory for microglia of putamen (a), ec (b) and pfc (c). UMAP were generated using default parameters except reduction = ’pca’, dims = 1:10. Cell cluster were defined using resolution = 0.15. d–f, Heatmap of the three pseudotime DEG co-expression modules in the microglia of putamen (d), ec (e) and pfc (f) for each condition. Pseudotime DEGs are genes whose expression significantly associated with pseudotime progression (generalized additive model, FDR-adjusted P value < 0.05). g–i, Gene expression changes along pseudotime trajectory for example genes in module 1, 2 or 3 in the microglia of putamen (g), ec (h) and pfc (i) for each condition. LOESS Regression were performed using loess() function in R with 95% confidence intervals plotted. Only statistically significant pseudotime DEGs (FDR-adjusted P value < 0.05) were shown. j, Expression dynamics along the pseudotime trajectory of pseudotime DEGs shared by the control, AD and PD samples in module 1, 2 or 3. k, Microglia-activation-related gene ontology terms enriched in the core gene-co-expression module genes in module 1 (Mod1) or module 2 and 3 combined (Mod2+3) (hypergeometric test, FDR-adjusted P value < 0.05, ≥ 5 query genes). Core modules shared by diverse brain regions and disorders We took two different approaches to investigate whether the microglia activation-associated core modules were shared by activated microglia in other human studies. First, we analyzed pfc microglia and identified 835 and 819 pseudotime DEGs for control and AD microglia, respectively ([257]Fig. 6f and [258]Supplementary Tables 29 and [259]30). AD risk genes APOC1, APOE, HLA-DRB1, INPP5D and MEF2C were associated with pseudotime progression in both AD and control samples ([260]Fig. 6i), similar to their expression changes in putamen microglia. Analysis of ec microglia identified only 163 and 56 pseudotime DEGs likely due to the limited number of cells ([261]Fig. 6e and [262]Supplementary Tables 31 and [263]32). However, our analysis was sufficiently powered to identify APOE and APOC1 as pseudotime DEGs of modules 2 and 3, respectively ([264]Fig. 6h), as in the other two brain regions. One hundred and two genes were shared by six out of the seven datasets with concordant direction of changes across all datasets detected ([265]Extended Data Fig. 8c; P value < 0.001). Second, we retrieved pseudotime DEGs reported by Sankowski et al.^[266]48, which examined glioma-associated microglia (GAM) from the temporal or frontal lobes of glioblastoma multiforme patients. Out of the 545 reported pseudotime DEGs 79 were shared by seven out of the eight datasets with concordant direction of change across all datasets, including many genes commonly upregulated during microglia activation ([267]Fig. 7a). To investigate whether the core modules are present in mice, we compared the pseudotime DEGs of DAMs^[268]37 and ARMs (activated response microglia)^[269]45 from AD mouse models. Only a partial list of ARM pseudotime DEGs is available^[270]45, and limited similarities were observed between DAMs and ARMs ([271]Extended Data Fig. 8e). However, APOE and TYROBP were shared between human and mouse with concordant expression changes, and TYROBP-APOE signaling has been implicated in the initiation of DAM phenotypes^[272]38. In conclusion, human activated microglia, including Micr-1 subpopulations identified in our study, the previously identified GAM, and likely other reported activated microglia subpopulations, share a core gene co-expression module regardless of disease status and brain region. The shared TYROBP-APOE signaling also suggests that some core changes are likely evolutionarily conserved. Fig. 7 |. Comparison of genes and pathways of microglia activation-associated transcriptome changes. Fig. 7 | [273]Open in a new tab a, Pseudotime DEGs (generalized additive model, FDR-adjusted P value < 0.05) shared by human activated microglia isolated from the putamen of cognitively normal controls (CTRL), AD and PD samples, pfc of control and AD samples, ec of control and AD samples and temporal or frontal lobes (tc) of glioblastoma multiforme samples reported by Sankowski et al. b, Cell death-related GO terms enriched in the pseudotime DEGs. c, GO terms related to Aβ pathology, tauopathy, and autophagy that were enriched in the pseudotime DEGs. d, Top KEGG pathways enriched in the pseudotime DEGs. Pathways with FDR-adjusted P value < 0.05 (hypergeometric test) and at least five query genes were considered statistically significant. Up: upregulated; down: downregulated; NotDEG: not pseudotime DEG. Common and disease-only changes during microglia activation Next, we compared pathways that were dynamically regulated during microglia activation between control, AD and PD, as well as between different brain regions. We focused our pathway analysis on the putamen and pfc microglia and GAMs as they have sufficient numbers of pseudotime DEGs. The most prominent observation was the upregulated pathways shared among all conditions and across all brain regions. Many of the pathways are known to change during microglia activation in animal studies, such as the regulation of the intrinsic apoptotic signaling pathway, positive regulation of cytokine production including interferon-γ, response to interferon-γ, nuclear factor κB signaling, phagocytic capacity and antigen processing and presentation^[274]33,[275]49(module 2 + 3; [276]Fig. 7b,[277]d and [278]Extended Data Fig. 8d,[279]f). These results suggest shared changes in major metabolic states and immune properties during human microglia activation across all conditions and different brain regions. Interestingly, there were also major differences indicating diseasespecific transcriptome changes associated with human microglia activation. First, pathways downregulated during microglia activation were largely unique to each condition and brain region (module 1; [280]Fig. 7d and [281]Extended Data Fig. 8f). Second, some pathways were uniquely linked to disease pathology. For example, the ‘transcriptional misregulation in cancer’ pathway was specifically downregulated in the GAMs, whereas multiple neurodegenerative disease pathways were uniquely upregulated in the putamen and pfc microglia ([282]Fig. 7d). Third, although the positive regulation of cytokine production pathway was shared by all microglia there were significant differences in the specific cytokines being regulated in different conditions and brain regions ([283]Extended Data Fig. 8d). Lastly, the Aβ-binding pathway was upregulated in most datasets ([284]Fig. 7c) providing in vivo evidence linking microglia activation and Aβ pathology in humans, a phenomenon widely-observed in animal studies^[285]50. Interestingly, many more Aβ-related pathways and the tau protein binding pathway were uniquely regulated during GAM activation ([286]Fig. 7c), matching with the speculated role of GAM in reducing the risk of developing AD^[287]51. Whether and how these disease-specific transcriptome changes associated with human microglia activation could contribute to disease pathogenesis remain to be elucidated. The robust core gene signature of microglia activation defined in our study will help reveal disease-specific roles of microglial activation in various CNS diseases. Microglia and Aβ pathology, tauopathy and neuron death Comparing microglia from AD and PD brains with those of controls identified 112–245 DEGs with the majority (65.3–75.4%) being downregulated in the diseased brains ([288]Extended Data Fig. 9a and [289]Supplementary Tables 33–[290]36). Micr-0 and Micr-1 shared many DEGs and their genome-wide transcriptional changes were significantly correlated ([291]Extended Data Figs. 9a,[292]b and [293]10a,[294]b). Global gene expression changes between AD and PD were highly correlated sharing many downregulated DEGs (45.8% and 35.1%) and a few upregulated DEGs (7.4% and 8.0%) ([295]Extended Data Fig. 10a,[296]c). Pathway analyses revealed many disease-related changes in AD and PD microglia. For example, the Aβ binding pathway was downregulated in both diseases including MSR1 and CST3 ([297]Extended Data Fig. 9a,[298]b,[299]e). In contrast, multiple protein folding pathway components including genes encoding stress-inducible heat shock protein HSP90AA1 and HSP90AB1 and their co-chaperone, FKBP4, were uniquely upregulated in PD microglia ([300]Extended Data Figs. 9d and [301]10d). Hsp90 regulates tauopathy through co-chaperone complexes, and overexpression of FKBP4 prevents the accumulation of tau^[302]52. Therefore, our data suggest that microglia may play a beneficial role in preventing tau pathology, consistent with the less severe tauopathy in patients with PD. Additionally, multiple neuronal death pathways were downregulated in AD ([303]Extended Data Fig. 9e). Gene expression analysis of GWAS AD and PD risk genes demonstrated largely distinct up-regulation of disease risk genes between AD and PD microglia ([304]Extended Data Fig. 9f). We detected 14–56 DEGs in the ec and pfc microglia subpopulations ([305]Extended Data Fig. 10e), which was much fewer than that of the putamen microglia. Nonetheless, gene expression analysis of GWAS AD and PD risk genes demonstrated largely distinct regulation of disease risk genes between AD and PD microglia and between cortical and subcortical brain regions ([306]Extended Data Fig. 9f). Some pathways were regulated in the opposite direction between the putamen and cortical microglia ([307]Extended Data Fig. 10d). Our results suggest significant regional differences in microglial responses which may contribute to differential regional vulnerability. Neuronal diversity of the human putamen The human striatum neuronal heterogeneity has not been characterized at the single-cell level. We identified 13 transcriptionally distinct neuronal populations through subclustering analysis: eight MSN and five interneuron clusters ([308]Fig. 8a and [309]Supplementary Tables 38–[310]50), revealing a greater MSN heterogeneity than previously suggested in the mouse and the non-human primate (NHP) striata. The expression of PPP1R1B and MEIS2, MSN marker genes^[311]53,[312]54, versus ELAVL2^[313]55, broadly distinguished the MSNs from the interneurons ([314]Fig. 8b and [315]Supplementary Fig. 1a). Conventionally, the co-expression of dopamine receptor DRD1 with neuropeptide TAC1 versus DRD2 with proenkephalin PENK has been used to delineate D1 direct-pathway neurons from D2 indirect-pathway neurons, respectively^[316]11,[317]56. Four MSN subpopulations displayed conventional combinatorial expressions patterns, whereas the other four subpopulations had distinct combinatorial patterns. The co-expression of DRD1 with TAC1 by ‘mD1’ and ‘pD1’ neurons is consistent with their D1 neuron identity. The expression of STXBP6 and EPHA4 in mD1 neurons indicates their matrix compartment localization, whereas the expression of KCNIP1, BACH2 and the D1-patch– specific marker PDYN in pD1 neurons indicates patch compartment localization ([318]Fig. 8c and [319]Supplementary Fig. 1a)^[320]11,[321]57. In contrast, ‘ncD1’ neurons expressed DRD1 but had a very low levels of TAC1 expression (thus named ‘non-canonical D1’). Moreover, ncD1 expressed multiple unique marker genes, such as RXFP2, FAM210B and MPP3, supporting its unique identity ([322]Fig. 8b and [323]Supplementary Fig. 1a). Similarly, the co-expression of DRD2 with PENK by ‘mD2’ and ‘pD2’ neurons is consistent with their D2 neuron identity, and the matrix/patch marker genes indicate matrix versus patch localization of mD2 and pD2 respectively ([324]Fig. 8b,[325]c). ‘ncD2,’ the non-canonical D2 MSN, expressed DRD2 but had a low expression level of PENK. Instead, ncD2 exhibited aberrantly high levels of TAC1, the canonical D1 MSN marker gene ([326]Fig. 8b and [327]Supplementary Fig. 1a). Furthermore, ncD2 expressed multiple unique marker genes, such as POU6F2, LGR5 and FHAD1, supporting its unique identity. The third type of non-canonical MSN exhibited expression of both dopamine receptor genes and both TAC1 and PENK, named hybrid MSN (hMSN) herein. hMSN expressed unique conserved marker genes RXFP1, MKX and JAG1, indicating it is a distinct neuronal subtype rather than an artifact of doublets ([328]Fig. 8b and [329]Supplementary Fig. 1a). The D1/D2 ‘hybrid’ MSN has been reported in NHP striatum^[330]11 and human nucleus accumbens^[331]12, with the same co-expression of D1 and D2 markers and unique RXFP1 expression. In contrast to these undescribed MSN subtypes having their own unique conserved marker genes, cluster ‘sMSN’ did not have any well-characterized marker genes uniquely expressed in the cluster. Instead, the conserved marker genes of this cluster had unique enrichment for many stress- and unfolded protein response-related pathways ([332]Fig. 8e,[333]f). We thus referred to this cluster as stressed MSN (sMSN). Fig. 8 |. Characterization of neuronal subpopulations and gene expression changes in disease conditions. Fig. 8 | [334]Open in a new tab a, UMAP visualization of neuron subpopulations colored by cluster identity. b, Dot plot of neuronal subpopulation conserved marker gene expression (FindConservedMarkers using Wilcoxon rank sum test and metap R package with meta-analysis combined P value < 0.05). c, Violin plot showing the expression of matrix- and patch-compartment marker gene expression in mD1, pD1, mD2 and pD2 neurons (FindConservedMarkers using Wilcoxon rank sum test and metap R package with meta-analysis combined P value < 0.05). d–f, GO terms enriched in the conserved cluster marker genes of each neuronal subpopulation related to cell death (d), stress response (e), amyloid and tau metabolism and unfolded protein response pathways (f). g, Neurodegeneration and addiction-related KEGG pathway terms enriched in the conserved cluster marker genes of each neuronal subpopulation. h, UMAP visualization of MSN neuron subpopulations with sMSN colored by cells expressing only DRD1 (left), only DRD2 (middle) or both DRD1 and DRD2 (right). i, Top five GO terms in the Biological Process category enriched in the DEGs of each neuronal subpopulation in AD and PD. Only cell subpopulations with enriched GO terms are shown. Pathways with FDR-adjusted P value < 0.05 (hypergeometric test) and at least five query genes were considered statistically significant. Five interneuron subtypes were identified, accounting for 9.32% of all neurons in the precommissural putamen, as reported^[335]58. The ‘in.PDGFD’ neurons had specific and robust expression of PDGFD, OPN3 and PTHLH ([336]Fig. 8b and [337]Supplementary Fig. 1b), similar to the co-expression of Opn3 with Pthlh in mouse striatal interneurons^[338]59. Cluster ‘in.CALB2’ co-expressed calretinin (CALB2) and TAC3. Striatal CALB2^+ and TAC3^+ interneurons have been described in rodents, NHP^[339]59 and human striata^[340]11,[341]61. The ‘in.TH’ neurons robustly expressed tyrosine hydroxylase (TH), dopamine reuptake transporter SLC6A3, vesicular monoamine transporter-2 SLC18A2, dopaminergic neuron-specific transcription factor BNC2 and ALDH1A1 (ref.^[342]61), which defines a dopaminergic neuron subpopulation in the substantia nigra pars compacta, suggesting their dopaminergic identity. The ‘in.SST’ neurons expressed SST, NPY and NOS1 as described in the mammalian striatum^[343]11,[344]59,[345]60. The ‘in.SLC5A7’ cluster expressed the high-affinity choline uptake transporter SLC5A7, choline acetyltransferase CHAT and the key cholinergic neuronal function regulator LHX8, which are characteristic features of cholinergic neurons^[346]59. Neuronal transcriptome and selective neuronal vulnerability Pathway enrichment analysis on the conserved marker genes in each cluster found extensively shared pathways across different neuronal subpopulations related to neuronal functions ([347]Supplementary Fig. 1c). Many neuronal clusters were uniquely enriched for specific pathways ([348]Supplementary Fig. 1d), suggesting they represent neurons with distinct functions. Multiple neuronal death, neurodegenerative disease and amyloid- and tau-related pathways were selectively enriched in the hMSN, sMSN, in.PDGFD, in.CALB2 and in.SST neurons ([349]Fig. 8d,[350]f,[351]g), suggesting a possible link to selective neuronal vulnerability in the putamen. The enrichment of multiple addiction pathways across many neuronal clusters is consistent with the known involvement of the striatum in addictive behavior^[352]63. Analysis of MSN subtype marker gene expression revealed that sMSN represents a heterogenous population composed of both D1 and D2 MSNs, as well as a population of unknown identity. Of its 755 MSNs, 42.5% express DRD1 receptors only, 36.8% express DRD2 receptors only, and 13.5% co-expressed DRD1 and DRD2 but lacked any of the MSN subtype-specific markers. The conserved marker genes were uniquely enriched for multiple stress-related pathways, amyloid and tau protein binding, response to unfolded protein binding, chemical carcinogenesis-reactive oxygen species and negative regulation of endoplasmic reticulum stress-induced intrinsic apoptotic signaling pathway ([353]Fig. 8d–[354]g), suggesting that sMSN may represent MSNs that had responded to various types of stress and altered their original cellular identity or function. Interestingly, HPCA, PSAP and DKK3 are conserved marker genes overexpressed in sMSN ([355]Fig. 8b and [356]Supplementary Fig. 1a), all of which have neuroprotection function against various types of stress such as endoplasmic reticulum stress-induced neurodegeneration, heat stress, Aβ susceptibility and oxidative stress^[357]64–[358]67. These results suggest that this heterogenous group of MSNs shares the same stress responses irrespective of disease. To understand neuronal damage in AD and PD, we compared the transcriptomic responses of each neuronal cluster between disease and control conditions and detected 23–652 DEGs per cluster ([359]Supplementary Fig. 1e and [360]Supplementary Tables 51–[361]69). AD neurons generally had more upregulated genes, whereas PD neurons had more downregulated genes. Pathway analyses showed broad disturbance of pathways critical to normal neuronal function in MSNs, but not in interneurons ([362]Fig. 8i). Regulations of long-term synaptic potentiation, synaptic plasticity, neurotransmitter transport and learning or memory pathways were uniquely downregulated in D1 neurons from PD samples ([363]Fig. 8i). This finding is consistent with previous findings that long-term synaptic potentiation is selectively impaired in dSPNs (D1 neurons) in the parkinsonian state^[364]68,[365]69, which is a critical cellular and circuit mechanism associated with PD pathophysiology. Interestingly, pD2 and hMSN from AD samples were enriched for the APP metabolic process pathway, signifying a differential predisposition to Aβ-related pathology in these neurons. Discussion Astrocytes and microglia exert many essential actions that are crucial for neuronal survival and function in healthy CNS tissues. Mounting evidence demonstrates how glial cells play a key role in neurodegenerative disease^[366]33,[367]44,[368]70. However, most current knowledge is derived from in vitro systems or animal models whose relevance to human disease remains under debate^[369]71. Using unbiased snRNA-seq technology, we identified three astrocyte and two microglia subpopulations in human putamen that are conserved across multiple brain regions and different disease conditions, as well as between humans and mice. Importantly, our study revealed common microglia activation-associated genes and pathways shared by cognitively normal controls and diverse human disease conditions such as AD, PD and glioblastoma multiforme, irrespective of the brain region being studied. These analyses also revealed diseasespecific microglia-activation-associated transcriptomic changes linking human microglia activation to Aβ pathology, neuroinflammation and neurodegeneration. Our study discovered regional differences in pathology-associated transcriptomic changes in astrocytes and microglia, which may underlie selective regional vulnerability of the cortex and striatum. Finally, we described striatal MSN heterogeneity and neuronal transcriptomic profiles indicating disease-specific changes and selective neuronal vulnerability. Our analyses revealed regionally distinct astrocytic transcriptomic changes relevant to neurodegenerative diseases, including differences related to Aβ pathology, neurodegeneration, neuroinflammation and synapse organization. DEG overlap examination, genome-wide gene expression change correlation analysis, pathway analysis and GWAS AD and PD risk gene expression analysis together provide strong support for distinct transcriptomic changes in AD between putamen and cortical astrocytes. Although cortical astrocytes were examined at a lower sequencing depth than the putamen astrocytes ([370]Supplementary Table 11), the directionality of gene expression changes, the proportion of genes being dysregulated in the disease condition, the genome-wide gene expression change correlation analysis and the GWAS risk gene analyses results are independent of sequencing depth supporting the biological relevance of the findings rather than being sequencing artifacts. Regional heterogeneity in glia response to aging and injury has been described in mice and humans^[371]72. It is well documented that cortical amyloid plaques appear early in the disease and present in many nondemented older adults, whereas striatal plaques occur only at later histopathological stages of AD, and largely after dementia onset^[372]4–[373]6. The observed region-specific astrocytic and microglial disease-pathologyrelated transcriptomic alterations might contribute to these spatial variations in disease pathology and neuronal vulnerability. Previous comparison of signature genes of reactive microglial populations revealed a heterogenous microglial response to AD pathology in mice and humans^[374]73. By comparing pseudotime DEGs, we observed a high concordance of a set of microglia activation-associated genes shared by diverse human disease conditions and brain regions ([375]Fig. 7a), supporting the hypothesis that reactive microglia share a common microglial response to CNS pathology, irrespective of the disease etiology^[376]37,[377]74. The number of cells, sequencing depth, temporal dynamic nature of gene transcription regulation and the end stage at sampling all could contribute to the discrepancy of signature genes, whereas pseudotime DEGs are less affected by those factors, which may explain the differences between the two comparison methods. APOE and TYROBP were shared between human and mice with concordant expression changes in all 10 datasets (probability of 10^−6), and TYROBP-APOE signaling has been implicated in the initiation of DAM phenotypes^[378]38 suggesting evolutionally conserved microglial activation mechanism. We also discovered that microglia of varying disease conditions elicit a disease-specific response relevant to disease pathology ([379]Fig. 7d). The robust core gene signature identified in this study will help to define disease-specific roles of microglial activation in various CNS diseases. Our study reveals shared astrocytic transcriptome changes between AD and PD but largely distinct changes for microglia, suggesting common and divergent cell-type-specific mechanisms of pathogenesis. Interestingly, the ‘amyloid precursor protein metabolic process’ was uniquely upregulated in AD MSNs, whereas ‘long-term synaptic potentiation’ was uniquely downregulated in PD neurons. Furthermore, multiple neurodegenerative disease pathways were dysregulated in astrocytes, but not in neurons. Whether disease-specific glial transcriptomics changes are causal to disease-specific neuronal changes needs future investigation. A better understanding of how neurons and glia communicate as well as whether and how those pathways in corresponding glial cells contribute to disease pathogenesis may offer new ways to combat neurodegenerative diseases. Limitations of the study The sample size of our study is limited. However, using the same approaches, we validated our findings in four independent datasets for both astrocytes and microglia subpopulations, suggesting that our findings are representative. All snRNA-seq studies used samples from postmortem brain tissues and shared the limitations associated with all postmortem tissue studies. Methods Subjects In agreement with local ethical committee requirements, patients provided written informed consent before cognitive impairment, or the next of kinsperson supplied consent antemortem or postmortem (Washington University Institutional Review Board, Washington University School of Medicine, St Louis, MO). Clinically and neuropathologically well-defined human brain tissues were collected from the Charles F. and Joanne Knight Alzheimer Disease Research Center (Knight ADRC) and The Movement Disorders Center (MDC) Brain Bank at Washington University School of Medicine. (The clinical information and pathological characteristics are summarized in [380]Supplementary Table 1.) The use of tissue for genetics, autoradiography and biochemistry research was approved by the Knight ADRC and MDC Leadership Committees (ethics approval reference number T1705). Dementia level was assessed by Clinical Dementia Rating (CDR)^[381]75 according to the CDR criteria for diagnosing dementia in PD. Individuals with a CDR ≥ 1 were taken. AD pathological changes were classified using Braak staging^[382]76. Braak stages of Aβ accumulation use letter ratings: (A) the initial deposits in the basal neocortex, (B) deposits that extend into the adjacent areas of the neocortex and (C) heavy deposition throughout the entire cortex. Stages of neurofibrillary pathology were characterized as transentorhinal (I-II), limbic (III-IV) and neocortical (V and VI). All the cases were previously comprehensively examined by the MDC and Knight ADRC neuropathology core for their pathologic stages for Aβ (plaque), tau (NFT) and α-synucleinopathy (Lewy body) before being released to research laboratories. The systematic neuropathological characterization and AD grading were conducted and previously reported by our team^[383]77, and α-synucleinopathy with Lewy bodies was rated according to the scheme proposed by McKeith et al.^[384]78 and modified by Burack et al.^[385]77. Histologic stains include hematoxylin and eosin. Pathological staining was performed using the related antibodies Aβ (10D5, Elan Pharmaceuticals), tau (PHF-1, Abcam) and α-synuclein (LB- 509, Zymed), as we have reported^[386]79–[387]83. The selected cases had highly similar pathological profiles for each group in this study. The AD cases had Braak Aβ stage of C (n = 4), NFT stage V (n = 1) and VI (n = 3) and Lewy body stage of 0 (n = 4). The PD cases had Braak Aβ stage of C (n = 2), 0 (n = 1) and B (n = 1); NFT stage I (n = 2), II (n = 1), and III (n = 1); and Lewy body stage of 6 (n = 4). The cognitively healthy control cases had Braak Aβ stage of 0 (n = 4), NFT stage I (n = 1), II (n = 1) and III (n = 2), and Lewy body stage of 0 (n = 4). Although there are unavoidable pathological overlaps in Aβ and NFT in these aged patients between the three groups, the AD group had higher average stages of Aβ and NFT than the control and PD groups. In addition, the PD group had a higher Lewy body stage than AD and controls. The average age and postmortem interval time did not significantly differ across groups. The tissues harvested were as follows: four AD (two males, two females) aged 71–82 (mean: 75 ± 2) years at death, four PD (two males, two females) aged 69–77 (mean: 73 ± 2) years at death, and four age-matched healthy control cases (two males, two females) aged 73–85 (mean: 79 ± 2) years at death. No statistical methods were used to predetermine sample sizes, but our sample sizes are similar to those reported in previous publications^[388]7,[389]8. Sample processing and nuclei isolation Brains were obtained at the time of the autopsy. The right hemisphere was coronally sectioned and snap-frozen using liquid nitrogen vapor. The tissue blocks were preserved at −80 °C until use. The putamen tissue (~50 mg) was carefully dissected out with a scalpel at −20 °C using the autoradiography images of dopaminergic biomarkers^[390]80–[391]84, and the tissue was then homogenized using a glass Dounce grinder in 4 ml ice-cold homogenization buffer (HB; consisting of 0.1 mM DTT (Promega, P1171), 1X Protease inhibitor cocktail (Promega, G6521), 0.2 U μl^−1 RNasin Plus RNase Inhibitor (Promega, N2615), and 0.1% Triton X-100 (Sigma, T8787) in nuclei isolation media (consisting of 10 mM Tris buffer pH 8.0 (ThermoFisher Scientific, AM9856), 250 mM sucrose (VWR 97061), 25 mM KCl (ThermoFisher Scientific, AM9640G) and 5 mM MgCl[2] (ThermoFisher Scientific, AM9530G) in molecular biology grade water)). An additional 2 ml HB was added to the solution, and it was incubated on ice for five min. Then, the homogenized solutions were passed through a 70 μm cell strainer first and then filtered for an additional time with a 30 μm cell strainer. The double-strained homogenate was centrifuged (900 g, for 10 min at 4 °C), and the supernatant was removed. The remaining nuclei pellet was resuspended in 3 ml blocking buffer (BB; consisting of 1X PBS (Life Technologies, AM9625), 1% BSA (Sigma, 126625), and 0.2 U μl^−1 RNasin Plus RNase Inhibitor (Promega, N2615) in molecular biology grade water). Then, 30 μl myelin remove beads was added to the nuclei suspension, and the solution was incubated at 4 °C for 15 min after resuspension. After incubation, an additional 3 ml BB was added to the solution, which was centrifuged (500 g, for five min at 4 °C), and then the supernatant was removed. Then, the resulting pellet was resuspended in 3 ml BB and incubated on a Dynamag magnet (ThermoFisher Scientific, 12301D) for 15 min at 4 °C refrigerator. Finally, the supernatant was removed and filtered through a 30 μm cell strainer. For accurate quantification, the nuclei-enriched supernatant was stained with DAPI (1:1,000) (ThermoFisher Scientific, D1306), and the nuclei were counted with a Countess II Automated Cell Counter (ThermoFisher Scientific, AMQAX1000). Library construction and sequencing Purified nuclei were delivered to the McDonnell Genome Institute (MGI) at Washington University School of Medicine to generate 10x Genomics libraries using Chromium Single Cell 3′ V3 Reagent Kits according to the 10x Genomics protocol. The generated libraries were sequenced on the NovaSeq S4 platform (Illumina). Sequencing saturation ranged from 60.8% to 94.8%. MGI demultiplexed raw base sequence calls generated from the sequencer into sample-specific FASTQ files. Mapping snRNA-seq data to the reference genome and cell quality control CellRanger 3.0.2 (10x Genomics, [392]https://support.10xgenomics.com/) was used to align FASTQ files to the human GRCh38 pre-mRNA reference genome. The aligned reads were traced back to individual cells, and the gene expression level of individual genes was quantified based on the number of UMIs (unique molecular indices) detected in each cell. The filtered gene-cell barcode matrices generated with CellRanger were used for further analysis with the R package Seurat v3.0 (ref.^[393]13), 4.0.5 (ref.^[394]85). R version 4.0.2, 4.0.4, 4.1.1 was used for statistical analysis and plotting (R Core Team (2013). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria; [395]http://www.R-project.org/). Quality control was implemented as the first step in data analysis. We first filtered out genes that were detected in less than five nuclei. Nuclei that were doublets or low quality were further filtered out by two criteria. First, nuclei with less than 500 genes, more than 10% mitochondrial content or an extremely high number of detected genes or UMIs were filtered out. Cutoffs for UMI and gene number were determined on the basis of a scatter plot showing the number of genes as a function of the number of UMI per cell. A cutoff of 500–70,000 UMI and 500–9,000 genes was applied. Next, after unsupervised clustering, cell clusters with a mixed expression of markers from different cell types and clusters with low quality were removed, including clusters with a high percentage of mitochondrial genes and clusters without marker genes conserved across control, AD and PD groups. The initial dataset contained 38,929 cells. After these quality control procedures, we obtained 30,908 high-quality single-nucleus gene expression profiles (an average of 2,576 cells/subject) and detected a median of 2,187 genes and 4,363 transcripts per nucleus. Cell clustering and cell-type identification For integrative analysis, we followed the workflow described in the Seurat guided analysis for ‘Performing integration on datasets normalized with SCTransform^[396]86‘. We use SCTransform to normalize gene expression levels and to regress out variations from mitochondrial gene expressions. To integrate the single-cell data from individual donor samples, we used function SelectIntegrationFeatures (nfeatures = 3,000) to identify highly variable genes. Functions PrepSCTIntegration, FindIntegrationAnchors (normalization.method = ’SCT’) and IntegrateData (normalization.method = ’SCT’) from Seurat v3.0 were implemented. The top 3,000 most variable genes were selected as integration features and used for integration anchor selection. Principal component analysis was performed using the top 30 PCAs. UMAP analysis was performed with the top 20 dimensions. Clusters were identified with the functions FindNeighbors (dims = 1:20) and FindClusters (resolution = 0.1). A resolution of 0.1 was selected for the downstream analysis because clusters were clearly separated and matched visual inspection. Default parameters were used unless noted. Cell-type identification The function FindConservedMarkers (assay = ’SCT’, slot = ’data’, test. use = ’wilcox’, min.pct = 0.2, grouping.var = ’Genotype’, only.pos = TRUE) implemented in Seurat v3.0 was used for identifying marker genes that were conserved in the control, AD and PD groups. This function first calculates differentially expressed genes of each cluster against all other clusters for each condition using the function FindMarkers, and then the metap R package with default meta-analysis method metap::minimump to perform meta-analysis of P values (significance values) to generate a combined P value. max_pval < 0.05 was used as a cutoff to determine the conserved marker genes for each cluster, identifying positive markers for a given cluster that are shared by control, AD and PD. Identified markers were compared with celltype-specific markers from mouse striatum^[397]87 and human brains^[398]7–[399]10,[400]88. Cell type was manually annotated based on the expression for the following known marker genes: astrocytes (GFAP, AQP4, SLC1A2, ALDH1L1, GJA1, SLC1A3), endothelial cells and pericytes (FLT1, CLDN5, RGS5, PDG-FRB), immune cells and microglia (PTPRC, C1QB, CSF1R, CD74, CX3CR1, P2RY12, HLA-DRA, ITGAM, RUNX1), neurons (SYT1, SNAP25, RBFOX1, RBFOX3, GRIK2, GRIA1, GRIN2B, GAD1, GAD2, GRIN1), oligodendrocytes (MOG, MBP, MOBP, PLP1, CLDN11, SOX10, OLIG1, OLIG2) and oligodendrocyte precursors (VCAN, PCDH15, MEGF11, SOX10, OLIG1, OLIG2). Astrocyte subcluster analysis We first isolated nuclei of the astrocyte cluster (control = 1,203, AD = 1,642, PD = 1,433 nuclei) from the original Seurat object using the subset function. The data were split into individual samples based on the subject identity. Then we performed data integration and unsupervised clustering following the procedure similar to that used for our initial cell-type clustering using default parameters except noted below: SCTransform^[401]86 normalization (vars.to.regress = c(‘nCount_ RNA’, ‘percent.mt’), SelectIntegrationFeatures (nfeatures = 3,000), PrepSCTIntegration(anchor.features = selected.features), FindIntegrationAnchors (normalization.method = ’SCT’, anchor.features = selected.features, normalization.method = ’SCT’, reduction = ’cca’, k.filter = 170), IntegrateData (anchorset = selected.anchors, normalization.method = ’SCT’), RunPCA(npcs = 30). Additional low-quality cell clusters, including cell clusters that showed mixed expression of markers of astrocyte with markers from other cell types, which likely represent doublets, and cell clusters containing a high percentage of reads mapped to mitochondrial genes, were further filtered. To measure the effect of parameters on clustering results and determine the most stable cell population structure, a total of 48 different combinations of parameters for dimensionality (5, 10, 15, 20, 25, 30) and resolution (0.05, 0.1, 0.15, 0.2, 0.25, 0.3, 0.35, 0.4) were used to perform cell clustering. The concordance of cluster identity was measured using the adjusted Rand index (ARI) calculated using the adjustedRandIndex function implemented in the mclust R package. To measure the concordance of cell identity between using all cells (cells from control, AD and PD cases) for clustering and using only cells from AD and controls for clustering, we used different combinations of parameters to perform cell clustering in each situation, and ARI for shared cells were calculated. The clustering results were stable across a wide range of parameter combinations. With a dimensionality of 15 and resolution of 0.25, the parameters resulted in the highest ARI (0.96) between the two situations and were used for all the downstream analyses for astrocytes. The same approach and parameters were used for analyzing the Grubman et al.^[402]7, Lau et al.^[403]9 and Feleke et al.^[404]8 datasets whereas different parameters were used for Mathys et al.^[405]10 data (dimensionality = 30, resolution = 0.5). Immune cell subcluster analysis We first isolated nuclei of the immune cell cluster (control = 558, AD = 827, PD = 619 nuclei) from the original Seurat object using the subset function. The data were split into individual samples based on the subject identity, and we performed data integration and unsupervised clustering following the procedure similar to that used for our initial cell-type clustering using default parameters except noted below. Due to the small number of cells in one subject, some parameters were adjusted. SCTransform normalization (vars.to.regress = c(‘nCount_ RNA’, ‘percent.mt’), SelectIntegrationFeatures (nfeatures = 3000), PrepSCTIntegration(anchor.features = selected.features), FindIntegrationAnchors (normalization.method = ’SCT’, anchor.features = selected. features, normalization.method = ’SCT’, dims = 1:20, k.anchor = 5, k.filter = 20, k.score = 20, max.features = 200), IntegrateData (anchorset = selected.anchors, normalization.method = ’SCT’), RunPCA(npcs = 30). Cell clusters were defined using resolution = 0.2. To perform microglia pseudotime analysis, we isolated microglia nuclei using the subset function and determined the best parameters for obtaining the most stable microglia subpopulation structure as described above for the astrocyte subpopulation analysis. A dimensionality of 10 and resolution of 0.15 were selected for our microglia data analyses, and for the Grubman et al.^[406]7 and Lau et al.^[407]9 datasets whereas different parameters were used for the Feleke et al.^[408]8 (dimensionality = 20, resolution = 0.15) and the Mathys et al.^[409]10 (dimensionality = 40, resolution = 0.5) data. Neuronal cell subcluster analysis We first isolated the nuclei of the neuronal clusters (control = 2259, AD = 3343, PD = 3286 nuclei) from the original Seurat object using the subset function. The data were split into individual samples based on the subject identity, Then we performed data integration and unsupervised clustering following the procedure similar to that used for our initial cell-type clustering using default parameters except noted below. SCTransform normalization (vars.to.regress = c(‘nCount_RNA’, ‘percent.mt’)), SelectIntegrationFeatures (nfeatures = 3,000), PrepSCTIntegration(anchor.features = selected.features), FindIntegrationAnchors (normalization.method = ’SCT’, anchor. features = selected.features, normalization.method = ’SCT’, dims = 1:30), IntegrateData (anchorset = selected.anchors, normalization. method = ’SCT’), RunPCA(npcs = 30). Additional low-quality cell clusters, including cell clusters that showed mixed expression of markers of neurons with markers from other cell types, which likely represent doublets, and cell clusters that containing a high percentage of reads that mapped to mitochondrial genes, were further filtered. Cell clusters were defined using a dimensionality of 20 and a resolution of 0.3. Cluster ‘ncD1,’ a cluster discretely separate from the other clusters but clustered with ‘sMSN,’ was manually selected using the CellSelector function of the Seurat R package. Conserved marker analysis of the cluster confirmed it was transcriptionally distinct, with its unique marker genes. The neuronal subtypes were manually annotated using the following markers: MSNs (PPP1R1B, MEIS2), interneurons (ELAVL2, PDGFD, OPN3, CALB2, TAC3, TH, SLC6A3, SST, NPY, SLC5A7, LHX8), D1 and D2 (DRD1, DRD2), classical D1 (TAC1), classical D2 (PENK), matrix MSNs (STXBP6, EPHA4) and patch MSNs (KCNIP1, BACH2). Analysis of gene differential expression FindConservedMarkers function (assay = ’SCT’, slot = ’data’, test. use = ’wilcox’, min.pct = 0.2) was used to determine statistically significant cluster marker genes that were conserved in the control, AD and PD cases. A gene with meta-analysis combined P value < 0.05 determined by the default function metap was determined to be statistically significant. FindMarkers function (assay = ’SCT’, slot = ’data’, test.use = ’wilcox’, min. pct = 0.2) was used to determine differentially expressed genes (DEGs) in disease conditions compared to the controls. We used base = exp(1) (the default parameter for calculating fold change in the DEG analysis implemented in Seurat v3.0) for all the comparisons to keep the analysis consistent within the manuscript. A gene with a Benjamini–Hochberg (FDR) adjusted P value < 0.05 and a natural logarithm of fold change > 0.25 or < −0.25 was determined to be statistically significant. An absolute value of logFC > 0.25 (natural logarithm of fold change, Seurat v3.0 default parameter) is equivalent to 1.28-fold. Gene set enrichment analysis and comparison with previously published microglia-activation-associated pseudotime DEGs GO and KEGG^[410]89 pathway enrichment analyses were performed using the R package clusterProfiler v3.16.1 (ref.^[411]90). Results with FDR-corrected P value < 0.05 and at least five query genes were reported as significantly enriched pathways. We performed GO term enrichment analysis under the following three sub-ontologies: biological process, molecular function and cellular component. Gene signatures of DAM were obtained from Keren-Shaul et al.^[412]37. Gene signatures of disease-associated astrocyte (DAA) were obtained from Habib et al.^[413]14. Signature gene enrichment was evaluated using the hypergeometric test implemented in the phyper function in the R Hypergeometric package with lower.tail= FALSE. The total number of features that were detected at least once in the cell population being analyzed was used as the background gene set in GO and KEGG pathway enrichment analysis. To compare with previously published pseudotime DEGs of activated microglia populations, pseudotime DEGs and direction of change as the cells transition from homeostatic to activated states were retrieved from the following publications: From Sankowski et al.^[414]48, ‘[415]supplementary table 11‘ was downloaded. From Keren-Shaul et al.^[416]37, [417]Supplementary Table 7 was downloaded. Gene names and gene expression direction changes were obtained Sala Frigerio et al.^[418]45 ([419]Figure S5 and main text; this is not a complete list of pseudotime DEGs, as these authors only reported AD risk genes in [420]Figure S5). The data were plotted using R package ComplexHeatmap version 2.12.0 (ref. ^[421]91). Statistics and reproducibility No statistical methods were used to predetermine sample sizes, but our sample sizes are similar to those reported in previous publications. Wilcoxon rank sum test is a nonparametric test, which does not require normal distribution of the data. The study participants were allocated into groups based on their clinical diagnoses. We selected one case from each group for RNAscope mRNA in situ hybridization combined with immunohistochemistry assays. We were not blinded to allocation during experiments and outcome assessment, although the F3 RNAScope in situ hybridization signal quantifications were conducted blind to the conditions of the experiments. The sample of PD subject 1654 was replaced with the sample of PD subject 5212 in F3 RNAScope in situ hybridization signal quantifications experiment because not enough tissue from subject 1654 was available for the experiment. Our findings were replicated in four independent datasets for both astrocytes and microglia subpopulations, suggesting that our findings are representative. Reporting summary Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article. Extended Data Extended Data Fig. 1 |. snRNA-seq profiling and characterization of major cell types. Extended Data Fig. 1 | [422]Open in a new tab a, Brain region analyzed with snRNA-seq. Created with BioRender. com. b, Comparison of age, postmortem interval (PMI), number of cells, the median number of transcripts and median number of genes per nucleus among control, AD and PD groups. c, Heatmap of the relative expression level of top 10 marker genes for each cell type. d, Violin plots of gene expression levels of known cell-type-specific marker genes. e, UMAP plot colored by experimental batch or individual label. UMAP were generated using the same parameters as described in [423]Fig. 1. f,g, Percentage of cells from (f) each disease group or (g) individuals of each disease group in each of the major cell type. Ast: Astrocyte; EP: Endothelia cell and pericyte; Immune: Immune cell including microglia; OLIGO: Oligodendrocyte; OPC: Oligodendrocyte precursor cell. Conserved marker genes were determined by FindConservedMarkers using Wilcoxon Rank Sum test and metap R package with meta-analysis combined P value < 0.05 comparing gene expression in the given cluster with the other cell clusters for AD (n = 4), PD (n = 4) and the controls (n = 4). Extended Data Fig. 2 |. Identification and validation of the three astrocytes subpopulations. Extended Data Fig. 2 | [424]Open in a new tab a, Heatmap plot of the adjusted rand index (ARI) of pair-wise clustering result comparison using all cells with a range of dimensionality (5–30) and resolution (0.05–0.35). The black star indicates the parameter selected for all downstream analyses including analyses of entorhinal and prefrontal cortex astrocytes (dimensionality = 15, resolution = 0.25). The black lines delineate the range of parameters that generated high ARIs. b,c, UMAP visualization of subclusters of astrocytes colored by (b) disease diagnosis or (c) individual identity. d, Distribution of cells from each diagnostic group in the astrocyte subpopulations. Each dot represents an individual except entorhinal cortex data where each dot represents samples from two subjects that were processed together. e, Distribution of cells from each astrocyte subpopulation in different diagnostic groups. f,g, RNAscope in situ hybridization (ISH) analysis of Ast-2 conserved marker genes CD44 (f) and TNC (g) transcript expression (red) and immunohistochemistry staining (brown) of AQP4 in the internal capsule tissue sections of the same subjects of the control (CTRL), AD and PD groups shown in [425]Fig. 1. For all data, the experiment was performed once. Hematoxylin-positive cell nuclei are shown in blue. Scale bar = 100 μm. Extended Data Fig. 3 |. Characterization of astrocyte subpopulations in the prefrontal cortex (pfc) of Mathys et al., 2019 and the anterior cingulate cortex (acc) of the Feleke et al. 2021 data. Extended Data Fig. 3 | [426]Open in a new tab a,b, UMAP visualization of astrocyte subpopulations colored by cluster identity for (a) prefrontal cortex and (b) anterior cingulate cortex astrocytes. c,d UMAP visualization of astrocyte subpopulations colored by conserved marker gene expression levels for (c) prefrontal cortex and (d) anterior cingulate cortex. e, Dot plot of conserved marker gene expression levels in Ast-0, Ast-1 and Ast-2 astrocytes from the two brain regions. f, Violin plot showing the expression of Ast-2 conserved marker genes shared with putamen Ast-2. g–j UMAP visualization of subclusters of astrocytes colored by (g,h) disease diagnosis or (i,j) individual identity. k, The distribution of cells from each astrocyte subpopulation in different diagnostic groups (left) and the distribution of cells from each diagnostic group in the astrocyte subpopulations (right) of the Mathys et al., 2019 data. Each dot represents an individual. Conserved marker genes were genes whose expression is significantly higher than its expression in other cell clusters in all diagnostic groups determined by FindConservedMarkers using Wilcoxon Rank Sum test and metap R package with meta-analysis combined P value < 0.05. Red asterisks (*) indicate statistical significant conserved marker genes. Extended Data Fig. 4 |. Characterization and comparison of the three astrocytes subpopulations from the putamen (pu), entorhinal cortex (ec), and prefrontal cortex (pfc) of Lau et al. data. Extended Data Fig. 4 | [427]Open in a new tab a, Upset plot showing the overlap between putamen conserved marker genes of Ast-0, Ast-1 and Ast-2 astrocyte with marker genes of mouse DAA and Gfap-high astrocytes from Habib et al., 2020. b, Violin plots showing the expression level distributions of orthologous genes of murine DAA and Gfap-high astrocyte marker genes in the putamen astrocytes. c, PCA plot using murine DAA and Gfap-high astrocyte marker gene logFC of gene expression (comparing murine DAA and Gfap-high astrocyte with Gfap-low astrocytes, downloaded from Habib et al., 2020) and the logFC of the human orthologous genes (comparing putamen Ast-1 and Ast-2 with Ast-0 astrocytes). d,e, Violin plots showing the expression level distributions of reactive astrocyte marker genes in astrocytes from the (d) putamen and (e) prefrontal cortex. f, Violin plots showing the expression level distributions of A1-, A2-specific activated astrocyte markers and JAK-STAT3 pathway genes. g, Top 10 GO terms in the Biological Process category enriched in the astrocyte subpopulation signature genes (hypergeometric test, FDR-adjusted P value < 0.05, ≥ 5 query genes). Conserved marker genes plotted in panel (b), (d) and (e) were determined by FindConservedMarkers using Wilcoxon Rank Sum test and metap R package with meta-analysis combined P value < 0.05 comparing gene expression in the given cluster with the other cell clusters for AD (n = 4), PD (n = 4) and the controls (n = 4). Genes plotted in (f) were not statistically significantly higher in any of the astrocyte subpopulations. Extended Data Fig. 5 |. Comparison of differentially expressed genes (DEGs) of the three astrocyte subpopulations from the putamen (pu), entorhinal cortex (ec), and prefrontal cortex (pfc) from the Lau et al., data. Extended Data Fig. 5 | [428]Open in a new tab a, UpSet plot showing the number of overlapping up- and downregulated DEGs among the three astrocyte subpopulations for AD (left) and PD (right) astrocytes. b, Venn diagram showing the overlap of up- and downregulated DEG between AD and PD in each putamen astrocyte subpopulation (hypergeometric test). c, UpSet plot showing the overlap of DEGs that were up- or downregulated in AD between putamen (pu), entorhinal cortex (ec) and prefrontal cortex (pfc) astrocyte subpopulations. d, Disease-related Gene Ontology (GO) terms enriched in the astrocyte DEGs (hypergeometric test, FDR-adjusted P value < 0.05, ≥ 5 query genes). UP: upregulated in disease samples. Down: downregulated in disease samples. Extended Data Fig. 6 |. Identification of immune cells and validation of microglia subpopulations. Extended Data Fig. 6 | [429]Open in a new tab a,b, Violin plots showing the expression level distributions of marker genes for (a) PVM and (b) activated microglia. c, Distribution of percentage of cells from each subject in each immune cell cluster of the putamen (pu), entorhinal cortex (ec) from the Grubman et al. data and prefrontal cortex (pfc) from the Lau et al. data (one-way ANOVA or student’s t-test). Each dot represents a subject except the ec data. d,e, Immunohistochemistry staining (brown) of microglia marker protein P2RY12 and RNAscope in situ hybridization (ISH) analysis (red) of (d) AIF1 and (e) APOC1 transcript expression in the internal capsule tissue of the same subjects shown in [430]Fig. 1. Hematoxylin-positive cell nuclei are shown in blue. For all data, the experiment was performed once. f–h, UMAP visualization of only microglia subpopulations from (f) pu, (g) ec and (h) pfc. UMAPs were generated using a dimensionality of 10 and resolution of 0.15. i–k, Violin plots showing the expression level distributions of conserved microglial subpopulation marker genes in putamen (i), entorhinal cortex (j) and preprontal cortex (k) microglia subpopulations. Conserved marker genes plotted in panel (a), and HLA-DRA, HLA-DPB1, FTL and CD14 plotted in panel (b) were determined by FindConservedMarkers using Wilcoxon Rank Sum test and metap R package with meta-analysis combined P value < 0.05 comparing gene expression in the given cluster with the other cell clusters for AD (n = 4), PD (n = 4) and the controls (n = 4). Extended Data Fig. 7 |. Four distinct immune cell populations in (A-D) the prefrontal cortex (pfc) of the Mathys et al., and (E-H) the anterior cingulate cortex (acc) data of the Feleke et al. data. Extended Data Fig. 7 | [431]Open in a new tab a,e, UMAP visualization of subclusters of immune cells colored by cell cluster (left) or disease diagnosis (right). UMAPs were generated using parameters of dimensionality of 40 and resolution of 0.5 for the Mathys et al. data (AD n = 24, controls n = 24) and dimensionality of 20 and resolution of 0.15 for the Feleke et al. data (n = 7 each for the control, DLBD, PD and PDD samples). Violin plots showing the expression level distributions of genes for (b, f) T cell, microglia and PVM shared markers and PVM unique markers; (c, g) microglia-specific markers, and microglia subpopulation markers; (d, h) Micr-0 marker and activated microglia markers. The color code is the same as in (a) and (e), respectively. The conserved marker genes were determined by FindConservedMarkers using Wilcoxon Rank Sum test and metap R package with meta-analysis combined P value < 0.05 comparing gene expression in the cells of given cluster with that of the other cells. PVM: perivascular macrophage; CycM: cycling microglia. Extended Data Fig. 8 |. Comparison of microglial pseudotime DEGs. Extended Data Fig. 8 | [432]Open in a new tab a, Venn diagram showing the overlap between pseudotime DEGs of control, AD and PD microglia with AD-risk genes (hypergeometric test). Pseudotime DEGs are genes whose expression significantly associated with pseudotime progression (generalized addictive model, FDR-adjusted P value < 0.05). b, UpSet plot showing the overlap between control, AD and PD microglial pseudotime gene coexpression modules from putamen microgla. c, Heatmap showing pseudotime DEGs shared by human activated microglia from the putamen (pu) of cognitively normal controls, AD and PD samples, from prefrontal cortex (pfc) of the control and AD samples and from the entorhinal cortex (ec) of the control and AD samples. d, GO terms related to immune functions enriched in the microglia pseudotime DEGs. e, Heatmap showing pseudotime DEGs shared by the mouse activated microglia DAM and ARM. f, Top 5 GO terms in the biological process category enriched in the microglia pseudotime DEGs. Pathways with FDR-adjusted P value < 0.05 (hypergeometric test) and at least five query genes were considered statistically significant. DAM: Disease-associated microglia; ARM: activated response microglia. UP: upregulated during pseudotime progress (module 2 and 3 genes). Down: downregulated during pseudotime progress (module 1 genes). Mod 1: module 1 genes; Mod 2 + 3: module 2 and 3 genes combined. Extended Data Fig. 9 |. Microglia transcriptomic changes in disease contributed to Aβ pathology, tauopathy and neuronal death. Extended Data Fig. 9 | [433]Open in a new tab a–d, Volcano plots showing significant DEGs in Micr-0 and Micr-1 comparing cells from AD (left panels) or PD (right panels) with cells from the controls (CTRL). The x-axis specifies the logFC and the y-axis specifies the negative logarithm to the base 10 of the FDR-adjusted P values. Magenta and cyan dots represent genes expressed at significantly higher or lower levels respectively in disease samples (Wilcoxon Rank Sum test, FDR-adjusted P value < 0.05, absolute logFC > 0.25) comparing AD (Micr-0 = 440, Micr-1 = 299 cells) or PD (Micr-0 = 329, Micr-1 = 201 cells) microglia to the control (Micr-0 = 264, Micr-1 = 198 cells) microglia. Violin plots showing the expression level distributions of example DEGs that were (b) downregulated in both AD and PD microglia, (c) uniquely downregulated in AD or (d) uniquely upregulated in PD. e, GO terms related to neuron death, Aβ pathology and tauopathy enriched in microglial DEGs (hypergeometric test, FDR-adjusted P value < 0.05, ≥ 5 query genes). f, Heatmaps showing the logFC of expression level of significant DEGs for GWAS AD- and PD-risk genes; GWAS genes differentially expressed in at least two subpopulations were plotted for visualization. UP: upregulated in disease samples. Down: downregulated in disease samples. Extended Data Fig. 10 |. Comparison of microglia DEGs. Extended Data Fig. 10 | [434]Open in a new tab a, Venn diagram demonstrating overlap between AD and PD DEGs in the Micr-0 and Micr-1 cells for DEGs upregulated (left) or downregulated (right) in the disease samples. b,c, Scatter plots showing pair-wise correlations of genome-wide gene expression logFC (b) between Micr-0 and Micr-1 in AD (left) or PD (right) samples and (c) between AD and PD samples in Micr-0 (left) or Micr-1 (right) cells respectively. d, Top 5 GO terms in the biological process category enriched in the DEGs of the microglia subpopulations from the putamen (pu), entorhinal cortex (ec), prefrontal cortex (pfc) (hypergeometric test, FDR-adjusted P value < 0.05, ≥ 5 query genes). e, Bar plot showing the number of DEGs for each subpopulation of microglia from the three brain regions (Wilcoxon Rank Sum test, FDR-adjusted P value < 0.05 and absolute logFC >0.25). UP: upregulated in disease samples. Down: downregulated in disease samples. Supplementary Material Supplementary Information [435]NIHMS1880699-supplement-Supplementary_Information.pdf^ (19.9MB, pdf) Supplementary Table [436]NIHMS1880699-supplement-Supplementary_Table.xlsx^ (2.2MB, xlsx) Acknowledgements