Abstract

   The commonalities and differences in cell-type-specific pathways that
   lead to Alzheimer disease (AD) and Parkinson disease (PD) remain
   unknown. Here, we performed a single-nucleus transcriptome comparison
   of control, AD and PD striata. We describe three astrocyte
   subpopulations shared across different brain regions and evolutionarily
   conserved between humans and mice. We reveal common features between AD
   and PD astrocytes and regional differences that contribute toward
   amyloid pathology and neurodegeneration. In contrast, we found that
   transcriptomic changes in microglia are largely unique to each
   disorder. Our analysis identified a population of activated microglia
   that shared molecular signatures with murine disease-associated
   microglia (DAM) as well as disease-associated and regional differences
   in microglia transcriptomic changes linking microglia to
   disease-specific amyloid pathology, tauopathy and neuronal death.
   Finally, we delineate undescribed subpopulations of medium spiny
   neurons (MSNs) in the striatum and provide neuronal transcriptomic
   profiles suggesting disease-specific changes and selective neuronal
   vulnerability.
     __________________________________________________________________

   Neurodegenerative disorders such as AD, PD, Huntington disease and
   amyotrophic lateral sclerosis are all characterized by the aggregation
   and deposition of abnormal proteins^[56]1,[57]2. However, the
   composition of protein aggregates is unique and distinct to each
   disorder. For example, intracellular neurofibrillary tangles (NFTs)
   formed by hyperphosphorylated tau proteins and extracellular Aβ plaques
   generated from amyloid precursor protein (APP) are hallmarks of AD,
   whereas Lewy bodies resulting from misfolded α-synuclein proteins occur
   in PD. These observations suggest there are both common and divergent
   mechanisms of neurodegenerative pathogenesis, but the identity of these
   pathways remains mysterious.

   Damage to the basal ganglia occurs in many neurodegenerative diseases,
   including AD, PD and Huntington disease^[58]3, yet little is known
   about the underlying molecular mechanisms. The striatum, which
   comprises of the caudate and putamen, is the main input structure of
   the basal ganglia, which is crucial for motor learning and a variety of
   cognitive functions. Striatal Aβ plaques and NFT deposits, and a
   reduction in striatal volume, are common features of AD and PD.
   However, cortical amyloid plaques appear early in the disease and are
   present in many nondemented older adults, whereas striatal plaques
   usually occur at later stages of AD and largely after dementia
   onset^[59]4–[60]6. Furthermore, the neuronal heterogeneity of the human
   striatum has not been characterized in detail, and the vulnerability of
   neuronal populations remains unclear. To answer these questions, we
   used single-nucleus RNA sequencing (snRNA-seq) to compare
   transcriptomes from postmortem striata of well-characterized
   cognitively normal controls, AD and PD cases and with previously
   published transcriptomes from postmortem entorhinal cortex (ec)^[61]7,
   anterior cingulate cortex (acc)^[62]8 and prefrontal cortex
   (pfc)^[63]9,[64]10. We identified evolutionarily conserved astrocyte
   and microglia subpopulations shared across different disease conditions
   and multiple brain regions. We further describe unique human astrocyte
   and microglia activation states, their regional differences in
   transcriptomic changes in disease conditions and their contributions to
   Aβ pathology, tauopathy and neuronal death. Finally, we observed
   greater striatal MSN heterogeneity than previously shown^[65]11,[66]12
   and neuronal transcriptomic profiles that indicate disease-specific
   changes and selective neuronal vulnerability.

Results

The single-nucleus transcriptomes of human AD and PD brains

   We selected the precommissural putamen postmortem brains of four
   patients with AD, four patients with PD and four control cases matched
   for sex, age (range 69–85.4 years) and postmortem interval (Methods,
   [67]Extended Data Fig. 1a,[68]b and [69]Supplementary Table 1). We
   obtained 30,908 high-quality single-nucleus gene expression profiles
   after quality filtering and doublets removal with a comparable number
   of cells, genes and transcripts across diagnostic groups ([70]Extended
   Data Fig. 1b). We performed unsupervised clustering using Seurat^[71]13
   and mapped clusters to six major cell types by comparing conserved
   marker genes (markers that are conserved among the groups of AD, PD and
   controls) with the expression patterns of known cell-type-specific
   markers, including astrocytes (AQP4, SLC1A2), endothelial cells and
   pericytes (FLT1, RGS5), immune cells (CSF1R, PTPRC, RUNX1), neurons
   (RBFOX1, RBFOX3, SYT1), oligodendrocytes (MBP, PLP1) and
   oligodendrocyte precursor cells (PCDH15, MEGF11 and VCAN) ([72]Fig.
   1a,[73]b and [74]Extended Data Fig. 1c,[75]d). Clustering was not
   driven by experimental batch or individual samples, and the percentage
   of cells from each case that make up each cluster was not statistically
   different across the groups ([76]Extended Data Fig. 1e–[77]g).

Fig. 1 |. Characterization of six major cell types and three distinct
astrocyte subpopulations.

   Fig. 1 |
   [78]Open in a new tab

   a, Unsupervised clustering of snRNA-seq data and UMAP (Uniform Manifold
   Approximation and Projection) plot of all cells from putamen (pu)
   colored by cluster identity. UMAP plots were generated using default
   parameters except reduction = ’pca’, dims = 1:20. b, UMAP plot of all
   cells colored by marker gene expression levels. c–e, UMAP visualization
   of astrocyte subpopulations colored by cluster identity for putamen (c;
   total nuclei: control 1,203, AD 1,642, PD 1,433), ec (d; control 1,660,
   AD 702) and pfc (e; control 6,109, AD 7,144) astrocytes. f–h, UMAP
   visualization of astrocyte subpopulations colored by conserved marker
   gene expression levels for putamen (f), ec (g) and pfc astrocytes (h).
   i,j, Dot plot of conserved marker genes (i) and CD44 and TNC expression
   levels (j) in Ast-0, Ast-1 and Ast-2 astrocytes from the three brain
   regions. k, Venn diagram demonstrating overlap of conserved marker
   genes among the three brain regions for each astrocyte subpopulation.
   l,m, Violin plot showing the expression of Ast-2 conserved marker genes
   CD44 (l) and TNC (m) measured by snRNA-seq. n,o, CD44 (n) and TNC (o)
   expression validated by RNAScope in situ hybridization together with
   AQP4 immunohistochemistry staining in the putamen of control, AD and PD
   samples. CD44 and TNC, red; AQP4, tan. For all data, the experiment was
   performed once. FindConservedMarkers using Wilcoxon rank sum test and
   metap R package with meta-analysis combined P value < 0.05. Scale bars,
   100 μm. CTRL, control; immune, immune cell; Ast, astrocyte; EP,
   endothelial cell and pericyte; OLIGO, oligodendrocytes; OPC,
   oligodendrocyte precursor cell.

Conserved existence of three distinct astrocyte subtypes

   We first examined if there were distinct populations of astrocytes in
   our samples. We selected the parameters resulting in the most stable
   clustering (dimensionality = 15, resolution = 0.25, adjusted rand index
   = 0.96; [79]Extended Data Fig. 2a), for all downstream analyses.
   Subclustering analysis of all astrocyte nuclei revealed three
   subpopulations: Ast-0 (n = 2,301), Ast-1 (n = 1,338) and Ast-2 (n =
   638) ([80]Fig. 1c). Twenty-four genes for Ast-0 (for example GPC5,
   NRXN1); 261 genes for Ast-1, including AD risk genes APOE, CLU and
   APOC1 and 160 genes for Ast-2 (for example, DPP10, GFAP) were
   identified to be conserved marker genes for the given cell types
   ([81]Fig. 1f,[82]i,[83]k and [84]Supplementary Tables 2–4). CD44 and
   TNC were uniquely enriched in Ast-2 cells ([85]Fig. 1j,[86]l,[87]m).
   AQP4 immunohistochemistry staining combined with CD44 or TNC mRNA
   RNAscope in situ hybridization in the adjacent tissue sections of
   control, AD and PD samples analyzed by snRNA-seq confirmed the
   existence of the Ast-2 astrocytes in both the putamen ([88]Fig.
   1n,[89]o) and the white matter tissue of the internal capsule in all
   three groups ([90]Extended Data Fig. 2f,[91]g). The conservation of
   these marker genes among all groups suggested there are three distinct
   astrocyte subpopulations regardless of disease status.

   To determine whether astrocyte subpopulations identified in the
   striatum existed in other human brain regions, we analyzed data from
   previously published snRNA-seq studies using the same parameters.
   Grubman et al.^[92]7 sampled the ec of six patients with AD and six
   matched controls. Three astrocyte subpopulations were detected
   ([93]Fig. 1d) with 4/8 (50%), 20/23 (87%, P value = 6.1 × 10^−31),
   19/23 (82.6%, P value = 2.2 × 10^−40, hypergeometric test) conserved
   marker genes overlapped with those identified in the putamen ([94]Fig.
   1g,[95]i,[96]k and [97]Supplementary Tables 5–7). Lau et al.^[98]9
   sampled pfc tissues from 12 patients with AD and nine matched controls
   and identified four astrocyte subpopulations ([99]Fig. 1e). The first
   three most abundant populations had 5/11 (45%), 46/67 (68.7%, P value
   =1.1 × 10^−77), 20/22 (90.9%, P value = 2.7 × 10^−44, hypergeometric
   test) conserved marker genes ([100]Supplementary Tables 8–[101]10) that
   overlapped with those identified in the putamen ([102]Fig.
   1h,[103]i,[104]k). We further validated the existence of the three
   astrocyte subpopulations in the pfc of a much larger AD cohort with 24
   patients with AD and 24 matched controls (referred to as ‘AD pfc
   Mathys’ data)^[105]10 as well as in the acc samples from a Lewy body
   disease (LBD) cohort of 28 cases, including healthy controls, PD, PD
   dementia and dementia with Lewy bodies (n = 7 per group, referred to as
   ‘LBD acc Feleke’ data)^[106]8. Most of the conserved marker genes were
   shared by the astrocyte subpopulations in these two datasets
   ([107]Extended Data Fig. 3a–[108]f), although some genes were not
   detected or did not reach statistical significance in the pfc Mathys
   data, likely due to the low cell numbers and sequencing depth
   ([109]Supplementary Table 11). Therefore, the three astrocyte
   populations identified in the putamen also present in the other three
   brain regions. Cell clustering was not driven by experimental batch or
   by individual samples, and the percentage of cells from each sample was
   not statistically different between AD and controls for any of the
   datasets ([110]Extended Data Figs. 2b–[111]e and
   [112]3g,[113]i,[114]k).

   The three human astrocyte subpopulations resembled the Gfap-low,
   disease-associated astrocyte (DAA) and Gfap-high astrocytes identified
   in the mouse hippocampi^[115]14. Gfap-low astrocytes and human Ast-0
   share the marker gene features GPC5 and NRXN1^[116]14 ([117]Fig.
   1f,[118]i). Analysis of DAA and Gfap-high astrocyte marker gene
   expression in the putamen astrocytes and the principal-component
   analysis (PCA) using all distinguishing marker genes ([119]Extended
   Data Fig. 4a–[120]c) indicated that although human and mouse astrocytes
   differ substantially, Ast-1 was more similar to DAA, whereas Ast-2 was
   more similar to the Gfap-high astrocytes. These results demonstrate
   homology among human Ast-0, Ast-1 and Ast-2 astrocytes with murine
   Gfap-low, DAA and Gfap-high astrocytes, respectively.

Three astrocyte subsets represent distinct activation states

   The expression levels of reactive astrocyte markers^[121]15 S100B, VIM,
   MT2A, MT1E, CRYAB and MT1G were significantly higher in putamen Ast-1
   than in the other two cell populations in all three diagnostic groups
   ([122]Extended Data Figs. 4b,[123]d and [124]Supplementary Table 3).
   Meanwhile, GFAP, CD44, C3, SYNM and MAOB were more highly expressed in
   Ast-2 compared to the other two populations in all groups ([125]Fig.
   1i,[126]l and [127]Extended Data Fig. 4d). These observations suggested
   that Ast-1 and Ast-2 both represented activated astrocytes with
   distinct activation states whereas Ast-0 represented homeostatic
   astrocytes. VIM, MT2A and MT1E were also highly expressed in the pfc
   Ast-1 in both AD and controls, suggesting shared astrocyte activation
   features in different brain regions ([128]Extended Data Fig. 4e and
   [129]Supplementary Table 9). There were no consistent expression
   differences between cell clusters across diagnostic groups for murine
   A1-and A2-specific reactive astrocyte marker or signaling pathway genes
   ([130]Extended Data Fig. 4f)^[131]16,[132]17, suggesting that Ast-1 and
   Ast-2 represent astrocyte activation states distinct from A1 or A2
   states.

Regional divergence of the astrocyte transcriptomes

   The identification of homologous astrocyte subpopulations provides an
   opportunity to compare gene expression patterns across different brain
   regions. Some genes had conserved expression patterns between putamen
   and one brain region but not the other ([133]Fig. 2a,[134]b). Gene
   Ontology (GO) and KEGG (Kyoto Encyclopedia of Genes and Genomes)
   pathway enrichment analysis on the conserved cluster marker genes
   revealed unique enrichment of GO terms for each putamen astrocyte
   subpopulation ([135]Extended Data Fig. 4g), suggesting distinct
   functions of each. Pathways including the regulation of apoptotic
   signaling and gliogenesis were shared among Ast-1 from all three brain
   regions ([136]Fig. 2d and [137]Extended Data Fig. 4g), supporting their
   common functionality. However, the majority of top 10 KEGG terms and
   disease-related GO terms were highly enriched for the putamen Ast-1
   marker genes only ([138]Fig. 2c,[139]d), such as multiple
   neurodegenerative disease pathways, amyloid fibril formation, tau
   protein binding, ferroptosis, regulation of inflammatory response and
   neuron death pathways. These pathways include AD and PD risk genes such
   as APOE and PARK7 and genes encoding proteins such as metallothionein
   protein MT3, superoxide dismutase 1 (SOD1) and stress-inducible heat
   shock protein HSP90AB1. These proteins can be secreted extracellularly
   by astrocytes to protect neurons from the toxic effect of Aβ, dopamine
   quinone neurotoxicity or oxidative stress^[140]18–[141]21 suggesting a
   potential neuroprotective role of Ast-1. In summary, our results
   suggest that homologous astrocyte subpopulations from different brain
   regions may share certain functionalities but also have differences
   that may contribute to regional differences in neuronal vulnerability,
   amyloid pathology and tauopathy.

Fig. 2 |. Transcriptomic comparison of astrocyte subpopulations.

   Fig. 2 |
   [142]Open in a new tab

   a,b, Violin plots showing genes with conserved expression patterns in
   the putamen and ec (a) or the putamen and pfc (b) in
   FindConservedMarkers using Wilcoxon rank sum test and metap R package
   with meta-analysis combined P value < 0.05). c,d, KEGG pathway terms
   (c) and disease-related GO terms (d) enriched in the subcluster
   conserved marker genes (false discovery rate (FDR)-adjusted P value <
   0.05, hypergeometric test, ≥ 5 query genes). e–j, Volcano plots showing
   significant DEGs comparing cells from AD (e–g, Ast-0 = 834, Ast-1 =
   553, Ast-2 = 255 cells) or PD (h–j, Ast-0 = 784, Ast-1 = 427, Ast-2 =
   222 cells) with cells from the controls (CTRL, Ast-0 = 683, Ast-1 =
   358, Ast-2 = 161 cells). The x-axis specifies the log fold changes
   (logFCs), and the y-axis specifies the negative logarithm to the base
   10 of the adjusted P values (−log[10](P[adj])). Magenta and cyan dots
   represent genes upregulated and downregulated in disease brains,
   respectively (Wilcoxon rank sum test, FDR-adjusted P value < 0.05 and
   absolute logFC > 0.25 using natural logarithm (ln)). k–m, Violin plots
   showing the expression level distributions of example DEGs of Ast-0
   (k), Ast-1 (l) and Ast-2 (m). n, Violin plots showing F3 gene
   expression in all major cell types in the putamen. o, Representative
   images of RNAScope in situ hybridization analysis of F3 transcript
   expression in the putamen. p, Single-cell F3 in situ hybridization
   signal from four images each for four subjects from each group were
   quantified, AD (n = 863 cells), PD (n = 387 cells) and control (CTRL, n
   = 1,120 cells) using one-way analysis of variance with Tukey’s multiple
   comparisons test, ***P < 0.001, *P < 0.05, AD versus CTRL P value <
   0.001, PD versus CTRL P value < 0.001, PD versus AD P value = 0.016).
   Data are presented as mean values ± standard deviation (s.d.). Minima =
   3.69, maxima = 5.41, mean CTRL = 4.63, AD = 4.38, PD = 4.27. The lower
   and upper hinges correspond to the 25th and 75th percentiles. The
   upper/lower whisker extends from the hinge to the largest/smallest
   value no further than 1.5× interquartile range from the hinge. Down:
   downregulated; Up: upregulated; NotSig: not statistically significant;
   log[10](IntDen + 1): logarithm to the base 10 of the integrated
   density.

Shared astrocytic transcriptomic changes between AD and PD

   To understand the functional significance of astrocyte subpopulations
   in pathogenesis, we compared astrocyte gene expression in disease
   samples to that in controls within each population and detected 124 to
   668 differentially expressed genes (DEGs) ([143]Figs. 2e–[144]m and
   [145]3a and [146]Supplementary Tables 12–[147]17). Interestingly, all
   putamen astrocyte subpopulations had nearly 2/3 or more DEGs
   upregulated in AD and PD, including CRYAB, HSPB1, multiple
   metallothionein (MT) family genes, EGFR and TLR4 ([148]Fig. 2e–[149]m),
   consistent with observed expression increase in patients with AD and
   PD^[150]22–[151]27. Downregulated DEGs include the AD risk gene APOE
   ([152]Fig. 2f,[153]h,[154]i,[155]l), consistent with previous snRNA-seq
   studies^[156]7,[157]10. Tissue factor (F3) expression was significantly
   downregulated in AD and PD as validated using RNAScope in situ
   hybridization and quantification ([158]Fig. 2e–[159]j,[160]m–[161]p).
   Within each disease condition, astrocyte subpopulations shared
   extensive transcriptomic changes (p < 0.01, hypergeometric test) with
   100% concordance in the directions of the gene expression change
   ([162]Extended Data Fig. 5a). Gene expression fold changes of all genes
   in the genome were highly correlated in all pair-wise correlation
   analyses (Pearson’s correlation coefficient r = 0.718 to 0.894,
   FDR-adjusted P value < 0.01; [163]Fig. 3b). Pathway analysis revealed
   concordant changes of multiple pathways across all putamen astrocyte
   subpopulations ([164]Fig. 3c). Together, these results demonstrate
   shared transcriptomic changes among the three astrocyte subpopulations
   in each disease condition.

Fig. 3 |. Regional differences in astrocytic transcriptomic changes in
disease.

   Fig. 3 |
   [165]Open in a new tab

   a, Bar plot showing the number of up- and downregulated differentially
   expressed genes (DEG) in the three astrocyte subpopulations from the
   putamen (pu), ec and pfc (Wilcoxon rank sum test, FDR-adjusted P value
   < 0.05 and logFC > 0.25 using natural logarithm (ln)). Number of
   subjects: AD (n = 4), PD (n = 4) controls (n = 4). b, Heatmap of
   Pearson’s correlation coefficient of genome-wide gene expression logFC
   among the three astrocyte subpopulations from the pu, ec and pfc. The
   color represents the correlation’s directionality, and the shade of
   color represents the significant levels. Only significant correlations
   were plotted (FDR-correlated P value < 0.05). c, Top two biological
   process pathways enriched in the DEGs. d, Neurodegenerative
   disease-related KEGG pathways enriched in the DEGs (hypergeometric
   test, FDR-adjusted P value < 0.05, ≥ 5 query gene). e, Heatmaps showing
   the logFC of significant DEGs for GWAS AD- and PD-risk genes; GWAS
   genes differentially expressed in at least two subpopulations were
   plotted for visualization. Upregulated: upregulated in disease samples.
   Downregulated: downregulated in disease samples.

   Our analysis indicated that transcriptomic changes in astrocytes are
   highly concordant between AD and PD samples. For each astrocyte
   subpopulation, AD and PD samples had significant overlap and concordant
   change of DEGs (P < 0.01, hypergeometric test), a significant
   correlation of genome-wide gene expression level changes (r =
   0.542–0.795, P < 0.001), and common pathways dysregulated in disease
   conditions ([166]Fig. 3b,[167]c and [168]Extended Data Fig. 5b,[169]d).
   Interestingly, upregulated DEGs in both AD and PD astrocytes were
   significantly enriched for multiple neurodegenerative disease pathways
   ([170]Fig. 3d). However, discordant transcriptomic changes between AD
   and PD were also detected, especially for the downregulation of the
   amyloid-beta (Aβ) binding pathway in PD astrocytes ([171]Extended Data
   Fig. 5d). Thus, our results suggest a possible link between astrocytic
   transcriptomic changes to neurodegenerative diseases and reveal common
   and unique dysregulated genes in AD and PD, possibly linking to
   differential Aβ pathology between the two diseases.

Regional differences in astrocytic transcriptomic changes

   Next, we compared astrocytic gene expression changes in disease
   conditions across different brain regions. We detected 181–263 and
   90–123 DEGs for the three ec and pfc astrocyte populations,
   respectively ([172]Fig. 3a and [173]Supplementary Tables 18–[174]23).
   Although numbers of downregulated genes in each brain region were
   comparable, over 90% of DEGs were downregulated for all cortical
   astrocytes (91.1–97.6%), in contrast to fewer than 1/3 in putamen
   astrocytes ([175]Fig. 3a). Additionally, DEGs overlapped appreciably
   among cortical astrocyte subpopulations and between the two brain
   regions but were largely non-overlapping with, or regulated in opposite
   directions to putamen astrocyte DEGs ([176]Extended Data Fig. 5c).
   Furthermore, genome-wide gene expression changes of the cortical
   astrocytes were significantly correlated (r = 0.51 to 0.68,
   FDR-adjusted P value < 0.01) but were not with that of the putamen
   astrocytes (r ≤ 0.14; [177]Fig. 3b). Many pathways were differentially
   regulated between cortical and putamen astrocytes, including multiple
   neurodegenerative disease pathways ([178]Fig. 3c,[179]d). We therefore
   examined AD and PD risk genes^[180]7,[181]28–[182]30 identified by
   genome-wide association studies (GWASs), and found ~10% of these genes
   differentially expressed in at least two clusters ([183]Fig. 3e).
   Expression changes of AD risk genes in the putamen were concordant
   between AD and PD but largely in opposite directions compared with the
   cortex, except for APOE and CLU ([184]Fig. 3e). Many PD risk genes were
   differentially expressed in putamen astrocytes, but few were affected
   in cortical astrocytes. In summary, our results demonstrate that
   cortical astrocytes had similar transcriptomic changes in AD but were
   distinct from putamen astrocytes.

Four distinct immune cell populations in the human brain

   We next investigated immune cell heterogeneity in the putamen and
   identified four subpopulations ([185]Fig. 4a,[186]b). Conserved marker
   genes of T cell cluster included T cell-specific markers BCL11B, CD247
   and SKAP1^[187]31,[188]32, and were enriched for T cell-specific
   functions ([189]Fig. 4c,[190]e). Perivascular macrophages (PVMs) and
   microglia shared macrophage markers CSF1R and C1QB but can be
   distinguished by the expression of PVM-specific markers such as MRC1,
   LYVE1, CD163 and F13A1 ([191]Fig. 4c and [192]Extended Data Fig. 6a)
   and microglia-specific markers such as P2RY12 and
   CX3CR1^[193]33–[194]35. The two microglia subpopulations, referred to
   as Micr-0 and Micr-1 herein, can be distinguished by the unique
   enrichment of AIF1 and APOC1 in Micr-1 cells as validated using
   immunohistochemistry staining of P2RY12 combined with in situ
   hybridization for AIF1 or APOC1 ([195]Fig. 4d,[196]f,[197]g and
   [198]Extended Data Fig. 6d,[199]e). Decreased P2RY12 expression and
   higher AIF1, CD14, FTL and MHC-II gene expression in Micr-1 suggested
   that Micr-0 and Micr-1 represented homeostatic and activated microglia,
   respectively. Consistently, Micr-1 signature genes were enriched for
   pathways associated with activated microglia^[200]36 ([201]Fig. 4e).
   Each cluster contained a similar percentage of cells from all cases of
   all diagnostic groups ([202]Extended Data Fig. 6c). The conservation of
   marker genes in control, AD and PD brains indicated these distinct
   immune cell populations existed independent of disease status.

Fig. 4 |. Four distinct immune cell populations.

   Fig. 4 |
   [203]Open in a new tab

   a,b, UMAP visualization of subclusters of immune cells (total nuclei:
   control 558, AD 827, PD 619) colored by cell cluster (a) or disease
   diagnosis (b). c,d, Violin plots showing the expression level
   distributions of genes for T cell, microglia and PVM shared markers and
   PVM unique markers (c); microglia-specific markers and microglia
   subpopulation markers (d). The color code is the same as in panel a. e,
   Subcluster signaturegene enriched GO terms in the Biological Process
   category (hypergeometric test, FDR-adjusted P value < 0.05, ≥ 5 query
   genes). f,g, Immunohistochemistry staining (brown) of marker protein
   P2RY12 and RNAscope in situ hybridization analysis (red) of AIF1 (f)
   and APOC1 (g) transcript expression in the adjacent tissue sections
   from the putamen tissue of a control, AD or PD brain.
   Hematoxylin-positive cell nuclei are shown in blue. For all data, the
   experiment was performed once. UMAP were generated using default
   parameters except reduction = ’pca’, dims = 1:30. Cell cluster were
   defined using resolution = 0.2. Conserved marker genes were determined
   by FindConservedMarkers using Wilcoxon Rank Sum test and metap R
   package with meta-analysis combined P value < 0.05. Number of subjects:
   AD (n = 4), PD (n = 4) and the controls (n = 4). Scale bars, 100 μm.

   To determine whether immune cell subpopulations identified in the
   striatum existed in other human brain regions, we analyzed the four
   published snRNA-seq data independently^[204]7–[205]10. All four immune
   cell subpopulations were detected in the Mathys-pfc^[206]10 and the
   Feleke-acc^[207]8 data with most of the conserved marker genes shared
   by the immune cells in these two datasets ([208]Extended Data Fig. 7).
   However, some genes were not detected or statically insignificant in
   the Mathys-pfc data likely due to the low cell numbers and the
   sequencing depth ([209]Supplementary Table 11). Both microglia
   subpopulations were also present in the ec and Lau-pfc data. FRMD4A and
   ST6GALNAC3 were identified as the marker genes of Micr-0, whereas
   activated microglia markers such as APOE, HLA-DRA, HLA-DPB1, FTH1 and
   FTL were identified as the marker genes of Micr-1, which were shared
   across all five datasets and among the control, AD, PD, PD dementia and
   dementia with Lewy bodies samples ([210]Extended Data Figs. 6f–[211]k
   and [212]7d,[213]h). These results suggest that the two microglia
   subpopulations identified in the putamen were also present in all the
   other brain regions, irrespective of disease status.

Micr-1 share transcriptomic signatures with murine DAM

   Interestingly, 32 of the 83 Micr-1 conserved marker genes overlapped
   with signature genes of murine DAM^[214]37 (P value = 6.46 × 10^−33,
   hypergeometric test; [215]Fig. 5a and [216]Supplementary Table 24).
   When we compared Micr-1 with homeostatic Micr-0 microglia from AD
   samples, we identified more DAM signature genes and a greater
   significant overlap with murine DAM gene signatures ([217]Fig. 5a; 40
   out of 111 genes, P value = 3.03 × 10^−40, hypergeometric test). The
   upregulation of APOE, B2M and TYROBP in Micr-1 cells compared to
   homeostatic Micr-0 microglia was observed across all diagnostic groups
   ([218]Fig. 5b), which was similar to the reported expression changes
   during murine DAM activation^[219]37. In particular, TYROBP-APOE
   signaling is implicated in the initiation of DAM phenotypes independent
   of TREM2^[220]38, whereas TREM2 is critical for the transition of DAM
   from an intermediate state to a fully activated state^[221]37,[222]39.
   Consistent with this, TREM2 expression was more enriched in Micr-1
   cells in AD and PD brains ([223]Fig. 5b,[224]e), suggesting a possible
   transition from an intermediate activated state in controls to a fully
   activated state in the disease conditions^[225]37. Thus, Micr-1
   represents human activated microglia and shares similar transcriptomic
   changes with murine DAM, suggesting a general microglia activation
   response to central nervous system (CNS) challenge conserved in the
   murine models^[226]37.

Fig. 5 |. Characterization of the human activated microglia.

   Fig. 5 |
   [227]Open in a new tab

   a, Venn diagram demonstrating overlap (hypergeometric test) between
   murine DAM marker genes and the conserved marker genes of human
   activated microglia (Micr-1) of AD, PD and controls (left, P value =
   6.46 × 10^−33) or marker genes of AD-only human activated microglia
   (right, P value = 3.03 × 10^−40). b–d, Violin plots showing the
   expression level distributions of (b) TREM2, APOE, B2M and TYROBP; (c)
   M1- and M2- microglia markers; (d) M1- and M2- microglia regulatory
   transcription factors (TF). e, Immunohistochemistry staining (brown) of
   marker protein P2RY12 and RNAscope in situ hybridization analysis (red)
   of TREM2 transcript expression in the adjacent tissue sections from the
   putamen of an AD and a PD case. For all data, the experiment was
   performed once. Hematoxylin-positive cell nuclei are shown in blue. f,
   Heatmap of Pearson’s correlation coefficient of M1- and M2- microglia
   marker gene expression for Micr-0 and Micr-1 microglia respectively.
   The shade of the color represents the significance levels
   (FDR-correlated P value < 0.05). The color represents the
   directionality of correlation. APOE, B2M and TYROBP were determined to
   be conserved cluster marker for Micr-1 by FindConservedMarkers using
   Wilcoxon rank sum test and metap R package with meta-analysis combined
   P value < 0.05 comparing gene expression in Micr-1 cluster with the
   other cell clusters for AD (n = 4), PD (n = 4), and the controls (n =
   4). Scale bars, 100 μm.

Undescribed activation states of human microglia

   To define the activation states of human microglia, we investigated the
   expression of canonical M1 and M2 marker genes. M1 markers, such as
   IL-18 and CD86, and M2 markers, such as IL4R and TGFB1^[228]40, were
   expressed in both microglia subpopulations at similar levels across all
   diagnostic groups ([229]Fig. 5c). We calculated pairwise expression
   correlation among all known M1 and M2 marker genes^[230]36,[231]40
   expressed in at least 20% of Micr-0 and Micr-1 cells. Most of these
   genes were expressed in the same cells and were positively correlated
   irrespective of being an M1 or M2 marker ([232]Fig. 5f). STAT1, PPARG,
   STAT3 and MEF2C, transcription factors critical for M1 or M2
   polarization^[233]41,[234]42, were expressed at similar levels in both
   populations ([235]Fig. 5d). Therefore, the activation state of human
   microglia could not be distinguished using known markers, suggesting
   that human microglia may have an activation state distinct from what
   had been observed in vitro and in animal models.

Co-expression modules shared by control, AD and PD microglia

   To better understand gene expression dynamics during human microglia
   activation, we reconstructed microglia activation trajectories
   ([236]Fig. 6a). We identified 549, 516 and 437 genes whose expression
   changes were significantly associated with pseudotime progression
   (pseudotime DEG) for control, AD and PD samples, respectively
   ([237]Supplementary Tables 25–[238]27). Interestingly, pseudotime DEGs
   were significantly enriched for AD risk genes^[239]43,[240]44
   ([241]Extended Data Fig. 8a; FDR-adjusted P value < 0.05,
   hypergeometric test), resembling findings in an App knockin mouse
   model^[242]45, with APOC1, APOE, HLA-DRB1, INPP5D and MEF2C shared by
   all diagnostic groups. PD risk genes CHCHD2, FBXO7 and
   PARK7^[243]28,[244]29 also overlapped with pseudotime DEGs. Consensus
   k-means partitioning identified three co-expression modules for each
   diagnostic group ([245]Fig. 6d and [246]Supplementary Tables
   25–[247]27). Module 1 consisted of genes downregulated during microglia
   activation. Modules 2 and 3 were comprised of genes upregulated in
   early and late activated microglia. Many genes were shared among all
   three conditions with highly concordant expression within each module
   and between different conditions ([248]Fig. 6g,[249]j and [250]Extended
   Data Fig. 8b). Top-ranking genes in modules 2 and 3 (APOE, B2M, FTH1,
   FTL and CD74; [251]Fig. 6j) are involved in microglia transition from a
   homeostatic to an activated stage associated with neurodegeneration in
   humans and mice^[252]37,[253]46,[254]47. We defined the set of genes
   that were present in all three conditions in module 1 (26 genes,
   downregulated) or modules 2 + 3 (179 genes, upregulated) as the core
   gene co-expression modules. Upregulated core genes were enriched for
   hallmark pathways of activated microglia ([255]Fig. 6k), confirming
   that our pseudotime analysis captured the true transcriptomic dynamics
   associated with microglia activation.

Fig. 6 |. Transcriptome transition during microglia activation.

   Fig. 6 |
   [256]Open in a new tab

   a–c, UMAP visualization of pseudotemporal trajectory for microglia of
   putamen (a), ec (b) and pfc (c). UMAP were generated using default
   parameters except reduction = ’pca’, dims = 1:10. Cell cluster were
   defined using resolution = 0.15. d–f, Heatmap of the three pseudotime
   DEG co-expression modules in the microglia of putamen (d), ec (e) and
   pfc (f) for each condition. Pseudotime DEGs are genes whose expression
   significantly associated with pseudotime progression (generalized
   additive model, FDR-adjusted P value < 0.05). g–i, Gene expression
   changes along pseudotime trajectory for example genes in module 1, 2 or
   3 in the microglia of putamen (g), ec (h) and pfc (i) for each
   condition. LOESS Regression were performed using loess() function in R
   with 95% confidence intervals plotted. Only statistically significant
   pseudotime DEGs (FDR-adjusted P value < 0.05) were shown. j, Expression
   dynamics along the pseudotime trajectory of pseudotime DEGs shared by
   the control, AD and PD samples in module 1, 2 or 3. k,
   Microglia-activation-related gene ontology terms enriched in the core
   gene-co-expression module genes in module 1 (Mod1) or module 2 and 3
   combined (Mod2+3) (hypergeometric test, FDR-adjusted P value < 0.05, ≥
   5 query genes).

Core modules shared by diverse brain regions and disorders

   We took two different approaches to investigate whether the microglia
   activation-associated core modules were shared by activated microglia
   in other human studies. First, we analyzed pfc microglia and identified
   835 and 819 pseudotime DEGs for control and AD microglia, respectively
   ([257]Fig. 6f and [258]Supplementary Tables 29 and [259]30). AD risk
   genes APOC1, APOE, HLA-DRB1, INPP5D and MEF2C were associated with
   pseudotime progression in both AD and control samples ([260]Fig. 6i),
   similar to their expression changes in putamen microglia. Analysis of
   ec microglia identified only 163 and 56 pseudotime DEGs likely due to
   the limited number of cells ([261]Fig. 6e and [262]Supplementary Tables
   31 and [263]32). However, our analysis was sufficiently powered to
   identify APOE and APOC1 as pseudotime DEGs of modules 2 and 3,
   respectively ([264]Fig. 6h), as in the other two brain regions. One
   hundred and two genes were shared by six out of the seven datasets with
   concordant direction of changes across all datasets detected
   ([265]Extended Data Fig. 8c; P value < 0.001). Second, we retrieved
   pseudotime DEGs reported by Sankowski et al.^[266]48, which examined
   glioma-associated microglia (GAM) from the temporal or frontal lobes of
   glioblastoma multiforme patients. Out of the 545 reported pseudotime
   DEGs 79 were shared by seven out of the eight datasets with concordant
   direction of change across all datasets, including many genes commonly
   upregulated during microglia activation ([267]Fig. 7a). To investigate
   whether the core modules are present in mice, we compared the
   pseudotime DEGs of DAMs^[268]37 and ARMs (activated response
   microglia)^[269]45 from AD mouse models. Only a partial list of ARM
   pseudotime DEGs is available^[270]45, and limited similarities were
   observed between DAMs and ARMs ([271]Extended Data Fig. 8e). However,
   APOE and TYROBP were shared between human and mouse with concordant
   expression changes, and TYROBP-APOE signaling has been implicated in
   the initiation of DAM phenotypes^[272]38. In conclusion, human
   activated microglia, including Micr-1 subpopulations identified in our
   study, the previously identified GAM, and likely other reported
   activated microglia subpopulations, share a core gene co-expression
   module regardless of disease status and brain region. The shared
   TYROBP-APOE signaling also suggests that some core changes are likely
   evolutionarily conserved.

Fig. 7 |. Comparison of genes and pathways of microglia activation-associated
transcriptome changes.

   Fig. 7 |
   [273]Open in a new tab

   a, Pseudotime DEGs (generalized additive model, FDR-adjusted P value <
   0.05) shared by human activated microglia isolated from the putamen of
   cognitively normal controls (CTRL), AD and PD samples, pfc of control
   and AD samples, ec of control and AD samples and temporal or frontal
   lobes (tc) of glioblastoma multiforme samples reported by Sankowski et
   al. b, Cell death-related GO terms enriched in the pseudotime DEGs. c,
   GO terms related to Aβ pathology, tauopathy, and autophagy that were
   enriched in the pseudotime DEGs. d, Top KEGG pathways enriched in the
   pseudotime DEGs. Pathways with FDR-adjusted P value < 0.05
   (hypergeometric test) and at least five query genes were considered
   statistically significant. Up: upregulated; down: downregulated;
   NotDEG: not pseudotime DEG.

Common and disease-only changes during microglia activation

   Next, we compared pathways that were dynamically regulated during
   microglia activation between control, AD and PD, as well as between
   different brain regions. We focused our pathway analysis on the putamen
   and pfc microglia and GAMs as they have sufficient numbers of
   pseudotime DEGs. The most prominent observation was the upregulated
   pathways shared among all conditions and across all brain regions. Many
   of the pathways are known to change during microglia activation in
   animal studies, such as the regulation of the intrinsic apoptotic
   signaling pathway, positive regulation of cytokine production including
   interferon-γ, response to interferon-γ, nuclear factor κB signaling,
   phagocytic capacity and antigen processing and
   presentation^[274]33,[275]49(module 2 + 3; [276]Fig. 7b,[277]d and
   [278]Extended Data Fig. 8d,[279]f). These results suggest shared
   changes in major metabolic states and immune properties during human
   microglia activation across all conditions and different brain regions.

   Interestingly, there were also major differences indicating
   diseasespecific transcriptome changes associated with human microglia
   activation. First, pathways downregulated during microglia activation
   were largely unique to each condition and brain region (module 1;
   [280]Fig. 7d and [281]Extended Data Fig. 8f). Second, some pathways
   were uniquely linked to disease pathology. For example, the
   ‘transcriptional misregulation in cancer’ pathway was specifically
   downregulated in the GAMs, whereas multiple neurodegenerative disease
   pathways were uniquely upregulated in the putamen and pfc microglia
   ([282]Fig. 7d). Third, although the positive regulation of cytokine
   production pathway was shared by all microglia there were significant
   differences in the specific cytokines being regulated in different
   conditions and brain regions ([283]Extended Data Fig. 8d). Lastly, the
   Aβ-binding pathway was upregulated in most datasets ([284]Fig. 7c)
   providing in vivo evidence linking microglia activation and Aβ
   pathology in humans, a phenomenon widely-observed in animal
   studies^[285]50. Interestingly, many more Aβ-related pathways and the
   tau protein binding pathway were uniquely regulated during GAM
   activation ([286]Fig. 7c), matching with the speculated role of GAM in
   reducing the risk of developing AD^[287]51. Whether and how these
   disease-specific transcriptome changes associated with human microglia
   activation could contribute to disease pathogenesis remain to be
   elucidated. The robust core gene signature of microglia activation
   defined in our study will help reveal disease-specific roles of
   microglial activation in various CNS diseases.

Microglia and Aβ pathology, tauopathy and neuron death

   Comparing microglia from AD and PD brains with those of controls
   identified 112–245 DEGs with the majority (65.3–75.4%) being
   downregulated in the diseased brains ([288]Extended Data Fig. 9a and
   [289]Supplementary Tables 33–[290]36). Micr-0 and Micr-1 shared many
   DEGs and their genome-wide transcriptional changes were significantly
   correlated ([291]Extended Data Figs. 9a,[292]b and [293]10a,[294]b).
   Global gene expression changes between AD and PD were highly correlated
   sharing many downregulated DEGs (45.8% and 35.1%) and a few upregulated
   DEGs (7.4% and 8.0%) ([295]Extended Data Fig. 10a,[296]c). Pathway
   analyses revealed many disease-related changes in AD and PD microglia.
   For example, the Aβ binding pathway was downregulated in both diseases
   including MSR1 and CST3 ([297]Extended Data Fig. 9a,[298]b,[299]e). In
   contrast, multiple protein folding pathway components including genes
   encoding stress-inducible heat shock protein HSP90AA1 and HSP90AB1 and
   their co-chaperone, FKBP4, were uniquely upregulated in PD microglia
   ([300]Extended Data Figs. 9d and [301]10d). Hsp90 regulates tauopathy
   through co-chaperone complexes, and overexpression of FKBP4 prevents
   the accumulation of tau^[302]52. Therefore, our data suggest that
   microglia may play a beneficial role in preventing tau pathology,
   consistent with the less severe tauopathy in patients with PD.
   Additionally, multiple neuronal death pathways were downregulated in AD
   ([303]Extended Data Fig. 9e). Gene expression analysis of GWAS AD and
   PD risk genes demonstrated largely distinct up-regulation of disease
   risk genes between AD and PD microglia ([304]Extended Data Fig. 9f).

   We detected 14–56 DEGs in the ec and pfc microglia subpopulations
   ([305]Extended Data Fig. 10e), which was much fewer than that of the
   putamen microglia. Nonetheless, gene expression analysis of GWAS AD and
   PD risk genes demonstrated largely distinct regulation of disease risk
   genes between AD and PD microglia and between cortical and subcortical
   brain regions ([306]Extended Data Fig. 9f). Some pathways were
   regulated in the opposite direction between the putamen and cortical
   microglia ([307]Extended Data Fig. 10d). Our results suggest
   significant regional differences in microglial responses which may
   contribute to differential regional vulnerability.

Neuronal diversity of the human putamen

   The human striatum neuronal heterogeneity has not been characterized at
   the single-cell level. We identified 13 transcriptionally distinct
   neuronal populations through subclustering analysis: eight MSN and five
   interneuron clusters ([308]Fig. 8a and [309]Supplementary Tables
   38–[310]50), revealing a greater MSN heterogeneity than previously
   suggested in the mouse and the non-human primate (NHP) striata. The
   expression of PPP1R1B and MEIS2, MSN marker genes^[311]53,[312]54,
   versus ELAVL2^[313]55, broadly distinguished the MSNs from the
   interneurons ([314]Fig. 8b and [315]Supplementary Fig. 1a).
   Conventionally, the co-expression of dopamine receptor DRD1 with
   neuropeptide TAC1 versus DRD2 with proenkephalin PENK has been used to
   delineate D1 direct-pathway neurons from D2 indirect-pathway neurons,
   respectively^[316]11,[317]56. Four MSN subpopulations displayed
   conventional combinatorial expressions patterns, whereas the other four
   subpopulations had distinct combinatorial patterns. The co-expression
   of DRD1 with TAC1 by ‘mD1’ and ‘pD1’ neurons is consistent with their
   D1 neuron identity. The expression of STXBP6 and EPHA4 in mD1 neurons
   indicates their matrix compartment localization, whereas the expression
   of KCNIP1, BACH2 and the D1-patch– specific marker PDYN in pD1 neurons
   indicates patch compartment localization ([318]Fig. 8c and
   [319]Supplementary Fig. 1a)^[320]11,[321]57. In contrast, ‘ncD1’
   neurons expressed DRD1 but had a very low levels of TAC1 expression
   (thus named ‘non-canonical D1’). Moreover, ncD1 expressed multiple
   unique marker genes, such as RXFP2, FAM210B and MPP3, supporting its
   unique identity ([322]Fig. 8b and [323]Supplementary Fig. 1a).
   Similarly, the co-expression of DRD2 with PENK by ‘mD2’ and ‘pD2’
   neurons is consistent with their D2 neuron identity, and the
   matrix/patch marker genes indicate matrix versus patch localization of
   mD2 and pD2 respectively ([324]Fig. 8b,[325]c). ‘ncD2,’ the
   non-canonical D2 MSN, expressed DRD2 but had a low expression level of
   PENK. Instead, ncD2 exhibited aberrantly high levels of TAC1, the
   canonical D1 MSN marker gene ([326]Fig. 8b and [327]Supplementary Fig.
   1a). Furthermore, ncD2 expressed multiple unique marker genes, such as
   POU6F2, LGR5 and FHAD1, supporting its unique identity. The third type
   of non-canonical MSN exhibited expression of both dopamine receptor
   genes and both TAC1 and PENK, named hybrid MSN (hMSN) herein. hMSN
   expressed unique conserved marker genes RXFP1, MKX and JAG1, indicating
   it is a distinct neuronal subtype rather than an artifact of doublets
   ([328]Fig. 8b and [329]Supplementary Fig. 1a). The D1/D2 ‘hybrid’ MSN
   has been reported in NHP striatum^[330]11 and human nucleus
   accumbens^[331]12, with the same co-expression of D1 and D2 markers and
   unique RXFP1 expression. In contrast to these undescribed MSN subtypes
   having their own unique conserved marker genes, cluster ‘sMSN’ did not
   have any well-characterized marker genes uniquely expressed in the
   cluster. Instead, the conserved marker genes of this cluster had unique
   enrichment for many stress- and unfolded protein response-related
   pathways ([332]Fig. 8e,[333]f). We thus referred to this cluster as
   stressed MSN (sMSN).

Fig. 8 |. Characterization of neuronal subpopulations and gene expression
changes in disease conditions.

   Fig. 8 |
   [334]Open in a new tab

   a, UMAP visualization of neuron subpopulations colored by cluster
   identity. b, Dot plot of neuronal subpopulation conserved marker gene
   expression (FindConservedMarkers using Wilcoxon rank sum test and metap
   R package with meta-analysis combined P value < 0.05). c, Violin plot
   showing the expression of matrix- and patch-compartment marker gene
   expression in mD1, pD1, mD2 and pD2 neurons (FindConservedMarkers using
   Wilcoxon rank sum test and metap R package with meta-analysis combined
   P value < 0.05). d–f, GO terms enriched in the conserved cluster marker
   genes of each neuronal subpopulation related to cell death (d), stress
   response (e), amyloid and tau metabolism and unfolded protein response
   pathways (f). g, Neurodegeneration and addiction-related KEGG pathway
   terms enriched in the conserved cluster marker genes of each neuronal
   subpopulation. h, UMAP visualization of MSN neuron subpopulations with
   sMSN colored by cells expressing only DRD1 (left), only DRD2 (middle)
   or both DRD1 and DRD2 (right). i, Top five GO terms in the Biological
   Process category enriched in the DEGs of each neuronal subpopulation in
   AD and PD. Only cell subpopulations with enriched GO terms are shown.
   Pathways with FDR-adjusted P value < 0.05 (hypergeometric test) and at
   least five query genes were considered statistically significant.

   Five interneuron subtypes were identified, accounting for 9.32% of all
   neurons in the precommissural putamen, as reported^[335]58. The
   ‘in.PDGFD’ neurons had specific and robust expression of PDGFD, OPN3
   and PTHLH ([336]Fig. 8b and [337]Supplementary Fig. 1b), similar to the
   co-expression of Opn3 with Pthlh in mouse striatal
   interneurons^[338]59. Cluster ‘in.CALB2’ co-expressed calretinin
   (CALB2) and TAC3. Striatal CALB2^+ and TAC3^+ interneurons have been
   described in rodents, NHP^[339]59 and human striata^[340]11,[341]61.
   The ‘in.TH’ neurons robustly expressed tyrosine hydroxylase (TH),
   dopamine reuptake transporter SLC6A3, vesicular monoamine transporter-2
   SLC18A2, dopaminergic neuron-specific transcription factor BNC2 and
   ALDH1A1 (ref.^[342]61), which defines a dopaminergic neuron
   subpopulation in the substantia nigra pars compacta, suggesting their
   dopaminergic identity. The ‘in.SST’ neurons expressed SST, NPY and NOS1
   as described in the mammalian striatum^[343]11,[344]59,[345]60. The
   ‘in.SLC5A7’ cluster expressed the high-affinity choline uptake
   transporter SLC5A7, choline acetyltransferase CHAT and the key
   cholinergic neuronal function regulator LHX8, which are characteristic
   features of cholinergic neurons^[346]59.

Neuronal transcriptome and selective neuronal vulnerability

   Pathway enrichment analysis on the conserved marker genes in each
   cluster found extensively shared pathways across different neuronal
   subpopulations related to neuronal functions ([347]Supplementary Fig.
   1c). Many neuronal clusters were uniquely enriched for specific
   pathways ([348]Supplementary Fig. 1d), suggesting they represent
   neurons with distinct functions. Multiple neuronal death,
   neurodegenerative disease and amyloid- and tau-related pathways were
   selectively enriched in the hMSN, sMSN, in.PDGFD, in.CALB2 and in.SST
   neurons ([349]Fig. 8d,[350]f,[351]g), suggesting a possible link to
   selective neuronal vulnerability in the putamen. The enrichment of
   multiple addiction pathways across many neuronal clusters is consistent
   with the known involvement of the striatum in addictive
   behavior^[352]63.

   Analysis of MSN subtype marker gene expression revealed that sMSN
   represents a heterogenous population composed of both D1 and D2 MSNs,
   as well as a population of unknown identity. Of its 755 MSNs, 42.5%
   express DRD1 receptors only, 36.8% express DRD2 receptors only, and
   13.5% co-expressed DRD1 and DRD2 but lacked any of the MSN
   subtype-specific markers. The conserved marker genes were uniquely
   enriched for multiple stress-related pathways, amyloid and tau protein
   binding, response to unfolded protein binding, chemical
   carcinogenesis-reactive oxygen species and negative regulation of
   endoplasmic reticulum stress-induced intrinsic apoptotic signaling
   pathway ([353]Fig. 8d–[354]g), suggesting that sMSN may represent MSNs
   that had responded to various types of stress and altered their
   original cellular identity or function. Interestingly, HPCA, PSAP and
   DKK3 are conserved marker genes overexpressed in sMSN ([355]Fig. 8b and
   [356]Supplementary Fig. 1a), all of which have neuroprotection function
   against various types of stress such as endoplasmic reticulum
   stress-induced neurodegeneration, heat stress, Aβ susceptibility and
   oxidative stress^[357]64–[358]67. These results suggest that this
   heterogenous group of MSNs shares the same stress responses
   irrespective of disease.

   To understand neuronal damage in AD and PD, we compared the
   transcriptomic responses of each neuronal cluster between disease and
   control conditions and detected 23–652 DEGs per cluster
   ([359]Supplementary Fig. 1e and [360]Supplementary Tables 51–[361]69).
   AD neurons generally had more upregulated genes, whereas PD neurons had
   more downregulated genes. Pathway analyses showed broad disturbance of
   pathways critical to normal neuronal function in MSNs, but not in
   interneurons ([362]Fig. 8i). Regulations of long-term synaptic
   potentiation, synaptic plasticity, neurotransmitter transport and
   learning or memory pathways were uniquely downregulated in D1 neurons
   from PD samples ([363]Fig. 8i). This finding is consistent with
   previous findings that long-term synaptic potentiation is selectively
   impaired in dSPNs (D1 neurons) in the parkinsonian
   state^[364]68,[365]69, which is a critical cellular and circuit
   mechanism associated with PD pathophysiology. Interestingly, pD2 and
   hMSN from AD samples were enriched for the APP metabolic process
   pathway, signifying a differential predisposition to Aβ-related
   pathology in these neurons.

Discussion

   Astrocytes and microglia exert many essential actions that are crucial
   for neuronal survival and function in healthy CNS tissues. Mounting
   evidence demonstrates how glial cells play a key role in
   neurodegenerative disease^[366]33,[367]44,[368]70. However, most
   current knowledge is derived from in vitro systems or animal models
   whose relevance to human disease remains under debate^[369]71. Using
   unbiased snRNA-seq technology, we identified three astrocyte and two
   microglia subpopulations in human putamen that are conserved across
   multiple brain regions and different disease conditions, as well as
   between humans and mice. Importantly, our study revealed common
   microglia activation-associated genes and pathways shared by
   cognitively normal controls and diverse human disease conditions such
   as AD, PD and glioblastoma multiforme, irrespective of the brain region
   being studied. These analyses also revealed diseasespecific
   microglia-activation-associated transcriptomic changes linking human
   microglia activation to Aβ pathology, neuroinflammation and
   neurodegeneration. Our study discovered regional differences in
   pathology-associated transcriptomic changes in astrocytes and
   microglia, which may underlie selective regional vulnerability of the
   cortex and striatum. Finally, we described striatal MSN heterogeneity
   and neuronal transcriptomic profiles indicating disease-specific
   changes and selective neuronal vulnerability.

   Our analyses revealed regionally distinct astrocytic transcriptomic
   changes relevant to neurodegenerative diseases, including differences
   related to Aβ pathology, neurodegeneration, neuroinflammation and
   synapse organization. DEG overlap examination, genome-wide gene
   expression change correlation analysis, pathway analysis and GWAS AD
   and PD risk gene expression analysis together provide strong support
   for distinct transcriptomic changes in AD between putamen and cortical
   astrocytes. Although cortical astrocytes were examined at a lower
   sequencing depth than the putamen astrocytes ([370]Supplementary Table
   11), the directionality of gene expression changes, the proportion of
   genes being dysregulated in the disease condition, the genome-wide gene
   expression change correlation analysis and the GWAS risk gene analyses
   results are independent of sequencing depth supporting the biological
   relevance of the findings rather than being sequencing artifacts.
   Regional heterogeneity in glia response to aging and injury has been
   described in mice and humans^[371]72. It is well documented that
   cortical amyloid plaques appear early in the disease and present in
   many nondemented older adults, whereas striatal plaques occur only at
   later histopathological stages of AD, and largely after dementia
   onset^[372]4–[373]6. The observed region-specific astrocytic and
   microglial disease-pathologyrelated transcriptomic alterations might
   contribute to these spatial variations in disease pathology and
   neuronal vulnerability.

   Previous comparison of signature genes of reactive microglial
   populations revealed a heterogenous microglial response to AD pathology
   in mice and humans^[374]73. By comparing pseudotime DEGs, we observed a
   high concordance of a set of microglia activation-associated genes
   shared by diverse human disease conditions and brain regions ([375]Fig.
   7a), supporting the hypothesis that reactive microglia share a common
   microglial response to CNS pathology, irrespective of the disease
   etiology^[376]37,[377]74. The number of cells, sequencing depth,
   temporal dynamic nature of gene transcription regulation and the end
   stage at sampling all could contribute to the discrepancy of signature
   genes, whereas pseudotime DEGs are less affected by those factors,
   which may explain the differences between the two comparison methods.
   APOE and TYROBP were shared between human and mice with concordant
   expression changes in all 10 datasets (probability of 10^−6), and
   TYROBP-APOE signaling has been implicated in the initiation of DAM
   phenotypes^[378]38 suggesting evolutionally conserved microglial
   activation mechanism. We also discovered that microglia of varying
   disease conditions elicit a disease-specific response relevant to
   disease pathology ([379]Fig. 7d). The robust core gene signature
   identified in this study will help to define disease-specific roles of
   microglial activation in various CNS diseases.

   Our study reveals shared astrocytic transcriptome changes between AD
   and PD but largely distinct changes for microglia, suggesting common
   and divergent cell-type-specific mechanisms of pathogenesis.
   Interestingly, the ‘amyloid precursor protein metabolic process’ was
   uniquely upregulated in AD MSNs, whereas ‘long-term synaptic
   potentiation’ was uniquely downregulated in PD neurons. Furthermore,
   multiple neurodegenerative disease pathways were dysregulated in
   astrocytes, but not in neurons. Whether disease-specific glial
   transcriptomics changes are causal to disease-specific neuronal changes
   needs future investigation. A better understanding of how neurons and
   glia communicate as well as whether and how those pathways in
   corresponding glial cells contribute to disease pathogenesis may offer
   new ways to combat neurodegenerative diseases.

Limitations of the study

   The sample size of our study is limited. However, using the same
   approaches, we validated our findings in four independent datasets for
   both astrocytes and microglia subpopulations, suggesting that our
   findings are representative. All snRNA-seq studies used samples from
   postmortem brain tissues and shared the limitations associated with all
   postmortem tissue studies.

Methods

Subjects

   In agreement with local ethical committee requirements, patients
   provided written informed consent before cognitive impairment, or the
   next of kinsperson supplied consent antemortem or postmortem
   (Washington University Institutional Review Board, Washington
   University School of Medicine, St Louis, MO). Clinically and
   neuropathologically well-defined human brain tissues were collected
   from the Charles F. and Joanne Knight Alzheimer Disease Research Center
   (Knight ADRC) and The Movement Disorders Center (MDC) Brain Bank at
   Washington University School of Medicine. (The clinical information and
   pathological characteristics are summarized in [380]Supplementary Table
   1.) The use of tissue for genetics, autoradiography and biochemistry
   research was approved by the Knight ADRC and MDC Leadership Committees
   (ethics approval reference number T1705). Dementia level was assessed
   by Clinical Dementia Rating (CDR)^[381]75 according to the CDR criteria
   for diagnosing dementia in PD. Individuals with a CDR ≥ 1 were taken.
   AD pathological changes were classified using Braak staging^[382]76.
   Braak stages of Aβ accumulation use letter ratings: (A) the initial
   deposits in the basal neocortex, (B) deposits that extend into the
   adjacent areas of the neocortex and (C) heavy deposition throughout the
   entire cortex. Stages of neurofibrillary pathology were characterized
   as transentorhinal (I-II), limbic (III-IV) and neocortical (V and VI).
   All the cases were previously comprehensively examined by the MDC and
   Knight ADRC neuropathology core for their pathologic stages for Aβ
   (plaque), tau (NFT) and α-synucleinopathy (Lewy body) before being
   released to research laboratories. The systematic neuropathological
   characterization and AD grading were conducted and previously reported
   by our team^[383]77, and α-synucleinopathy with Lewy bodies was rated
   according to the scheme proposed by McKeith et al.^[384]78 and modified
   by Burack et al.^[385]77. Histologic stains include hematoxylin and
   eosin. Pathological staining was performed using the related antibodies
   Aβ (10D5, Elan Pharmaceuticals), tau (PHF-1, Abcam) and α-synuclein
   (LB- 509, Zymed), as we have reported^[386]79–[387]83. The selected
   cases had highly similar pathological profiles for each group in this
   study. The AD cases had Braak Aβ stage of C (n = 4), NFT stage V (n =
   1) and VI (n = 3) and Lewy body stage of 0 (n = 4). The PD cases had
   Braak Aβ stage of C (n = 2), 0 (n = 1) and B (n = 1); NFT stage I (n =
   2), II (n = 1), and III (n = 1); and Lewy body stage of 6 (n = 4). The
   cognitively healthy control cases had Braak Aβ stage of 0 (n = 4), NFT
   stage I (n = 1), II (n = 1) and III (n = 2), and Lewy body stage of 0
   (n = 4). Although there are unavoidable pathological overlaps in Aβ and
   NFT in these aged patients between the three groups, the AD group had
   higher average stages of Aβ and NFT than the control and PD groups. In
   addition, the PD group had a higher Lewy body stage than AD and
   controls. The average age and postmortem interval time did not
   significantly differ across groups. The tissues harvested were as
   follows: four AD (two males, two females) aged 71–82 (mean: 75 ± 2)
   years at death, four PD (two males, two females) aged 69–77 (mean: 73 ±
   2) years at death, and four age-matched healthy control cases (two
   males, two females) aged 73–85 (mean: 79 ± 2) years at death. No
   statistical methods were used to predetermine sample sizes, but our
   sample sizes are similar to those reported in previous
   publications^[388]7,[389]8.

Sample processing and nuclei isolation

   Brains were obtained at the time of the autopsy. The right hemisphere
   was coronally sectioned and snap-frozen using liquid nitrogen vapor.
   The tissue blocks were preserved at −80 °C until use. The putamen
   tissue (~50 mg) was carefully dissected out with a scalpel at −20 °C
   using the autoradiography images of dopaminergic
   biomarkers^[390]80–[391]84, and the tissue was then homogenized using a
   glass Dounce grinder in 4 ml ice-cold homogenization buffer (HB;
   consisting of 0.1 mM DTT (Promega, P1171), 1X Protease inhibitor
   cocktail (Promega, G6521), 0.2 U μl^−1 RNasin Plus RNase Inhibitor
   (Promega, N2615), and 0.1% Triton X-100 (Sigma, T8787) in nuclei
   isolation media (consisting of 10 mM Tris buffer pH 8.0 (ThermoFisher
   Scientific, AM9856), 250 mM sucrose (VWR 97061), 25 mM KCl
   (ThermoFisher Scientific, AM9640G) and 5 mM MgCl[2] (ThermoFisher
   Scientific, AM9530G) in molecular biology grade water)). An additional
   2 ml HB was added to the solution, and it was incubated on ice for five
   min. Then, the homogenized solutions were passed through a 70 μm cell
   strainer first and then filtered for an additional time with a 30 μm
   cell strainer. The double-strained homogenate was centrifuged (900 g,
   for 10 min at 4 °C), and the supernatant was removed. The remaining
   nuclei pellet was resuspended in 3 ml blocking buffer (BB; consisting
   of 1X PBS (Life Technologies, AM9625), 1% BSA (Sigma, 126625), and 0.2
   U μl^−1 RNasin Plus RNase Inhibitor (Promega, N2615) in molecular
   biology grade water). Then, 30 μl myelin remove beads was added to the
   nuclei suspension, and the solution was incubated at 4 °C for 15 min
   after resuspension. After incubation, an additional 3 ml BB was added
   to the solution, which was centrifuged (500 g, for five min at 4 °C),
   and then the supernatant was removed. Then, the resulting pellet was
   resuspended in 3 ml BB and incubated on a Dynamag magnet (ThermoFisher
   Scientific, 12301D) for 15 min at 4 °C refrigerator. Finally, the
   supernatant was removed and filtered through a 30 μm cell strainer. For
   accurate quantification, the nuclei-enriched supernatant was stained
   with DAPI (1:1,000) (ThermoFisher Scientific, D1306), and the nuclei
   were counted with a Countess II Automated Cell Counter (ThermoFisher
   Scientific, AMQAX1000).

Library construction and sequencing

   Purified nuclei were delivered to the McDonnell Genome Institute (MGI)
   at Washington University School of Medicine to generate 10x Genomics
   libraries using Chromium Single Cell 3′ V3 Reagent Kits according to
   the 10x Genomics protocol. The generated libraries were sequenced on
   the NovaSeq S4 platform (Illumina). Sequencing saturation ranged from
   60.8% to 94.8%. MGI demultiplexed raw base sequence calls generated
   from the sequencer into sample-specific FASTQ files.

Mapping snRNA-seq data to the reference genome and cell quality control

   CellRanger 3.0.2 (10x Genomics, [392]https://support.10xgenomics.com/)
   was used to align FASTQ files to the human GRCh38 pre-mRNA reference
   genome. The aligned reads were traced back to individual cells, and the
   gene expression level of individual genes was quantified based on the
   number of UMIs (unique molecular indices) detected in each cell. The
   filtered gene-cell barcode matrices generated with CellRanger were used
   for further analysis with the R package Seurat v3.0 (ref.^[393]13),
   4.0.5 (ref.^[394]85). R version 4.0.2, 4.0.4, 4.1.1 was used for
   statistical analysis and plotting (R Core Team (2013). R: A language
   and environment for statistical computing. R Foundation for Statistical
   Computing, Vienna, Austria; [395]http://www.R-project.org/).

   Quality control was implemented as the first step in data analysis. We
   first filtered out genes that were detected in less than five nuclei.
   Nuclei that were doublets or low quality were further filtered out by
   two criteria. First, nuclei with less than 500 genes, more than 10%
   mitochondrial content or an extremely high number of detected genes or
   UMIs were filtered out. Cutoffs for UMI and gene number were determined
   on the basis of a scatter plot showing the number of genes as a
   function of the number of UMI per cell. A cutoff of 500–70,000 UMI and
   500–9,000 genes was applied. Next, after unsupervised clustering, cell
   clusters with a mixed expression of markers from different cell types
   and clusters with low quality were removed, including clusters with a
   high percentage of mitochondrial genes and clusters without marker
   genes conserved across control, AD and PD groups. The initial dataset
   contained 38,929 cells. After these quality control procedures, we
   obtained 30,908 high-quality single-nucleus gene expression profiles
   (an average of 2,576 cells/subject) and detected a median of 2,187
   genes and 4,363 transcripts per nucleus.

Cell clustering and cell-type identification

   For integrative analysis, we followed the workflow described in the
   Seurat guided analysis for ‘Performing integration on datasets
   normalized with SCTransform^[396]86‘. We use SCTransform to normalize
   gene expression levels and to regress out variations from mitochondrial
   gene expressions. To integrate the single-cell data from individual
   donor samples, we used function SelectIntegrationFeatures (nfeatures =
   3,000) to identify highly variable genes. Functions PrepSCTIntegration,
   FindIntegrationAnchors (normalization.method = ’SCT’) and IntegrateData
   (normalization.method = ’SCT’) from Seurat v3.0 were implemented. The
   top 3,000 most variable genes were selected as integration features and
   used for integration anchor selection. Principal component analysis was
   performed using the top 30 PCAs. UMAP analysis was performed with the
   top 20 dimensions. Clusters were identified with the functions
   FindNeighbors (dims = 1:20) and FindClusters (resolution = 0.1). A
   resolution of 0.1 was selected for the downstream analysis because
   clusters were clearly separated and matched visual inspection. Default
   parameters were used unless noted.

Cell-type identification

   The function FindConservedMarkers (assay = ’SCT’, slot = ’data’, test.
   use = ’wilcox’, min.pct = 0.2, grouping.var = ’Genotype’, only.pos =
   TRUE) implemented in Seurat v3.0 was used for identifying marker genes
   that were conserved in the control, AD and PD groups. This function
   first calculates differentially expressed genes of each cluster against
   all other clusters for each condition using the function FindMarkers,
   and then the metap R package with default meta-analysis method
   metap::minimump to perform meta-analysis of P values (significance
   values) to generate a combined P value. max_pval < 0.05 was used as a
   cutoff to determine the conserved marker genes for each cluster,
   identifying positive markers for a given cluster that are shared by
   control, AD and PD. Identified markers were compared with
   celltype-specific markers from mouse striatum^[397]87 and human
   brains^[398]7–[399]10,[400]88. Cell type was manually annotated based
   on the expression for the following known marker genes: astrocytes
   (GFAP, AQP4, SLC1A2, ALDH1L1, GJA1, SLC1A3), endothelial cells and
   pericytes (FLT1, CLDN5, RGS5, PDG-FRB), immune cells and microglia
   (PTPRC, C1QB, CSF1R, CD74, CX3CR1, P2RY12, HLA-DRA, ITGAM, RUNX1),
   neurons (SYT1, SNAP25, RBFOX1, RBFOX3, GRIK2, GRIA1, GRIN2B, GAD1,
   GAD2, GRIN1), oligodendrocytes (MOG, MBP, MOBP, PLP1, CLDN11, SOX10,
   OLIG1, OLIG2) and oligodendrocyte precursors (VCAN, PCDH15, MEGF11,
   SOX10, OLIG1, OLIG2).

Astrocyte subcluster analysis

   We first isolated nuclei of the astrocyte cluster (control = 1,203, AD
   = 1,642, PD = 1,433 nuclei) from the original Seurat object using the
   subset function. The data were split into individual samples based on
   the subject identity. Then we performed data integration and
   unsupervised clustering following the procedure similar to that used
   for our initial cell-type clustering using default parameters except
   noted below: SCTransform^[401]86 normalization (vars.to.regress =
   c(‘nCount_ RNA’, ‘percent.mt’), SelectIntegrationFeatures (nfeatures =
   3,000), PrepSCTIntegration(anchor.features = selected.features),
   FindIntegrationAnchors (normalization.method = ’SCT’, anchor.features =
   selected.features, normalization.method = ’SCT’, reduction = ’cca’,
   k.filter = 170), IntegrateData (anchorset = selected.anchors,
   normalization.method = ’SCT’), RunPCA(npcs = 30). Additional
   low-quality cell clusters, including cell clusters that showed mixed
   expression of markers of astrocyte with markers from other cell types,
   which likely represent doublets, and cell clusters containing a high
   percentage of reads mapped to mitochondrial genes, were further
   filtered. To measure the effect of parameters on clustering results and
   determine the most stable cell population structure, a total of 48
   different combinations of parameters for dimensionality (5, 10, 15, 20,
   25, 30) and resolution (0.05, 0.1, 0.15, 0.2, 0.25, 0.3, 0.35, 0.4)
   were used to perform cell clustering. The concordance of cluster
   identity was measured using the adjusted Rand index (ARI) calculated
   using the adjustedRandIndex function implemented in the mclust R
   package. To measure the concordance of cell identity between using all
   cells (cells from control, AD and PD cases) for clustering and using
   only cells from AD and controls for clustering, we used different
   combinations of parameters to perform cell clustering in each
   situation, and ARI for shared cells were calculated. The clustering
   results were stable across a wide range of parameter combinations. With
   a dimensionality of 15 and resolution of 0.25, the parameters resulted
   in the highest ARI (0.96) between the two situations and were used for
   all the downstream analyses for astrocytes. The same approach and
   parameters were used for analyzing the Grubman et al.^[402]7, Lau et
   al.^[403]9 and Feleke et al.^[404]8 datasets whereas different
   parameters were used for Mathys et al.^[405]10 data (dimensionality =
   30, resolution = 0.5).

Immune cell subcluster analysis

   We first isolated nuclei of the immune cell cluster (control = 558, AD
   = 827, PD = 619 nuclei) from the original Seurat object using the
   subset function. The data were split into individual samples based on
   the subject identity, and we performed data integration and
   unsupervised clustering following the procedure similar to that used
   for our initial cell-type clustering using default parameters except
   noted below. Due to the small number of cells in one subject, some
   parameters were adjusted. SCTransform normalization (vars.to.regress =
   c(‘nCount_ RNA’, ‘percent.mt’), SelectIntegrationFeatures (nfeatures =
   3000), PrepSCTIntegration(anchor.features = selected.features),
   FindIntegrationAnchors (normalization.method = ’SCT’, anchor.features =
   selected. features, normalization.method = ’SCT’, dims = 1:20, k.anchor
   = 5, k.filter = 20, k.score = 20, max.features = 200), IntegrateData
   (anchorset = selected.anchors, normalization.method = ’SCT’),
   RunPCA(npcs = 30). Cell clusters were defined using resolution = 0.2.
   To perform microglia pseudotime analysis, we isolated microglia nuclei
   using the subset function and determined the best parameters for
   obtaining the most stable microglia subpopulation structure as
   described above for the astrocyte subpopulation analysis. A
   dimensionality of 10 and resolution of 0.15 were selected for our
   microglia data analyses, and for the Grubman et al.^[406]7 and Lau et
   al.^[407]9 datasets whereas different parameters were used for the
   Feleke et al.^[408]8 (dimensionality = 20, resolution = 0.15) and the
   Mathys et al.^[409]10 (dimensionality = 40, resolution = 0.5) data.

Neuronal cell subcluster analysis

   We first isolated the nuclei of the neuronal clusters (control = 2259,
   AD = 3343, PD = 3286 nuclei) from the original Seurat object using the
   subset function. The data were split into individual samples based on
   the subject identity, Then we performed data integration and
   unsupervised clustering following the procedure similar to that used
   for our initial cell-type clustering using default parameters except
   noted below. SCTransform normalization (vars.to.regress =
   c(‘nCount_RNA’, ‘percent.mt’)), SelectIntegrationFeatures (nfeatures =
   3,000), PrepSCTIntegration(anchor.features = selected.features),
   FindIntegrationAnchors (normalization.method = ’SCT’, anchor. features
   = selected.features, normalization.method = ’SCT’, dims = 1:30),
   IntegrateData (anchorset = selected.anchors, normalization. method =
   ’SCT’), RunPCA(npcs = 30). Additional low-quality cell clusters,
   including cell clusters that showed mixed expression of markers of
   neurons with markers from other cell types, which likely represent
   doublets, and cell clusters that containing a high percentage of reads
   that mapped to mitochondrial genes, were further filtered. Cell
   clusters were defined using a dimensionality of 20 and a resolution of
   0.3. Cluster ‘ncD1,’ a cluster discretely separate from the other
   clusters but clustered with ‘sMSN,’ was manually selected using the
   CellSelector function of the Seurat R package. Conserved marker
   analysis of the cluster confirmed it was transcriptionally distinct,
   with its unique marker genes. The neuronal subtypes were manually
   annotated using the following markers: MSNs (PPP1R1B, MEIS2),
   interneurons (ELAVL2, PDGFD, OPN3, CALB2, TAC3, TH, SLC6A3, SST, NPY,
   SLC5A7, LHX8), D1 and D2 (DRD1, DRD2), classical D1 (TAC1), classical
   D2 (PENK), matrix MSNs (STXBP6, EPHA4) and patch MSNs (KCNIP1, BACH2).

Analysis of gene differential expression

   FindConservedMarkers function (assay = ’SCT’, slot = ’data’, test. use
   = ’wilcox’, min.pct = 0.2) was used to determine statistically
   significant cluster marker genes that were conserved in the control, AD
   and PD cases. A gene with meta-analysis combined P value < 0.05
   determined by the default function metap was determined to be
   statistically significant. FindMarkers function (assay = ’SCT’, slot =
   ’data’, test.use = ’wilcox’, min. pct = 0.2) was used to determine
   differentially expressed genes (DEGs) in disease conditions compared to
   the controls. We used base = exp(1) (the default parameter for
   calculating fold change in the DEG analysis implemented in Seurat v3.0)
   for all the comparisons to keep the analysis consistent within the
   manuscript. A gene with a Benjamini–Hochberg (FDR) adjusted P value <
   0.05 and a natural logarithm of fold change > 0.25 or < −0.25 was
   determined to be statistically significant. An absolute value of logFC
   > 0.25 (natural logarithm of fold change, Seurat v3.0 default
   parameter) is equivalent to 1.28-fold.

Gene set enrichment analysis and comparison with previously published
microglia-activation-associated pseudotime DEGs

   GO and KEGG^[410]89 pathway enrichment analyses were performed using
   the R package clusterProfiler v3.16.1 (ref.^[411]90). Results with
   FDR-corrected P value < 0.05 and at least five query genes were
   reported as significantly enriched pathways. We performed GO term
   enrichment analysis under the following three sub-ontologies:
   biological process, molecular function and cellular component. Gene
   signatures of DAM were obtained from Keren-Shaul et al.^[412]37. Gene
   signatures of disease-associated astrocyte (DAA) were obtained from
   Habib et al.^[413]14. Signature gene enrichment was evaluated using the
   hypergeometric test implemented in the phyper function in the R
   Hypergeometric package with lower.tail= FALSE. The total number of
   features that were detected at least once in the cell population being
   analyzed was used as the background gene set in GO and KEGG pathway
   enrichment analysis.

   To compare with previously published pseudotime DEGs of activated
   microglia populations, pseudotime DEGs and direction of change as the
   cells transition from homeostatic to activated states were retrieved
   from the following publications: From Sankowski et al.^[414]48,
   ‘[415]supplementary table 11‘ was downloaded. From Keren-Shaul et
   al.^[416]37, [417]Supplementary Table 7 was downloaded. Gene names and
   gene expression direction changes were obtained Sala Frigerio et
   al.^[418]45 ([419]Figure S5 and main text; this is not a complete list
   of pseudotime DEGs, as these authors only reported AD risk genes in
   [420]Figure S5). The data were plotted using R package ComplexHeatmap
   version 2.12.0 (ref. ^[421]91).

Statistics and reproducibility

   No statistical methods were used to predetermine sample sizes, but our
   sample sizes are similar to those reported in previous publications.
   Wilcoxon rank sum test is a nonparametric test, which does not require
   normal distribution of the data. The study participants were allocated
   into groups based on their clinical diagnoses. We selected one case
   from each group for RNAscope mRNA in situ hybridization combined with
   immunohistochemistry assays. We were not blinded to allocation during
   experiments and outcome assessment, although the F3 RNAScope in situ
   hybridization signal quantifications were conducted blind to the
   conditions of the experiments. The sample of PD subject 1654 was
   replaced with the sample of PD subject 5212 in F3 RNAScope in situ
   hybridization signal quantifications experiment because not enough
   tissue from subject 1654 was available for the experiment. Our findings
   were replicated in four independent datasets for both astrocytes and
   microglia subpopulations, suggesting that our findings are
   representative.

Reporting summary

   Further information on research design is available in the Nature
   Portfolio Reporting Summary linked to this article.

Extended Data

Extended Data Fig. 1 |. snRNA-seq profiling and characterization of major
cell types.

   Extended Data Fig. 1 |
   [422]Open in a new tab

   a, Brain region analyzed with snRNA-seq. Created with BioRender. com.
   b, Comparison of age, postmortem interval (PMI), number of cells, the
   median number of transcripts and median number of genes per nucleus
   among control, AD and PD groups. c, Heatmap of the relative expression
   level of top 10 marker genes for each cell type. d, Violin plots of
   gene expression levels of known cell-type-specific marker genes. e,
   UMAP plot colored by experimental batch or individual label. UMAP were
   generated using the same parameters as described in [423]Fig. 1. f,g,
   Percentage of cells from (f) each disease group or (g) individuals of
   each disease group in each of the major cell type. Ast: Astrocyte; EP:
   Endothelia cell and pericyte; Immune: Immune cell including microglia;
   OLIGO: Oligodendrocyte; OPC: Oligodendrocyte precursor cell. Conserved
   marker genes were determined by FindConservedMarkers using Wilcoxon
   Rank Sum test and metap R package with meta-analysis combined P value <
   0.05 comparing gene expression in the given cluster with the other cell
   clusters for AD (n = 4), PD (n = 4) and the controls (n = 4).

Extended Data Fig. 2 |. Identification and validation of the three astrocytes
subpopulations.

   Extended Data Fig. 2 |
   [424]Open in a new tab

   a, Heatmap plot of the adjusted rand index (ARI) of pair-wise
   clustering result comparison using all cells with a range of
   dimensionality (5–30) and resolution (0.05–0.35). The black star
   indicates the parameter selected for all downstream analyses including
   analyses of entorhinal and prefrontal cortex astrocytes (dimensionality
   = 15, resolution = 0.25). The black lines delineate the range of
   parameters that generated high ARIs. b,c, UMAP visualization of
   subclusters of astrocytes colored by (b) disease diagnosis or (c)
   individual identity. d, Distribution of cells from each diagnostic
   group in the astrocyte subpopulations. Each dot represents an
   individual except entorhinal cortex data where each dot represents
   samples from two subjects that were processed together. e, Distribution
   of cells from each astrocyte subpopulation in different diagnostic
   groups. f,g, RNAscope in situ hybridization (ISH) analysis of Ast-2
   conserved marker genes CD44 (f) and TNC (g) transcript expression (red)
   and immunohistochemistry staining (brown) of AQP4 in the internal
   capsule tissue sections of the same subjects of the control (CTRL), AD
   and PD groups shown in [425]Fig. 1. For all data, the experiment was
   performed once. Hematoxylin-positive cell nuclei are shown in blue.
   Scale bar = 100 μm.

Extended Data Fig. 3 |. Characterization of astrocyte subpopulations in the
prefrontal cortex (pfc) of Mathys et al., 2019 and the anterior cingulate
cortex (acc) of the Feleke et al. 2021 data.

   Extended Data Fig. 3 |
   [426]Open in a new tab

   a,b, UMAP visualization of astrocyte subpopulations colored by cluster
   identity for (a) prefrontal cortex and (b) anterior cingulate cortex
   astrocytes. c,d UMAP visualization of astrocyte subpopulations colored
   by conserved marker gene expression levels for (c) prefrontal cortex
   and (d) anterior cingulate cortex. e, Dot plot of conserved marker gene
   expression levels in Ast-0, Ast-1 and Ast-2 astrocytes from the two
   brain regions. f, Violin plot showing the expression of Ast-2 conserved
   marker genes shared with putamen Ast-2. g–j UMAP visualization of
   subclusters of astrocytes colored by (g,h) disease diagnosis or (i,j)
   individual identity. k, The distribution of cells from each astrocyte
   subpopulation in different diagnostic groups (left) and the
   distribution of cells from each diagnostic group in the astrocyte
   subpopulations (right) of the Mathys et al., 2019 data. Each dot
   represents an individual. Conserved marker genes were genes whose
   expression is significantly higher than its expression in other cell
   clusters in all diagnostic groups determined by FindConservedMarkers
   using Wilcoxon Rank Sum test and metap R package with meta-analysis
   combined P value < 0.05. Red asterisks (*) indicate statistical
   significant conserved marker genes.

Extended Data Fig. 4 |. Characterization and comparison of the three
astrocytes subpopulations from the putamen (pu), entorhinal cortex (ec), and
prefrontal cortex (pfc) of Lau et al. data.

   Extended Data Fig. 4 |
   [427]Open in a new tab

   a, Upset plot showing the overlap between putamen conserved marker
   genes of Ast-0, Ast-1 and Ast-2 astrocyte with marker genes of mouse
   DAA and Gfap-high astrocytes from Habib et al., 2020. b, Violin plots
   showing the expression level distributions of orthologous genes of
   murine DAA and Gfap-high astrocyte marker genes in the putamen
   astrocytes. c, PCA plot using murine DAA and Gfap-high astrocyte marker
   gene logFC of gene expression (comparing murine DAA and Gfap-high
   astrocyte with Gfap-low astrocytes, downloaded from Habib et al., 2020)
   and the logFC of the human orthologous genes (comparing putamen Ast-1
   and Ast-2 with Ast-0 astrocytes). d,e, Violin plots showing the
   expression level distributions of reactive astrocyte marker genes in
   astrocytes from the (d) putamen and (e) prefrontal cortex. f, Violin
   plots showing the expression level distributions of A1-, A2-specific
   activated astrocyte markers and JAK-STAT3 pathway genes. g, Top 10 GO
   terms in the Biological Process category enriched in the astrocyte
   subpopulation signature genes (hypergeometric test, FDR-adjusted P
   value < 0.05, ≥ 5 query genes). Conserved marker genes plotted in panel
   (b), (d) and (e) were determined by FindConservedMarkers using Wilcoxon
   Rank Sum test and metap R package with meta-analysis combined P value <
   0.05 comparing gene expression in the given cluster with the other cell
   clusters for AD (n = 4), PD (n = 4) and the controls (n = 4). Genes
   plotted in (f) were not statistically significantly higher in any of
   the astrocyte subpopulations.

Extended Data Fig. 5 |. Comparison of differentially expressed genes (DEGs)
of the three astrocyte subpopulations from the putamen (pu), entorhinal
cortex (ec), and prefrontal cortex (pfc) from the Lau et al., data.

   Extended Data Fig. 5 |
   [428]Open in a new tab

   a, UpSet plot showing the number of overlapping up- and downregulated
   DEGs among the three astrocyte subpopulations for AD (left) and PD
   (right) astrocytes. b, Venn diagram showing the overlap of up- and
   downregulated DEG between AD and PD in each putamen astrocyte
   subpopulation (hypergeometric test). c, UpSet plot showing the overlap
   of DEGs that were up- or downregulated in AD between putamen (pu),
   entorhinal cortex (ec) and prefrontal cortex (pfc) astrocyte
   subpopulations. d, Disease-related Gene Ontology (GO) terms enriched in
   the astrocyte DEGs (hypergeometric test, FDR-adjusted P value < 0.05, ≥
   5 query genes). UP: upregulated in disease samples. Down: downregulated
   in disease samples.

Extended Data Fig. 6 |. Identification of immune cells and validation of
microglia subpopulations.

   Extended Data Fig. 6 |
   [429]Open in a new tab

   a,b, Violin plots showing the expression level distributions of marker
   genes for (a) PVM and (b) activated microglia. c, Distribution of
   percentage of cells from each subject in each immune cell cluster of
   the putamen (pu), entorhinal cortex (ec) from the Grubman et al. data
   and prefrontal cortex (pfc) from the Lau et al. data (one-way ANOVA or
   student’s t-test). Each dot represents a subject except the ec data.
   d,e, Immunohistochemistry staining (brown) of microglia marker protein
   P2RY12 and RNAscope in situ hybridization (ISH) analysis (red) of (d)
   AIF1 and (e) APOC1 transcript expression in the internal capsule tissue
   of the same subjects shown in [430]Fig. 1. Hematoxylin-positive cell
   nuclei are shown in blue. For all data, the experiment was performed
   once. f–h, UMAP visualization of only microglia subpopulations from (f)
   pu, (g) ec and (h) pfc. UMAPs were generated using a dimensionality of
   10 and resolution of 0.15. i–k, Violin plots showing the expression
   level distributions of conserved microglial subpopulation marker genes
   in putamen (i), entorhinal cortex (j) and preprontal cortex (k)
   microglia subpopulations. Conserved marker genes plotted in panel (a),
   and HLA-DRA, HLA-DPB1, FTL and CD14 plotted in panel (b) were
   determined by FindConservedMarkers using Wilcoxon Rank Sum test and
   metap R package with meta-analysis combined P value < 0.05 comparing
   gene expression in the given cluster with the other cell clusters for
   AD (n = 4), PD (n = 4) and the controls (n = 4).

Extended Data Fig. 7 |. Four distinct immune cell populations in (A-D) the
prefrontal cortex (pfc) of the Mathys et al., and (E-H) the anterior
cingulate cortex (acc) data of the Feleke et al. data.

   Extended Data Fig. 7 |
   [431]Open in a new tab

   a,e, UMAP visualization of subclusters of immune cells colored by cell
   cluster (left) or disease diagnosis (right). UMAPs were generated using
   parameters of dimensionality of 40 and resolution of 0.5 for the Mathys
   et al. data (AD n = 24, controls n = 24) and dimensionality of 20 and
   resolution of 0.15 for the Feleke et al. data (n = 7 each for the
   control, DLBD, PD and PDD samples). Violin plots showing the expression
   level distributions of genes for (b, f) T cell, microglia and PVM
   shared markers and PVM unique markers; (c, g) microglia-specific
   markers, and microglia subpopulation markers; (d, h) Micr-0 marker and
   activated microglia markers. The color code is the same as in (a) and
   (e), respectively. The conserved marker genes were determined by
   FindConservedMarkers using Wilcoxon Rank Sum test and metap R package
   with meta-analysis combined P value < 0.05 comparing gene expression in
   the cells of given cluster with that of the other cells. PVM:
   perivascular macrophage; CycM: cycling microglia.

Extended Data Fig. 8 |. Comparison of microglial pseudotime DEGs.

   Extended Data Fig. 8 |
   [432]Open in a new tab

   a, Venn diagram showing the overlap between pseudotime DEGs of control,
   AD and PD microglia with AD-risk genes (hypergeometric test).
   Pseudotime DEGs are genes whose expression significantly associated
   with pseudotime progression (generalized addictive model, FDR-adjusted
   P value < 0.05). b, UpSet plot showing the overlap between control, AD
   and PD microglial pseudotime gene coexpression modules from putamen
   microgla. c, Heatmap showing pseudotime DEGs shared by human activated
   microglia from the putamen (pu) of cognitively normal controls, AD and
   PD samples, from prefrontal cortex (pfc) of the control and AD samples
   and from the entorhinal cortex (ec) of the control and AD samples. d,
   GO terms related to immune functions enriched in the microglia
   pseudotime DEGs. e, Heatmap showing pseudotime DEGs shared by the mouse
   activated microglia DAM and ARM. f, Top 5 GO terms in the biological
   process category enriched in the microglia pseudotime DEGs. Pathways
   with FDR-adjusted P value < 0.05 (hypergeometric test) and at least
   five query genes were considered statistically significant. DAM:
   Disease-associated microglia; ARM: activated response microglia. UP:
   upregulated during pseudotime progress (module 2 and 3 genes). Down:
   downregulated during pseudotime progress (module 1 genes). Mod 1:
   module 1 genes; Mod 2 + 3: module 2 and 3 genes combined.

Extended Data Fig. 9 |. Microglia transcriptomic changes in disease
contributed to Aβ pathology, tauopathy and neuronal death.

   Extended Data Fig. 9 |
   [433]Open in a new tab

   a–d, Volcano plots showing significant DEGs in Micr-0 and Micr-1
   comparing cells from AD (left panels) or PD (right panels) with cells
   from the controls (CTRL). The x-axis specifies the logFC and the y-axis
   specifies the negative logarithm to the base 10 of the FDR-adjusted P
   values. Magenta and cyan dots represent genes expressed at
   significantly higher or lower levels respectively in disease samples
   (Wilcoxon Rank Sum test, FDR-adjusted P value < 0.05, absolute logFC >
   0.25) comparing AD (Micr-0 = 440, Micr-1 = 299 cells) or PD (Micr-0 =
   329, Micr-1 = 201 cells) microglia to the control (Micr-0 = 264, Micr-1
   = 198 cells) microglia. Violin plots showing the expression level
   distributions of example DEGs that were (b) downregulated in both AD
   and PD microglia, (c) uniquely downregulated in AD or (d) uniquely
   upregulated in PD. e, GO terms related to neuron death, Aβ pathology
   and tauopathy enriched in microglial DEGs (hypergeometric test,
   FDR-adjusted P value < 0.05, ≥ 5 query genes). f, Heatmaps showing the
   logFC of expression level of significant DEGs for GWAS AD- and PD-risk
   genes; GWAS genes differentially expressed in at least two
   subpopulations were plotted for visualization. UP: upregulated in
   disease samples. Down: downregulated in disease samples.

Extended Data Fig. 10 |. Comparison of microglia DEGs.

   Extended Data Fig. 10 |
   [434]Open in a new tab

   a, Venn diagram demonstrating overlap between AD and PD DEGs in the
   Micr-0 and Micr-1 cells for DEGs upregulated (left) or downregulated
   (right) in the disease samples. b,c, Scatter plots showing pair-wise
   correlations of genome-wide gene expression logFC (b) between Micr-0
   and Micr-1 in AD (left) or PD (right) samples and (c) between AD and PD
   samples in Micr-0 (left) or Micr-1 (right) cells respectively. d, Top 5
   GO terms in the biological process category enriched in the DEGs of the
   microglia subpopulations from the putamen (pu), entorhinal cortex (ec),
   prefrontal cortex (pfc) (hypergeometric test, FDR-adjusted P value <
   0.05, ≥ 5 query genes). e, Bar plot showing the number of DEGs for each
   subpopulation of microglia from the three brain regions (Wilcoxon Rank
   Sum test, FDR-adjusted P value < 0.05 and absolute logFC >0.25). UP:
   upregulated in disease samples. Down: downregulated in disease samples.

Supplementary Material

   Supplementary Information
   [435]NIHMS1880699-supplement-Supplementary_Information.pdf^ (19.9MB,
   pdf)
   Supplementary Table
   [436]NIHMS1880699-supplement-Supplementary_Table.xlsx^ (2.2MB, xlsx)

Acknowledgements