Abstract Ischemic stroke constitutes a multifaceted neurological affliction that spans various cellular types. Lack of dynamic chromatin accessibility data after stroke is one of the obstacles to understanding this process. To gain insights into the variations in transcriptional regulation among various cell types subsequent to a stroke, we employed single-nucleus ATAC-seq to curate a chromatin accessibility compendium from the cerebral cortex of mice subjected to middle cerebral artery occlusion/reperfusion (MCAO/R). Tissue samples were collected at various time points including 0, 6, 12, 24 hours, and 7, 14 days post-reperfusion, in addition to Sham control group. We obtained 99,271 high-quality nuclei across nine cell types, thereby establishing the single-nucleus chromatin accessibility atlas. This atlas provides data for interpreting the regulatory mechanisms that pervade the continuum of ischemic stroke. The data presented herein constitutes a valuable resource for the comprehension of regulatory interplays within the pathology-afflicted cerebrum. Subject terms: Stroke, Stroke Background & Summary Ischemic stroke is the second leading cause of death and disability worldwide, typically resulting in motor and cognitive impairments^[48]1,[49]2. Due to the intricate nature of brain structure and the diverse array of cell types present, our understanding of the molecular mechanisms underlying stroke remains incomplete. To decipher the biochemical regulatory alterations and interactions during stroke, researchers have employed a variety of methods including transcriptomic and proteomic sequencing^[50]3,[51]4. Single-cell RNA sequencing (scRNA-seq) has emerged as a transformative technology in stroke research, providing valuable insights into the activation of immune response, multicellular interactions, new potential biomarkers, and therapeutic targets^[52]5,[53]6. However, these single-cell studies have predominantly focused on transcriptomics, leaving the regulatory mechanisms that control gene expression largely unexplored. The coordination between chromatin structure and transcription plays a crucial role in gene regulation, influencing processes from embryonic development to the progression of disease^[54]7,[55]8. Chromatin accessibility is essential in regulating gene expression and cellular identity, and alterations in accessibility have been implicated in the occurrence and development process of neurological diseases^[56]9,[57]10. Single-nucleus epigenomic profiling has been applied across a range of diseases, including coronary artery disease and neurodegenerative diseases. By utilizing single-nucleus assay for transposase-accessible chromatin with high-throughput sequencing (snATAC-seq), researchers can pinpoint cell type-specific, disease-related candidate cis-regulatory elements (CREs) and their candidate target genes, underscoring the significance and potential of this technique in the realm of neurological diseases^[58]11. The identification of emerging CREs unveils cell-state-specific modifications, aiding in the discovery of novel putative biomarkers and therapeutic targets^[59]12. However, the existing literature lacks a comprehensive view of the dynamic landscape of global chromatin accessibility of stroke. To investigate the CREs and their dynamic within specific cell types following ischemic stroke, we selected cortical tissues from middle cerebral artery occlusion/reperfusion (MCAO/R) mice at seven different time points after reperfusion: 0 hours (h), 6 h, 12 h, 24 h, 7 days (d), and 14 d, as well as Sham group. We performed snATAC-seq analysis on chromatin accessibility of 99,271 nuclei and constructed a single-nucleus chromatin accessibility atlas, annotating cell types with their respective putative target genes and functions for CREs. The delineation of transcriptional regulatory networks offers a valuable resource for potentially identifying key cell subtypes, transcription factors (TFs) and dynamic alterations pertinent to ischemic stroke, guiding the development of more targeted and efficacious therapeutic interventions for this disease. Methods Animal experiments Male C57BL/6 mice (age: 7-8 weeks, weight: 22–24 g) were employed for this study. All animal procedures were performed in compliance with guidelines for the care and use of animals and were approved by the University Huazhong Agriculture Institutional Animal Care and Use Committee (approval number: HZAUMO-2021-0011). Mice were assigned into groups subjected to MCAO/R with various recovery periods, or into Sham operation. MCAO/R model establishment Ischemic stroke was established with MCAO/R surgery on male C57BL/6 mice according to the laboratory’s preliminary experimental methods^[60]13. The mice were anesthetized with 2% isoflurane (RWD, R510-22-10), and were placed in the supine position. To induce focal cerebral ischemia, a silicon-coated filament (RWD, MSMC21B120PK50) was inserted into the left middle cerebral artery (MCA) to block blood flow. After 60 minutes later, the filament was extracted for reperfusion. Rectal temperature was maintained at 37 °C during the entire procedure. The cerebral blood flow (rCBF) was detected using a laser doppler flowmetry (Perimed, PERIMED005). A 55% decrease in the rCBF of the ipsilateral hemisphere, as compared to contralateral hemisphere, was considered the threshold for successful establishment of cerebral ischemia. Mice of the sham-operated group were performed the same as the MCAO/R procedure without filament insertion. Brain tissue dissection Following the MCAO/R operation, the mice were allowed recovery periods of 0 h, 6 h, 12 h, 24 h, 7 d and 14 d. Both MCAO/R mice and sham-operated mice were euthanized via cervical dislocation. The brains of mice were promptly and meticulously excised. Ophthalmic forceps were used to carefully separate the left cerebral cortex from the rest of the brain without compromising its integrity. All samples were placed on ice and quickly dissected, frozen in liquid nitrogen, and stored at −80 °C until further processing. Nuclei isolation for snATAC-seq Nuclei were isolated from cerebral cortex of mice for snATAC-seq according to the mechanical extraction method with slight modification^[61]14. In brief, the frozen tissues were thawed from −80 °C storage and 2 mL pre-chilled homogenization buffer (Sigma, CELLYTPN1-1KT) was added to each sample. Each 1 mL mixture was transferred to a 2 mL Dounce homogenizer (Kimble, NO.885300-0002) on ice, and then homogenized with 10–20 strokes of the tight pestle (the B pestle). Next, the homogenized mixture was filtered into a 1.5 mL tube through 40 µm cell strainers (Falcon, 352340), followed by centrifugation at 1,200 g for 5 minutes at 4 °C to pellet the nuclei, and then suspended in a blocking buffer containing 1% bovine serum albumin (BSA, Sangon Biotech, 68551-06-4) and 0.2 U/μL RNase inhibitor in 1 × PBS. Finally, the nuclei were pelleted for a second time by centrifugation at 500 g for 5 minutes at 4 °C, and resuspended in 1 × PBS containing 1% BSA for following library preparation. snATAC-Seq library preparation and sequencing For snATAC-Seq library preparation, DNBelab C Series Single-Cell ATAC Library Prep Set (MGI, 1000021878) was employed. In summary, libraries constructed as the user guide involved in transposition, droplet generation, PCR pre-amplification, emulsion breakage, capture beads collection, DNA amplification, and eventually purification for sequencing on the MGI DNBSEQ-T1 platform, with which we can acquire 109 bp for read 1, 50 bp for read 2, and 10 bp for the sample index. For quality control all of these snATAC-Seq libraries were assayed by Qubit ssDNA Assay Kit (Thermo Fisher Scientific, [62]Q10212) to make sure enough DNA molecule and concentrations for sequencing and assessed the fragment size distribution using Agilent Bioanalyzer 2100 (Agilent Technologies, G2939A). snATAC-seq raw data processing First, the raw reads were separated and filtered using PISA (version 1.1)^[63]15, then aligned to mouse genome (mm10) with the software BWA (version 0.7.17-r1188)^[64]16. The obtained BAM files were further processed by bap2 (version 0.6.2)^[65]17 to identify reads with the same cell barcode and create fragment files for downstream bioinformatic analysis. Quality control ArchR (version 1.0.2)^[66]18 was employed to filter nucleus with low quality based on the following parameters: for each library the value of log10 (unique nuclear fragments) ≥ 3 and transcriptional start site (TSS) enrichment score ≥ 4. Subsequently, the potential doublets were removed from the datasets by applying the ‘addDoubletScores’ function of ArchR with the parameters of filterRatio = 2. The outlier samples are removed based on UMAP integration results. Pearson correlation coefficients between technical or biological replicates across tissues were calculated based on the PeakMatrix. An overview of quality control parameters for the snATAC-seq profiles seen in Table [67]1. Table 1. An overview of quality control parameters for the snATAC-seq profiles established in samples from 6 different time points and Sham group. Time points Sham M0 h M6 h M12 h M24 h M7 d M14 d Library Number 11 5 8 8 4 2 4 Cell Number 22951 16201 20113 17689 9051 4835 8431 Doublet Enrichment 0.73 0.62 0.50 0.70 0.60 0.60 0.68 Median Fragment 2624 3992 5704 4567 3913 5485 4100 TSS Enrichment 7.09 8.25 8.32 7.66 9.09 12.28 9.80 Peak Number 353870 322618 317837 309113 274276 265104 248302 [68]Open in a new tab Monitoring time: 0 hours (M0 h), 6 hours (M6 h), 12 hours (M12 h), 24 hours (M24 h), 7 days (M7 d), and 14 days (M14 d) post-stroke, and Sham group (Sham). Latent semantic indexing (LSI) clustering of snATAC-seq data Considering the sparsity of snATAC-seq data, iterative latent semantic indexing was adopted with ‘addIterativeLSI’ function of ArchR for dimensionality reduction, then library integration was performed by ‘addHarmony’ function and unsupervised clustering was employed using the ‘addClusters’ function on the basis of the Seurat Leiden clustering algorithm^[69]19. The gene activity scores were calculated by ‘addGeneScorelatriy’ function. Marker genes were identified by ‘getMarkerFeatures’ function (parameters: useMatrix = ‘GeneScoreMatix’, groupBy = ‘Clusters’) with metrics of FDR ≤ 0.05 & Log2FC ≥ 0.25, where the statistical test for differential genes was the rank-sum test. Further, cell types was annotated for each cluster in accordance with well-established cell type-specific markers. Additionally, scCODA^[70]20 employs a Bayesian model primarily for analyzing differences in cell composition across different sample groups. Consequently, we used the scCODA package to compute the proportions of different cell types at various time points and performed significance testing. In the model construction process (using the function sccoda. utl. cpuana. mod. PositionalAnalysis), the “formula” parameter is set to the mouse injury time “time”. Set the reference_cellotype parameter to “automatic”. Then use the sample_ hmc() function to perform HMC sampling inference and view the results through summay(). The “Final Parameter” displays significance. If the value is 0, the effect is considered statistically unreliable; otherwise, it is considered reliable. In addition, logFC is used to indicate a decrease or increase in abundance (relative to the reference cell type). Pseudo-bulk: verification of data authenticity DESeq2^[71]21,[72]22 is an R package used for differential expression analysis of RNA-seq data. It employs an empirical Bayesian method to estimate the prior values of log2 fold changes and variances, and calculates the posterior values of these statistics. DEseq2 was used to calculate the differentially expressed genes in microglia (MG) cells at different time points. Specifically, we extracted MG and added peaks with identical biological replicates to form pseudo-bulk data. We then used DESeq2 to calculate the differential pseudo-bulk peaks between M24h and M14d. Subsequently, we conducted functional enrichment analysis of the genes associated with pseudo-bulk peaks in biological processes (BP) pathways. This functional enrichment was used to interpret the functions that MG perform at M24h and M14d, thereby reflecting the authenticity of the data. Peak calling of snATAC-seq Given that per-cell snATAC-seq data is essentially binary, we created pseudo-bulk replicates using the ‘addGroupCoverages’ function for each cell type and the ‘addReproduciblePeakSet’ function was executed to call peak with the parameters set as groupBy = ‘annotation’, pathToMACS = pathToMacs2, maxCells = 20,000. Further analysis involved the acquisition of the marker peaks was achieved by the ‘getMarkerFeatures’ function and ‘getMarkers’ functions. Motif enrichment analysis With the intention of exploration what motifs enriched in peaks we performed cell type-specific and time-specific motif enrichment analysis through ‘addMotifAnnotations’ function of ArchR to add motif annotations first, specifying the motifSet as ‘cisbp’ and assigning the name ‘Motif’, and then using the ‘peakAnnoEnrichment’ function for enrichment of various motifs, whereby the top 7 cell type-specific and time-specific motifs were shown in the heatmap. TF footprinting analysis For the purpose of determining the precise binding location of the TF the ‘getPositions’ function was utilised to obtain the positions of the relevant motifs with default parameters. Afterwards the ‘addGroupCoverages’ function and ‘getFootprints’ function were used to accurately profile TF footprints which were visualized by ‘plotFootprints’ function. Pathway enrichment analysis Metascape bioinformatics tool ([73]http://metascape.org)^[74]18 was leveraged to perform Gene Ontology (GO) and Kyoto Encylopedia of Genes and Genomes (KEGG) pathway enrichment analyses with the metrics of P value < 0.05, enrichment factor of >1.5 and a minimum count of 3. We have provided a sheet (Table [75]2) highlighting the detailed parameters used for each datasets. Table 2. The detailed parameters used for each dataset. Function Description Parameters createArrowFiles Create ArrowFiles from input files. filterTSS = 4; filterFrags = 1000 addDoubletScores Independently assign inferred doublet information to each cell. k = 10; knnMethod = “UMAP”; LSIMethod = 1 filterDoublets Filter doublets. filterRatio = 1.5 addIterativeLSI Compute an iterative LSI dimensionality reduction. useMatrix = “TileMatrix”; iterations = 2; varFeatures = 25000; dimsToUse = 1:30 addHarmony Add the Harmony batch-corrected reduced dimensions to an ArchRProject. reducedDims = “IterativeLSI”; groupBy = “time” addClusters Identify clusters from a reduced dimensions object. method = “Seurat”; resolution = 0.8 addUMAP Compute a UMAP embedding. nNeighbors = 30; minDist = 0.5; metric = “cosine”; reducedDims = “Harmony” getMarkers Identify Markers and return a List of Features or a GRangesList for each group of significant marker features. “FDR <  = 0.01 & Log2FC >  = 1.25” addGroupCoverages Merge cells within each designated cell group for the generation of pseudo-bulk replicates and then merge these replicates into a single insertion coverage file. groupBy = “Clusters2” addReproduciblePeakSet Get insertions from coverage files, call peaks, and merge peaks to get a “Union Reproducible Peak Set”. peakMethod = “Tiles”; method = “p” addMotifAnnotations Adds information about which peaks contain motifs to a given ArchRProject. For each peak, a binary value is stored indicating whether each motif is observed within the peak region. motifSet = “cisbp” peakAnnoEnrichment Perform hypergeometric enrichment of a given peak annotation within the defined marker peaks. peakAnnotation = “Motif”; cutOff = “FDR <  = 0.1 & Log2FC >  = 0.5” [76]Open in a new tab Data Records The data (BAM files and fragment files) generated in this study by snATAC-seq (0 h, 6 h, 12 h, 24 h, 7 d, 14 d and Sham) have been deposited to CNGB Nucleotide Sequence Archive (CNSA)^[77]23 of China National GeneBank DataBase (CNGBdb)^[78]24 [79]https://db.cngb.org/search/project/CNP0005552/: CNP0005552 (2024)^[80]25 and NCBI Sequence Read Archive [81]https://identifiers.org/ncbi/insdc.sra:SRP548281 (2024)^[82]26. Additionally, the gene score matrix, cell-type peak matrices and metadata had also uploaded to Figshare (10.6084/m9.figshare.25558785.v4)^[83]27. Technical Validation In the present study, we collected the identical left cerebral cortex from mice at seven distinct time points (0 h, 6 h, 12 h, 24 h, 7 d, and 14 d) after MCAO/R, as well as from sham-operated mice (Fig. [84]1). To ensure data reliability, we generated three biological replicates (comprising 6–12 libraries) for each post-reperfusion time point as well as the Sham group to construct the snATAC-seq library (Methods, Table [85]1). Fig. 1. [86]Fig. 1 [87]Open in a new tab An overview of the experimental and data analysis workflow. Schematic shows the collection of samples, sequencing procedures and downstream bioinformatic analyses. Samples from mice at 7 different time points (0 hours [M0h], 6 hours [M6h], 12 hours [M12h], 24 hours [M24h], 7 days [M7d], and 14 days [M14d] post-stroke) and Sham group (Sham) were collected from cerebral cortex for single-nucleus assay for transposase-accessible chromatin with high-throughput sequencing (snATAC-seq). Initially, we applied Pearson correlation coefficient analysis to the 42 libraries to evaluate the similarity among the libraries at different time points post-stroke. The heatmap revealed that replicates originating from the same time points exhibited the highest correlation, affirming the high reproducibility of both biological and technical replicates (Fig. [88]2a). For quality control, we excluded cells with a low TSS enrichment score and a small number of unique nuclear fragments (Fig. [89]2b). Further doublet removal was performed to mitigate their substantial effects on clustering by ArchR addDoubletScores with a filterRatio parameter of 5 (Fig. [90]2c). After quality control, we obtained 99,271 high-quality cells for across the 7 groups from MCAO/R and sham-operated mice, encompassing 42 snATAC-seq libraries. We calculated the TSS enrichment fraction and the number of unique fragments for each cell nucleus under different time points after injury. The TSS enrichment fraction was mainly within the range of 4–20, while the number of unique fragments was mainly 1,000–32,000 (Fig. [91]2d and [92]2e, Table [93]1). The nuclei numbers of libraries at each time point were counted for further analyses (Fig. [94]2f). Additionally, the TSS enrichment profiles showed a clear peak around the TSS sites and a smaller shoulder peak in the right of TSS sites, as well as nucleosome periodicity, consistent with previous studies (Fig. [95]2g)^[96]7,[97]18,[98]28. Fig. 2. [99]Fig. 2 [100]Open in a new tab snATAC-seq data quality control and features. (a) Heatmap showing the similarity among the 42 libraries. The heatmap for the seven time points, from top to bottom, represents Sham group (Sham), 0 hours (M0h), 6 hours (M6h), 12 hours (M12h), 24 hours (M24h), 7 days (M7d), and 14 days (M14d) post-stroke. (b) Scatter plots show bivariate distributions of transcriptional start site (TSS) enrichment vs log10 (unique fragments) of different time points after stroke and Sham group. (c) Uniform manifold approximation and projection (UMAP) of snATAC-seq data shows the doublet enrichment of different time points after stroke. (d) Violin Plots show the unique fragment numbers of 7 different time points and Sham group. (e) Violin Plots show the TSS enrichment scores of 7 different time points and Sham group. (f) Histogram shows the filtered nuclei numbers of 7 different time points and Sham group. (g) Plots show the enrichment of snATAC-seq fragments around TSS. Upon completion of the quality control process, the nuclei of all groups were integrated and clustered, with gene activity scores utilized to estimate gene expression and annotate cell types. We identified eight different cell types (Fig. [101]3a and [102]3b), including Astrocytes (AST, marker gene: Aqp4 and Fgfr3), Endothelial cells (EC, marker gene: Pecam1 and Cldn5)^[103]29,[104]30, Fibroblast (FB, marker gene: Col1a2 and Dcn), Microglia (MG, marker gene: P2ry12 and Cx3cr1), Oligodendrocytes (OL, marker gene: Plp1 and Mag), Oligodendrocyte progenitor cells (OPC, marker gene: Pdgfra and Cspg4), Excitatory neurons (Ex_Neuron, marker gene: Rbfox3 and Satb2) and GABAergic neurons (GABA_Neuron1, marker gene: Rbfox3 and Meis2; GABA_Neuron2, marker gene: Rbfox3 and Gad1). Rbfox3 serves as a marker for mature neurons, distinguishing neuronal cells from glial cells^[105]31. Subsequently, we identified eight differential accessibility regions (DARs) with cell types specificity (Fig. [106]3c). Visual inspections on a genome browser of 9 DARs showed that each cell cluster exhibited a unique signature of snATAC-seq peaks. For example, signal peaks detected in the TSS region of marker genes Fgfr3 are specifically enriched in AST, but not observed in other cell types (Fig. [107]3c). Similarly, the signal peaks detected around Meis2 are significantly enriched in GABA_Neuron1 than in other cell types (Fig. [108]3c). Furthermore, GO analysis on the differential peaks specific to each cell type revealed a significant correlation between the identified cell types and their corresponding GO pathways (Fig. [109]3d). Vasculature development related pathways were enriched in EC, while the leukocyte activation and synapse immune response pathways exhibited enrichment in MG. In addition, myelin assembly-related pathways show enrichment in OL (Fig. [110]3d). Fig. 3. [111]Fig. 3 [112]Open in a new tab Clustering and annotation of chromatin accessibility in the cerebral cortex of MCAO/R mice. (a) Clustering visualization and annotations of 9 clusters identified by snATAC-seq data. (b) UMAP visualization of cell type–specific gene activity scores. Each dot represents an individual cell colored by cluster assignment. (c) The genome browser view shows aggregated chromatin accessibility profiles of cell type-specific gene loci. (d) The bar plot displays representative enriched pathways specific to the cell types. (e) The stacked chart illustrates the composition of cell proportions in the cerebral cortex for seven different time points of MCAO/R and Sham group, with colours indicate the cell types. (f) GO enrichment pathway analysis of genes highly expressed at M24h in MG. (g) GO enrichment pathway analysis of genes highly expressed at M14d in MG. By calculating the proportions of each cell population, we observed dynamic changes of nine cell types within the cerebral cortex after MCAO/R. Compared to sham-operated mice, there was an significant increase in the proportion of Ex_Neuron during the acute stage (post-MCAO/R 24 hours, M24h)^[113]32, which was accompanied by the significant decrease in the proportion of AST, GABA_Neuron, and OPC (Fig. [114]3e). As the stroke progresses into the chronic phase (post-MCAO/R 7 days-14days, M7d-M14d)^[115]32, the composition of different cell types (Fig. [116]3e) continues to shift over time. Specifically, there was a significant changes between M7d and M4d in GABA_Neuron1. The proportion of Ex_Neuron exhibits a complex pattern of initial decline, subsequent increase, and eventual decrease. We used scCODA to statistically analyze the proportions of different cell types at various time points. The results indicated significant changes in cell proportions for Ex_Neuron at M24h (logFC = 0.448) and for GABA_Neuron2 at 7d (logFC = −1.187). At M14d, subpopulations such as FB (logFC = 1.742), GABA_Neuron1 (logFC = −4.272), MG (logFC = 2.571), and GABA_Neuron2 (logFC = −1.370) showed significant changes in cell proportions. To verify the authenticity of the data analysis, we used DEseq2 to calculate the differentially expressed genes in MG cells at different time points. To verify the authenticity of the data, we extracted MG and added peaks with identical biological replicates to form pseudo-bulk data. We then used DESeq2 to calculate the differential pseudo-bulk peaks between M24h and M14d (see Methods section), identifying a total of 503 differential peaks. Subsequently, pathway enrichment analysis was performed on the genes associated with these peaks to validate the accuracy of the results. The results showed that the differentially expressed genes at M24h were enriched in migration and immune pathways (Fig. [117]3f), which is consistent with the function of microglia in acute injury phase post-stroke^[118]33. The differentially upregulated peak-related genes at M14d were enriched in pathways related to adhesion and wound healing (Fig. [119]3g), which is also consistent with the function of MG in the chronic phase post-stroke^[120]34. Additionally, differential peaks were observed in genes such as Serpinf1, Cxcr4, Ctsb and Siglech, all of which have been reported to be involved in the activation of microglia and secretion of inflammatory factors after stroke^[121]35–[122]37. Therefore, these pieces of evidence demonstrate that our data reflect authentic biological phenomena. These patterns in cell type proportions highlight potential molecular mechanisms that underlie the functionality of different cell types in the context of MCAO/R-induced ischemic stroke. The TFs binding sites of different cell types (Fig. [123]4a) and time points (Fig. [124]4b) are closely related to their corresponding cell functions. TFs related to neurodevelopment and function, such as Neurog1 and Neurod2, are enriched in Ex_Neuron. TFs related to lymphocyte proliferation and differentiation, such as Elf1 and Sfpi1 families, are enriched in MG. OL are specifically enriched with Sox9^[125]38–[126]40. At distinct time points (Fig. [127]4b), binding motifs associated with Fos, a transcriptional regulator implicated in the modulation of oxidative stress responses and maintenance of mitochondrial homeostasis, exhibited specific enrichment during the acute phase^[128]41. Fig. 4. [129]Fig. 4 [130]Open in a new tab Motif enrichment in each cell type and time point. (a) Heatmaps of motifs enrichment in marker peaks of each cell type. (b) The heatmap illustrates the time point–specific enrichment of motifs. The color represents the normalized motif enrichment score calculated in ArchR using HOMER with the hypergeometric test. In order to validate the identified motifs, we performed the TF footprinting analysis to confirm the binding of TFs to DNA. The TF footprinting analysis also demonstrated that the identified motifs exhibited active TF binding activity in the corresponding cell types (Fig. [131]5)^[132]42–[133]44. In summary, our data reproduce the enrichment patterns of key cell type-specific TFs reported in previous studies, indicating the accuracy of our cell types identification and the high quality of our data. Fig. 5. [134]Fig. 5 [135]Open in a new tab TF footprinting analysis identifies representative cell type-specific TFs activities in snATAC-seq data. (a) Cell type-specific TFs binding motifs sequence logo of cell types (top), and representative cell type-specific TFs footprint profiles (bottom). Taken together, our datasets provide a valuable resource for exploration of the epigenetic regulation mechanisms after ischemic stroke in mice. Usage Notes The pipeline of the snATAC-seq data processing, including the read mapping, low-quality cells filtering, unsupervised clustering and peak calling were run on the Linux operating system. All R source codes with the optimized parameters used for the downstream data analyses and visualization are provided online (10.6084/m9.figshare.25558785)^[136]27. Acknowledgements