Abstract The mammalian liver is a central hub for systemic metabolic homeostasis. Liver tissue is spatially structured, with hepatocytes operating in repeating lobules, and sub-lobule zones performing distinct functions. The liver is also subject to extensive temporal regulation, orchestrated by the interplay of the circadian clock, systemic signals and feeding rhythms. However, liver zonation was previously analyzed as a static phenomenon, and liver chronobiology at tissue level resolution. Here, we use single-cell RNA-seq to investigate the interplay between gene regulation in space and time. Using mixed-effect models of mRNA expression and smFISH validations, we find that many genes in the liver are both zonated and rhythmic, most of them showing multiplicative space-time effects. Such dually regulated genes cover key hepatic functions such as lipid, carbohydrate and amino acid metabolism, but also previously unassociated genes, such as protein chaperones. Our data also suggest that rhythmic and localized expression of Wnt targets could be explained by rhythmically expressed Wnt ligands from non-parenchymal cells near the central vein. Core circadian clock genes are expressed in a non-zonated manner, indicating that the liver clock is robust to zonation. Together, our scRNA-seq analysis reveals how liver function is compartmentalized spatio-temporally at the sub-lobular scale. Introduction The liver is a vital organ maintaining body physiology and energy homeostasis. The liver carries out a broad range of functions related to carbohydrate and lipid metabolism, detoxification, bile acid biosynthesis and transport, cholesterol processing, xenobiotics biotransformation, and carrier proteins secretion. Notably, the liver performs catabolic and anabolic processing of lipids and amino acids and produces the majority of plasma proteins^[39]1. Liver tissue is highly structured on the cellular scale, being heterogeneous in both cell-type composition and microenvironment^[40]2. In fact, liver tissue is made up of millions of repeating anatomical and functional subunits, called lobules, which in mice contain hepatocytes arranged in about 15 concentric layers with a diameter of about 0.5mm^[41]3,[42]4. On the portal side of the lobule, blood from the portal vein and the hepatic arteriole enters small capillaries called sinusoids and flows to the central vein. This is accompanied with gradients in oxygen concentration, nutrients and signaling along the portal-central axis, with the latter notably involving the Wnt pathway^[43]5,[44]6. Due to this polarization, hepatocytes in different layers perform separate functions. This is accompanied with gradients of gene expression along the portal-central axis, with some genes expressed more strongly near the central vein, and vice versa for portally expressed genes. This phenomenon is termed liver zonation ^[45]1,[46]7. Recently, Halpern et al. combined single-cell RNA-sequencing (scRNA-seq) of dissociated hepatocytes and single-molecule RNA fluorescence in situ hybridization (smFISH) to reconstruct spatial mRNA expression profiles along the portal-central axis^[47]8. This analysis revealed an unexpected breadth of spatial heterogeneity, with ~50% of genes showing spatially non-uniform patterns. Among them, functions related to ammonia clearance, carbohydrate catabolic and anabolic processes, xenobiotics detoxification, bile acid and cholesterol synthesis, fatty acid metabolism, targets of the Wnt and Ras pathways, and hypoxia-induced genes were strongly zonated. In addition to its spatial heterogeneity, liver physiology is also highly temporally dynamic. Chronobiology studies showed that temporally gated physiological and metabolic programs in the liver result from the complex interplay between the endogenous circadian liver oscillator, rhythmic systemic signals, and feeding/fasting cycles^[48]9,[49]10,[50]11. An intact circadian clock has repeatedly been demonstrated as key for healthy metabolism, also in humans^[51]12. In addition, the hepatocyte clock has specifically been shown to play a major role in the physiological coordination of nutritional signals and cell-cell communication (including non-hepatocytic cells) controlling rhythmic metabolism^[52]13. Temporal compartmentalization can prevent two opposite and incompatible processes from simultaneously occurring, for example, glucose is stored as glycogen following a meal and is later released into the blood circulation during fasting period to maintain homeostasis in plasma glucose levels. Functional genomics studies of the circadian liver were typically performed on bulk liver tissue^[53]14. In particular, several studies showed how both the circadian clock and the feeding fasting cycles pervasively drive rhythms of gene expression in bulk, impacting key sectors of liver physiology such as glucose homeostasis, lipid and steroid metabolism^[54]15–[55]18. Here, we asked how these spatial and temporal regulatory programs interact on the level of individual genes and liver functions more generally. In particular, can zonated gene expression patterns be temporally modulated on a 24 h time scale? And conversely, can rhythmic gene expression patterns observed in bulk samples exhibit sub-lobular structure? More complex situations may also be envisaged, such as time-dependent zonation patterns of mRNA expression (or, equivalently, zone-dependent rhythmic patterns), or sublobular oscillations that would escape detection on the bulk level due to cancelations. On the physiological level, it is of interest to establish how hepatic functions might be compartmentalized both in space and time. To study both the spatial and temporal axes, we performed scRNA-seq of hepatocytes at four different times along the 24 h day, extending a previous approach^[56]8,[57]19 to reconstruct spatial profiles at each time point. The resulting space-time patterns were statistically classified using a mixed-effect model describing both spatial and temporal variations in mRNA levels. In total, ~5000 liver genes were classified based on their spatio-temporal expression profiles, and a few representative profiles were further analyzed with smFISH. Overall, this approach revealed the richness of space-time gene expression dynamics of the liver and provides a comprehensive view on how spatio-temporal compartmentalization is utilized at the sub-lobular scale in the mammalian liver. Results Single-cell RNA-seq captures space-time gene expression patterns in mouse liver To investigate spatio-temporal gene expression patterns in mouse liver, we sequenced mRNA from individual liver cells obtained via perfusion from 10 ad libitum fed mice at 4 different times of the day (ZT = 0h, 6h, 12h and 18h, two to three replicates per time point). The interval between ZT0 and ZT12 has the light on and corresponds to the fasting period in mice, while feeding happens predominantly between ZT12 and ZT0. We here focused on hepatocytes by enrichment of cells according to size and in silico filtering, yielding a total of 19663 cells ([58]Methods). To validate that the obtained scRNA-seq data captured the expected variability in both spatial and temporal mRNA levels, we generated a clustering analysis of all cells using a standard 2-dimensional t-SNE dimensionality reduction ([59]Methods) and colored cells either by their positions along the central-portal axis (the a posteriori assigned layers, see below) ([60]Figure 1a) or time ([61]Figure 1b). The clustering revealed that portally and centrally expressed landmark transcripts, such as the cytochrome P450 oxygenases Cyp2f2 and Cyp2e1 involved in xenobiotics metabolism, mark cells in opposite regions of the projections ([62]Figure 1c-d). Likewise, time-of-day gene expression varied along an orthogonal direction ([63]Figure 1b), as shown for the fatty acid elongase Elovl3 peaking at ZT0 ([64]Figure 1e). Figure 1. A scRNA-seq approach to space-time gene expression in mouse liver. [65]Figure 1 [66]Open in a new tab (a-e) Global gene expression varies in both space and time, as shown using t-SNE visualizations of the scRNA-seq (n=19663 hepatocytes examined over ten independent animals). Each dot represents one cell. Individual cells are colored by the (a posteriori assigned) lobule layer (a), Zeitgeber time (b), expression levels of the zonated genes Cyp2f2 and Cyp2e1 (c-d), or the temporally-regulated and centrally zonated gene Elovl3 (e). (f-h) Reconstructed spatial profiles (lobule layers 1-8) of selected zonated genes (f, top: Glul pericentrally (PC) expressed, bottom: Ass1 periportally (PP) expressed); rhythmic but non-zonated genes (g, top: core-clock gene Bmal1 peaking at ZT18-0, bottom: clock-controlled Dbp, peaking at ZT6-12); zonated and rhythmic genes (h, Top: Elovl3, bottom: Pck1). Expression levels correspond to fraction of total UMI per cell in linear scale. Log-transformed profiles are in [67]Extended Data Figure 1. Dots in f-h represent data points from the individual mice. Lines represent the mean expression per time point. Shaded areas represent one standard deviation (SD) across the mice. For the scRNA-seq we used n=2 (ZT6, ZT18) or n=3 mice (ZT0, ZT12) ([68]Methods). To obtain spatial mRNA expression profiles for each gene along the central-portal axis, we here introduced eight lobule layers, to which we assigned each individual cell. For this, we adapted a previous method that uses expression levels of landmark zonated genes to define a central-portal coordinate^[69]19, with the modification that only landmark transcripts that were sufficiently expressed and that did not vary across mice and time points were used (27 central and 28 portal landmark genes, [70]Methods). The resulting reconstructed (binned) mRNA expression profiles yielded 80 (8 layers over 10 mice) data points for each transcript. Although our resolution is lower compared with the typically 12-15 hepatocyte layers found in the liver^[71]3,[72]4, these reconstructions faithfully captured reference zonated genes, with both central, and portal, expression ([73]Figure 1f). Two examples of such genes are the centrally expressed glutamine synthethase (Glul), and the portally expressed urea cycle gene argininosuccinate synthetase (Ass1), showing mutually exclusive expression along the lobule^[74]8. The reconstruction also successfully identified transcripts of the core circadian clock, such as the master transcription factor Bmal1 (also named Arntl), whose mRNA peaked between ZT18-ZT0 ([75]Figure 1g)^[76]20. In addition to core clock genes, important clock outputs such as the PAR bZip transcription factor Dbp, which is a direct transcriptional target of BMAL1 regulating detoxification enzymes, peaked between ZT6-12 ([77]Figure 1g)^[78]21. Finally, genes showing both zonated and rhythmic mRNA accumulation were found ([79]Figure 1h), for example elongation of very long chain fatty acids 3 (Elovl3) is centrally expressed and peaks near ZT0, while phosphoenolpyruvate carboxykinase 1 (Pck1) regulating gluconeogenesis during fasting is expressed portally and peaks shortly before ZT12. Since most of the zonated profiles showed exponential shapes, and gene expression changes typically occur on a log scale^[80]22, we log-transformed the data for further analysis ([81]Methods, [82]Figure 1f and [83]Extended Data Figure 2a-b-c). Together, these examples indicate that the obtained gene expression profiles reliably capture spatial and temporal regulation of hepatocyte gene expression. mRNA expression profiles categorized according to zonation and rhythmicity To gain a systematic understanding of the space-time gene expression profiles, we next investigated if zonated gene expression patterns could be dynamic along the day, or conversely whether temporal expression patterns might be zone-dependent. To select a reliable set of reconstructed mRNA expression profiles for subsequent analyses, we filtered out lowly expressed genes as well as genes with significant biological variability across replicate liver samples, although this may be at the expense of a potentially decreased sensitivity ([84]Methods). This yielded 5058 spatio-temporal gene expression profiles ([85]Extended Data Figure 2d). An exploratory analysis of variance clearly identified zonated genes, rhythmic genes, and fewer genes showing variability along both axes, with known zonated and rhythmic genes distributed as expected ([86]Figure 2a). Figure 2. Space-time mRNA expression profiles categorized with mixed-effect models. [87]Figure 2 [88]Open in a new tab (a) Spatial and temporal variations for mRNA transcript profiles, calculated as standard deviations (SD) of log2 expression along spatial or temporal dimensions. Colored dots correspond to reference zonated genes (orange) and reference rhythmic genes (blue) ([89]Methods). (b) Extended harmonic regression model for spatio-temporal expression profiles describing a static but zonated layer-dependent mean μ(x), as well as layer-dependent harmonic coefficients (a(x) and b(x)). All layer-dependent coefficients are modeled as second order polynomials; i denotes the biological replicates. Temporal dependency is modelled with 24h-periodic harmonic functions. μ[i] are random effects needed due to the data structure hierarchy ([90]Methods). (c) Schema illustrating the different categories of profiles. Depending on which coefficients are non-zero ([91]Methods), genes are assigned to: F/N (flat or noisy, not represented), Z (zonated), R (rhythmic), Z+R (additive zonation and rhythmicity), ZxR (interacting zonation and rhythmicity). Graphs emphasize either zonation (top), with the x-axis representing layers, or rhythmicity (bottom), with the x axis representing time (ZT). Right side of the panel: two examples of fits (Elovl3 and Cyp7a1, respectively Z+R and ZxR). (d) Number of transcripts in each category. (e) Boxplot of the mean expression per category shows that zonated genes (Z, Z+R and ZxR) are more expressed than rhythmic (R) or flat/noisy (F/N). ZxR genes are the most expressed according to median expression (orange line). Box limits are lower and upper quartile, whiskers extend up to the first datum greater/lower than the upper/lower quartile plus 1.5 times the interquartile range. The number of genes per category is indicated. Remaining points are outliers. KW stands for the Kruskal-Wallis and MW for the Mann-Whitney (two-sided) tests. To identify possible dependencies between spatial and temporal variations, we built a mixed-effect linear model^[92]23 for the space-time mRNA profiles, which extends harmonic regression to include a spatial covariate ([93]Figure 2b). In this model, rhythms are parameterized with cosine and sine functions, while spatial profiles are represented with (up to second order) polynomials. In its most complex form, the model uses nine parameters describing spatially modulated oscillations, and one intercept per mouse ([94]Methods). When some of the parameters are zero, the model reduces to simpler mRNA profiles, for example purely spatial or purely temporal expression profiles ([95]Figure 2c). We then used model selection^[96]24 to identify the optimal parameterization and category for each gene ([97]Methods). In fine, we classified each mRNA profile into one of five types of patterns ([98]Figure 2c). If only the intercept is used, the profile will be classified as flat or noisy (F/N, [99]Methods). If only time-independent zonation parameters are retained, the predicted profile will be purely zonated (Z). If only layer-independent rhythmic parameters are retained, the predicted profile will be purely rhythmic (R). If only layer-independent rhythmic parameters and time-independent zonation parameters are retained, the profile is classified as independent rhythmic-zonated (Z+R). If at least one layer-dependent rhythmic parameter is selected, the profile will be termed interacting (ZxR). This classification revealed that, overall, about 30% of the mRNA profiles were zonated (Z, Z+R and ZxR) and about 20% were rhythmic (R, Z+R and ZxR) ([100]Figure 2d). The peak times of these rhythmic transcripts were highly consistent with bulk chronobiology data^[101]25 ([102]Extended Data Figure 2e). The entire analysis can be browsed as a web-app resource along with the corresponding data ([103]https://czviz.epfl.ch). Interestingly, we found that 7% of the analyzed genes in the liver were both zonated and rhythmic. Such dually regulated transcripts represent 25% of all zonated transcripts, and 36% of all rhythmic transcripts, respectively. For example, the previously shown Elovl3 transcript, involved in fatty acid elongation, and Pck1, a rate limiting enzyme in gluconeogenesis, are prototypical Z+R genes ([104]Figure 1h, [105]Extended Data Figure 2c). Gluconeogenesis is an energetically-demanding task^[106]3. As mice are in a metabolically fasted state requiring glucose production towards the end of the light phase (~ ZT10) and oxygen needed for ATP production is most abundant portally^[107]26, this process is indeed both spatially and temporally regulated. The dual regulation of zonated-rhythmic genes may therefore ensure optimal liver function under switching metabolic conditions. Dually regulated genes were mostly Z+R, with only a minority of ZxR patterns. The average expression across categories showed that rhythmic genes are significantly less expressed on average than genes in zonated categories, likely reflecting shorter half-lives ([108]Figure 2e and [109]Extended Data Figure 2f). Surprisingly, we find few highly expressed flat genes. Together, we found that mRNA expression of many zonated genes in hepatocytes is not static, and is in fact compartmentalized both in space and time. Properties of dually zonated and rhythmic mRNA profiles The majority of dually regulated genes are Z+R, which denotes additive (in log) space-time effects, or dynamic patterns where slopes or shapes of spatial patterns do not change with time ([110]Figure 2c). On the other hand, interacting patterns (ZxR) are rare. Comparing the proportions of central, mid-lobular (peaking in the middle of the portal-central axis) and portal genes among the purely zonated genes (Z), and independently zonated and rhythmic genes (Z+R), did not reveal significant differences ([111]Figure 3a), suggesting that rhythmicity is uncoupled with the direction of zonation. Similarly, comparing the phase distribution among the purely rhythmic genes (R) and the Z+R genes did not show a significant difference ([112]Figure 3b), indicating that zonation does not bias peak expression time. Moreover, oscillatory amplitudes were uncorrelated with the zonation slopes in Z+R genes ([113]Figure 3c). Finally, for ZxR genes with potentially more complex space-time patterns, we investigated the spreads in amplitudes and peak times across the layers ([114]Figure 3d). For wave-like pattern (phase modulated profiles), the phase difference across the lobule was up to 3h, which corresponds to a difference in time between neighboring hepatocytes on the order of 10 minutes for lobules of about 15 cell layers. On the other hand, amplitude modulated patterns showed up to 2-fold difference in oscillatory amplitude across the lobule. Figure 3. Properties of dually zonated and rhythmic mRNA profiles. [115]Figure 3 [116]Open in a new tab (a) Proportions of pericentral (green) and periportal (blue) transcripts are similar in Z and Z+R. Mid-lobular genes (orange) are rare (<2%). The number of genes are N = 484, 14, 628, and 157, 6, 184 for the portal, mid-lobular, and central genes in Z and Z+R, respectively. KS stands for the two-sided Kolmogorov-Smirnov test. (b) Peak time distributions of rhythmic transcripts are similar in R and Z+R categories (two-sample Kuiper test). (c-d) Effect sizes of zonation (slope) vs. rhythmicity (fold-change amplitude, log2 peak-to-trough) in Z+R genes (c). Magnitude of time shifts (delta phase, in hours) vs. fold-change amplitude gradient (delta amplitude, in log2) along the central-portal axis in ZxR genes (d). Genes for which the protein is also rhythmic (p<0.05, harmonic regression, F-test) in bulk data are indicated in dark red with the corresponding label (represented in [117]Extended Data Figure 3). To assess the potential physiological role of dually zonated and rhythmic transcripts, we asked if protein levels of the identified Z+R and ZxR genes accumulated rhythmically in a previous proteomics experiment^[118]27. In general, proteins rhythms are fewer, damped, and time-delayed compared to mRNA rhythms due to protein half-lives^[119]14,[120]27,[121]28 (Discussion). However, while R transcripts were twice more frequent than Z+R transcripts, the proportions were inverted for rhythmic proteins. Indeed, we found that among 65 rhythmic proteins (with q-value<0.2 in ref.^[122]27), 18 corresponded to Z+R and 10 to R transcripts. Moreover, the identified Z+R and ZxR genes with rhythmic protein accumulation cover key hepatic and zonated functions ([123]Extended Data Figure 3, for a functional interpretation, see below) and include rate limiting enzymes. For example, for Z+R transcripts ([124]Extended Data Figure 3a), PCK1 (rate limiting for gluconeogenesis), LPIN2 (Lipin2, catalyzes the conversion of phosphatidic acid to diacylglycerol during triglyceride, phosphatidylcholine and phosphatidylethanolamine biosynthesis), POR (cytochrome P450 oxidoreductase, required to activate P450 enzymes), DNAJA1 (HSP40 co-chaperone), ALAS1 (rate-limiting for heme biosynthesis), GNE (rate-limiting in the sialic acid biosynthetic pathway), THRSP (biosynthesis of triglycerides from medium-length fatty acid chains), show robust rhythms at the protein level. Similarly, for ZxR proteins, CYP7A1 (rate limiting enzyme in bile acid synthesis), CYP2A5 (coumarin 7-hydroxylase), SLC1A2 (high-affinity glutamate transporter), and multidrug resistance protein ABCC2 show rhythms on the protein level ([125]Extended Data Figure 3b). Moreover, the protein rhythms accompanying those Z+R and ZxR transcripts peak with an expected delay of maximally about 6 hours^[126]28 compared to the mRNA peak times ([127]Extended Data Figure 3c). smFISH analysis of space-time mRNA counts To substantiate the RNA-seq profiles, we performed RNA single molecule fluorescence in situ hybridization (smFISH) experiments on a set of selected candidate genes with diverse spatio-temporal patterns. smFISH provides a sensitive and independent, albeit low-throughput, measurement of mRNA expression. Purely zonated genes (Z) were already well studied with smFISH^[128]8. To analyze the core-clock, we measured two genes peaking at different times, Bmal1 and Per1, which were classified as R in the RNA-seq analysis. Bmal1 (~ZT0) and Per1 (~ZT12) phases were nearly identical in both experiments, and the rhythms did not depend on the lobular position consistent with R genes ([129]Figure 4a). We analyzed three genes classified as Z+R: Pck1 was indeed both portally biased and rhythmic in RNA-seq and smFISH ([130]Figure 4b); Elovl3 is both centrally biased and rhythmic in RNA-seq and smFISH, even though the amplitude of the oscillations was damped on the portal side in the FISH experiment ([131]Extended Data Figure 4a); and for Arg1 (Arginase 1) the portal RNA-seq and smFISH profiles matched well ([132]Extended Data Figure 4b). Finally, Acly showed a pattern in smFISH data which validates its classification as ZxR, with a lower amplitude on the portal side, where the transcript is more highly expressed ([133]Figure 4c). Thus, overall, the reconstructed scRNA-seq and smFISH profiles were consistent, with minor discrepancies. Figure 4. smFISH analysis of rhythmic and zonated transcripts. [134]Figure 4 [135]Open in a new tab (a) smFISH (RNAscope, [136]Methods) of the core clock genes Bmal1 and Per1 (both assigned to R) in liver slices sampled every 3 hours. Left: representative images at ZT0, ZT06, ZT12 and ZT18 for Bmal1. Central vein (CV) and a portal node (PN) are marked. Scale bar is 50μm. Endothelial cells lining the PC and cholangiocytes surrounding the PP were excluded from the quantification. mRNA transcripts and nuclei were detected in PN and PC zones ([137]Methods). Right: temporal profiles of Bmal1 and Per1 from smFISH at 8 time points from ZT0 to ZT21, every 3 hours (the line shows the mean number of quantified mRNAs, shaded area indicate SD across 5-8 images from one animal per time point), and scRNA-seq (same representation as in [138]Figure 1f-h, PC is layer 1, PN is layer 8), both in PN and PC regions. (b-c) smFISH (Stellaris, [139]Methods) for Pck1 (Z+R) and Acly (ZxR) in liver slices sampled every 12 hours. Left: representative images at ZT0, ZT12 for Pck1 (b) or Acly (c). Central vein (CV) and a portal node (PN) are marked. Scale bar - 20μm. Right: quantified profiles for each gene in the two time points from smFISH (the line shows the mean number of quantified mRNAs, shaded areas indicate SD across at least 10 independent images taken from at least 2 mice per time point) and scRNA-seq (same representation as in [140]Figure 1f-h). As above, the scRNA-seq used n=2 (ZT6, ZT18) or n=3 mice (ZT0, ZT12). Space-time logic of hepatic functions We next used our classification to explore the spatio-temporal dynamics of hepatic functions and signaling pathways in the liver. Given the prevalence of zonated gene expression profiles, we first analyzed if the circadian clock is sensitive to zonation. We found that profiles of reference core-clock genes ([141]Extended Data Figure 5) were assigned to the rhythmic only category (R), except for Cry1 and Clock that were assigned to Z+R, but with high probabilities also for R ([142]Supplementary Table 2). This suggests that the circadian clock is largely non-zonated, as also seen in the smFISH ([143]Figure 4a), and therefore robust to the heterogenous hepatic microenvironment. We then systematically explored enrichment of biological functions in the zonated category by querying the KEGG pathways database ([144]Supplementary Table 3, [145]Figure 5, [146]Methods). In addition to recapitulating well documented zonated liver functions^[147]8,[148]29, which we do not discuss here, this analysis highlighted Z and Z+R functions which, to our knowledge, had not been linked with liver zonation. For instance, we found that cytosolic chaperones accumulate centrally, while the endoplasmic reticulum (ER) chaperones linked with protein secretion accumulate portally ([149]Figure 5b,d, [150]Extended Data Figure 6). Both groups of chaperones peak during the activity/feeding phase, probably due to body temperature rhythms peaking during the active phase^[151]30,[152]31, and likely reflect increased needs of protein folding during times of high protein synthesis. Also, we found that mRNAs of ribosomal protein genes accumulate centrally (Z), as do proteasome components ([153]Figure 5a), the latter also containing rhythmic members (Z+R). In the liver, ribosomal proteins are rate-limiting for the synthesis of ribosomes, which themselves are rate-limiting for the synthesis of proteins^[154]32. Therefore, the overall protein synthesis rate is likely higher in these hepatocytes. Conversely, transcripts encoding components of the proteasome are involved in protein degradation. Together, these observations suggest that protein turnover is higher in centrally located hepatocytes, which are exposed to an environment with high concentrations of xenobiotics and hypoxic stress^[155]33. Figure 5. Space-time logic of compartmentalized hepatic functions for Z+R genes. [156]Figure 5 [157]Open in a new tab (a) KEGG analysis of the Z and Z+R genes. For clarity, only KEGG pathways (second column) with adjusted p-value < 0.01 (standard Enrichr^[158]67 output test) are presented ([159]Supplementary Table 3 for all enriched functions). The percentage of central genes is represented by a blue-red gradient. *: appears central due to the KEGG annotation system; however, fatty acid elongation is biased portally (see text). (b-c-d) KEGG analysis of Z+R genes. Representations of genes in central (b-c) and portal (d) enriched Z+R categories ([160]Supplementary Table 3). Polar representation: peak expression times are arranged clockwise (ZT0 on vertical position) and amplitudes (log2, values indicated on the radial axes) increase radially. The radius coordinate of genes with an amplitude >0.9 is halved (indicated with a black circle around the colored dot). In addition, while many mitophagy genes are expressed centrally (Z), some of those also show robust temporal rhythms peaking during the fasting period (Z+R), in particular two gamma-aminobutyric acid receptor-associated proteins (Gabarap and Gabarapl1) with an important function in autophagosome mediated autophagy^[161]34. Consistent with this temporal regulation, we had previously reported that the nuclear abundance of the fasting-dependent regulators of autophagy TFEB and ZKSCAN3 peaked near ZT6^[162]35. Moreover, the centrally and synchronously accumulating Ubiquitin B mRNA (Ubb, Z+R) may contribute to triggering mitophagy^[163]36. Thus, centrally biased mitophagy may both participate in removal of damaged mitochondria in the stressed central environment. Similarly, genes involved in bile acid synthesis and bile secretion are known to show zonated expression patterns^[164]8 ([165]Figure 5c). Here, we found that while the rate-limiting enzyme in the bile acid biosynthetic pathway Cyp7a1 (ZxR, [166]Figure 2c) is known to be clock-controlled, with its mRNA^[167]37,[168]38 and protein^[169]39 ([170]Extended Data Figure 7d) expressed maximally early during the feeding period, the ABC transporters Sterolin-1 and 2 (ABCG5/ ABCG8, both identified as Z+R) which excrete most of the biliary cholesterol^[171]40 peak towards the end of the fasting period near ZT9. Many detoxification enzymes of the cytochromes P450 (CYPs) superfamily are known to be centrally zonated in the liver^[172]8 and several of those are found in the Z+R category. In particular, the Flavin-containing monooxygenases Fmo1, 2, and 5, which are NADPH-dependent monooxygenases involved in drug and xenobiotics detoxification, exhibit Z+R mRNA patterns with peaks near ZT16 ([173]Figure 5c). Also, the FMO5 protein accumulates rhythmically ([174]Figure 3c, [175]Extended Data Figure 3). Moreover, the rate limiting enzyme ALAS1 producing the P450 cofactor heme was found as a centrally zonated Z+R transcript with peak mRNA at ZT13, showing also a robust rhythm in protein expression ([176]Extended Data Figure 3). To substantiate the above finding on heat shock genes, we examined the space-time behavior of temperature-regulated genes. To this end, we considered targets (bound in ChIP-seq) of the heat shock transcription factor HSF1 (Chip-Atlas^[177]41, includes our own liver data^[178]16). This showed that several targets, known to peak during the active phase^[179]30,[180]31, are also zonated ([181]Extended Data Figure 6). Notably, the cytoplasmic Hsp90aa1 (Hsp90A chaperone) and its interactor Dnaja1 (HSP40), respectively mitochondrial Hspd1 (HSP60), chaperones are expressed centrally where protein turnover is high. On the contrary, endoplasmic reticulum (ER) located chaperones: Hspb1 (Hsp90B) and interactor Pdia3, as well as Hspa5 (HSP70) and ER-resident Dnajc3 and Dnajb11 (DNAJ/HSP40) are expressed portally, consistent with their role in folding proteins in the secretory pathway (secretion is known to occur portally^[182]29). On the other hand, the analysis of cold induced genes (i.e. CIRBP and analogous, taken from ref.^[183]42) did not show zonated gene expression. Finally, we note that among all KEEG pathway related to lipids, a majority show central enrichment ([184]Figure 5a, [185]Supplementary Table 3). Inspection of the genes involved shows that this is due to the large number of genes related to peroxisomal β-oxidation, i.e. lipid catabolism, which are incidentally also listed in biosynthesis KEGG pathways ([186]Table S3). However, fatty acid synthesis is biased portally, as supported by key portally expressed genes such as Fasn, Srebf1, Acly, Acaca, Elovl2, Elovl5, and which is consistent with the fact that oxygen needed for mitochondrial β-oxidation is most abundant portally^[187]26. On the other hand, Elovl3, which is known to be transcriptionally controlled by the peroxisome regulator PPARα and atypically regulated among Elov family fatty acid elongases^[188]43 is expressed centrally. Space-time logic of activity of signaling pathways Signaling pathways that include Wnt, Ras and hypoxia have been shown to shape hepatocyte zonation^[189]8. We therefore examined the space-time activities of these pathways, extracted from the behavior of canonical target genes. We mainly focused on Wnt, as it is often considered the master regulator of liver zonation^[190]44. To systematically investigate spatio-temporal WNT/β-Catenin activity in liver, we extracted a set of Wnt targets derived from an APC KO in mouse liver^[191]8,[192]45. We found that rhythmic transcripts (in the R, Z+R and ZxR categories) are enriched among targets of the Wnt pathway, showing a proportion that increases with the strength of the targets, with the strongest Wnt targets containing 80% of rhythmic transcripts ([193]Extended Data Figure 7a). Positive Wnt targets were pericentral^[194]8 and peaked between ZT9 and ZT12, whereas negative Wnt targets were periportal^[195]8 and peaked between ZT21 and ZT3 ([196]Figure 6a). Figure 6. Rhythmic activity of Wnt signaling. [197]Figure 6 [198]Open in a new tab (a) Enrichment/depletion at different times (window size: 3h), of both positive (N=471) and negative (N=149) Wnt targets (background: all R and Z+R genes). Colormap shows p-values (two-tailed hypergeometric test): red (blue) indicates enrichment (depletion). (b) Heatmaps representing scRNA-seq profiles of the top 50 Wnt targets (according to the Apc-KO fold change, Lgr5 was also added) showing rhythmic mRNA in bulk (p<0.01, harmonic regression, F-test, data from ref.^[199]25). The profiles are computed in three different zones of the central-portal axis: central (layers 1-2), mid-lobular (layers 3-4-5) and portal (layers 6-7-8). Gene profiles (log2) are mean-centered in each zone. An enrichment of the phases around ZT8-14 can be observed, in agreement with [200]Figure 6a. (c) Nuclear protein abundance from ref.^[201]35 of the Wnt effector TCF4 (encoded by the Tcf7l2 gene) in mouse liver shows a rhythm (p=0.003, harmonic regression, n=2, sampled every 3h) peaking at ZT7.5, consistent with the accumulation of mRNA targets a few hours later (panel a). (d) mRNA profiles from bulk RNA-seq (top) and scRNA-seq (bottom). Top five targets with the highest Apc-KO fold change, and rhythmicity in the bulk data (p<0.01, harmonic regression, F-test, data from ref.^[202]25). Rhythmicity (indicated above each panel) is also computed in three different zones for the scRNA-seq data. As above, the scRNA-seq used n=2 (ZT6, ZT18) or n=3 mice (ZT0, ZT12). To obtain a temporal view of Wnt activity, we considered the top 50 Wnt pathway targets (according to the liver APC KO data) in the liver and analyzed the temporal profiles from high temporal resolution bulk liver mRNA and from our scRNA-seq binned in three different zones: central (layers 1-2), mid-lobular (layers 3-4-5) and portal (layers 6-7-8)([203]Figure 6b,d, [204]Extended Data Figure 7b-c). This analysis confirmed that the peak times of rhythmic Wnt targets is preferentially between ZT9 and ZT12, and that the bulk and single cell data are consistent with each other, despite of the lower temporal sampling of the scRNA-seq. Further evidence of rhythmic WNT/β-Catenin activity was provided by our previous proteomics data^[205]35 showing that the potent Wnt effector TCF4 (encoded by the Tcf7l2 gene) has rhythmic nuclear abundance in mouse liver with a peak phase at ZT7.5 ([206]Figure 6c), and hence explains the accumulation of its mRNA targets a few hours later. Among the rhythmic genes detected in bulk RNA-seq, the five strongest Wnt targets were, in decreasing order: Axin2, Glul, Slc1a2, Tuba8, Rnf43, with the latter showing the largest amplitude ([207]Figure 6d); all but Tuba8 peaked in the morning. Note that Glul, an important marker of central zonation and canonical Wnt target (just like Axin2 and Rnf43), was assigned to the Z category, but with second highest probability for ZxR ([208]Supplementary Table 2), peaking at ZT12. In addition to the mRNA rhythms, we found that several of the Z+R or ZxR Wnt targets showed clear rhythms in bulk proteomics with the characteristic phase delays. The strongest five targets with rhythmic proteins ([209]Extended Data Figure 7d) included the rate-limiting enzyme in the bile acid biosynthetic pathway CYP7A1, the NADPH-dependent monooxygenases involved in drug and xenobiotics detoxification FMO5, the P450 detoxification enzymes coumarin 7-hydroxylase (Cyp2a5), which may protect mice from dietary coumarin-induced toxicity^[210]46, the high-affinity glutamate transporter Slc1a2 (Eaat2), and the multidrug resistance protein ABCC2. All these proteins showed high amplitude protein rhythms peaking during the feeding phase between ZT12 and ZT18. Thus, Wnt transcription activity is clearly rhythmic in the liver, and this rhythm can propagate to protein expression. We next asked whether the temporal oscillations in the expression of Wnt-activated genes might correlate with temporal oscillations in Wnt morphogens produced by pericentral non-parenchymal liver cells. To this end, we performed smFISH experiments and quantified the expression of the Wnt ligand Wnt2 ^[211]5 and of Rspo3 ^[212]6,[213]47, a critical facilitator of Wnt signaling, as well as the Wnt antagonist Dkk3 ^[214]19 ([215]Figure 7a-b). We found that both Wnt2 and Rspo3 in liver non-parenchymal cells (NPCs) exhibit non-uniform expression around the clock, with significantly higher mRNA levels at ZT0 (p=4x10^-5 for Wnt2, and p=2x10^-8 for Rspo3, Kruskal-Wallis, [216]Figure 7b). Given various delays between mRNA accumulation of ligands and expression of the Wnt targets, this timing is compatible with the peak nuclear accumulation of the TCF4 (Tcf7l2 gene) transcription factor observed at ZT7.5 ([217]Figure 6c) and with the peaks in Wnt-activated genes between ZT6-12 ([218]Figure 6a-b). Differences in Dkk3 expression were not significant (p=0.053). Thus, production of Wnt morphogens by central non-parenchymal liver cells might underlie the observed rhythmic Wnt pathway activity. Figure 7. Wnt targets could be explained by rhythmically expressed Wnt ligands from non-parenchymal cells. [219]Figure 7 [220]Open in a new tab (a) Representative smFISH images of Wnt2, Dkk3, and Rspo3 expression at ZT0 (left) and ZT18 (right), shown in green. Markers of non-parenchymal cells (NPCs) are shown in red ([221]Methods). Nuclei are stained in blue (DAPI). Scale bars, 2 μm. (b) Violin plots representing quantitative analysis of smFISH images (n=1420 cells from 189 central veins of at least two mice per time point). Wnt2, Dkk3, and Rspo3 transcripts are quantified in NPCs lining the central vein ([222]Methods). mRNA expression is in smFISH dots per μm^3. P-values are obtained from Kruskal-Wallis test. Ras signalling and hypoxia are two additional pathways that have been implicated in shaping hepatocyte zonation^[223]8. In agreement with ref.8, we found that the negative targets of Ras were enriched in central genes, whereas the positive Ras targets were enriched in portal genes (not shown). The rhythmic targets (R and Z+R) of hypoxia showed a pattern of temporal compartmentalization similar with those of Wnt ([224]Extended Data Figure 7e): the negative targets were enriched around ZT0 (dark-light transition) and underrepresented around ZT14, while the positive targets were enriched around ZT10 and underrepresented around ZT3. Ras targets, positive or negative, did not exhibit significant temporal bias. Discussion Recent genome-wide analyses of zonated gene expression in mouse and human liver^[225]8,[226]48,[227]49 uncovered a rich organization of liver functions in space at the sublobular scale, while chronobiology studies of bulk liver tissue revealed a complex landscape of rhythmic regulatory layers orchestrated by a circadian clock interacting with feeding-fasting cycles and systemic signals^[228]35,[229]50–[230]52. Here, we established how these two regulatory programs combine to shape the daily space-time dynamics of gene expression patterns and physiology in adult liver by extending our previous scRNA-seq approach^[231]8. We found that liver uses gene expression programs with many genes exhibiting compartmentalization in both space and time. In this study, we chose to focus on the parenchymal cells in the liver, the hepatocytes, for which smFISH data on landmark zonated genes was readily available, which enabled reconstructing spatio-temporal mRNA profiles from scRNA-seq^[232]8. Zonation profiles of other cell types in the liver may be obtained as well; in fact, static zonation mRNA expression profiles have been obtained for liver endothelial cells, using a paired-cell approach^[233]19 in which mRNA from pairs of attached mouse cells were sequenced and gene expression from one cell type was used to infer the pairs’ tissue coordinates. In addition, ab initio reconstruction methods such as diffusion pseudo time48 or novoSpaRc^[234]53, in which a zonation coordinate is inferred by assuming that the major axis of variability for a cell type reflects transcriptome-wide gene expression changes associated with zonation, could be used for spatially sparse cell types with no available zonated marker genes, e.g. stellate or resident immune Kupffer cells. Moreover, it was recently found that rhythmic gene expression and metabolism in non-hepatocyte cells can be driven both by clocks in hepatocytes via cell-cell communication as well as feeding cycles^[235]13. Our computational framework for analyzing space-time logic of gene expression could be widely applicable in such future studies. To study whether the observed space-time expression profiles may be regulated by either liver zonation, 24h rhythms in liver physiology, or both, we developed a mixed-effect model, combined with model selection. This enabled classifying gene profiles into five categories representing different modes of spatio-temporal regulation, from flat to wave-like. To validate these, we performed smFISH in intact liver tissue, which showed largely compatible profiles although some quantitative differences were observed. These differences most likely reflect the lower sensitivity of RNA-seq, uncertainties in the spatial analysis of smFISH in tissues, as well as known inter-animal variability in the physiologic states of individual livers, notably related to the animal-specific feeding patterns^[236]25. Together, this temporal analysis confirms that a large proportion of gene expression in hepatocytes is zonated^[237]8 or rhythmic^[238]17, and in addition reveals marked spatio-temporal regulation of mRNA levels in mouse liver (Z+R and ZxR genes, comprising 7% of all detected genes according to our criteria). This means that zonated gene expression patterns can be temporally modulated on a circadian scale, or equivalently, that rhythmic gene expression profiles can exhibit sub-lobular structure. The dominant pattern for dually regulated gene was Z+R, which corresponds to additive effects of space and time in log, or multiplicate effects of gene expression levels, and describes genes expression profiles that are compartmentalized in both space and time. In other words, such patterns are characterized by shapes (in space) that remain invariant with time, but whose magnitudes are rhythmically rescaled in time. Or equivalently, the oscillatory amplitude (fold change) and phases are constant along the lobular coordinate, but the mean expression is patterned along the lobule. Such multiplicative effects could reflect the combined actions of transcriptional regulators for the zone and rhythm on promoters and enhancers of Z+R genes. Indeed, gene expression changes induced by several regulators combine multiplicatively^[239]22. Note that though the (relative) shape of Z+R patterns is invariant in time, threshold-dependent responses that would lie downstream of such genes would then acquire domain boundaries which can shift in time. Similar phenomena are expected for interacting profiles (phase and amplitude modulated) (ZxR) that we observed for a smaller number of genes. As shown by us and others^[240]27,[241]28, rhythms at the protein level are typically damped and phase-delayed compared the cognate mRNA rhythms, depending on the protein half-lives. Indeed, longer protein half-lives imply smaller oscillatory amplitudes and longer delays between mRNA and protein accumulation^[242]14. Analysis of liver proteomes in bulk showed that the number of cyclic proteins is significantly lower than the number of cyclic mRNAs, with delays approaching the predicted maximum of six hours. Here, we found genes from both Z+R and ZxR that exhibited rhythmic accumulation in bulk proteomics experiments, including genes encoding rate-limiting enzymes, suggesting that dually space-time regulated patterns have a physiological role in the liver. Moreover, phase delays between the mRNA and protein profiles were as expected. Future studies will utilize emerging spatial proteomics approaches to reconstruct a space-time liver proteomic atlas^[243]54. In addition to previously discussed zonated liver functions^[244]8, a systematic querying of KEGG pathways highlighted Z+R functions not previously associated with rhythmic liver zonation. The roles and profiles of the corresponding genes allowed us to better understand the spatiotemporal logic of the identified pathways. For instance, we found that the expression levels of both ribosome protein genes (rate-limiting for protein synthesis^[245]32) and proteasome components (involved in protein degradation) were higher in central hepatocytes. Since the central environment is subject to high concentrations of xenobiotics and hypoxic stress, this could indicate an elevated protein turnover in this region, which would ensure that damaged proteins are rapidly exchanged with new, undamaged proteins. This interpretation is corroborated by the observed increased levels of cytosolic and ER chaperones during the feeding phase, to assist protein synthesis and secretion, thereby counteracting such protein stress. It was previously shown that Wnt signaling can explain the zonation of up to a third of the zonated mRNAs^[246]7. Wnt ligands are secreted by pericentral non-parenchymal cells, mostly endothelial cells^[247]5,[248]6, forming a graded spatial morphogenetic field. As a result, and as observed in our enrichment analysis, Wnt-activated genes were pericentrally-zonated. Moreover, both the scRNA-seq data and previous bulk mRNA and protein measurements showed that Wnt activity is rhythmic in the liver. Our smFISH analysis suggested that temporal fluctuations in the expression of Wnt2 and Rspo3, two key Wnt ligands, secreted by pericentral non-parenchymal cells might underlie oscillatory and zonated expression of Wnt targets at times near the fasting/feeding transition. In summary, we demonstrate how liver gene expression can be quantitatively investigated with spatial and temporal resolution and how liver functions are compartmentalized along these two axes. Our approach could be used to reconstruct spatio-temporal gene expression patterns in other zonated tissues such as the intestine and kidney^[249]3. Methods Animals and ethics statement All animal care and handling were approved by the Institutional Animal Care and Use Committee of WIS and by the Canton de Vaud laws for animal protection (authorization VD3197.b). Male (C57BL/6JOlaHsd) mice aged of 6 weeks, housed under reverse-phase cycle and under ad libitum feeding (Teklad Global 18% Protein Rodent Diet) were used to generate sc-RNA-seq data of hepatocytes and single-molecule RNA-FISH (smFISH). Littermate controls were used for the ZT6 and ZT18 time points. Male mice between 8 to 10 weeks old (C57BL/6J), housed under 12:12 light-dark cycle, and having access to food only during the night (Kliba Nafag 3242 Breeding, Vitamin-fortified, irradiated > 25 kGy) were used for smFISH of circadian clock genes (Reporting Summary). Hepatocytes isolation and single-cell RNA-seq Liver cells were isolated using a modified version of the two-step collagenase perfusion method of Seglen^[250]55. The tissue was digested with Liberase Blendzyme 3 recombinant collagenase (Roche Diagnostics) according to the manufacturer instructions. To enrich for hepatocytes, we applied a centrifuge step at 30g for 3 min to pull down all hepatocytes while discarding most of the non-parenchymal cells that remained in the sup. We next enriched for live hepatocytes by 2 cycles of percoll gradient, hepatocytes pellet was resuspended in 25 ml of PBS, percoll was added for a final concentration of 45% and mixed with the hepatocytes. Dead cells were discarded after a centrifuge step (70g for 10min) cells were resuspended in 10x cells buffer (1x PBS, 0.04% BSA) and proceeded directly to the 10x pipeline. The cDNA library was prepared with the V2 chemistry of 10X genomics Chromium system according to manufactures instructions and sequencing was done with Illumina Nextseq 500 at estimated depth of 40,000 reads per cell. Overall, independent libraries were prepared at ZT0 (n=3 biological replicates from individual mice), ZT6 (n=2), ZT12 (n=3) and ZT18 (n=2). Conceivably, the dissociation of liver tissue into individual cells and the purification of hepatocytes are relatively lengthy and may thus lead to alterations in mRNA expression. While it has been shown that mRNA levels do not change much during the purification and 24-hour cultivation of hepatocytes^[251]56, transcription rates on the other hand can be diminished by 8-fold to nearly 100-fold during this process^[252]57. This difference between nascent transcript and mature mRNA levels can be explained by the relatively long half-lives of liver-specific RNAs. In our case, the time needed from the dissociation of the tissue until the cell lysis is approximately 1 hour and the cell are not placed in culture. Since we are measuring mature transcripts, with half-lives in the range of typically 1-5 hours ([253]Extended Data Figure 2f), the changes in mRNA levels due to the protocol will remain contained. In particular, the scRNA-seq of cells carry the typical hepatocyte gene expression signatures, for example, genes such as Alb or Apoa2 rank at 2nd and 5th position genome-wide. As further validation, we compared our reconstructed gene expression zonation profiles with the zonation profiles from the massively validated zonation study of ref.^[254]8, revealing a near-perfect agreement ([255]Extended Data Figure 1f). Filtering of raw scRNA-seq data The initial data analysis was done in R v3.4.2 using Seurat v2.1.0^[256]58. Each expression matrix was filtered separately to remove dead, dying and low-quality cells. We firstly only kept genes that were expressed in at least 5 cells in any of the ten samples. We then defined a set of valid cells with more than 500 expressed genes and between 1000 and 10000 unique molecular identifiers (UMIs) and secondly an additional expression matrix with cells having between 100 and 300 UMIs which was used for background estimation ([257]Extended Data Figure 1a). Other UMI-filters have been tried, but yielded equal or less reliable profiles. The mean expression of each gene was then calculated for the background dataset and subtracted from the set of valid cells. This was subsequently filtered to only include hepatocytes by removing cells with expression of non-parenchymal liver cell genes. Next, the cells were filtered based on the fraction of mitochondrial gene expression. First, expression levels in each cell were normalized by the sum of all genes excluding mitochondrial and major urinary protein (Mup) genes. Indeed, as mitochondria are more abundant in periportal hepatocytes, the expression of mitochondrial genes is higher in this area^[258]59; and since these genes are very highly expressed, including them would reduce the relative expression of all other genes based on the cell’s lobular location. Mup genes are also highly abundant and mapping their reads to a reference sequence is unreliable due to their high sequence homology^[259]60. Moreover, Mup genes encode for pheromones that vary greatly between individuals to facilitate individual recognition^[260]61. Mitochondrial content is often used to remove non-viable cells^[261]62. The mitochondrial content of sequenced hepatocytes exhibited a bi-modal behavior ([262]Extended Data Figure 1b). To identify the range of mitochondrial fractions that included viable hepatocytes we used an intrinsic property of hepatocytes, which is the anti-correlation of the pericentral landmark gene Cyp2e1 and the periportal landmark gene Cyp2f2 ^[263]7 ([264]Extended Data Figure 1c). We found that hepatocytes with mitochondrial fraction in the range of 9-35% exhibited an almost perfect anticorrelation between Cyp2e1 and Cyp2f2 ([265]Extended Data Figure 1d,e), suggesting that these are the best quality, and we consequently kept hepatocytes within this range of mitochondrial content for further analysis. t-SNE clustering To validate that the expected spatial and temporal axes of variation are present in the scRNA-seq data, we generated a low-dimensional representation of all cells using the standard t-SNE (t-distributed stochastic neighbor embedding)^[266]63, a nonlinear dimensionality reduction technique that embeds high-dimensional data on a 2-dimensional plane such that points that are similar in high-dimensional space are close together on the 2-dimensional representation. We then colored cells either by their position along the central-portal axis, or by time of day. Spatial reconstruction of zonation profiles from scRNA-seq data Choice of landmark genes The reconstruction algorithm relies on a priori knowledge about the zonation of a small set of landmark genes to infer the location of the cells. Ref.^[267]8 used smFISH to determine the zonation pattern in situ of 6 such landmark genes and used them to reconstruct the spatial profiles of all other genes at a single time point. Since we here aimed at reconstructing zonation profiles at different time points, we could not rely on those landmark genes, which might be subject to temporal regulation. Therefore, we used an alternative strategy where we selected landmark zonated genes from ref.^[268]8 (q < 0.2), with the additional constraints that those should be highly expressed (mean expression in fraction UMI of more than 0.01% and less than 0.1%), and importantly vary little across mice and time. Specifically, we calculated the variability in the mean expression (across all layers) between all mice for every gene and removed genes with >= 10% variability. This yielded 27 central (Akr1c6, Alad, Blvrb, C6, Car3, Ccdc107, Cml2, Cyp2c68, Cyp2d9, Cyp3a11, Entpd5, Fmo1, Gsta3, Gstm1, Gstm6, Gstt1, Hpd, Hsd17b10, Inmt, Iqgap2, Mgst1, Nrn1, Pex11a, Pon1, Psmd4, Slc22a1, Tex264); and 28 portal (Afm, Aldh1l1, Asl, Ass1, Atp5a1, Atp5g1, C8a, C8b, Ces3b, Cyp2f2, Elovl2, Fads1, Fbp1, Ftcd, Gm2a, Hpx, Hsd17b13, Ifitm3, Igf1, Igfals, Khk, Mug2, Pygl, Sepp1, Serpina1c, Serpina1e, Serpind1, Vtn) landmark genes. Reconstruction algorithm The reconstruction algorithm is based on the algorithm in ref.^[269]8 and was used in the modified version from ref.^[270]19. Briefly, the expression of each landmark gene was normalized to its maximal expression over all cells. Then, for each cell, we divided the summed expression of all portal landmark genes by the summed expression of all portal and central landmark genes, resulting in a value μ[i] between 0-1, which we used as the cell location in the liver lobule. We then compared the obtained μ[i] with the distributions of cell locations from each of the layers from ref.^[271]8, which yielded a matrix in which every cell has a given probability to belong to a given layer. These weights were then used to compute the mean expression of all genes in a particular layer. The procedure was applied independently on each mouse, yielding ten spatial gene expression profiles for each gene, given as fraction of UMI per cell. Spatiotemporal analysis of liver gene expression profiles Data Each profile for the 14678 genes includes 8 layers from the pericentral to the periportal zone and 4 time points: ZT0 (n=3 biological replicates from individual mice), ZT6 (n=2), ZT12 (n=3) and ZT18 (n=2). The expression levels (noted as x) are then log-transformed as follows: [MATH: y=log2(x+Δ)B :MATH] (i) The offset Δ = 10^−4 buffers variability in lowly expressed genes, while the shift B = −log [2](11 × 10^−5) changes the scale so that y = 0 corresponds to about 10 mRNA copies per cell (we expect on the order of 1M mRNA transcripts per liver cell). Reference genes For ease of interpretation ([272]Figure 2 and [273]Extended Data Figure 2), we used a set of reference circadian genes and a set of reference zonated genes, highlighted in several figures. The reference core circadian clock and clock output genes are the following: Bmal1, Clock, Npas2, Nr1d1, Nr1d2, Per1, Per2, Cry1, Cry2, Dbp, Tef, Hlf, Elovl3, Rora, Rorc. The reference zonated genes are the following: Glul, Ass1, Asl, Cyp2f2, Cyp1a2, Pck1, Cyp2e1, Cdh2, Cdh1, Cyp7a1, Acly, Alb, Oat, Aldob, Cps1. Gene expression variance in space and time To analyze variability in space and time ([274]Figure 2a) we computed, for each gene, the spatial variance V[x] and the temporal variance V[T]. Let y[x,t,j] represent the expression profile, with j the replicate index, t ∈ {1,2,…, N[t]} the time index, and x ∈ {1,2,…, N[x]} the layer index. Then, V[x] and V[T] are computed as follows: [MATH: VX=1Ntt x[j(yx, t,j1 Nx< mi>xyx,t,j)]2Nj2Nx :MATH] (ii) [MATH: VT=1Nxx t[j(yx, t,j1 Nt< mi>tyx,t,j)]2Nj2Nt :MATH] (iii) Thus, the spatial variance V[x] is computed along the space (and averaged over the replicates) for each time condition, and then averaged over time. The procedure is similar, symmetrically, for V[t]. Gene filtering For the analyses in [275]Figure 2, we selected transcripts that were reproducible between replicates, as well as sufficiently highly expressed (see scatterplot in [276]Extended Data Figure 2d). To assess reproducibility across replicates, we computed the average relative variance of the spatiotemporal profiles over the replicates: [MATH: VJ=1NxNtx,t< /mrow>[1Njj(yx,t ,j1Njj< /msub>yx,t,j)2]1Nx< /msub>NtNjx,t,j[(yx, t,j1 NxNtNj x,t,j< mi>yx,t,j)2] :MATH] (iv) We considered genes with values below 50% ([277]Extended Data Figure 2). To filter lowly expressed genes, we required the maximum expression level across layers and time points to exceed 10^-5 (fraction of UMIs) which corresponds to y = 0 or about 10 copies of mRNA per cell. While this was quite more permissive than previous scRNA-seq studies it allowed to keep most reference circadian and zonated genes. However, scRNA-seq has still limited sensitivity and some potentially important genes may have been removed in the filtering process. In the end, our filters kept 5085 genes (1437 were removed due to low expression, 4733 due to high variance, and 3543 due to both), which were then used for subsequent analyses. Mixed-effect model for spatiotemporal mRNA profiles Since the data is longitudinal is space (8 layers measured in each animal), modelling the space-time profiles require the use of mixed-effect models. To systematically analyze the spatiotemporal mRNA profiles, we used a parameterized function. Specifically, the model uses sine and cosine functions for the time, and polynomials (up to degree 2) for space. Possible interaction between space and time are described as space-dependent oscillatory functions, or equivalently, time-dependent polynomial parameters. Our model for the transformed mRNA expression y reads: [MATH: yx,t,i=μi+μ(x)+a(x)cos(ωt)+b(x)sin(ωt)+εx,< /mo>t,i :MATH] (v) Here t is the time, x the spatial position along the liver layers, and i ∈ {1,2,…,10} the animal index. This function naturally generalizes harmonic regression, often used for analysis of circadian gene expression^[278]25, by introducing space-dependent coefficients: [MATH: {μ(x)=μ0+μ1P1(x)+μ2P2(x)a(x)=a0+a1P1(x)+a2P2(x)b(x)=b0+b1P1(x)+b2P2(x) :MATH] Here, P [1] and P [2] are the Legendre polynomials of degrees 1 and 2, respectively; μ [0], μ [1] and μ [2] represent the static zonation profile, a [0] and b [0] represent the global (space-independent) rhythmicity of the gene, while a [1], a [2], b [1], b [2] represent layer-dependent rhythmicity. ε[x,t,j] is a Gaussian noise term with standard deviation σ. In addition to the fixed-effect parameters described so far, we also introduced a mouse-specific random-effect μ[i](with zero mean). This parameter groups the dependent layer measurements (obtained in the same animal) and thereby properly adjusts the biological sample size for the rhythmicity analysis. Phases φ (related to peak times t through t = φ * 24/2π) and amplitudes A, for each profile can then be computed for any layer from the coefficients a(x) and b(x): [MATH: φ(x)=arctan2(b(x),a(x))A(x)=a(x)2+b(x)2 :MATH] (vi) Note that peak-to-trough difference is 2A(x). The peak-to-trough ratio or fold change of the original expression levels is then 2^2A(x). We also note that an equivalent writing of the model formulates the problem in terms of time-dependent zonation parameters instead of space-dependent rhythmicity: [MATH: yx,t,i=μi+μ0(t)+μ1(t)P1(x)+μ2(t)P2(x)+εx,< /mo>t,i :MATH] (vii) where: [MATH: {μ0(t)=μ0+a0 cos(ωt)+b0s in(ωt)μ1(t)=μ1+a1 cos(ωt)+b1s in(ωt)μ2(t)=μ2+a2 cos(ωt)+b2s in(ωt) :MATH] In this study, we fixed [MATH: ω=2π2 4h :MATH] since the animals were entrained in a 24 h light-dark cycle and the low time resolution would prevent us from studying ultradian rhythms. The model parameters, including the variance of the random effects and Gaussian noise strength σ, are estimated for each gene using the fit function from the Python library StatsModels (version 0.9.0). Nelder-Mead was chosen as the optimization method, and the use of a standard likelihood was favored over the REML likelihood to allow for model comparison^[279]64. To prevent overfitting of the gene profiles, we added a noise offset σ [0] = 0.15 [log [2]] to the estimated noise σ, in the expression of the likelihood function used in the mixed-effect model optimization. Depending on the gene, the model presented in ([280]v) and ([281]vii) may be simplified by setting all or some of the (fixed) parameters to 0. For example, a non-oscillatory gene profile would normally have non-significant a[j] and b[j] parameters. In practice, considering the fixed effects, 2^9 sub-models of various complexity can be generated. However, we added a few reasonable requirements to reduce the number of models. First, the intercept μ [0] must be present in every model. Similarly, the parameters a [0] and b [0], providing a global rhythm, must be present in every rhythmic model. Finally, the parameters a[j] and b[j] for j=0,1,2 must be paired to ensure a proper phase definition ([282]vi). The models can then be classified in different categories, depending on the retained (non-zero) parameters ([283]Figure 2c): * The model comprising only the intercepts μ [0] and μ[i], termed flat or noisy (F/N). * The models comprising only the intercepts and zonation parameters: μ [1] and/or μ [2], termed purely zonated (Z). * The models comprising only the intercepts and rhythmic parameters: a [0] and b [0], termed purely rhythmic (R). * The models comprising only the intercepts, zonated parameters and rhythmic parameters: μ [1] and/or μ [2], and a [0], b [0], termed independent (Z+R). * The models comprising interaction parameters: a[j] and b[j] for j=1,2, termed interacting (ZxR). Note that we only plot the fixed effects in the predicted gene profiles. The Bayesian Information Criterion (BIC) is then used for model selection, enabling to choose the most parsimonious model for each gene. Consequently, the F/N class also contains noisy profiles, since genes that are not well fitted with any complex model will then be assigned to the simplest model. Additionally, it appears that, for some profiles, several competing models can result in close BIC values (see e.g. the discussion on Clock and Cry1 in the Results). Therefore, when assigning hard classes, if some models have a relative difference of less than 1% in their BIC, we systematically keep the most complex model. Moreover, we also assigned probabilities to the different categories (F, Z, R, Z+R and ZxR), computed as Schwartz BIC weights^[284]65, which is useful in case of ambiguous classification ([285]Supplementary Table 2). All best fits with their parameter values are listed in [286]Supplementary Table 1. Bulk RNAseq dataset A bulk liver RNA-seq dataset was obtained from ref.^[287]25. These data was obtained from a sampling every 2 hours for 24 hours, with 4 replicates per time condition. For [288]Extended data Figure 2e, we only compared genes that for which rhythmicity is not changing across layers, viz. the R and Z+R categories. Note that since the scRNA-seq data has a lower temporal resolution and fewer replicates per time point, we found overall less rhythmic genes. To assess gene rhythmicity, we used harmonic regression on the log-transformed profiles. Using the same notation as above, we define the two following models: [MATH: {yt, i=μ+ε(ix)yt, i=μ+acos(ωt)+bsin(ωt)+ε(x) :MATH] We then fit eq. ([289]viii) and eq. ([290]ix) to every transcript. Depending on the figure, we either keep the model with the lowest BIC ([291]Extended data Figure 2e, for which we also compute the circular correlation coefficient^[292]66) or the ones having a significant rhythmicity according to a F-test ([293]Figure 6). KEGG pathway Enrichment analysis Functional annotation clustering from Enrichr^[294]67 for the categories F/N, Z and Z+R (which is then subdivided in central and portal), Zc+R (central zonated and rhythmic), Zp+R (portal zonated and rhythmic) and finally R was ran ([295]https://maayanlab.cloud/Enrichr/) with standard parameters, using the standard KEGG 2019 Mouse set of pathways. The enriched pathways (adjusted p-value < 0.1, standard Enrichr output test) were then further annotated to compute, e.g. the number of central/portal genes in each category and the phase of each gene. This analysis is available [296]Supplementary Table 3. smFISH Analysis of Z+R and ZxR genes (Stellaris smFIH probes) Preparation of probe libraries, hybridization procedure and imaging conditions were previously described^[297]19. Briefly, smFISH probe libraries were coupled to TMR, Alexa594 or Cy5. Cell membranes were stained with alexa fluor 488 conjugated phalloidin (Rhenium A12379) that was added to GLOX buffer^[298]68. Portal node was identified morphologically on DAPI images based on bile ductile, central vein was identified using smFISH for Glul in TMR, included in all hybridizations. Images were taken as scans spanning the portal node to the central vein. Images were analyzed using ImageM^[299]68. Quantification of zonation profiles in different circadian time points were generated by counting dots and dividing the number of dots in radial layers spanning the portal-central axis by the layer volume. Temporal analysis of circadian genes (RNA scope smFISH probes) smFISH of R genes were done on fresh-frozen liver cryosections (8μm) embedded in O.C.T Compound (Tissue-Tek; Sakura-Finetek USA), sampled every three hours (ZT0 to ZT21). RNAscope® probes for Bma1l mRNA (Mm-Arntl, catalog #: 438748-C3) and Per1 mRNA (Mm-Per1, catalog #: 438751) were used, according to the manufacturer’s instructions for the RNAscope Fluorescent Multiplex V1 Assay (Advanced Cell Diagnostics). To detect the central vein, an immunofluorescence of Glutamine Synthetase (ab49873, Abcam, diluted 1:2000 in PBS/BSA 0.5%/Triton-X0.01%) was done together with smFISH. Nuclei were counterstained with DAPI and sections were mounted with ProLong™ Gold Antifade Mountant. Liver sections were imaged with a Leica DM5500 widefield microscope and an oil-immersion x63 objective. Z-stacks were acquired (0.2μm between each Z position) and mRNA transcripts were quantified using ImageJ, as described previously in ref.^[300]50. Pericentral (PC) and Periportal (PP) veins were manually detected based on Glutamine Synthetase IF or on bile ducts (DAPI staining). The Euclidean distance between two veins and the distance from the vein of each mRNA transcript were calculated. mRNA transcripts were assigned to a PP or PC zone if the distance from the corresponding vein was smaller than one-third of the distance between the PP and PC veins (ranging from 50 to 130μm). Wnt2, Rspo3 and Dkk3 expression in non-parenchymal cells (NPCs) (Stellaris smFIH probes) Preparation of probe libraries, hybridization protocol and imaging conditions were previously described^[301]19. The Aqp1, Igfbp7 and Ptprb probe libraries were coupled to TMR, the Wnt2 library was coupled to Alexa594 and the Dkk3 or Rspo3 library were coupled to Cy5. Cell membranes were stained with alexa fluor 488 coupled to phalloidin (Rhenium A12379) that was added to GLOX buffer^[302]68. The central vein was identified based on morphological features inspected in the DAPI and Phalloidin channels and presence of Wnt2-mRNA (detected by smFISH). Central vein niche NPCs were identified by co-staining of Aqp1, Igfbp7 and Ptprb. The central vein area was imaged and the images were analyzed using ImageM^[303]68. We counted dots of Wnt2, Rspo3 and Dkk3 expression (corresponding to single mRNA molecules) in NPCs lining the central vein and removed background dots larger than 25 pixels. We then divided the dot count by the segmented cell volume. In total 1420 NPCs from 189 central veins of at least 2 mice per time point (ZT0,6,12,18) were imaged and a Kruskal-Wallis test based on the mean mRNA dot concentration in each cell was performed to compare the ZT0 and ZT18 time points. Extended Data Extended Data Fig. 1. scRNA-seq pre-processing. [304]Extended Data Fig. 1 [305]Open in a new tab (a) Histogram of number of UMIs per cell barcode for each mouse. Red patches mark the cells used for background estimation (100-300 UMI/cell barcode), gray patches mark the cells used for downstream analysis (1000-10000 UMI/cell barcode). (b) Histogram of fraction of all UMIs mapping to mitochondrial genes. Filter used for downstream analysis in grey (0.09-0.35). (c) smFISH staining of a liver lobule with probes against Cyp2e1 (red) and Cyp2f2 (green). CV = central vein, PN = portal node. Overall, the data combine 10 images from from two mice. (d) Expression of Cyp2e1 and Cyp2f2 in cells with different fraction of mitochondrial expression. Three different filters for the fraction of UMIs mapping to mitochondrial genes (0-0.09, 0.09-0.35, 0.35-1) were applied, the data of all mice merged and the resulting datasets visualized as t-SNE plots. (e) Violin plots for the correlations between Cyp2e1 and Cyp2f2 expression in single hepatocyte populations with different filters for fractions of mitochondrial expression. Each dot represents one mouse (n=10 mice for each distribution) and the shape of the violin represents the density of points. (f) Comparison of the zonation profiles of Z and Z+R genes obtained in our current study and the previous reconstruction from Halpern et al. 2017. Profiles were interpolated to fit 15 layers, where 1 is pericentral and 8 is periportal. Dots indicate the center of mass (expression-weighted lobule layer) of the Z and Z+R genes computed in both datasets, for gene having an average expression of at least 10^-5 in Halpern et al. r is the correlation coefficient, p the corresponding p-value from a standard linear regression. Extended Data Fig. 2. Log-transformed reconstructed profiles, pre-filtering of the genes and comparison with external datasets. [306]Extended Data Fig. 2 [307]Open in a new tab (a-c) Expression levels of the reconstructed profiles for the genes from [308]Figure 1F-H after log-transformation ([309]Methods). Dots in represent data points from the individual mice. Lines represent mean expression per time point. Shaded areas represent one standard deviation (SD) across the mice (n=2 or n=3 depending on the time point, [310]Methods). (d) Biological variability of gene profiles across independent replicate liver samples, quantified in terms of the average relative replicate variance. 0 shows perfectly reproducible profiles while 1 the most variable genes ([311]Methods). Genes inside the bottom-right box (x-cutoff at 10^-5; y-cutoff at 0.5) are selected and contain all but one of the reference genes. Colored dots show reference zonated genes (blue) and reference rhythmic genes (orange). (e) Comparison of the peak times for rhythmic genes in R and Z+R, with the dataset from Atger et al, PNAS 2015. Circular correlation coefficient is 0.746 ([312]Methods). (f) Boxplot of the mRNA half-lives (data from Wang, J. et al, 2017) shows that R genes as a group (median, orange line) are the shortest-lived. Box limits are lower and upper quartiles, whiskers extend up to the first datum greater/lower than the upper/lower quartile plus 1.5 times the interquartile range. Remaining points are marked. MW stands for the two-sided Mann-Whitney test, and KW stands for Kruskal-Wallis test. Extended Data Fig. 3. Z+R and ZxR transcripts with corresponding rhythmic protein accumulation in bulk mass spectrometry data. [313]Extended Data Fig. 3 [314]Open in a new tab (a-b) Rhythmic proteins corresponding to Z+R (a) and ZxR (b) transcripts were selected from Robles et al., 2014 (from original [315]Table S2), and fitted with harmonic regression (p-value of rhythmicity from F-tests are indicated above the plot). Only proteins having a p-value<0.01 are shown. (c) Scatter plot of the phase of the fits from the transcripts (x-axis) against the phase of the fits from the proteins (y-axis). The diagonal is indicated with a dashed grey line, the theoretical upper bound (6h) for the delay between mRNA and protein is indicated with a dashed red line. All rhythmic proteins (q<0.2 in the original analysis) are represented.} Extended Data Fig. 4. Additional validations for the Z+R category. [316]Extended Data Fig. 4 [317]Open in a new tab (a-b) smFISH (Stellaris, [318]Methods) for Elovl3 (Z+R) and Arg1 (Z+R). smFISH quantifications were made for ZT0 and ZT12 ([319]Methods). Left: representative images at ZT0, ZT12 for Elovl3 (a) or Arg1 (b). Pericentral veins (CV) and a periportal node (PN) are marked. Scale bar - 20μm. Right: quantified profiles for each gene at the two time points from smFISH (top, line plot is the mean number of mRNAs, shaded area indicate SD across twelve images), and scRNA-seq data (bottom, line plot is mean expression, shaded area is SD across mice, n=2 or n=3 depending on the time point, [320]Methods). Extended Data Fig. 5. The core circadian-clock is not zonated. [321]Extended Data Fig. 5 [322]Open in a new tab (a) Spatial and temporal profiles and fits for circadian core-clock genes. Peak times are indicated on the temporal representation. For the genes Cry1 and Clock, additional dashed lines represent fits for the R model, as the Schwartz BIC weights from the R and Z+R models were close ([323]Supplementary Table 2). (b) Amplitudes and peak times of the core-clock circadian genes in a polar coordinate representation (clock-wise ZT times are indicated, distance from the center corresponds to the amplitude) show the expected organization of core clock transcript in the liver. Extended Data Fig. 6. Spatio-temporal mRNA expression profiles for heat-responsive genes. Represented genes correspond to the top 200 targets from the ChIP-Atlas list for mouse HSF1. [324]Extended Data Fig. 6 [325]Open in a new tab (a) Polar plot representation of the transcripts that are R, Z+R, or ZxR among the HSF1 targets from the ChIP-Atlas ([326]http://chip-atlas.org/). Genes involved in chaperone functions (chaperones, co-chaperones, or chaperone facilitators) are named. Color indicates zonation, while grey dots show purely rhythmic genes. (b) Spatial representation of the transcripts corresponding to proteins involved in chaperone functions, separated in central (left, cytoplasmic function) and portal (right, endoplasmic reticulum function) zonation. Extended Data Fig. 7. Rhythmicity of Wnt targets in bulk RNA-seq, and proteomics liver time series data. [327]Extended Data Fig. 7 [328]Open in a new tab (a) Enrichment of rhythmic genes (R, Z+R and ZxR) among the targets of the Wnt pathway, computed on the bulk dataset (Atger et al., 2015). Targets above a given percentile (x-axis) of Apc-KO fold change are considered. The percentage of rhythmic genes in the whole Atger et al. dataset is indicated by a dashed blue line. (b) Bulk mRNA (coming from Atger et al. dataset) rhythmicity profiles of Wnt targets among the top-50 targets with highest Apc-KO fold change. Gene profiles are centered around their mean. An enrichment of the phases around ZT8-14 is observed, in agreement with [329]Figure 6a. (c) Polar plot representation of the individual gene phases and amplitudes represented in panel b (bulk data). (d) Temporal representation of selected genes profiles from the scRNA-seq (top, n=2 or n=3 animals depending on the time point, [330]Methods) and bulk proteomics (bottom, data from Robles et al, 2014, n=2 replicates per time point sampled every 3h) data. Represented profiles are ones with (1) the highest Apc-KO fold change, (2) a significantly rhythmic protein (p < 0.05, standard harmonic regression, F-test), and (3) belonging to the Z+R or ZxR category. (e) Enrichment/depletion at different times (window size: 3h), of both positive and negative Ras (N=31 and N=33, respectively) and Hypoxia (N=73 and N=41, respectively) targets (background: all R and Z+R genes). Colormap shows p-values (two-tailed hypergeometric test): red (blue) indicates enrichment (depletion). Supplementary Material Supplementary table 1 [331]EMS114743-supplement-Supplementary_table_1.xlsx^ (449.3KB, xlsx) Supplementary table 2 [332]EMS114743-supplement-Supplementary_table_2.xlsx^ (400.8KB, xlsx) Supplementary table 3 [333]EMS114743-supplement-Supplementary_table_3.xlsx^ (64.4KB, xlsx) Acknowledgments