Abstract Human cytomegalovirus (HCMV) is a betaherpesvirus which rarely presents problems in healthy individuals, yet may result in severe morbidity in immunocompromised patients and in immune-naïve neonates. HCMV has a large 235 kb genome with a coding capacity of at least 165 open reading frames (ORFs). This large genome allows complex gene regulation resulting in different sets of transcripts during lytic and latent infection. While latent virus mainly resides within monocytes and CD34^+ progenitor cells, reactivation to lytic infection is driven by differentiation towards terminally differentiated myeloid dendritic cells and macrophages. Consequently, it has been suggested that macrophages and dendritic cells contribute to viral spread in vivo. Thus far only limited knowledge is available on the expression of HCMV genes in terminally differentiated myeloid primary cells and whether or not the virus exhibits a different set of lytic genes in primary cells compared with lytic infection in NHDF fibroblasts. To address these questions, we used Illumina next generation sequencing to determine the HCMV transcriptome in macrophages and dendritic cells during lytic infection and compared it to the transcriptome in NHDF fibroblasts. Here, we demonstrate unique expression profiles in macrophages and dendritic cells which significantly differ from the transcriptome in fibroblasts mainly by modulating the expression of viral transcripts involved in immune modulation, cell tropism and viral spread. In a head to head comparison between macrophages and dendritic cells, we observed that factors involved in viral spread and virion composition are differentially regulated suggesting that the plasticity of the virion facilitates the infection of surrounding cells. Taken together, this study provides the full transcript expression analysis of lytic HCMV genes in monocyte-derived type 1 and type 2 macrophages as well as in monocyte-derived dendritic cells. Thereby underlining the potential of HCMV to adapt to or influence different cellular environments to promote its own survival. Introduction Human cytomegalovirus (HCMV) is a ubiquitous species-specific betaherpesvirus that infects between 60–100% of the population depending on age, gender, demographics and socio-economic status [[34]1]. The 235kb HCMV genome is characterized by unique long (UL) and unique short (US) regions, each flanked by terminal (TRL and TRS) and internal (IRL and IRS) inverted repeats [[35]2]. Transcription of the HCMV genome is complexly regulated and different sets of transcripts are expressed during productive and latent infection. During latency, the major immediate-early (IE) promotor, which drives IE gene expression, is epigenetically blocked. However, early studies in experimental latency models detected several sense and antisense CMV latency-associated transcripts (CTLs) expressed from the major immediate-early region of the genome, although these CTLs are controversial [[36]3–[37]5]. Since then four transcripts i.e. UL81ast/LUNA [[38]6–[39]10], UL111A [[40]10–[41]12], UL138 [[42]8, [43]10, [44]13, [45]14] and, depending on the virus used and the presence of GATA transcription binding sites, UL144 [[46]15] were characterized thoroughly as latency-associated transcripts. Alternatively, during productive infection, the full array of genes is expressed and gene expression follows a temporal cascade starting with immediate-early gene expression, succeeded by early and late gene expression [[47]16–[48]22]. The transcriptome consists of both polyadenylated protein coding and polyadenylated non-protein coding RNAs. The latter RNA species is comprised of four different long non-coding RNAs (lnc RNAs) which do not overlap with any of the protein coding regions. These four lnc RNAs have been reported to be responsible for 65% of the total polyadenylated RNA pool [[49]23, [50]24]. In addition, HCMV also expresses non-coding RNA in the form of anti-sense transcripts which often overlap with protein coding genes, and microRNAs which have been attributed a regulatory function [[51]23–[52]25]. Currently, the transcriptome of HCMV has not been entirely unraveled. It has long been assumed that the genome of clinical isolates contained about 170 genes [[53]23, [54]24]. However, recently a new study revealed an unexpected complexity when Stern-Ginossar et al. identified hundreds of new ORFs through ribosome profiling and transcript analysis in lytically infected fibroblasts [[55]26]. Currently, the functions of these newly discovered transcripts remain largely unknown. Only recently, transcriptomics technology platforms have been used to study the influence of HCMV infection on host and viral gene expression in different cell lines and cell types. Towler et al. showed that, depending on the cell type, the temporal expression of viral genes varies; but also that, compared to fibroblasts, the transcriptome in epithelial cells and astrocyte cell lines is differentially regulated [[56]27]. CD34^+ progenitor cells and CD14^+ monocytes are currently the only confirmed sites of latency [[57]28, [58]29]. Upon differentiation to macrophages or dendritic cells, the virus reactivates and lytic replication is enabled, possibly promoting viral spread after reactivation [[59]17, [60]30]. The transcriptome in latently infected CD34^+ progenitor cells and CD14^+ monocytes is slowly being unraveled [[61]8, [62]10, [63]12]. However, only one study aimed at delineating the HCMV transcriptome in permissive myeloid cells [[64]31]. It remains unclear how the viral transcriptome adapts to infection in macrophages (MΦs) and dendritic cells (DCs) and if the virus shows a different transcriptional profile in pro-inflammatory DCs or macrophages type 1 (MΦ1) or anti-inflammatory macrophages type 2 (MΦ2). Therefore, we used Illumina next generation sequencing to characterize how HCMV modulates gene expression specifically in terminally differentiated monocyte-derived MΦ1, MΦ2 and DCs in comparison to fibroblasts. The data reported here is, to our knowledge, the first next generation sequencing transcriptome study investigating HCMV transcription in cell types which are relevant for in vivo viral spread. We show that HCMV follows a specific transcriptional program in primary cells in which genes involved in virion composition, cell tropism or cell-specific replication, viral spread and immunomodulation are expressed at different levels compared to fibroblasts. This not only highlights the potential of HCMV to adapt to or influence different cellular environments to promote its own survival. This also reflects the significant differences in gene expression between cell types used in the laboratory environment and in primary cell types. Materials and Methods Ethics statement Peripheral blood mononuclear cells (PBMCs) and primary monocytes were isolated from whole blood. Blood was obtained via the Antwerp Blood Transfusion Center of the Red Cross ([65]www.redcross.be). Donors gave written consent for their samples to be used for scientific research. Cell lines ARPE-19 cells (CRL-2302, ATCC) and neonatal NHDF fibroblasts (CC-2509, Lonza) were maintained and propagated in DMEM:F12 with L-glutamine (Biowhittaker) and MEM (Life Technologies) respectively, containing 10% heat-inactivated FCS (HI-FCS; Life Technologies) and 0.04% gentamicine. All infection experiments were done in 24-well plates (Costar). Culturing of clinical isolates TB40/E High epithelial tropic stocks of the HCMV BAC4-TB40/E [[66]32] were generated in ARPE-19 cells. In brief, ARPE-19 cells were seeded 24 hours before infection to reach 40–60% confluence at the time of infection (3.5 x 10^6 cells/T175). Incubation with TB40/E at MOI 0.02 was done for 3 hours at room temperature on a rocking platform. The virus was harvested until the cells detached from the flasks. Harvests were spun down for 10 minutes at 600xg to remove cellular debris. All harvests were titrated (see below) and only the 6 harvests with the highest titers were used for concentration. Viral stocks were prepared by centrifugation (1 hour at 3000xg) and ultracentrifugation (1 hour at 5800xg). Concentrated virus was resuspended in 5 ml MEM/10% FCS and kept at -80°C for long term storage. Virus titration using immediate-early protein immunofluorescent staining Virus yield was determined on NHDF and ARPE-19 cells by an immune-fluorescence assay adapted from Chou et al. [[67]33]. NHDF cells and ARPE-19 were used to determine TB40/E titers or epithelial tropism, respectively. In brief, 15000 cells (NHDF or ARPE-19 cells) per well were seeded in a 96-well plate in 75 μl MEM/2% HI-FCS or DMEM:F12/2% HI-FCS. Then, quadruplicates of a ten-fold dilution series of the virus was added to the cells (25μl input). Three days post infection, the viral replication was determined after staining for immediate-early antigen expression in an immune-fluorescence assay (see below). Wells were scored positive if they contained at least one colony of 2–5 IE-positive cells. After scoring, the TCID[50]/ml was calculated based on the Spearman-Kärber method. Immediate-early protein staining The medium of infected cells was removed and the cells were washed once with phosphate buffered saline (PBS). Subsequently, the cells were fixed using 100% ice-cold methanol (10 minutes at -20°C). To remove residual methanol, the cells were washed twice with PBS. Staining with the primary antibody was carried out for 20 minutes at room temperature (RT) with an anti-immediate-early antigen antibody (anti-I.E.A., 11–003, Argene) diluted 1/100 in PBST (PBS containing 0.05% tween-20). Prior to incubation with anti-mouse antibody conjugated to Alexa Fluor 488 (Alexa Fluor 488 goat anti-mouse IgG, A-11001, Life Technologies), the cells were washed twice with PBS. After 20 minutes incubation at RT, the secondary antibody was removed and the cells were washed twice with PBS. Afterwards, the cells were stored at 4°C in 200μl PBS until readout. Isolation of primary CD14+ monocytes and differentiation to HCMV permissive cell types Peripheral blood mononuclear cells (PBMCs) were isolated from buffy coats using ficoll-paque. In brief, 60 ml blood was added to 100 ml ice-cold PBS. Twenty-five ml diluted blood was carefully applied on 15 ml ficoll-paque. After centrifugation (800xg, 20 minutes, no brake), the PBMC fraction was collected and divided over four 50 ml tubes. Each tube was filled with ice-cold PBS to a total volume of 50 ml and the cells were spun down for 10 minutes at 300xg. This washing procedure was repeated twice. The total number of PBMCs was determined by manual counting under the light microscope with a counting chamber. Subsequently, monocytes were isolated using magnetic CD14 microbeads (Miltenyi, 130-050-201) as instructed by the manufacturer. The monocytes were counted manually under a light microscope using a counting chamber, resuspended at 1.5 x 10^6 cells/ml and 1.5 x 10^6 cells were seeded per well in a 24-well plate (tissue treated, Costar). The CD14^+ monocytes were allowed to adhere for 3 hours at 37°C/5% CO[2]. After attachment, the PBS was removed and 200μl X-Vivo-15 medium (Lonza) without supplements was added and the cells were left overnight at 37°C/5% CO[2]. Subsequently, monocytes were differentiated to monocyte-derived dendritic cells (DC), monocyte-derived type 1 macrophages (MΦ1) and monocytes-derived type 2 macrophages (MΦ2) in X-Vivo-15 medium containing 2.5mM L-glutamine and 10% HI-FCS supplemented with 100ng/ml GMCSF and 100ng/ml IL4 (adapted from [[68]34, [69]35]), 100ng/ml GMCSF or 100ng/ml MCSF (adapted from [[70]34]), respectively, for 7 days (all cytokines from Peprotech). The medium was refreshed on day 4 after the start of differentiation. Activation of terminally differentiated cells was done by adding 500mg/ml LPS (Sigma). On day 8, the cells were infected for 3 hours at room temperature with TB40/E at MOI 5 in 0.5 ml X-Vivo-15 medium containing 2.5mM L-glutamine and 10% HI-FCS. After infection, the virus inoculum was removed and 1 ml fresh medium without cytokines was added. The cells were incubated for 72 hours at 37°C/5% CO[2] before samples were harvested. FACS analysis of surface markers on monocyte-derived cell types Monocytes were seeded in a 96-well plate at 3x10^5 cells per well and differentiated to MΦ1, MΦ2 and DCs as described above. After 7 days, the cells were challenged with 500ng/ml LPS (Sigma) and surface markers were determined. Therefore, the cells were detached from the plate using 5mM EDTA (20 minutes at 37°C). The cells were spun down at 300xg, the EDTA was discarded and the cells were incubated with a mix of anti-CD14-APC-Cy7, anti-CD209-PerCP-Cy5.5, anti-CD163-PE and anti-CD80-PE-Cy7 (5μl of each antibody, all antibodies from BD). The antibodies were incubated for 20 minutes at 4°C. After incubation, 200μl PBS was added and the cells were spun down at 300xg. The supernatant was discarded, 200μl PBS was added once more and the cells were collected by centrifugation. Readout was done on a BD FACS CANTO II and subsequent analysis were done using BD DIVA software. Plots were made in Sigmaplot (Systat Software). RNA sequencing using the Illumina next generation sequencing platform RNA samples were prepared from cells obtained from five 24-wells using a Qiagen RNeasy Plus kit with an additional on-column DNase digestion step according to the manufacturer’s instructions. RNA and DNA concentrations were measured using a ND-1000 spectrophotometer (Nanodrop). RNA samples were processed for Illumina sequencing using the TruSeq stranded mRNA Library Prep kit (Illumina) according to manufacturer’s instructions. In brief, the workflow involved purifying the poly-A containing mRNA molecules using poly-T oligo-attached magnetic beads. Following purification, the mRNA was fragmented into small pieces using divalent cations at elevated temperature. The cleaved RNA fragments were copied into first strand cDNA using reverse transcriptase and random primers. This was followed by second strand cDNA synthesis using DNA Polymerase I and RNase H. These strand-specific cDNA fragments went through an end repair process to a single ‘A’ base, followed by ligation of the adapters. The products were purified and enriched with PCR to create the final cDNA library. Finally, samples were pooled and sequenced for 147 cycles on a GAIIx (Illumina) using the TruSeq SBS Kit v5 kit (Illumina). The raw signal was processed into individual sequencing reads per sample with the Illumina software (Casava 1.8.2). All sequence reads per sample were aligned versus the publicly available reference genome for TB40/E (acc. No. [71]EF999921; 2008). This reference genome is annotated with 168 coding sequences (cds) and was manually updated with 9 additional coding sequences from accession number [72]KF297339 (2013). In addition, also four long non-coding RNA’s (RNA1.2, RNA2.7, RNA4.9 and RNA 5.0) were added from accession number [73]KF297339 (2013) [[74]24]. In this study, the TB40/E-BAC4 was used which misses US1-US6 compared to the wild type virus [[75]32], for the analysis the lack of this region was taken into account. Mapping of the sequence reads and calculating the total read counts per cds were performed using the RNA-seq analysis module of CLC bio’s CLC Genomics Workbench using default parameters. All raw data was uploaded to a public repository ([76]http://www.ebi.ac.uk/ena/data/view/PRJEB15199). For the statistical analysis, the Bioconductor suite was used [[77]36]. Count data were preprocessed (normalized and variance stabilized transformed for visualization purposes) using the DESeq package [[78]37]. For each individual transcript, differential expression between cell types was performed from normalized counts using a generalized linear model hypothesizing a negative binomial distribution. Individual p-values were adjusted for multiple testing with the Benjamini-Hochberg procedure [[79]38]. Finally, a pathway enrichment analysis was performed using the MLP algorithm [[80]39, [81]40]. Results HCMV infection of monocyte-derived cell types and percentage HCMV reads We isolated CD14^+ monocytes from six independently processed blood donors (A-F in [82]Fig 1) and differentiated these cells to pro-inflammatory DCs and MΦ1; and anti-inflammatory MΦ2. DCs were characterized as CD14^-/CD163^-/DCSIGN^++ and upon LPS treatment they showed upregulation of CD80; MΦ1 were CD14^+/CD163^-/DCSIGN^+ whereas MΦ2 showed a CD14^+/CD163^+/DCSIGN^+ marker profile with poor upregulation of CD80 upon LPS treatment ([83]S1 Fig). Fig 1. Percentage of mapped reads to the HCMV transcriptome in primary cell types. [84]Fig 1 [85]Open in a new tab Six different blood donors (A-F) were processed to obtain monocytes which were differentiated to DCs, MΦ1 and MΦ2. Subsequently, these primary cell types were infected with TB40/E at MOI5. As a comparison, we infected NHDF cells (n = 3) at MOI 0.16, MOI 0.5 and MOI 5. Given in the top panel are the percentages of mapped reads on the HCMV genome for each cell type. Subsequently, these cell-types were infected with highly epithelial tropic TB40/E HCMV strain at MOI 5. In this study, the infection was carried out for 3 hours without subsequent inactivation of the remaining surface-bound virus e.g. with citrate buffer to avoid any damage or any changes in host transcriptome of the cells thereby attempting to keep the cell-virus interplay as authentic as possible. In contrast to fibroblasts where 100% of the cells could be infected, 3.11% (2.91–4.12%) of the DCs, 1.20% (1.01–1.36%) of the MΦ1 and 6.27% (3.5–8.16%) of the MΦ2 could be infected ([86]S2 Fig). Illumina RNA sequencing resulted in an average of almost 23.3 million (9.4–36.4 x 10^6) sequences per sample covering both HCMV and host mRNA. In order to perform a comparative analysis of HCMV gene expression between fibroblasts and monocyte-derived cells, we argue that a more representative control is provided by infecting NHDF cells at a lower MOI thereby mimicking the actual percentage of HCMV reads in DCs and MΦs ([87]Fig 1). Infection of DCs, MΦ1 and MΦ2 resulted in a median percentage of mapped reads of 0.63%, 0.04% and 0.67%, respectively. There was some variability in infectivity between the cells derived from different donors. We infected NHDF cells at three different MOIs and show that infection of NHDF fibroblasts at MOI 0.16 reflects better the percentage of reads mapped to the HCMV transcriptome (2.97%) in primary cells compared to MOI 0.5 (10.62%) and MOI 5 (32.12%) ([88]Fig 1). First, we evaluated the expression levels of each HCMV gene in NHDF cells at MOI 0.16. Expression levels are represented by the reads per kilobase of transcript per million (RPKM) value, which represents the number of mapped sequences per gene normalized for the gene length and the total amount of sequences from the experiment (accounting for experimental variability). We plotted the binned logs of these RPKM values on the HCMV genome annotation (acc.no. [89]EF999921) used for this study ([90]Fig 2). Fig 2. Expression levels of the HCMV genes transcriptome in fibroblasts (MOI 0.16). [91]Fig 2 [92]Open in a new tab Log(10) of the RPKM values per gene were binned in six categories and mapped on the reference genome of TB40/E. The highest expressing genes in this study during lytic infection at low MOI in NHDF fibroblasts were RNA4.9, RNA2.7, RNA1.2, UL22A, US18, RL12, RL13-RL14, UL4, UL5, UL40, UL41A, UL42, UL132 and UL148. On the other end of the spectrum UL12, UL38, UL80.5, UL121, UL123, UL127, truncated UL141, and vMIA are only expressed at low levels (10–100). In contrast, no reads were mapped for UL60 and UL90 in this study. Differentially regulated genes and gene clusters in primary cell types Next, we compared HCMV gene expression in monocyte-derived DCs, MΦ1 and MΦ2 with HCMV gene expression in NHDF cells ([93]Fig 3). In general, the regulation of the US region was striking in primary cell types. Especially US26, US33 (not in MΦ1), US33A, US34 and US34A were found to be expressed at higher levels in primary cells (p<0.01). Also in the US region, US7-US9, US10 (only significant in MΦ1), US11 (only significant in DCs), US23 (only significant in MΦ2) and US23 (only significant in MΦ2) were expressed to higher levels in primary cells. Striking was the significantly lower expression of US28 (in DCs) and US29 (in MΦ2) compared to NHDF cells. Fig 3. Differential regulation of HCMV gene expression in monocyte-derived DCs, MΦ1 and MΦ2 (MOI5) compared to lytic infection in NHDF fibroblasts (MOI0.16). [94]Fig 3 [95]Open in a new tab On the graph is the fold increase or decrease (reflecting genes which are expressed higher/lower in fibroblasts than in primary cell types). Red bars represent significant changes in gene expression between the indicated cell type and NHDF fibroblasts (adjusted p<0.05), black bars indicate modulated genes which did not reach the significance threshold (adjusted p≥0.05). In particular in DCs, the regulation of HCMV genes was clustered. RL11-RL12-RL13, UL41A-UL42 and (UL1)UL4-UL11 were present in lower amounts in primary cells while, besides the aforementioned genes in the US region, UL120-UL121 and UL148D-UL149 were found to be expressed higher. In MΦ1, except for the US region, this clustering was not as clear. Individual genes such as UL25, UL100, UL141, RNA5.0, UL80, UL83 and UL86 were detected at higher expression levels in contrast with the lower expression of UL17, UL20, UL78, UL105 and UL123. UL80, UL83 and UL86 may be part of a cluster. In MΦ2, RL10-RL11 and a cluster from UL82 to UL86 (p>0.05 for UL84) as well as individual genes such as RNA5.0, UL1, UL8, UL42, UL52, UL100, UL115 and UL146 were found to be lower expressed than in NHDF fibroblasts. Also genes in MΦ2 were more difficult to cluster, only UL2-UL2 and UL148A-UL149 were significantly higher expressed neighboring genes in addition to individual genes as UL12, UL17, UL20, UL29, UL78 and UL137. In a head to head comparison between the different primary cell types, the only significant difference were the lower amounts of UL4-UL5, US7 and US9 in DCs compared to MΦ1. This suggests that the virus regulates its gene expression similarly in both macrophage subtypes, despite their different inflammatory character and infectability. Functional analysis of differentially regulated genes Each HCMV gene has a specific function during the lytic or latent part of the viral life cycle. By annotating each gene with its function [[96]40], we were able to explore if any functional groups were significantly enriched ([97]S3 Fig). This means that the p-values of each functional group were compared between two cell types. The enrichment of a functional groups can be driven either by the differential expression of many genes with p-values between 0.05 and 0.01; or by a small number of highly significant genes (p<0.001). This pathway analysis showed that in DCs and MΦ1, compared to NHDF fibroblasts, the most affected functional group consisted of genes involved in immunomodulation. In DCs, the second and third functional groups which were enriched contained genes which have not been annotated with a function or genes involved in cell tropism and cell-dependent replication, respectively. In MΦ1, genes with a function in viral spread and entry contributed to the enrichment of the similarly named functional classes. Finally, in MΦ2 only two functional groups were significantly affected, being genes involved in cell tropism and cell-specific replication; and genes involved in viral spread ([98]Fig 4). Fig 4. Enrichment of functional groups in DCs, MΦ1 and MΦ2 compared to NHDF. [99]Fig 4 [100]Open in a new tab Genes involved in cell tropism, viral spread and immunomodulation are shown to be the most enriched groups in DCs, MΦ2 and MΦ1. Given are the p-vales of each individual gene. The group of immune-modulatory genes is the largest (38 tested genes) and the presence of many significant p-values (p<0.05) demonstrates the importance of the regulation of genes in this group in all examined monocyte-derived cells. In DCs, RL11-RL13, UL141, UL11, UL111A and UL20 were the most important differentially regulated genes involved in immunomodulation (p<0.01). To a lesser extent (0.01
0,05) between all cell types (Mo vs MΦ1, Mo vs MΦ2, Mo vs DC; MΦ1 vs DC; MΦ2 vs DC), except between MΦ1 and MΦ2 (p>0.05). (Panel B) DC marker DCSign was expressed significantly higher on DCs and all other cell types (p<0.05; Mo vs DC, MΦ1 vs DC, MΦ2 vs DC). DCSign was also expressed more on both types of MΦ compared to monocytes (p<0.05). No statistical difference was observed between both types of macrophages (p>0.05). (Panel C) CD163 was only expressed on MΦ2 and poorly on all other cell types (p<0.05; MΦ2 vs Mo, MΦ2 vs MΦ1, MΦ2 vs DC). (Panel D) To assess CD80 response, all cell types were challenged with 500ng/ml LPS for 24h. MΦ2 macrophages showed a significantly lower response in C80 upregulation compared with the other cell types (p<0.05; MΦ2 vs Mo, MΦ2 vs MΦ1, MΦ2 vs DC). CD80 expression between all other cell types was comparable (p>0.05). (TIF) [171]Click here for additional data file.^ (310.2KB, tif) S2 Fig. A representative image (donor A) of the number of IE expressing cells in TB40/E and mock (UV-inactivated TB40/E) infected primary cell types. In each representative image, we indicated the average number and the range of IE expressing cells for three independently processed donors. (TIF) [172]Click here for additional data file.^ (201.8KB, tif) S3 Fig. Distribution of functions of DCs, MΦ1 and MΦ2 compared to NHDF fibroblasts and of primary cells compared to each other. The most enriched functions are represented by the highest bars. The cut off to determine the most significantly enriched functions was based on the point where the box plot curves flatten out (marked by a red box). (TIF) [173]Click here for additional data file.^ (1.3MB, tif) Data Availability All relevant data are within the paper and its Supporting Information files. Raw sequencing data was uploaded to a public repository. Funding Statement All authors work for Janssen Phamaceutica NV and receive a salary. The work was funded by internal resources without grants. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. References