Abstract

   microRNAs (miRNAs) repress target genes by destabilizing mRNAs and/or
   by inhibiting translation. The best known factor for target recognition
   is the so called seed – a short continuous region of Watson-Crick base
   pairing between nucleotides 2–7 of the miRNA and complementary
   sequences in 3′ untranslated regions of target mRNAs. The miR-34 family
   consists of three conserved members with important tumor suppressor
   functions linked to the p53 pathway. The family members share the same
   seed, raising the question if they also have the same targets. Here, we
   analyse the effect of miR-34a and miR-34c on protein synthesis by
   pSILAC. Despite significant overlap, we observe that the impact of both
   family members on protein synthesis differs. The ability to identify
   specific targets of a family member is complicated by the occurrence of
   * strand mediated repression. Transfection of miR-34 chimeras indicates
   that the 3′end of the miRNA might be responsible for differential
   regulation in case of targets without a perfect seed site. Pathway
   analysis of regulated proteins indicates overlapping functions related
   to cell cycle and the p53 pathway and preferential targeting of several
   anti-apoptotic proteins by miR-34a. We used luciferase assays to
   confirm that Vcl and Fkbp8, an important anti-apoptotic protein, are
   specifically repressed by miR-34a. In summary, we find that miR-34a and
   miR-34c down-regulate distinct subsets of targets which might mediate
   different cellular outcomes. Our data provides a rich resource of
   miR-34 targets that might be relevant for clinical trials that want to
   implement the miR-34 family in cancer therapy.

Introduction

   Animal microRNAs (miRNAs) are a class of small endogenous, non-coding
   RNAs mediating posttranscriptional gene silencing [25][1], [26][2].
   miRNAs have a widespread impact on regulation of gene expression and
   evolution and are thought to affect over 50% of all human genes
   [27][2], [28][3], [29][4], [30][5]. Their function is not restricted to
   normal organism development: miRNAs also play a vital role in diseases
   such as cancer, where they can act as oncogenes or tumor suppressors
   [31][6], [32][7].

   miRNAs are transcribed as longer hairpin molecules that are processed
   over several steps until they are cut by DICER into duplexes of their
   final 22–23nt length [33][8]. As a last step, one strand of the miRNA
   duplex (“mature strand”) is incorporated into the RNA-induced silencing
   complex (RISC) while the other, so called star (*) strand is supposedly
   degraded [34][9]. Once integrated into the RISC miRNAs repress target
   mRNAs via either direct mRNA cleavage or translational regulation
   associated with mRNA degradation [35][2], [36][10], [37][11]. The
   overall role of mRNA degradation and translational repression for
   miRNA-mediated regulation is not entirely clear.

   One of the most important questions is how miRNAs recognize their
   target mRNAs. The best understood factor for target recognition are
   so-called “seed” sites: stretches of perfect Watson-Crick base pairing
   between nucleotide 2–7 of the miRNA and complementary sequences in the
   3′ untranslated region (3′UTR) of target mRNAs. The correlation between
   target repression and 3′UTR seeds had been found early in the
   exploration of miRNAs [38][12], [39][13]. While the seed is generally
   considered to be the most important sequence feature for target
   recognition, it is important to note that it is neither necessary nor
   sufficient. For example, some miRNA targets are down-regulated despite
   missing a complete seed match [40][14]. Others are dependent on so
   called “centered” seeds spanning miRNA nucleotides 4 to 15 [41][15].
   Finally, many mRNAs which contain a 3′UTR seed match are not repressed
   by over-expression of the corresponding miRNA. Collectively, these
   observations indicate that the seed is not the only factor involved in
   target recognition.

   Since members of miRNAs families usually share the same seed site but
   differ in their remaining sequence they present a natural setup to
   study target selection independent of seed differences [42][16],
   [43][17]. Differential targeting of family members should be mediated
   by variations aside the seed site and be more physiological than
   artificial mutations of miRNAs. In fact, it has been proposed that
   miRNA families do have different targets depending on their 3′end
   sequence [44][14]. However, only few studies investigated target
   selection of miRNA families by over-expression of individual family
   members so far. Two microarray studies on the miR-16 and miR-34
   families came to the conclusion that members of both families show
   functional redundancy [45][18], [46][19]. The miR-34 family is a
   particularly interesting example as one of the few families that are
   also conserved in Drosophila and C. elegans [47][20]. While
   invertebrates only possess one miR-34 gene, the miR-34 family consists
   of three members in vertebrates encoded at two different gene loci
   [48][21], [49][22]. While miR-34a and miR-34c are perfectly conserved
   in sequence between human, mouse and chicken, miR-34b shows slight
   nucleotide alterations between the three species [50][23].

   The miR-34 family is part of the p53 stress and DNA damage response
   pathway and has widespread regulatory effects on the cell [51][24].
   Activation of p53 by genotoxic stress activates expression of miR-34
   family members [52][19], [53][23]. In turn, miR-34a has been shown to
   up-regulate p53 activity via a positive feedback loop involving Sirtuin
   1 leading to apoptosis [54][25]. Several targets of the miR-34 family
   mediate cell progression and block apoptosis, suggesting that by
   repressing these targets miR-34 acts as a tumor suppressor [55][22],
   [56][24], [57][26], [58][27], [59][28], [60][29]. Validated targets
   include Cdk4, Cdk6, Hmga2, c-Met and Akt. Most of these targets have
   been validated for miR-34a while the two other family members are less
   well studied. Interestingly, despite the obvious links between the
   miR34 family and p53, recent in vivo studies showed that mice lacking
   all family members have normal p53-dependent responses [61][30],
   [62][31]. Ectopic expression of miR-34 within mouse tumor models,
   however, can significantly reduce tumor growth in mice and treatment
   with miR-34 is currently even considered for clinical trials [63][32].

   Whether different members of the miR-34 family have different targets
   is still an open question. Despite the fact that differential targeting
   between the miR-34 members has been reported, recent reviews of the
   miR-34 family come to the conclusion that they are redundant in
   function [64][27], [65][28]. However, so far studies mainly focused
   only on mRNA levels or individual selected targets. These approaches
   cannot cover the effect of differential targeting miRNA family members
   at the protein level on a global scale. Studies have shown that the
   degree of translational repression by miRNAs can amount to a large part
   of regulation [66][3], [67][11]. In fact, some specific targets of the
   miR-34 family such as c-Myc have been shown to be only translationally
   repressed [68][33]. Therefore, differences between family members may
   only become apparent at the protein level.

   We developed pulsed stable isotope labeling by amino acids in cell
   culture (pSILAC) to quantify relative changes in protein synthesis on a
   global scale [69][3], [70][34]. pSILAC has since been applied to assess
   translational regulation in several examples, including regulation by
   miR-34a [71][29], [72][35]. Here, we combine pSILAC and mRNA
   quantification by microarray to assess the effect of miR-34a and
   miR-34c on gene expression in HeLa cells. We focused on these two
   members since they show the biggest differences in sequence and
   conserved from chicken to human while miR-34b shows some sequence
   divergence between these species [73][23]. In addition, we also
   generated artificial chimeras between miR-34a and miR-34c to assess if
   target specificity depends on the 5′ or 3′ end. While we found
   considerable overlap, our results also indicate that both family
   members target distinct subsets of genes, suggesting non-redundant
   cellular functions.

Materials and Methods

miRNA design

   Fully complement siRNA duplexes for miR-34 members and chimeras were
   purchased from Dharmacon in annealed, desalted and 2′-deprotected form
   for direct use. Full complement duplexes were designed as follows
   (sense and antisense 5′-3′):

   miR-34a: UGGCAGUGUCUUAGCUGGUUGU/ AACCAGCUAAGACACUGCCAUA

   miR-34c: AGGCAGUGUAGUUAGCUGAUUGC/ AAUCAGCUAACUACACUGCCUGG

   miR-34ac: UGGCAGUGUAGUUAGCUGAUUGC/ AAUCAGCUAACUACACUGCCAUA

   miR-34ca: AGGCAGUGUCUUAGCUGGUUGU/ AACCAGCUAAGACACUGCCUGG

Cell culture and Transfection of HeLa cells with double-stranded RNAs

   HeLa (LGC Promochem) cells for mass spectrometry experiments were grown
   at 37°C with 5% CO2 in Dulbecco’s Modified Eagle’s Medium (DMEM) High
   Glucose (4.5 g/l) (PAA, custom preparation) supplemented with 10%
   sterile-filtered dialyzed fetal bovine serum (dFBS, Sigma-Aldrich), 4
   mM stable Glutamine (l-alanyl-l-glutamine, PAA), light L-arginine (84
   mg/l) and L-lysine (40 mg/l) [74][36]. The cells were transfected and
   processed as described before [75][3]. In short, transfection of
   synthetic RNAs (Dharmacon) of a final concentration of 100 nM was done
   according to the manufacturers protocol using DharmaFECT1 (Dharmacon).
   For transfection HeLa cells were plated on 10 cm dishes in
   antibiotic-free light (L) SILAC medium at a confluence of ∼70–80%. A
   mock control transfected with ddH[2]O instead of RNA was prepared for
   each RNA transfected sample. 8h after transfection, cells were washed
   twice and the medium for RNA transfected samples was changed to
   medium-heavy (M) SILAC medium (84 mg/l ^13C[6]-L-arginine and 40 mg/L
   ^2H[4]-L-lysine), while mock transfections were transferred to heavy
   (H) SILAC medium (84 mg/l ^13C[6] ^15N[4]-L-arginine and 40 mg/l
   ^13C[6]^15N[2]-L-lysine). 24h after pulse labeling cells were scraped
   off the plates, combined with the matching control, lysed using RIPA
   buffer and subjected to one-dimensional SDS-PAGE as described below. In
   addition to the original miR-34a and miR-34c transfection experiment,
   two replicates of miR-34a and one replicate of miR-34c were done in an
   independent transfection as were the miR-34 chimera RNA transfections.

Determination of transfection efficiency

   To ensure delivery of our synthetic siRNA duplexes we did transfections
   of double stranded, fluorescently labeled RNA oligomers (“BLOCK-IT”,
   Invitrogen) prior to further transfection experiments. The oligomers
   were transfected as described above. 8h after transfection cells were
   washed with 1x PBS (Gibco) and fixated in 4% paraformaldehyde (PFA) in
   D-PBS. Transfection efficiency was compared via the fluorescence of
   transfected versus non-transfected cells on a fluorescence microscopy
   (Keyence Biozero).

SDS-PAGE and tryptic digestion of samples

   About 100 μg of mixed protein samples were loaded on a 4–12% NuPage™
   Bis-Tris gradient gels (Invitrogen) and separated according to the
   manufacturer’s instructions. Gels were subjected to fixative solutions
   and colloidal Coomassie Brilliant Blue G-250 (Invitrogen) and single
   protein lanes were subsequently cut into 12-15 slices. Destaining,
   washing and tryptic digestion was done as described before [76][37].
   Before mass spectrometry samples were extracted and desalted using
   StageTips [77][38].

LC-MS/MS measurement

   LC – MS/MS analysis was performed as described before [78][3]. Peptides
   were analyzed using online reversed-phase liquid chromatography
   (rpHPLC) connected to an electrospray ion source (Proxeon) of a
   LTQ-Orbitrap mass spectrometer. rpHPLC was done using either the
   Agilent HPLC 1200 or Eksigent NanoLC – 1D Plus system. miR-34a and
   miR-34c samples were measured on Orbitrap classic and XL instruments
   while the chimera samples (miR-34ac and miR-34ca) were analysed on an
   Orbitrap Velos (Thermo Fisher). For HPLC separation we used fritless
   C18 microcolumns (75 m ID packed with ReproSil-Pur C18-AQ 3-µm resin,
   Dr. Maisch GmbH), manually produced as describe before [79][39].
   Peptide were loaded onto the column using a flow rate of 500 nl/min
   (Agilent HPLC 1200) or 250 nl/min (Eksigent/Proxeon HPLC). Gradients
   were run and subsequently eluted with a flow rate of 200 nl/min with a
   10 to 60 % acetonitrile gradient of 155min or 240min in 0.5% acetic
   acid. The Orbitraps were operated in a top5 or 10 mode using data
   dependent acquisition of MS/MS scans as essentially described before
   [80][40]. In this mode, every full MS scan in the Orbitrap (m/z
   300–1700; resolution 60,000; target value 1×10^6) is followed up by 5
   or 10 consecutive MS/MS scans in the LTQ isolating and fragmenting the
   5 or 10 top most intense ions (charge > 1; target value 5000;
   monoisotopic precursor selection enabled) by collision induced
   dissociation (CID; 35% normalized collision energy and wideband
   activation enabled). Dynamic exclusion of 60sec was used to minimize
   repeated fragmentation of the same ions.

Processing of MS data

   Mass spectrometry data were processed using the MaxQuant software
   version 1.0.13.13 [81][41] using the MASCOT search engine (version 2.2,
   MatrixScience). To facilitate data integration all raw files were
   processed together. Labels were set to medium-heavy (Arg6 and Lys4) and
   heavy (Arg10 and Lys8) with a maximum of three labeled amino acids per
   peptide (top 6 MS/MS peaks per 100 Da; Quant.exe). The resulting peak
   lists were submitted to the MASCOT engine and searched for matches with
   an in-house curated concatenated target-decoy database consisting
   forward and reversed proteins (supplemented with a fasta file for
   identification of common contaminants). Version 3.64 of the human IPI
   database (84,054 entries) was used for our analysis. Tryptic
   specificity with a maximum of two missed cleavages was required. The
   mass tolerance was set to 0.5 Da for fragment ions. For precursor ions,
   individual mass tolerances were assigned by MaxQuant as described
   [82][41]. Accepting thresholds for individual spectra were defined
   based on the target decoy database search strategy implemented in the
   MaxQuant software. Variable modifications were set to oxidation of
   methionine and acetylation of the protein N-terminus, while
   carbamidomethylation of cysteine was selected as fixed modification.
   For protein assembly only peptides with a minimum length of 6 amino
   acids were considered and per protein group at least one peptide was
   required. A maximum false discovery rate (FDR) of 1% (peptide and
   protein level) was allowed which was calculated by matches to reversed
   sequences in the concatenated target-decoy database. Only unique and
   “razor” peptides (non-unique peptides of to the protein group with the
   highest number of peptides) with a minimum ratio count of two were used
   for protein quantification. Normalization of data was done by MaxQuant
   under the assumption that most protein ratios do not change upon miRNA
   transfection. After removal of reverse hit and contaminants, we matched
   Reseq NP identifier of the MaxQuant output table with a list of Refseq
   NM IDs containing the number of mature or *seed sites in the 3′UTR of
   the respective gene. This list was curated using a list of human gene
   3′UTR sequences downloaded from the UCSC Genome Browser
   ([83]http://genome.ucsc.edu, gene list update from February 2009). This
   list of 3′UTRs was also the basis for all further studies (Sylarray,
   Sequence motifs analyses). The script also mapped PicTar
   ([84]http://pictar.bio.nyu.edu/cgibin/new_PicTar_mouse.cgi) predictions
   for all miR-34 members to our protein data. As a last step, log2 fold
   changes were calculated from the normalized H/M ratios of each sample.
   The resulting table was merged with the microarray data.

Luciferase cloning and assays

   The 3′UTRs of Fkbp8 ([85]NM_012181) and Prkar2a ([86]NM_004157) were
   synthesized and cloned into the pRL-TK Cxcr4 vector with prior removal
   of the Cxcr4 4x target site by Not1/Xho1 digestion and verified by
   sequencing (SINA Science Services GmbH). The Vcl ([87]NM_014000) vector
   was a kind gift of Dr. Markus Kaller cloned into the pGL3-control-MCS
   vector [88][29]. As positive control we used the known target of
   miR-34a c-Met ([89]NM_000245) from previous studies of our laboratory
   [90][3].

   The sequence of the Fkbp8 3′UTR without poly-A signal used for cloning
   is:

   5′CCACCTAGGTGGCTGCCACCCCCTCTGCACACCATGGACCCTGCCCTGCGCTCCCCAACTCCCCCAGGC
   TCCCTGTCCACTGCCCTCCCTGGTCTGGCCCCCTCCTCCGGGTTAGGGGAGCAAGGATTGGGGGTCGTGCA
   GCCCAGCCAGCAGGAGGGACTGAGGCCCTCTAGGAGGAAAGCCCAGAGGGAGGGGGCCCTCATTCCTTCAG
   ACCCAGTTTTCCCCCACCCTCCTTACCCCGCTGGGCTAGGTCTCCGCCAGGGCTGGCCTCAGTTTCTCCTC
   AACAGGCCTGGGGGCAGCCCTTCCCCTGCCTAGTCCCCGCCTGAGTGCCAGCCCCCCACCCCGCCTGCCGC
   CCCCTGTCCAGGTTCCCTCCCCGCCACAGTGAAATAAAGCATCCCACCCTGCAGTTTĆ3

   The sequence of the Prkar2a 3′UTR without poly-A signal used for
   cloning is:

   5′GTGTGCCACACCCCAGAGCCTTCTTAGTGTGACACCAAAACCTTCTGGTCAGCCACAGAACACATACAG
   AAAACAGACATGACAGAACTGTTCCTGCCGTTGCCGCCACTGCTGCCATTGCTGTGGTTATGGGCATTTAG
   AAAACTTGAAAGTCAGCACTAAAGGATGGGCAGAGGTTCAACCCACACCTCCACTTTGCTTCTGAAGGCCC
   ATTCATTAGACCACTTGTAAAGATTACTCCAACCCAGTTTTTATATCTTTGGTTCAAAACGGCATGTCTCT
   CCAACAATTTAAGTGCCTGATACAAAGTCCAAAGTATAAACATGCTCCTTTCCTCTCTTGCTGCTACTCTT
   GCTTTTGGAAGTTACCACAGGGTCTGCAGAAACCTGTTGTATAACTGTAGACACTCTCTAATGGTTCTCAA
   AGGAGGAAATGTAGCCTTCAGTCTCCTCATTTGTCCTTTGAGGAAGTCCACATTTGTTCACAGTTGCAGCC
   TTTGGTTTTACAGTGGGAAATGGTGGTGGATGATATGGACATATGTAGCCCAGTGGCATTGTACTTTCTGC
   TGACAGCTGCACACATTACAGCTGTCTCCAAACCCACAGTGATGCTTAGGGAAAGACCCTGCTCAGGACCC
   AGCAGGTCAGCACCCCAGAGCAGACTGATAGGTCCGTGGGACCCATGTTAGAGCAGAAAATTTGGGCTCAG
   CACATTTTACTGTTAGTAGAGAGCCAGGAAACGTTTTCTGGGTTGGGGATTTTGTGGGATTTTTTAATTTT
   TTTAGTAGGTTTTGTTTAACCTCTGTGCAGTTTGTATGAATGAATTGCTATACATTTATAAGGAGCCAGGG
   TCTGGAGGGTTGCTATCACTTTGTCCAGCCCAAATACCTTCCTGGGCAACTCCTACCATTTGTTTGCAGTT
   GCCT3

   Luciferase assays were performed as described previously [91][3]. In
   short, HeLa cells were seeded in 24-well plates in light (L) SILAC
   medium (1×10^5cells/well) the day before transfection which guaranteed
   a confluence of 90% the next day. The Fkbp8 or Prkar2a luciferase
   reporters were transfected into HeLa cells together with the respective
   miR-34 members and the pGL3 control vector (Promega) using
   Lipofectamine 2000 (Invitrogen) according to the manufacturer’s
   instructions. For the pGL3- Vcl reporter we used pRL-TK as a control
   plasmid (Promega). For transfection 180 ng of the reporter, 20 ng of
   control plasmid and 100 nM siRNA (final concentration) diluted in
   serum-free DMEM were used. All transfections were done in triplicates
   and each measurement was done three times. miR-16 was used as control
   miRNA that did not affect the synthesis of the examined genes as
   determined by MS (data from Selbach et al., 2008). The day after
   transfection the medium was changed and 48h after transfection cells
   were prepared and measured using the Luciferase Reporter assay system
   (Promega) according to manufacturer’s instructions. Fluorescence was
   measured on a MicroLumat Plus LB 96V luminometer (Berthold
   Technologies) and processed using MikroWin 2000 (Mikrotek Laborsysteme
   GmbH). Renilla luciferase activity of the reporter constructs was
   normalized using the activity of the firefly luciferase of the pGL3
   control plasmid (Promega) (or vice versa for the Vcl reporter).
   Evaluation of the measurement error was done by calculating the
   relative error of the three biological replicates of the respective
   reporter along with its control and adding it up according to the law
   of error propagation. The relative error was used as base for computing
   absolute errors of the normalized expression values. To assess the
   pSILAC error, the standard deviation of two replicates of the miR-34a
   transfections (miR-34a1 and miR-34a2.1) was used. Errors are displayed
   as +/– two standard deviations.

Data analysis

   All data analysis was done using perl or R scripts, including spearman
   correlation coefficients (pairs), correlation plots, cumulative
   distributions (ecdf) and hypergeometric tests (dhyper). Seeds to miR-34
   and * strand seeds were annotated using an in-house perl script based
   on 3′ UTR sequences in downloaded from the UCSC Genome Browser
   ([92]http://genome.ucsc.edu, gene list update from February 2009).
   Input for “Sylarray” analysis
   ([93]http://www.ebi.ac.uk/enright-srv/sylarray/)[94][42] were Refseq NM
   identifiers of one transfection experiment sorted from down- to
   up-regulated together with a background of all human gene 3′UTR
   sequences downloaded from the UCSC Genome Browser
   ([95]http://genome.ucsc.edu, gene list update from February 2009). The
   options “use all available words” and “use non-redundant sequences”
   were selected. For gene ontology analysis and clustering we produced
   lists of Refseq NM Ids according to the different conditions tested as
   input for the online David Gene Ontology tool
   ([96]http://david.abcc.ncifcrf.gov/). Refseq NM Identifiers of all
   proteins identified in our experiments were used as background for
   enrichment calculation. Output KEGG and GO biological process (GO_BP)
   terms were downloaded and only terms with at least p < 0.05 and 3 gene
   counts in one of the input datasets were used for comparison. Log2 fold
   changes for miR-34a expression in SW480 were extracted from Kaller et
   al., 2011 and mapped to our data via IPI identifiers using R. Mature
   miR-34 and *strand seeds were mapped. Proteins sorted according to the
   requirement given in the respective columns are marked with “True” or
   “False” if they comply with the requirement. We did not filter for *
   strand seeds for this analysis as this would reduce the number of
   shared proteins considerably.

   The significance of the differences in Spearman rank correlation
   coefficients was computed using the Fisher r-to-z transformation with
   an online tool ([97]http://vassarstats.net/rdiff.html). We treated
   Spearman coefficients as though they were Pearson coefficients since
   this procedure is more robust with respect to Type I error than either
   ignoring the non-normality and computing Pearson coefficients or
   converting the Spearman coefficients to Pearson equivalents prior to
   transformation [98][43].

Results

Experimental setup

   Transfection of HeLa cells was performed using double-stranded RNAs
   mimicking miR-34a and miR-34c in a pulsed SILAC (Stable Isotope
   Labeling of Amino Acids in Cell Culture) approach as described before
   [99][3], [100][34], [101][44]. To enable measurement of changes due to
   miR-34 over-expression, it was ensured that none of the miR-34 members
   is detectably expressed in HeLa cells [102][45]. Double-stranded RNAs
   were designed as mature miR-34 mimics, 22–23nt in length and with the
   3′strand designed as perfect complement to the mature 5′strand. A mock
   transfection control was prepared in parallel for each miR-34
   transfected sample. Cells were cultivated on light SILAC medium and
   transfected with the miRNA via Dharmafect1. 8h after the transfection,
   we transferred the miRNA transfected cells to medium-heavy (“M”) and
   control cells onto heavy (“H”) SILAC medium, incubated for another 24h
   hours and subsequently harvested them. Differentially treated cells
   were combined with the mock control and analyzed by high resolution
   LC-MS/MS ( [103]FIG 1A ). A transfection efficiency of over 90% in HeLa
   cells was determined using fluorophore-conjugated dsRNA prior to the
   experiment (FIG S1).

Figure 1. Experimental setup.

   [104]Figure 1
   [105]Open in a new tab

   A, Each member of the miR-34 family is transfected individually into
   HeLa cells in light SILAC medium. In parallel, a mock transfected
   control sample is prepared for each member. After 8h of transfection
   the samples are transferred to different SILAC medium, heavy (“H”) for
   the control and medium-heavy (“M”) for the miRNA transfected cells.
   After 24h of pulse labeling corresponding sample are combined and
   processed for mass spectrometry. The resulting peaks for one peptide
   are shown as an example. Peptides produced before pulse labeling appear
   as light peaks and can be disregarded. Differences in protein synthesis
   between control and miRNA-transfected samples can be read from the H/M
   ratio of the respective peptides. B, Nucleotide sequences of the miR-34
   family members miR-34a and miR-34c. To investigate the importance of 5′
   versus 3′ends two miRNA chimeras were constructed swapping head (nt
   1-9) and tail of miR-34a and miR-34c respectively. Differences in the
   nucleotide sequence are marked in blue. The seed is labeled red.

   To control for biological and technical variability we performed
   biological replicates of the miR-34a and miR-34c transfection
   experiments on a different day in a completely independent manner. In
   addition, one miR-34a transfection experiment was performed twice on
   the same day to have a comparison of biological replicates from both
   the same and different days. We also designed two chimeras of miR-34a
   and miR-34c comprising the first nine nucleotides of the 5′end of
   either miR-34a or miR-34c paired to the remaining 3′end nucleotides of
   the respective other miRNA. The two chimeras miR-34ac and miR-34ca
   (first letter indicates parent 5′, second parent 3′ end) were processed
   the same way as the miR-34 members. Sequences of all miR-34 family
   members and chimeras can be seen [106]FIG 1B.

miR-34a and miR-34c induced changes in protein synthesis

   Mass spectrometry lead to the identification of overall 6,241 and
   quantification of 5,435 proteins in all experiments. We required at
   least two quantified peptide evidences in each experiment, resulting in
   about 2,400 to 4,800 quantified proteins in each individual
   transfection experiment at a false discovery rate of 1%. The complete
   set of quantified proteins is given in [107]Table S1. Several targets
   of the miR-34s described in literature were down-regulated in our data
   as well ([108]FIG 2A). Note, that we removed all proteins that we were
   not able to map to a specific mRNA from all further analysis.

Figure 2. MiR-34a and miR-34c repress synthesis of many proteins.

   [109]Figure 2
   [110]Open in a new tab

   (A) Known targets of the miR-34 family are down-regulated in our
   dataset (error bars indicate standard deviations from two or three
   experiments). (B) Cumulative distribution plots show that synthesis of
   proteins with miR-34 seed matches in their mRNA 3′UTRs is repressed by
   transfecting miR-34a (n = 4612). (C) The same holds true for the
   miR-34c transfection (n = 4094). (D) When selecting for the seed of
   miR-1 this correlation between seed and down-regulation is not visible
   (n = 4612). (E) Enrichment of seed matches in down-regulated proteins
   is significant even at mild log2FC cut-offs (hypergeometric test,
   dashed line: log2FC cut-off -0.3, dotted line: significance threshold
   p = 0.05, n = 4612). (F) Sylarray analysis [111][42] of miR-34a
   proteins sorted from down- (left) to up-regulated (right) renders the
   mature miR-34a seed (ACTGCC) as enriched nucleotide motif; however,
   also the *seed of our siRNA duplex (GCTGGT) is enriched in
   down-regulated proteins. (G) Similar observations are made for miR-34c.
   (H) Overview of the numbers of quantified as well as regulated proteins
   in miR-34a and miR-34c.

   Since the seed match within 3′UTRs is important for miRNA target
   selection, direct miR-34 targets should be enriched for 3′ UTR seeds.
   We therefore investigated if proteins with a seed match in their
   corresponding mRNAs are down-regulated at the protein level. This can
   be visualized using cumulative distribution plots of the miR-34
   transfections. Here, the distribution of log2 fold changes for proteins
   with or without a 3′UTR seed match are plotted. “Seed proteins” clearly
   showed reduced protein synthesis compared to non-seed proteins
   ([112]FIG 2 B,C). This effect is not observed with a seed of a
   different miRNA such as miR-1 ([113]FIG. 2D). We conclude that many
   proteins in our pSILAC data are directly repressed by miR-34.

   Next, we sought to determine a cut-off value to define targets of
   miR-34. To this end, we calculated how significant the enrichment of
   proteins with 3′ UTR seed matches is at different cut-offs using the
   hypergeometric test ([114]FIG 2E). Proteins down-regulated by miR-34a
   were highly significantly enriched in miR34 seeds even at mild cut-offs
   (for example, p = 6×10^−16 for proteins with log2FC < 0). To minimize
   false positives we used a more stringent cut-off of –0.3
   (p = 3.8×10^−23, dashed line). To obtain an estimate of the actual
   number of direct targets identified at this cut-off we asked how many
   of the down-regulated proteins can be explained by the seed. 655 and
   687 proteins had a log2FC smaller than –0.3 in the miR-34a and miR-34c
   experiments, respectively. Of these down-regulated proteins, 275 (42%)
   and 257 (37%) had a 3′ UTR seed match for miR-34a and c. The background
   seed frequency of non-regulated proteins (absolute log2FC<0.1) was 23%
   in both cases. Therefore, about 19% (miR-34a) and 14% (miR-34c) of
   down-regulated proteins with a seed match are expected to be direct
   targets. This amounts to 52 targets for miR-34a and 36 for miR-34c. It
   should be noted that these estimates only include targets with 3′ UTR
   seed matches. Seed matches in the coding sequence or targets without
   seed matches are not included. Thus, the true number of direct targets
   is probably higher.

   A nucleotide motif enrichment analysis employing the online tool
   “Sylarray” [115][42] revealed that not only the signal for the mature
   miRNA but also the *strand seed of the respective miR-34 member was
   detectable ([116]FIG 2F,G). Recent studies suggest that the
   incorporation of the *strand seed might be a common trait for miRNAs
   and physiologically important [117][46], [118][47], [119][48]. However,
   since the transfected RNAs were designed as perfect duplexes, the
   sequence of the *strand we used in our experiments differs from the
   endogenous version, most notably in the *seed region. To minimize the
   impact of the artificial *seed in our data we excluded all proteins
   with any of the *seed sequences in their 3′UTRs. This reduces the
   number of quantified proteins to 2419 in the miR-34a and miR-34c
   transfection experiments (1204 proteins in all replicates). [120]FIG.
   2H gives an overview of the regulation of proteins by miR-34a and
   miR-34c. [121]Table S1 shows all quantified proteins and mRNA abundance
   for the miR-34 transfections for genes not containing a *strand seed
   site in their 3′UTR. Further data analysis was done using the two
   miR-34 experiments and the 2419 proteins quantified unless stated
   otherwise.

Correlation and differences in protein regulation by miR-34a and miR-34c

   Next, we compared pSILAC data for miR-34a and miR-34c. Log2 fold
   changes for both miRNAs were clearly correlated ([122]FIG 3A, rho  = 
   0.45). However, the scatter is higher than in typical biological
   replicates with the same miRNA, suggesting that targets of both family
   members are overlapping but not identical. To assess the experimental
   variability in our data we performed two parallel miR-34a experiments.
   Indeed, these experiments showed considerably higher correlation
   ([123]FIG 3B, rho  =  0.71). Of note, even two miR-34a experiments
   performed on different days correlated better with each other than the
   miR-34a and miR-34c data derived from parallel experiments on the same
   day ([124]FIG 3C). Next, we computed whether the observed differences
   in Spearman correlation coefficients are statistically significant
   using the Fisher r-to-z transformation [125][43]. We found that the
   correlation between miR-34a and miR-34c is significantly lower than the
   correlation between two miR-34a replicates performed on the same day (p
   < 0.0001). Even the correlation of miR-34a replicates performed on
   different days is significantly better than the correlation between
   miR-34a and miR-34c (p ≤ 0.0017). This analysis strongly suggests that
   the impact of both family members on protein synthesis is not
   identical. Interestingly, miR-34a and miR-34c display the biggest
   differences in down-regulated proteins, a hint that they might mainly
   differ in their putative direct targets. In fact, less than half of the
   seed proteins which are down-regulated by miR-34a are also
   down-regulated by miR-34c (log2FC < –0.3, [126]FIG 3D). This indicates
   that despite the similarities between miR-34a and miR-34c on protein
   regulation, each family member down-regulates a distinct set of
   putative targets. To minimize the impact of biological variability we
   focused our subsequent analysis on the miR-34a and miR-34c experiments
   performed on the same day.

Figure 3. Proteomic comparison of miR-34a and miR-34c targets.

   [127]Figure 3
   [128]Open in a new tab

   (A) The correlation of log2 fold changes between miR-34a and miR-34c in
   the same transfection experiment (n = 2419) show a lower Spearman
   correlation than the two replicates of miR-34a (n = 1404) (B). This
   holds also true when comparing miR-34a experiments from different days
   (n = 1777) (C). Spearman coefficients for all proteins are marked in
   black, while seed containing proteins are indicated in red. (D) The
   overlap of common targets between miR-34a and miR-34c is rather low.
   (E) The overlap of miR-34a targets (–0.3 log2 FC) from SW480 cells
   [129][29] is bigger with miR-34a than with miR-34c targets in our HeLa
   dataset. Venn diagrams show the overlap of the 81 down-regulated
   proteins quantified in both the Sw480 and our HeLa dataset. Numbers in
   Venn diagrams depict total number of proteins down-regulated by log2 <
   –0.3 for one miR-34 or shared by two miR-34 members. The percentage of
   down regulated SW480 proteins that are also down regulated in HeLa
   cells is given above the diagrams. (F) The overlap with miR-34a targets
   in SW480 cells is more significant for miR-34a than for miR-34c in HeLa
   cells (hypergeometric test).

   On one hand, transfecting cells with dsRNAs mimicking mature miRNAs is
   advantageous since it avoids possible differences in miRNA processing
   between family members. On the other, our artificial over-expression
   system might also induce unspecific effects. We therefore compared our
   data with results obtained by expressing the full precursor of miR-34a
   in SW480 cells [130][29]. In this study, an episomal pRTS-miR-34a
   plasmid was induced for 16h and pSILAC labeled for 24h. Of the 1,206
   quantified proteins in this study 946 could be mapped to our HeLa
   dataset (of which 212 have a seed match). 81 of the shared proteins
   were down-regulated in SW480 cells (log2 fold change < –0.3). Among
   these, 32 and 21 were down-regulated by miR-34a and miR-34c in our HeLa
   data, respectively ([131]FIG 3E). Hence, miR-34a in SW480 cells shares
   more potential targets with miR-34a than with miR-34c in HeLa cells.
   This observation also holds true when only proteins with a seed match
   in their 3′UTR are considered. The overlap for miR-34a in both datasets
   is highly significant with p-values of 8.4*10^−15 and 5.3*10^−07 for
   non-seed and seed proteins, respectively (hypergeometric test,
   [132]FIG. 3F). For miR-34c the overlap is less significant
   (p = 7.4*10^−05 and p = 0.009). As a control, we also compared the
   overlap of our miR-34 data with pSILAC data obtained for a different
   miRNA (miR-1, dataset taken from Selbach et al., 2008). In this
   biologically unrelated control the p-values exceed the significance
   threshold (p = 0.15, data not shown). The fact that the overlap for
   miR-34a is considerably higher than for miR-34c indicates that the
   observed differences between miR-34a and miR-34c are not an artifact of
   our experimental approach. Instead, the observation that our data can
   be replicated in a different cell type by a different group, using a
   different miRNA expression system and different pulse labeling times
   strongly suggests that results are robust and meaningful beyond our
   specific experimental conditions. In addition, it is in accordance with
   our observation that miR-34a and miR-34c down-regulate different sets
   of targets.

Pathway analysis of miR-34a and miR-34c affected proteins

   Having shown that miR-34a and miR-34c have overlapping but not
   identical targets, we next asked if these differences might reflect
   different biological functions. We therefore employed the online DAVID
   gene ontology tool to look for enriched KEGG pathways. Both miRNAs
   affected the pathways “cell cycle”, “p53 pathway” and “terpenoid
   backbone synthesis” (“both down”, [133]FIG. 4). Hence, pathway analysis
   indicates functional redundancies of both miRNAs. One pathway mostly
   enriched in miR-34a is “DNA replication” which includes the Mcm
   proteins (Mcm3/5/6/7) and Pold1 (DNA polymerase delta catalytic subunit
   1). The miR-34a specific enrichment is also visible when only exclusive
   targets of the two miRNAs are considered (miR-34a and miR-34c
   exclusive, [134]Fig. 4). Some of the involved proteins are also mildly
   regulated by miR-34c which actually has been reported to regulate “DNA
   replication” as well [135][33]. However, in our study miR-34a
   outnumbers miR-34c concerning targets in this KEGG pathway. In
   addition, important genes necessary for nucleotide synthesis and thus
   DNA replication such as Impdh1 are specifically repressed by miR-34a.
   In summary, comparison at the level of pathways suggests overlapping
   functions with a seemingly stronger impact of miR-34a on “DNA
   replication”. More details about pathway enrichment and names of
   corresponding proteins are available in [136]Table S2.

Figure 4. Functional enrichment analysis of miR-34a and miR-34c.

   [137]Figure 4
   [138]Open in a new tab

   KEGG pathway enrichment for subsets of miR-34a and miR-34c targets (for
   all proteins down-regulated log2 < -0.3 by one member, shared by both
   and exclusive targets of one miR-34 member). Enrichment is depicted as
   the -log of the respective p-value of the enriched term.

miR-34a and miR-34c chimeras exhibit specific 5' or 3′end co-regulation of
exclusive targets-

   To assess sequence specificity in target selection we compared which
   exclusive targets of miR-34a or miR-34c were down-regulated in their
   respective 5′ or 3′end chimera ([139]FIG. 5). The exclusive targets of
   both miR-34 members are only partly co-regulated by the chimera miRNAs,
   especially for targets without a seed site. This is expected since
   genes lacking 3′ UTR seeds are more likely indirect targets.
   Interestingly, miR-34a and miR-34c show different chimera preferences.