Large-scale approaches for studying the regulatory effects of microRNAs (miRNAs) have revealed important insights into target recognition and function. These approaches include computational analysis of the selective maintenance or avoidance of miRNA complementary sites during evolution1, 2, 3, 4, 5, 6, 7, 8 and experimental identification of messages destabilized or those preferentially associated with argonaute proteins in the presence of a miRNA7, 8, 9, 10, 11, 12, 13, 14, 15. Despite their utility, none of these approaches directly measures the influence of a miRNA on protein output, which is the most relevant readout of its regulatory effects. The influence of miRNAs on protein output has instead been limited to single-protein analyses, primarily immunoblotting and reporter assays, and a medium-size proteomics analysis with detection of 504 proteins16.
Proteomic consequences of added miRNAs
To acquire data sufficient to investigate the effects of miRNA regulation on the proteome, we applied a quantitative-mass-spectrometry-based approach using SILAC (stable isotope labelling with amino acids in cell culture)17 to investigate the influence of specific miRNAs on the levels of many proteins (Supplementary Figs 1 and 2). We first measured the effects of introducing miR-124, a brain-specific miRNA, into HeLa cells. To include proteins from a broad expression spectrum, this experiment focused on nuclear-localized proteins. Out of 2,120 proteins detected, the analysis considered 1,544 that mapped to our non-redundant mRNA data set and were each quantified by at least two independent measurements that passed our quality thresholds (Supplementary Data 1 and 5).
Because this and all subsequent SILAC analyses were performed with two technical replicates, and because different peptides from the same protein and different charge states from the same peptide also provided the opportunity for independent measurements, most proteins were quantified by many more than two independent measurements (median of 12 for the 1,544 quantified proteins). The high reproducibility when comparing technical replicates and when comparing different peptides representing the same protein illustrated the quantification accuracy (r2 = 0.72 and 0.65, respectively, Spearman's correlation; Supplementary Fig. 3).
Messages for proteins that decreased the most relative to the mock-transfection control were compared to the messages of the other quantified proteins (cutoff, 85th percentile), searching for motifs over-represented in their open reading frames (ORFs) or untranslated regions (UTRs). When considering all 16,384 possible 7-nucleotide motifs and the different regions of the mRNA, the only one significantly enriched after Bonferroni correction for multiple hypothesis testing was the GUGCCUU heptanucleotide in the 3' UTR (P < 10-7, Fisher's exact test). This heptanucleotide motif comprised the 6-nucleotide match to the seed of miR-124 (underlined) supplemented by a match to miRNA nucleotide 8, and is named the 7mer-m8 seed-matched site (Fig. 1a). It was the same motif that is most associated with 3' UTRs of messages destabilized after introduction of miR-124 (ref. 9). The other sites consistently associated both with preferential conservation and with mRNA destabilization after miRNA introduction are named the 6mer, 7mer-A1 and 8mer seed-matched sites2, 7, 8 (Fig. 1a). A more directed search for the seed-matched sites revealed that most of the robustly repressed proteins derived from messages with at least one 7–8mer 3'-UTR site (Fig. 1b). For example, 24 out of the 40 proteins repressed by at least 50% had at least one 7–8mer 3'-UTR site, with only 3 of these 24 attributed to chance (Fig. 1b, repression cutoff of 50%). Less stringent repression cutoffs yielded many additional proteins from messages with 7–8mer sites, even after subtracting those expected by chance. The overall enrichment of seed-matched sites in messages of downregulated proteins indicated that miR-124 recognition of mRNAs for repression of protein output used, more than any other type of site, seed-matched sites in 3' UTRs.
Figure 1: The impact of transfected miRNAs on protein output.

a, Canonical miRNA seed-matched sites2. b, The fraction of repressed proteins deriving from messages with miR-124 3'-UTR sites (filled orange bar). At each repression cutoff, the number of repressed proteins from messages without 3'-UTR sites (indicated in the open bar) was used to calculate the additional fraction expected by chance to have a site (dashed line, with the corresponding number of repressed proteins indicated below the dashed line). Above the dashed line is the surplus number of repressed proteins deriving from messages with sites. c, Response of proteins from messages with single miR-124 3'-UTR sites. Plotted is the fraction of proteins that change at least to the degree indicated on the x axis. Proteins from messages with multiple 3'-UTR sites were not considered. 6mer sites that were part of larger sites were not included in the 6mer distribution, and 7mers that were part of 8mers were not included in the 7mer distributions. d, Efficacy of single 3'-UTR sites when pooling data from miR-124, miR-1 and miR-181 transfections, plotted as in c. e, ORF and 3'-UTR targeting efficacy. Plotted is the average change (
standard error) of protein and corresponding mRNA for quantified proteins from messages with at least one 8mer in the ORF (n = 83) or 3' UTR (n = 87) corresponding to the transfected miRNA (excluding messages with sites in both ORF and 3' UTR).
Slides may be downloaded for educational use, according to the terms described in Nature Publishing Group's licensing policy.
To survey the efficacy of the different seed-matched sites, we plotted the response of proteins from messages with 3' UTRs possessing single sites (Fig. 1c). Proteins from messages with single 7–8mer sites had a significant propensity to be downregulated when compared to those from messages without 3'-UTR sites (P = 0.02, 0.0008 and 0.02 for 8mer, 7mer-m8 and 7mer-A1, respectively, Kolmogorov–Smirnov test).
We performed analogous SILAC experiments with two additional miRNAs: miR-1 and miR-181, for which 2,312 and 1,774 proteins, respectively, mapped to our non-redundant mRNA data set and passed our quantification quality cutoffs (Supplementary Data 2, 3 and 5). The motifs associated with messages of the most downregulated proteins mirrored those observed for miR-124; for miR-1, the 7mer-m8 match was the most confidently enriched heptanucleotide motif in the 3' UTRs of downregulated proteins (P = 0.0004), and, for miR-181, the 7mer-A1 match was among the top two motifs (P = 0.007), slightly less confidently enriched than an unrelated motif, CUGCCCC (P = 0.006, Fisher's exact test with Bonferroni correction).
When pooling the data from all three miRNA transfections, thereby combining 5,630 independent protein quantifications, proteins from messages with single 7mer or 8mer sites matching the cognate miRNA had a significant propensity to be downregulated (Fig. 1d, P < 10-14 overall, P < 10-4 for each site separately, Kolmogorov–Smirnov test). Vertical displacement from the no-site distribution demonstrated that at least 16% of the proteins from messages with single 7–8mer 3'-UTR sites responded to the miRNA (Fig. 1d). The response of proteins from messages with a 6mer site closely tracked that from messages with no site, indicating that in this system 6mer recognition was generally insufficient for detectable protein downregulation (Fig. 1d).
Analysis of site conservation, site depletion, argonaute pull-downs and reporter assays all indicate that targeting can occur in protein-coding regions2, 5, 13, 18. Analysis of mRNA destabilization concurs that targeting occurs in coding regions, but indicates that these sites are generally much less effective than those in 3' UTRs7. However, monitoring mRNA destabilization would understate the influence of sites in coding regions if these sites, by virtue of falling in the path of the ribosome, had a disproportionate effect on translation compared to mRNA destabilization. To address this possibility, we examined our data monitoring protein output and found that sites in coding regions were generally less effective that those in UTRs (Fig. 1e).
Proteomic consequences of disrupting mir-223
Measuring the effects of ectopic miRNA addition can provide generic insights into miRNA target recognition, but the responsive proteins are not necessarily the endogenous targets, and the magnitude and kinetics of mRNA and protein changes are not expected to match those of endogenous targeting (Supplementary Discussion). To obtain data relevant to endogenous miRNA–target interactions, with pertinent information on the degree of repression, we examined the effects of the mir-223 gene knockout in mouse neutrophils. mir-223 is preferentially expressed in myeloid haematopoietic cells, with high expression in neutrophils and their progenitors19, 20. To obtain labelled samples suitable for the quantitative proteomics experiment, we isolated bone marrow haematopoietic progenitors from wild-type and mir-223-deficient mice21 and developed a protocol for their proliferation in SILAC media and differentiation into mature neutrophils in vitro (Fig. 2a and Supplementary Fig. 4a, b). By day 8, the surviving cells had descended from progenitors that had undergone multiple cell divisions in the presence of SILAC media (Supplementary Fig. 4c), which resulted in >99% heavy isotope incorporation. RNA blots confirmed that both the progenitors and the differentiating neutrophils expressed mir-223 (Fig. 2b). Array experiments demonstrated that the effect of miR-223 on messages with cognate sites was analogous to that observed for neutrophils isolated directly from mice, although somewhat less robust (Fig. 2c), perhaps in part because the neutrophils differentiated in vitro accumulate
35% less miR-223 (Fig. 2b).
Figure 2: The proteomic impact of deleting mir-223 in mouse neutrophils.

a, Schematic of neutrophil labelling and analysis. Haematopoietic progenitors were isolated from wild-type (WT) or mir-223-/Y (KO) male mice and cultured in SILAC media containing granulocyte colony-stimulating factor (G-CSF) and stem cell factor (SCF) for six days. To enhance differentiation, SCF was withdrawn over the next 42 h. Mature neutrophils were mixed, and proteins were size-fractionated for quantitative MS analysis. mRNA was also collected from the cultures and directly from mice for expression profiling. b, mir-223 expression detected with RNA blots probing for miR-223. One blot analysed total RNA from sorted subpopulations of cells cultured in vitro (left, with sorting profiles shown at the far left). The other blot analysed total RNA from cells cultured in vitro for eight days and from neutrophils isolated directly from bone marrow (right). As a loading control, blots were re-probed for U6 small nuclear RNA. Below each blot are the relative expression levels, normalized using the loading control. Radio-labelled RNA markers (M) are also shown. c, Analysis of neutrophils isolated directly from mice (left) and those derived in vitro from haematopoietic precursors (right), monitoring the effects of miR-223 loss on messages with single miR-223 sites in their 3' UTRs. Plotted is the fraction of messages that changed at least to the degree indicated on the x axis, otherwise as in Fig. 1c. d, The fraction of upregulated proteins deriving from messages with miR-223 7–8mer 3'-UTR sites, plotted as in Fig. 1b. e, The impact of deletion of mir-223 on neutrophil proteins, considering proteins from messages with single miR-223 sites in their 3' UTRs, plotted as in Fig. 1c. f, Targeting efficacy in ORFs (n = 69) and 3' UTRs (n = 50), plotted as in Fig. 1e. g, Efficacy of single 7–8mer sites and multiple sites, plotted as in f.
High resolution image and legend (227K)Download Power Point slide (783K)Slides may be downloaded for educational use, according to the terms described in Nature Publishing Group's licensing policy.
Analysis by mass spectrometry of both nuclear and cytoplasmic fractions provided quantitative information for 5,019 proteins, 3,819 of which mapped to our mRNA data set and passed our quality cutoffs (Supplementary Data 4 and 5). The effects of removal of endogenous miR-223 on neutrophil protein levels were essentially the reciprocal of those observed when ectopically adding individual miRNAs, except more of the targeting trends were statistically significant, presumably because more proteins were quantified. For instance, derepressed proteins derived from messages with strong enrichment for 6–8mer seed-matched motifs in 3' UTRs (but not 5' UTRs or coding regions), with high confidence for all four site types, even after Bonferroni correction (Supplementary Table 1). The fraction of responsive proteins from messages with 3'-UTR sites (Fig. 2d) resembled that observed for ectopic miR-124 delivery (Fig. 1b). Proteins from messages with a single 7–8mer site tended to be derepressed (Fig. 2e, P < 10-5, P < 10-6 and P < 10-4 for 8mer, 7mer-m8 and 7mer-A1, respectively, Kolmogorov–Smirnov test). The apparent hierarchy of site efficacy observed when monitoring protein output (Fig. 2e, 8mer > 7mer-m8 > 7mer-A1 > 6mer) matched that obtained when monitoring mRNA effects7, 8. Evidence for modest ORF targeting was again observed (Fig. 2f). The 33 quantified proteins from messages with multiple sites tended to be more responsive (Fig. 2g), but the increased output did not exceed that expected from each site acting independently. This independent, non-cooperative response was in agreement with results monitoring mRNA destabilization and reporter assays, which indicate that cooperative action of sites tends to occur only for those sites falling within 8–40 nucleotides of each other7. Taken together, our results demonstrated experimentally that targeting principles elucidated from ectopically added miRNAs apply also to endogenous miRNA targeting, and in particular to endogenous targeting at the level of protein downregulation.
Endogenous response of predicted miRNA targets
The perturbation of endogenous targeting provided the opportunity to test sets of target predictions. When considering current predictions from miRBase Targets22, miRanda23, 24, PicTar4, 25, PITA26 and TargetScan2, 7, all of which use site conservation as a prediction criterion, those from TargetScan and PicTar performed the best (Fig. 3a). Predictions from TargetScan and PicTar are primarily those messages with at least one 3'-UTR 7–8mer site conserved among mammals, operationally defined as those sites preserved in orthologous locations of human, mouse, rat and dog UTRs2, 4. Their enhanced performance over the set of messages with any 3'-UTR 7–8mer sites demonstrated that considering site conservation not only enriches for sites with presumed functional roles but also enriches for those that are more effective. All of the other algorithms include many sites with least one mismatch or wobble to the seed, which seems to have compromised their performance. For example, the predictions of miRBase Targets had been generated using the miRanda algorithm23 with updated parameters, searching for conserved sites with more stringent seed pairing but still allowing one mismatch or wobble to the seed22. Analysis of the seed-matched and seed-mismatched predictions separately revealed that any benefit gained in searching for site conservation was offset by the inclusion of many poorly performing predictions with seed mismatches (Supplementary Fig. 5a). Despite the relative success of TargetScan and PicTar, two-thirds of their predicted targets appeared to be nonresponsive to miR-223 loss in neutrophils, indicating a false-positive rate within the range of that inferred from estimates of chance conservation of the target sites of this miRNA (Supplementary Fig. 5b).
Figure 3: Correspondence between computational target predictions and observed protein changes.

Analogous results were observed using the transfection data sets (Supplementary Fig. 8). a, Performance of programs that consider site conservation. Plotted is the average protein derepression (
standard error) of genes with
1 conserved or non-conserved 7–8mer 3'-UTR site (grey) and of genes predicted to be miR-223 targets. The number of quantified proteins in each set is in parenthesis. b, Recognition of an adenosine (A) opposite the first nucleotide of the miRNA. The cumulative plot of protein changes after miR-181 transfection compares proteins from messages with no seed-matched 3'-UTR site to those from messages with the indicated single 3'-UTR site. c, Relationship between the scores of predicted targets and protein derepression. Predictions corresponding to quantified proteins were divided into three equal-size bins according to the scores proposed to indicate the quality of the prediction or degree of repression. Statistically significant differences between the bottom and top third are indicated (asterisk, P < 0.01, Mann–Whitney U-test). d, Response of the top 29 predictions of each algorithm, plotted as in a. e, Performance of programs that do not consider site conservation, displayed as in a and c. Also shown is the response of quantified proteins from messages with only non-conserved 7–8mer 3'-UTR sites, binned by total context score.
Slides may be downloaded for educational use, according to the terms described in Nature Publishing Group's licensing policy.
The similar performance of PicTar and TargetScan was expected for miR-223, which begins with a U, but might not have been expected for those miRNAs that do not begin with a U. TargetScan rewards an A across from position 1, whereas PicTar (and similar algorithms3, 27) rewards a Watson–Crick match at this position. Therefore, for miRNAs that begin with A, C or G, only one of the two heptanucleotide matches (the 7mer-m8) is the same for the algorithms2, 4 and thus about half of the predicted targets are expected to differ. To investigate which type of heptanucleotide match is most associated with decreased protein output, we examined the proteomics data from the experiment transfecting miR-181, which does not begin with a U. Plotting the response of proteins from messages with single sites revealed that the 7mer-A1 match was more effective than the Watson–Crick 1–7 match (Fig. 3b, P = 0.009, Kolmogorov–Smirnov test). Moreover, the Watson–Crick 1–7 match was no more effective than were 6mer sites with G or C mismatches across from position 1 (Fig. 3b). We conclude that the recognition of an A across from miRNA nucleotide 1 favours miRNA-mediated protein downregulation, which explains the preferential conservation of an A at this position, even when it cannot participate in a Watson–Crick interaction2.
Target prediction sets are typically ranked, with the assertion that the better scoring predictions are more likely to be authentic or effective. Recent TargetScan predictions (release 4) are ranked by 'total context score', which is based on site type, site number and site context7. This ranking correlated with protein downregulation, with the top third significantly more responsive than the bottom third (Fig. 3c). For the other algorithms, the predictions scoring in the top third were not significantly more responsive than those in the bottom third (Fig. 3c, P > 0.05, Mann–Whitney U-test). Despite their poor overall performance, the more inclusive algorithms might still have utility when considering only their top few predictions. To investigate this possibility, we considered only the top 29 predictions of each algorithm, choosing 29 because the most restrictive set (that of PicTar) includes this number of predictions. At this stringent cutoff, the performances of the more inclusive algorithms approached that of PicTar (resulting in difference that was no longer statistically significant, P > 0.05), but remained lower than that of TargetScan (P < 0.05, Fig. 3d). Interestingly, the top 29 quantified proteins ranked only by the total context score of their respective 3' UTRs, without any regard to site conservation, were at least as responsive as the top 29 TargetScan predictions (Fig. 3d).
Analysis of the evolutionary impact of miRNAs and analysis of messages that are upregulated in miRNA-deficient animals both indicate that many non-conserved sites mediate repression in vivo5, 6, 10, 11. We also found evidence for widespread non-conserved targeting among natural miR-223–target interactions. In an attempt to predict non-conserved targets, RNA22 (ref. 28) and a more permissive version of PITA26 do not consider site conservation. When evaluated using our miR-223 data, these algorithms performed no better than did a simple search for messages with 7–8mer seed-matched sites (Fig. 3e). A more effective tool was the total context score, which correlated with derepression when considering only those messages with non-conserved 7–8mer sites (that is, sites missing or mutated in orthologous positions of human, rat or dog 3' UTRs), with the top third of non-conserved predictions significantly more effective than the bottom third (Fig. 3e). Indeed the top third of non-conserved predictions (Fig. 3e, context score) appeared as effective as the bottom two-thirds of conserved predictions (Fig. 3c, TargetScan), and because proteins from non-conserved predictions outnumbered those from conserved ones by 6 to 1, the non-conserved predictions with favourable context scores were a bountiful source of biological targets.
The success of the total context score in ranking both conserved and non-conserved predictions was due in part to its consideration of site type (Fig. 2e) and the number of sites (Fig. 2g). To isolate its third component (site context) we considered only those quantified proteins deriving from messages with single 7mer-m8 3'-UTR sites and still observed a significant correlation between context score and protein response (P = 0.001, Spearman's correlation test). Predicted 3'-UTR structure and other features of site context are reported to influence site accessibility and efficacy7, 8, 26, 27, 29, 30, 31. The context score combines some of these features, including high local AU nucleotide composition (which accounts for effects of predicted 3'-UTR structure on site accessibility), proximity to residues that can pair to miRNA nucleotides 13–16, and positioning away from the centre of long UTRs7. As anticipated from analyses of mRNA destabilization data7, the most influential component was local AU composition, which when examined in isolation significantly correlated with protein response (P = 0.01, Spearman's correlation test).
Response of proteins compared to that of mRNAs
Because previously used high-throughput methods were unable to determine the amount of protein repression, the relative contributions of mRNA destabilization and translational repression during miRNA-mediated regulation has been of intense interest. Our miR-223 data was informative for addressing this issue because it examined the response, at both the mRNA and the protein level, of removing an endogenous miRNA, without the confounding influences of exogenous targeting mediated by an ectopically delivered miRNA. The near steady-state nature of our miR-223 system also avoided quantification caveats inherent to transient transfection, such as variable transfection efficiencies and pre-steady-state complexities especially acute when comparing effects on an mRNA to those on its protein because messages and their proteins can have very different intrinsic stabilities. Note that our mRNA quantification used standard array platforms, which include oligo(dT) priming during detection, and thus the mRNA destabilization we observed encompassed the conversion of the message into a form that was unsuitable for translation because it lacked a poly(A) tail.
To achieve greater quantification accuracy in this analysis of individual proteins, we narrowed our focus to the 2,773 proteins quantified with
6 independent measurements. Plotting protein changes as a function of mRNA changes indicated a strong positive correlation for messages with 7mer or 8mer 3'-UTR sites (Fig. 4a; r2 = 0.45 and 0.63, P < 10-33 and P < 10-12, respectively) and weaker correlation for messages without sites (Fig. 4b; r2 = 0.15, P < 10-11, Pearson's correlation test). Proteins in both plots displayed some scatter around the origin; however, when normalizing to those without sites, many more of those from messages with sites increased in response to miR-223 loss (Fig. 4a, b and Supplementary Fig. 6). Immunoblots probing for three of the more responsive proteins confirmed protein derepression in mir-223-/Y neutrophils differentiated in vitro as well as in those isolated directly from mice (Supplementary Fig. 7).
Figure 4: Comparison of protein and mRNA changes accompanying miR-223 loss.

a, Protein and mRNA changes for quantified proteins deriving from messages with at least one 8mer 3'-UTR site (blue, n = 55) or at least one 7mer (orange, an additional 250 proteins). The least-squares best fit to the 8mer data are shown (blue line), as are reference lines (grey), which both have slope of 1.0. Vertical error bars indicate 25th and 75th percentiles for independent measurements of protein changes. Horizontal error bars indicate standard errors of mRNA changes from three biological replicates, one of which was also used for the SILAC experiments. b, Protein and mRNA changes for quantified proteins deriving from messages without 7–8mer 3'-UTR sites, plotted as in a. One of seven random cohorts is plotted here; the other six are in Supplementary Fig. 6. c, Distribution of the indicated reference-set mRNAs and quantified proteins with respect to mRNA expression, as indicated by the array signals from cultured neutrophils. d, Response of quantified proteins and their respective mRNAs to mir-223 deletion, considering those messages with 7–8mer 3'-UTR sites, grouped by mRNA expression.
High resolution image and legend (125K)Download Power Point slide (508K)Slides may be downloaded for educational use, according to the terms described in Nature Publishing Group's licensing policy.
Two of the three most responsive proteins derived from messages with single, non-conserved 7mers (Table 1)—sites that on their own would not be expected to impart such a robust response. Previous work has shown that sites falling within 8–40 nucleotides of sites to co-expressed miRNAs typically act cooperatively, which increases the effect of loosing interactions at particular sites7. We performed high-throughput sequencing to identify miRNAs co-expressed in cultured neutrophils (Supplementary Table 2) and found that both of the highly responsive 7mers fell near to sites matching a co-expressed miRNA, with intersite spacing favouring a cooperative response (Table 1). The site in Ctsl was near a site for the miR-26 family, one of five families sequenced more frequently than miR-223, whereas the site in Gns fell near a site to the miR-103/107 family, sequenced about a third as often as miR-223 (Supplementary Table 2).
If protein changes merely reflected mRNA changes, with no additional repression at the translational level, then the points would fall on the diagonal (Fig. 4a, grey line). Although many were on the diagonal or very close to it, least-squares linear regression yielded a positive y-intercept (+0.053 and +0.079 for 7mer and 8mer data, respectively). These modest yet statistically significantly positive y-intercept values (P = 0.0002 and P = 0.042, t-test) suggested that a cohort of genes were modestly derepressed at the protein level with little or no change at the mRNA level. The messages of such genes were each good candidates for targets affected only at the translational level, although some might have derived from genes undergoing non-miRNA-mediated transcriptional repression as a compensatory feedback response to the loss of miR-223 targeting.
Despite evidence for some translation-only repression, all proteins derepressed by more than 50% (log2 > 0.58) derived from messages that displayed detectable increases (Fig. 4a and Table 1). Moreover, only five points were more than 0.58 units (log2) above the diagonal (Fig. 4a, upper dashed line; Table 1, indicated with §). Note that a 33% repression by miR-223 in wild-type neutrophils would correspond to a 50% (+0.58 log2) derepression in mutant neutrophils. Thus, in wild-type neutrophils only 5 of the 305 quantified proteins from messages with 7–8mer 3'-UTR sites appeared to undergo translational repression by more than 33%. We conclude that, although in some instances translational repression produces a substantial amount of endogenous miRNA-mediated repression, this occurred for surprisingly few of the many inferred targets. Substantial translational repression appeared so rarely because targets repressed only at the level of translation were repressed quite modestly (<33%); for targets undergoing more robust repression, the major component of the repression was usually mRNA destabilization (Table 1). Further study is required to determine whether those mRNA molecules undergoing miRNA-mediated repression might experience translational repression as a prelude to destabilization, but our results show that mRNA destabilization can explain most of the endogenous miR-223-mediated repression.
Our proteomics data were limited to the confidently quantified proteins, which were expected to be those that were both soluble and more highly expressed in neutrophils. To consider how the expression bias might have influenced our results, we plotted the distributions of mRNAs and quantified proteins as a function of mRNA expression in neutrophils, considering all mRNAs of our non-redundant data set (including those without detectable expression), as well as those with 3'-UTR sites (Fig. 4c). The messages with conserved or non-conserved 3'-UTR sites displayed the full range of expression values, with a distribution matching that of messages more generally. As anticipated, more quantified proteins derived from highly expressed messages (Fig. 4c). However, the distribution of quantified proteins from messages with sites (conserved or non-conserved) closely matched that of those without sites. Moreover, we found no evidence that the greater representation of proteins from more highly expressed messages underrepresented the impact of miRNAs on protein output; if anything, proteins from more highly expressed messages tended to respond more robustly than did those from lowly expressed messages (Fig. 4d). An analysis using Gene Ontology terms32 came to similar conclusions (data not shown). Therefore, although our experiment monitored the impact on only a portion of the neutrophil proteome and thus missed many miR-223 targets (including some conserved targets, such as Mef2c; refs 2, 21), we found no reason to suspect that undetected targets respond more robustly.
The proteins from the least abundant mRNAs appeared to respond without detectable mRNA changes (Fig. 4d,
6.5 bin). Apparent dominance of the translational component might have been a consequence of less reliable array signals for these messages, many of which fell within background signals from non-expressed messages. A more intriguing possibility is that very efficient translation of these messages (inferred from the ability to quantify proteins from such lowly expressed messages) makes them more susceptible to greater translational repression.
The regulatory function of miR-223
Some of the most strongly derepressed proteins from messages with miR-223 sites provided potential explanations for the pro-inflammatory phenotype observed in mir-223–/Y neutrophils21. Cathepsin L and cathepsin Z (Ctsl and Ctsz, listed first and fourteenth in Table 1) are cysteine proteases associated with chronic inflammatory conditions, in which they can act as mediators of tissue destruction33, 34. Another potentially relevant target, the insulin-like growth factor receptor 1 (Igf1r, listed sixth), is crucial for the priming and activation of mature neutrophils35, 36.
To examine whether repression begins before neutrophil maturation, we profiled mRNA levels in sorted progenitors and neutrophils (Supplementary Data 4). Messages of most of the highly responsive proteins were derepressed already at the progenitor stage, although usually to a lower degree than in neutrophils, which accumulate more miR-223 (Table 1 and Fig. 2b).
The profiles of miR-223-deficient progenitors and neutrophils provided the opportunity to examine the regulation of putative miR-223 targets in the absence of miR-223 to determine whether miR-223-mediated repression predominantly acts coherently with (that is, in the same direction as) the other gene-regulatory processes acting on these genes. During differentiation from progenitor to neutrophil, putative targets increased and decreased in similar numbers (Table 1 and Supplementary Data 4). This result revealed a proportion of incoherent regulatory relationships larger than that observed for other miRNAs5, 6, 11 but nonetheless consistent with the miR-223 loss-of-function phenotype; this phenotype indicates that miR-223 dampens progenitor proliferation and neutrophil differentiation and activation21—functions opposite of those expected for coherent regulatory interactions involving a miRNA preferentially expressed in neutrophils.
Because the miR-223 proteomics experiment detected targeting potentially missed by other high-throughput methods, particularly non-conserved targets influenced (albeit modestly) at the level of translation, it provided the clearest picture so far of the scope and magnitude of endogenous miRNA targeting. The vertical displacement from the no-site distribution in Fig. 2e indicated that at least 18.4% of the 426 proteins from messages with 7–8mer 3'-UTR seed-matched sites underwent increased protein output attributable to the sites, thereby implicating messages for at least 78 out of the 3,819 quantified proteins as direct targets. These 78 included
33% of those quantified proteins from messages with conserved 3'-UTR sites and
16% of those from messages with nonconserved 3'-UTR sites. Assuming that only about one-third of the proteome was quantified, we estimate that miR-223 has >200 targets in neutrophils (3
78). These would not include any targets undergoing fail-safe regulation (targeting of messages for proteins not normally expressed at all in neutrophils), which are invisible in derepression experiments. Despite the broad scope of miR-223 targeting, each interaction had only a modest effect, even when observed at the protein level. Many miR-223-responsive targets also have sites for other miRNAs, some of which are also expressed in neutrophils, and thus the aggregate impact of miRNAs on these targets is presumably greater than that observed for miR-223 alone. Nonetheless, the targeting by other miRNAs is not expected to obscure the effect of removing miR-223 because multiple non-overlapping sites to co-expressed miRNAs typically act independently7, 8, and in the rare cases in which they do not act independently, they act cooperatively, which would boost rather than decrease the effect of loosing a single miRNA7. The widespread scope but low magnitude of endogenous miR-223-mediated repression indicates that this miRNA often acts as a rheostat to adjust protein output.
Methods Summary
HeLa cells were grown in media containing either regular (light) Lys and Arg or 13C6-labelled (heavy) Lys and Arg. Light cells were transfected with miRNA, and heavy cells were mock-transfected. After 24 h some cells were harvested for mRNA expression profiling. After 48 h the remaining cells were harvested, and equal numbers from both populations were mixed and enriched for soluble nuclear proteins. Neutrophil culture was as outlined in Fig. 2a. Protein mixtures were separated by SDS–PAGE, and fractions were digested with trypsin. Peptides were analysed by liquid chromatography–tandem mass spectrometry (LC–MS/MS), which identified peptides and quantified the relative amounts of isotopic pairs of the same peptide. To prevent double-counting of any targeting interactions, peptides were mapped to a non-redundant complementary DNA data set (Supplementary Data 5), and targeting analyses were as performed previously on mRNA destabilization data7. To compare to target-prediction algorithms, predictions by TargetScan (release 4.1)2, 7, PicTar (human, chimp, mouse, rat, dog)4, 25, miRanda (January 2008 release)23, 24, miRBase Targets (version 5)22, RNA22 (ref. 28) and PITA26 were obtained from their respective websites, using the most recent predictions publicly available as of March 2008.
Full methods accompany this paper.
regulates human granulopoiesis
g) was reduced (5 mM DTT in 50 mM ammonium bicarbonate, pH 8.2, at 56 °C, 30 min) and alkylated (15 mM iodoacetamide, in 50 mM ammonium bicarbonate in the dark at room temperature, 20–22 °C, 25 min), and then separated into 10 fractions (HeLa samples) or 16 fractions (neutrophils) by SDS–PAGE. Each fraction was in-gel digested with trypsin (5 ng
l-1 in 50 mM ammonium bicarbonate, pH 8.2, at 37 °C, 16 h). Peptides were extracted in 50% acetonitrile (ACN) and 5% formic acid (FA), and then dried down and desalted by reverse-phase (C18 StageTip). Peptide mixtures were resuspended in 5% ACN and 4% FA, and 20% of each mixture was analysed by LC–MS/MS in duplicate. Peptides were separated across a 55-min gradient ranging from 7% to 30% ACN in 0.1% FA in a microcapillary (125
