Natural selection on human microRNA binding sites inferred from SNP data


A fundamental problem in biology is understanding how natural selection has shaped the evolution of gene regulation. Here we use SNP genotype data and techniques from population genetics to study an entire layer of short, cis-regulatory sites in the human genome. MicroRNAs (miRNAs) are a class of small noncoding RNAs that post-transcriptionally repress mRNA through cis-regulatory sites in 3′ UTRs. We show that negative selection in humans is stronger on computationally predicted conserved miRNA binding sites than on other conserved sequence motifs in 3′ UTRs, thus providing independent support for the target prediction model and explicitly demonstrating the contribution of miRNAs to darwinian fitness. Our techniques extend to nonconserved miRNA binding sites, and we estimate that 30%–50% of these are functional when the mRNA and miRNA are endogenously coexpressed. As we show that polymorphisms in predicted miRNA binding sites are likely to be deleterious, they are candidates for causal variants of human disease. We believe that our approach can be extended to studying other classes of cis-regulatory sites.

Figure 1: SNP density in conserved miRNA sites.
Figure 2: DAF distributions in conserved miRNA sites suggests stronger negative selection compared with other conserved 7-mers in 3′ UTRs.
Figure 3: DAF distributions in nonconserved miRNA sites coexpressed with the miRNAs.


We thank P. Andolfatto, R. Borowsky, E. Halperin, N. Hübner and M. Siegal for helpful discussions. We also thank E. van Nimwegen and R. Nielsen for critical readings of a preliminary version of the manuscript. This research was supported in part by the Howard Hughes Medical Institute grant through the Undergraduate Biological Sciences Education Program to New York University.

