Abstract
Global-scale patterns of human population structure may be influenced by the rate of migration among populations that is nearly eight times higher for females than for males. This difference is attributed mainly to the widespread practice of patrilocality, in which women move into their mates' residences after marriage1. Here we directly test this hypothesis by comparing global patterns of DNA sequence variation on the Y chromosome and mitochondrial DNA (mtDNA) in the same panel of 389 individuals from ten populations (four from Africa and two each from Europe, Asia and Oceania). We introduce a new strategy to assay Y-chromosome variation that identifies a high density of single-nucleotide polymorphisms, allows complete sequencing of all individuals rather than relying on predetermined markers and provides direct sequence comparisons with mtDNA. We found the overall proportion of between-group variation (ΦST) to be 0.334 for the Y chromosome and 0.382 for mtDNA. Genetic differentiation between populations was similar for the Y chromosome and mtDNA at all geographic scales that we tested. Although patrilocality may be important at the local scale2,3, patterns of genetic structure on the continental and global scales are not shaped by the higher rate of migration among females than among males.
Similar content being viewed by others
Main
Differences in migration rates between males and females can be inferred from analyses of within- and between-group variation (e.g., FST, a measure of genetic differentiation among populations) for the maternally inherited mtDNA and paternally inherited Y chromosome. Using this approach, one study pieced together data sets from published mtDNA, Y-chromosome and autosomal studies and found that the Y chromosome had larger between-group differences than did other portions of the genome1. But these data sets varied considerably with regard to sample sizes, populations represented and method used to assay genetic variation4, making any comparison of the extent of genetic differentiation between populations difficult. This problem is exacerbated for Y chromosomes because they lack sequence diversity5. To accommodate this difficulty, Y-chromosome researchers have adopted a strategy6,7,8,9 to estimate levels of between-group variation using single-nucleotide polymorphisms (SNPs) discovered in small panels of globally diverse males and then genotyped in much larger population samples (e.g., the global data set10 analyzed in ref. 1). This nonrandom sampling of SNPs can result in an ascertainment bias that has an unknown effect on estimates of FST values. Several researchers have suggested that an extensive study of mtDNA and Y-chromosome diversity in the same samples, using the same method to assay variation, is necessary before one can make firm conclusions regarding the relative degree of mtDNA and Y-chromosome differentiation4,11.
Here, we directly compare 6.7 kb of Y-chromosome sequence and 770 bp of the gene mitochondrial cytochrome c oxidase 3 (MTCO3) in a sample of 389 individuals representing ten globally distributed human populations. We assayed Y-chromosome variation using DNA sequences that encompass recently inserted Alu retrotransposons (e.g., the Y family of Alu elements and its subfamilies12; Table 1), because these elements may have a higher mutation rate than other noncoding DNA as a result of their high density of CpG dinucleotides13. Focusing on these regions, we uncovered a much higher density of SNPs than reported in previous surveys. SNP density was 3.15 times higher in our data set than in a study of a pseudogene and other noncoding regions on the Y chromosome5 (comparing data from 73 individuals that overlap between the two studies). Given the value of Watterson's θ (an estimator of the quantity 2Neμ, where Ne is the effective population size and μ is the mutation rate) in our present Alu data set, we estimated the probability of observing a SNP density equal to or lower than that observed in the non-Alu data set to be P = 0.007 using coalescent simulations. Thus, there is a significant increase in DNA sequence variation in regions encompassing Alu elements over the background level of noncoding diversity on the Y chromosome. Using this rich source of polymorphism, we can directly compare sequence variation between the Y chromosome and mtDNA in a manner that is free of ascertainment bias for both loci. We observed 43 Y-chromosome SNPs and 68 mtDNA SNPs (Supplementary Tables 1 and 2 online).
We found no evidence that the Y chromosome has a higher level of differentiation between populations than does mtDNA. Using an analysis of molecular variance (AMOVA), we calculated the overall value of ΦST (which approximates the quantity 1/(1 + Nem) assuming an equilibrium island model of population structure14, where m is the migration rate between populations) to be 0.334 for the Y chromosome and 0.382 for mtDNA. We examined distinct geographic regions individually and observed the same pattern of slightly higher ΦST values for mtDNA than for the Y chromosome (Table 2). The between-groups component of genetic variation was higher for mtDNA than for the Y chromosome in every region except Asia, where the Y-chromosome ΦST value slightly exceeded that of mtDNA (by 0.002). An AMOVA that incorporated a hierarchical grouping of populations within continents yielded similar results (Table 3). Values of ΦSC (between-group, within-continent variation) and ΦCT (between-continent variation) were similar for mtDNA and the Y chromosome, though slightly higher for mtDNA.
In addition to the overall similarity of between-group components of variation for the Y chromosome and mtDNA, there was a strong and statistically significant correlation of ΦST values between pairs of populations for these loci (Fig. 1; Mantel correlation coefficient = 0.688; P < 0.001). This result has several important implications. First, it indicates that putative gene flow among populations is relatively symmetrical for females and males. The fact that ΦST values for these loci covary so strongly suggests that population-specific processes, such as variation in rates of migration for females versus males, do not influence our divergence data. Second, it indicates that there is no obvious trend towards different rates of divergence for mtDNA versus the Y chromosome. Although the nonindependence of the data points in Figure 1 precludes conventional statistical analyses, we noted that the slope of the regression line suggests a faster increase in divergence between populations for mtDNA than for the Y chromosome, contrary to the pattern that would be expected if females had a higher rate of migration. Finally, the strong correlation between ΦST values for these two compartments of the genome indicates that demographic, rather than locus-specific, evolutionary forces are the primary determinants of genetic distance between the populations we surveyed. Positive directional selection, for instance, operating in a subset of populations on either the Y chromosome or mtDNA would tend to uncouple ΦST values between loci. We see no evidence for such a process in our data.
Our survey of the Y chromosome and mtDNA found markedly different between-group components of variation than have been reported in previous global studies. Some Y-chromosome studies rely on predetermined SNPs and find that between-population components of genetic variation are much higher than we estimated1,11. Our data suggest that ascertainment biases associated with the use of particular SNPs may result in overestimates of genetic distance between populations. In contrast, previous studies of mtDNA often focus on hypervariable portions of the control region (where high mutation rates may cause a downward bias in estimates of between-group variation15) rather than coding DNA11. We obtained much higher estimates of between-group variation when we compared mtDNA coding sequences with hypervariable regions in the same panel of individuals (Supplementary Note online).
Our interpretation of the between-group components of genetic variation for the Y chromosome and mtDNA in terms of rates of migration relies on the assumption that the effective population sizes of the sexes (and thus of the Y chromosome and mtDNA) are equal. Among human populations, forces that skew the breeding sex ratio probably do so by increasing the number of females relative to males (e.g., owing to the widespread practice of polygyny and rarity of polyandry among cultures16,17, a higher variance in male lifetime reproductive success18 or higher rates of male mortality19); the magnitude of this skew is not known. If the effective size of the human female population is indeed somewhat larger than that of males, then our observation of roughly equal between-group components of variation for the Y chromosome and mtDNA implies a lower rate of migration for females than for males among the widely spaced populations we surveyed.
We did not detect the signature of a higher migration rate among populations for females than for males in our global survey, but this does not contradict the evidence for patrilocality effects at local scales. For instance, in a comparison of genetic variation in Northern Thailand, patrilocal villages were characterized by lower levels of variation for the Y chromosome than for mtDNA and higher Y-chromosome genetic distances between villages, whereas the opposite was true among matrilocal groups. Similar patterns were observed among patrilocal Bedouin tribes from the Sinai Peninsula3. One of the outstanding questions raised by studies such as these is the extent to which local cultural practices influence genetic patterns at the regional and global scale4. At present, there are too few studies that specifically examine these issues of scale with respect to Y-chromosome versus mtDNA differentiation to draw firm conclusions. But our results, taken together with several regional-scale studies that did not detect a genetic signal of increased migration among females versus males20,21,22, suggest that broader-scale genetic patterns may not always reflect the sum of local cultural processes. This may be because other demographic events (e.g., long-distance migrations) become proportionately more important at larger geographic scales, or because behavioral customs of individual populations do not have the temporal or geographic stability necessary to influence global patterning. Although we are unable to distinguish among these hypotheses, our results suggest that the role of female migration is no more important than that of male migration at the continental and global scales.
Note: Supplementary information is available on the Nature Genetics website.
Methods
Population samples.
We examined mtDNA and Y-chromosome variation in the same panel of individuals from ten globally distributed populations, as follows (the number of individuals sampled is indicated in parentheses): Africa: Bakola from Cameroon (25), Dogon from Mali (37), Bantu speakers from South Africa (47), Khoisan from Namibia and South Africa (25); Europe: Dutch (47) and Italians (47); Asia: Mongolian Khalks (46) and Sri Lankans (43); Oceania: highland Papua New Guineans (24) and Baining from New Britain (48). All samples were obtained with informed consent using protocols approved by the Human Subjects Protection Program at the University of Arizona.
DNA sequencing.
Our study focused on a 770-bp region in the gene MTCO3 of the mtDNA. We chose MTCO3 rather than the hypervariable portions of mtDNA to mitigate to the greatest extent possible the degree to which homoplasy would downwardly bias our estimates of population differentiation23. Our survey spanned 13 separate regions from the nonrecombining portion of the Y chromosome. We chose these regions on the basis of three criteria: (i) they fall within introns of single-copy genes24; (ii) they contain at least one element from the Y family (including subfamilies) of Alu retrotransposons12; and (iii) Alu insertions were fixed in our sample. We determined the family affiliation of Alu elements using RepeatMasker. To ensure priming specificity, we located amplification primers in unique regions flanking Alu elements. We then directly sequenced both flanking and Alu DNA. Sequences of amplification and sequencing primers, as well as reaction conditions, are available on request.
Data analysis.
We apportioned diversity within and between populations using an AMOVA, implemented in the program Arlequin v. 2.000 (ref. 25). The resulting ΦST values are especially sensitive to differences in mutation rate23. To minimize biases associated with a higher mutation rate for mtDNA, we calculated genetic distances using a Tamura-Nei distance with high among-site rate heterogeneity (γ = 0.22; data not shown). This measure accommodates homoplasy that may differentially occur in the mtDNA data set. For the Y chromosome, we calculated genetic distance using a Jukes-Cantor model of nucleotide substitution. All results for both loci are insensitive to choice of substitution model. There was evidence for a single recurrent mutation in our Y-chromosome SNP data. Parsimony analysis of the 40 haplotypes observed in our sample of 389 chromosomes resulted in a single tree of 44 steps with a consistency index of 0.977 (Supplementary Fig. 1 online). An A → C transversion occurs twice on the tree; however, because it occurs on separate branches, we were able to identify both mutational events. For the mtDNA MTCO3 locus, we analyzed the entire data set (S = 68), as well as a subset of the data including only synonymous sites (S = 49). Analyses of synonymous sites, which are presumably under less selective constraint than coding sites, produced similar results (data not shown) to analyses of the entire data set. We implemented a Mantel test (100,000 permutations) comparing pairwise distances between populations for mtDNA and the Y chromosome in Arlequin.
URLs.
RepeatMasker is available at http://www.repeatmasker.org/. Arlequin v. 2000 is available at http://lgb.unige.ch/arlequin/.
GenBank accession numbers.
mtDNA and Y-chromosomal sequences from the 389 individuals in our study, AY714986–AY720431.
References
Seielstad, M.T., Minch, E. & Cavalli-Sforza, L.L. Genetic evidence for a higher female migration rate in humans. Nat. Genet. 20, 278–280 (1998).
Oota, H. et al. Human mtDNA and Y-chromosome variation is correlated with matrilocal versus patrilocal residence. Nat. Genet. 29, 20–21 (2001).
Salem, A.H., Badr, F.M., Gaballah, M.F. & Pääbo, S. The genetics of traditional living: Y-chromosomal and mitochondrial lineages in the Sinai Peninsula. Am. J. Hum. Genet. 59, 741–743 (1996).
Stoneking, M. Women on the move. Nat. Genet. 20, 219–220 (1998).
Hammer, M.F. et al. Human population structures and its effects on sampling Y chromosome variation. Genetics 164, 1495–1509 (2003).
Hammer, M.F. et al. The geographic distribution of human Y chromosome variation. Genetics 145, 787–805 (1997).
Hammer, M.F. et al. Hierarchical patterns of global human Y-chromosome diversity. Mol. Biol. Evol. 18, 1189–1203 (2001).
Romualdi, C. et al. Patterns of human diversity, within and among continents, inferred from biallelic DNA polymorphisms. Genome Res. 12, 602–612 (2002).
Kayser, M. et al. Reduced Y-chromosome but not mitochondrial DNA, diversity in human populations from west new Guinea. Am. J. Hum. Genet. 72, 281–302 (2003).
Underhill, P.A. et al. Detection of numerous Y chromosome biallelic polymorphisms by denaturing high-performance liquid chromatography. Genome Res. 7, 996–1005 (1997).
Jorde, L.B. et al. The distribution of human genetic diversity: a comparison of mitochondrial, autosomal, and Y-chromosome data. Am. J. Hum. Genet. 66, 979–988 (2000).
Batzer, M.A. & Deininger, P.L. Alu repeats and human genomic diversity. Nat. Rev. Genet. 3, 370–379 (2002).
Labuda, D. & Striker, G. Sequence conservation in Alu evolution. Nucleic Acids Res. 17, 2477–2491 (1989).
Wright, S. Evolution in Mendelian populations. Genetics 16, 97–159 (1931).
Jin, L. & Chakraborty, R. Population structure, stepwise mutations, heterozygote deficiency and their implications in DNA forensics. Heredity 74, 274–285 (1995).
Low, B.S. Measures of polygyny in humans. Curr. Anthropol. 29, 189–194 (1988).
Murdock, G.P. Atlas of World Cultures (University of Pittsburgh Press, Pittsburgh, 1981).
Chagnon, N.A. Is reproductive success equal in egalitarian societies? in Evolutionary Biology and Human Social Behavior: An Anthropological Perspective (eds. Chagnon, N.A. & Irons, W.) 374–401 (Duxbury Press, North Scituate, Massachusetts, 1979).
Alexander, R.D. et al. Sexual dimorphisms and breeding systems in pinnipeds, ungulates, primates, and humans. in Evolutionary Biology and Human Social Behavior: An Anthropological Perspective (eds. Chagnon, N.A. & Irons, W.) 402–435 (Duxbury Press, North Scituate, Massachusetts, 1979).
Fuselli, S. et al. Mitochondrial DNA diversity in South America and the genetic history of Andean highlanders. Mol. Biol. Evol. 20, 1682–1691 (2003).
Mesa, N.R. et al. Autosomal, mtDNA, and Y-chromosome diversity in Amerinds: pre- and post-Columbian patterns of gene flow in South America. Am. J. Hum. Genet. 67, 1277–1286 (2000).
Al-Zahery, N. et al. Y-chromosome and mtDNA polymorphisms in Iraq, a crossroad of the early human dispersal and of post-Neolithic migrations. Mol. Phylogenet. Evol. 28, 458–472 (2003).
Urbanek, M., Goldman, D. & Long, J.C. The apportionment of dinucleotide repeat diversity in Native Americans and Europeans: a new approach to measuring gene identity reveals asymmetric patterns of divergence. Mol. Biol. Evol. 13, 943–953 (1996).
Skaletsky, H. et al. The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes. Nature 423, 825–837 (2003).
Excoffier, L., Smouse, P.E. & Quattro, J.M. Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics 131, 479–491 (1992).
Acknowledgements
We thank D. Garrigan and A. Indap for assistance with data analysis; D. Meltzer and S. Zegura for comments; and P. de Knijff, G. Destro-Bisol, J. Friedlaender, T. Jenkins, H. Soodyall and B. Strassmann for samples. This manuscript was made possible by a grant from the National Institute of General Medical Sciences (to M.H.). Its contents are solely the responsibility of the authors and do not necessarily represent the official views of the National Institutes of Health.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Supplementary Fig. 1
Parsimony network for 40 Y chromosome haplotypes. (PDF 212 kb)
Supplementary Table 1
Y chromosome polymorphic sites by haplotype in each population. (PDF 10 kb)
Supplementary Table 2
mtDNA polymorphic sites by haplotype in each population. (PDF 11 kb)
Supplementary Note
Comparison of between-population components of genetics variation from coding versus hypervariable mtDNA sequence. (PDF 2 kb)
Rights and permissions
About this article
Cite this article
Wilder, J., Kingan, S., Mobasher, Z. et al. Global patterns of human mitochondrial DNA and Y-chromosome structure are not influenced by higher migration rates of females versus males. Nat Genet 36, 1122–1125 (2004). https://doi.org/10.1038/ng1428
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/ng1428
This article is cited by
-
The paternal and maternal genetic history of Vietnamese populations
European Journal of Human Genetics (2020)
-
New native South American Y chromosome lineages
Journal of Human Genetics (2016)
-
Maternal ancestry and population history from whole mitochondrial genomes
Investigative Genetics (2015)
-
The history of the North African mitochondrial DNA haplogroup U6 gene flow into the African, Eurasian and American continents
BMC Evolutionary Biology (2014)
-
Human paternal and maternal demographic histories: insights from high-resolution Y chromosome and mtDNA sequences
Investigative Genetics (2014)