The distinctive geographic patterns of common pigmentation variants at the OCA2 gene

Kidd, Kenneth K.; Pakstis, Andrew J.; Donnelly, Michael P.; Bulbul, Ozlem; Cherni, Lotfi; Gurkan, Cemal; Kang, Longli; Li, Hui; Yun, Libing; Paschou, Peristera; Meiklejohn, Kelly A.; Haigh, Eva; Speed, William C.

doi:10.1038/s41598-020-72262-6

Download PDF

Article
Open access
Published: 22 September 2020

The distinctive geographic patterns of common pigmentation variants at the OCA2 gene

Kenneth K. Kidd¹,
Andrew J. Pakstis¹,
Michael P. Donnelly^1,2,
Ozlem Bulbul³,
Lotfi Cherni^4,5,
Cemal Gurkan^6,7,
Longli Kang^8,9,
Hui Li¹⁰,
Libing Yun¹¹,
Peristera Paschou¹²,
Kelly A. Meiklejohn¹³,
Eva Haigh¹ &
…
William C. Speed¹

Scientific Reports volume 10, Article number: 15433 (2020) Cite this article

7256 Accesses
7 Citations
23 Altmetric
Metrics details

Subjects

Abstract

Oculocutaneous Albinism type 2 (OCA2) is a gene of great interest because of genetic variation affecting normal pigmentation variation in humans. The diverse geographic patterns for variant frequencies at OCA2 have been evident but have not been systematically investigated, especially outside of Europe. Here we examine population genetic variation in and near the OCA2 gene from a worldwide perspective. The very different patterns of genetic variation found across world regions suggest strong selection effects may have been at work over time. For example, analyses involving the variants that affect pigmentation of the iris argue that the derived allele of the rs1800407 single nucleotide polymorphism, which produces a hypomorphic protein, may have contributed to the previously demonstrated positive selection in Europe for the enhancer variant responsible for light eye color. More study is needed on the relationships of the genetic variation at OCA2 to variation in pigmentation in areas beyond Europe.

What colour are your eyes? Teaching the genetics of eye colour & colour vision. Edridge Green Lecture RCOphth Annual Congress Glasgow May 2019

Article Open access 23 August 2021

David A. Mackey

Genome-wide association studies

Article 26 August 2021

Emil Uffelmann, Qin Qin Huang, … Danielle Posthuma

On the genetic basis of tail-loss evolution in humans and apes

Article Open access 28 February 2024

Bo Xia, Weimin Zhang, … Itai Yanai

Introduction

Oculocutaneous Albinism type 2 (OCA2) is a gene of interest for several reasons, not the least of which is its role in oculocutaneous albinism with about 30% of worldwide cases accounted for by 154 mutations in the OCA2 gene¹. Two amino acid substitutions in the coding sequence were shown by Sviderskaya et al.² to be associated with decreased expression of the OCA2 protein but not full ocular albinism. OCA2 was subsequently studied for its association with eye color but common variants are associated not just with variation in eye color but also with variation in skin color^3,4,5. Different polymorphisms in the regulatory and coding regions are primarily associated with different eye, hair, and skin pigmentation phenotypes and show large frequency differences among populations from different parts of the world.

Single nucleotide polymorphisms (SNPs) in the molecular region of OCA2 were first implicated in inheritance of eye color variation in Europeans⁶. The strongest evidence was for variation upstream of the OCA2 coding sequences in one of the introns of HERC2⁷, supported by broader population genetics studies⁸. Sturm et al.⁷ showed that rs12913832 disrupted a conserved regulatory region; the region was subsequently confirmed to be an enhancer of OCA2⁹. Functional variation in the HERC2 coding sequences seems unrelated to eye color⁷. OCA2 also has four commonly occurring SNPs that cause amino acid substitutions: rs1800414 (His615Arg), rs74653330 (Ala481Thr), rs1800407 (Arg419Gln), and rs1800401 (Arg305Trp). The Ala481Thr (rs74653330) and Val443Ile (rs121918166) variants were shown² to be hypomorphic but not pathogenic in their studies of ocular albinism. The Val443Ile missense variant (rs121918166) has been reported at < 1% in Scandinavian populations¹⁰. These missense SNPs are distributed across 63 kb of the gene (Table 1); the enhancer SNP (rs12913832) is 38.6 kb from the start of the coding sequence.

Table 1 Five commonly occurring and one rare functional SNP at OCA2 influencing expression of human pigmentation variation.

Full size table

Three of the OCA2 missense SNPs (rs1800414, rs74653330, rs1800407) have been studied in conjunction with pigmentation phenotypes, primarily in European and East Asian populations where the variants are most common. Walsh et al.^11,12,13 found that including the genotype at rs1800407 in a regression equation improved the ability to predict eye color in their samples. Edwards et al.¹⁴ and Yuasa et al.^15,16,17 found that rs1800414 was associated with skin color variation among individuals of East Asian ancestry. Eaton et al.¹⁸ studied both rs1800414 and rs74653330 on East Asians and found them to be independently associated with skin color. Rawofi et al.¹⁹ confirmed the association of rs1800414 with skin color and found it significantly associated with iris color. Lee et al.²⁰ identified the derived allele at rs74653330 at a frequency of about 1% in Europeans. This hypomorphic OCA2*481Thr (rs74653330) allele was later found to be moderately frequent in many East Asian populations^17,21.

Evidence of recent selection for the derived allele of rs12913832 at the enhancer is clearly documented in European populations as is selection for the derived allele at rs1800414 in East Asia⁸. The skin color effects of rs1800414 have been considered an example of parallel evolution for light skin color¹⁴. We are interested in these and other aspects of the population genetics of the OCA2 variants. To that end we have tested (Table 1) four of the functional SNPs in the large number of population samples we have available²². We have also retrieved data on these SNPs from the 1,000 Genomes (1 KG) project website²³ in those populations and assembled the published data on population frequencies. The derived alleles show very distinct biogeographic variation. That global pattern of variation is the focus of this paper.

Methods

Markers and Populations

Table 1 lists the three amino acid substitution SNPs at OCA2, rs1800414, rs74653330, and rs1800407 and the OCA2 enhancer SNP, rs12913832, in an intron of HERC2, that are the focus of this study. Data on all four of these SNPs come primarily from our genotyping studies (76 populations), from a collaboration with co-author Longli Kang (7 populations), and from the 22 relatively unadmixed populations of the 1 KG project (Phase 3)²³. Additional individual SNP frequencies were obtained primarily from the published literature and were entered into the ALFRED database (https://alfred.med.yale.edu) before it became static. A fourth amino acid substitution, rs1800401 (Arg305Trp), has been typed in the 1 KG samples but is not included here because it has been otherwise studied largely in samples defined by pigmentation phenotypes (eye, hair, skin color) in a few populations^3,24,25. The rare amino acid change (Val443Ile) at SNP rs121918166 has only been studied on a small number of European populations¹⁰ and studied for its effect on eye, hair, and skin color. Only three of the 1 KG populations, all European, have the variant allele at rare frequencies ranging from 0.5% to 0.9%. All of the samples were collected with informed consent for population genetic studies such as this. Because all samples are completely anonymous, the allele frequency collection in this study is not considered human research.

Marker Typing

Various methods were used to type the SNPs and are described in the multiple sources of the data. The source of data for each population sample is listed in Table S1 of supplemental data. The populations typed in Kidd Lab as part of this study were typed using TaqMan SNP Genotyping Assays obtained from Applied Biosystems as previously described; data on some of the SNPs in some of the populations were previously published^8,26.

Statistics

As these SNPs are simple co-dominant genetic systems allele frequencies were estimated by simple gene counting. The density plots were produced by Surfer (version 12.8) software (https://www.goldensoftware.com). The haplotype frequencies were estimated using Phase version 2.1.1^27,28. Each population was phased separately.

Results and discussion

We have assembled data on 238 population samples with allele or genotype frequencies for at least one of the four commonly studied variants. Most of those studies have data on two or more of the SNPs (Table S1). 105 population samples have data for all four of those SNPs at OCA2: three amino acid substitution SNPs at OCA2, rs1800414, rs74653330, and rs1800407 and the OCA2 enhancer SNP, rs12913832, in an intron of HERC2. The population samples with OCA2 data are listed in Supplemental Table S1.

Individual SNP frequencies

The population specific allele frequencies of the four functional SNPs noted in Table 1 are given in Supplemental Table S1 and presented as density plots in Figs. 1, 2, 3 and 4; a different graphic representation indicating the frequency data for each specific population sample is given in Supplemental Figs. S1 through S4. All of the functional SNPs have data for many population samples. Each of the Supplemental figures includes all of the population samples with data for any of the SNPs; blanks represent missing data for a given population sample. Each bar in the Supplemental figures represents the data from a single population study involving that SNP; there are several instances of multiple independent samples for the same ethnic/geographic group.

The derived allele at rs1800414 is largely restricted to but common in many East Asian populations (Figs. 1 and S1). This SNP has been studied in many populations that have not been studied for various of the other three SNPs. This variant reaches frequencies over 50% in most of East and Southeast Asia. It has lower frequencies of 5% to 15% in the Pacific populations and in Central and Northern Asia as well as Tibet and other parts of Southwestern China.

The derived allele at the missense SNP, rs74653330 (Ala481Thr) (Figs. 2 and S2) has been studied less comprehensively than rs1800414 but occurs widely in Northern Eurasia and is especially common in Eastern Siberian and Mongolian populations The report of a frequency of 52% in the Oroqen (sampled in northern China near the Russian border) is an outlier in terms of frequency but not geography: it was omitted from Fig. 2 but not Fig. S2. Off the scale of Fig. 2 (frequencies < 4%) the derived allele occurs rarely in most of Europe, in some Southwest Asian populations (Turkish, Iranians), in South Asia (Hazara), and in China (Tibetans). In northern Europe it occurs at low frequencies (1% to 3%) in some populations (Chuvash, Vologda Russians) and reaches 5% to 7% in Finnish samples. Given that the derived allele at rs74653330 is hypomorphic, it is a clear candidate for studies of selection favoring the allele in the northern populations.

The derived allele at rs1800407 (Figs. 3 and S3) occurs at low frequencies in most populations in North Africa, Europe, South Asia, and in some populations in East Asia but mostly off the scale in Fig. 3 which is driven primarily by a few values greater than 10% frequency. For example, in 18 Spanish Basques the frequency is 21% while in 14 Orcadians, the frequency is 14%.

The rs12913832 SNP (Figs. 4 and S4) is the enhancer polymorphism and has the largest number of population samples with data since most studies of other pigmentation SNPs have also included rs12913832. This variant is well known for high frequencies in Northern Europe (70% to 95%) as seen in Figs. 4 and S4. It is found at more moderate frequencies in populations from Southern Europe, Southwest Asia, North Africa, and at lower frequencies (5% to 20%) in South and Central Asia. It is seen less frequently in North and East Asia and in the Native American populations. While admixture of Europeans in Native American populations is common, our studies overall show very low frequencies in our specific population samples except for the Maya sample (Fig. S4). Given the evidence of the variant in Northern Asia, the likely ancestral region for Native Americans, it is possible that the existence of the promoter variant at a low frequency in Native Americans is ancestral and not due to recent admixture. The same possibility applies to the presence in Australian Aborigines. The subset of 39 less admixed Australian Aborigines have a 15% frequency compared to a frequency of 40% in the full sample of 102 Aborigines.

SNPs rs1800401 and rs121918166 have not been studied in as many populations as any of the four other SNPs and we have not considered them in this study. The variant at rs121918166 has only been observed at rare frequencies in Scandinavians. Based on the populations in the 1 KG the derived allele at rs1800401 occurs most frequently, 10% to 20%, in African and South Asian populations and is absent to < 12% in East Asia and Europe.

Evidence argues that the variant alleles at the four common SNPs depicted in Figs. 1, 2, 3 and 4 are functional^{2,8,9,11,17,24}. Each of the four variants has a distinct geographic distribution but overlaps exist. In East Asia the hypomorphic rs74653330 allele overlaps somewhat with the rs1800414 variant but they appear to occur on separate haplotypes in the population. However, both the enhancer variant at rs12913832 and the amino acid substitution at rs1800407 occur frequently in Europe and surrounding areas and occur on the same chromosome at some unclear frequency.

Two SNP haplotype–rs1800407 and rs12913832

The interaction between the rs12913832 and rs1800407 loci is interesting. The variant allele at rs1800407 has been included in the equations used for eye color prediction¹¹ for nearly a decade and was suggested by Sturm et al.⁷ as functioning to increase the penetrance of the enhancer variant. Duffy et al.²⁹ notes that heterozygosity for the derived allele at rs1800407 decreases the probability of green eyes on the homozygous derived rs12913832 background but increases it on a heterozygous rs12913832 background. Several studies have referred to the relationship of rs1800407, especially the 419Gln allele, and the enhancer variant as an example of epistasis^29,30,31,32. However, if we consider the functional unit as production of a protein we necessarily include the rate of production of mRNA and the coding content of that mRNA. The term epistasis seems inappropriate because these two DNA variants are not functionally independent loci. The haplotype is the functional unit and the locus can be considered as a four-allele locus, at least with respect to the enhancer and rs1800407 (Table 2). The phenotypes determined by three of the alleles (haplotypes) are clear; the fourth is not clear from existing studies.

Table 2 Haplotypes of the ancestral 419Arg and derived 419Gln alleles at rs1800407 and the enhancer normal (E +) and negative (E-) alleles at rs12913832. The doubly-derived (cis) haplotype has frequency estimates of 1% to 3.6% in 14 European populations (see Table S2).

Full size table

If the doubly-derived chromosome for rs1800407 and rs12913832 results in “higher penetrance” for light eye color, the derived allele at rs1800407 must have a functional difference. While it was not studied by Sviderskaya et al.², an obvious implication is that it is a hypomorphic allele. These cis chromosomes would have reduced production (because of the enhancer variant) of a hypomorphic OCA2 protein (because of the 419Gln allele at rs1800407). Selection operated on some trait to increase the frequency of the enhancer variant; this cis combination of the two variants with a presumably hypomorphic protein might have been more strongly affected.

On a background of homozygosity for the enhancer (rs12913832) variant, the frequency of heterozygotes of the amino acid substitution (rs1800407) is 246/(246 + 3,039) or 7.5% in Duffy’s largely British origin population sample. Those genotypes involve one chromosome that is doubly-derived (i.e., cis) for the two variants and one that has only the enhancer variant. On a heterozygous enhancer background genotype, however, the amino acid substitution heterozygotes occur at a higher frequency of 529/(529 + 1,248) or 29.8%. Those nearly 30% of individuals are composed of both cis and trans genotypes for the two functional variants. The evidence is consistent with those two genotypes having different phenotypes as would be predicted by considering the functional context: the cis genotype has one fully normal protein at normal amounts and one variant protein produced at reduced amounts; the trans genotype has a normal protein at reduced amounts and a variant protein at normal amounts.

The proportions of the two enhancer genotypes in that study²⁹ are not necessarily in HW proportions depending on how they were ascertained, which is not specified. In fact, the ratio of the enhancer homozygotes to heterozygotes is 1.849 which is compatible with an enhancer variant frequency of about 0.79, essentially the same as in our summary (Table S1) for Northwest Europe. However, the frequency of the amino acid substitution is not so easily estimated from these data.

By maximum likelihood the phase of the ambiguous double heterozygotes will be estimated to be partly genotypes with the derived alleles in cis if there is evidence that the cis allele exists. We find (Table 3) that direct gene counting evidence of the cis haplotype is seen primarily in northern Europeans. In those populations with the gene counting evidence for this haplotype the frequency of the cis haplotype is 3%. We note the higher frequencies are in the British, Irish, and CEU samples. Several individuals in these and other populations in northern Europe and elsewhere are double heterozygotes with phase to be estimated statistically. The uncertainties of statistical phasing make it difficult with the existing sample sizes to give exact proportions of the two relevant genotypes, cis and trans. Gene counting evidence exists for both the cis and trans chromosomes; the doubly heterozygous genotype must be apportioned statistically and that is the source of uncertainty given the small numbers of the relevant genotypes setting the expectation (Table 3).

Table 3 Observed genotype counts for rs1800407 and rs12913832 among individuals with no missing data for these two SNPs. The groups shown are primarily the subset of 105 populations in Fig. 2 from world regions (Europe/SWAsia/NAfrica/SCAsia) where double heterozygotes were observed. The cells with bold underlined values indicate definite evidence of the cis (doubly-derived) haplotype,TG; see text.

Full size table

There are 10 genotypes possible for the four haplotypes of the rs12913832 and rs1800407 variants. How all of those genotypes relate to phenotypes under selection is not known. The haplotype frequency distribution bar plots of the two SNPs common in Europe, Southwest Asia, and North Africa among 105 populations are shown in Fig. 5. (The haplotype frequencies are in supplemental Table S2). In our data we have seen direct evidence for 8 of those genotypes (Table 4). The variants at rs12913832 and rs1800407 occur in cis at the highest frequencies in the northern European samples and the gene-counting evidence for the cis chromosome occurs almost exclusively in these northern European populations (Table 3). These haplotypes are relevant to how genotypes might influence pigmentation and selection in those northern European populations. While random genetic drift can always be a possible explanation for the pattern, it seems a highly unlikely explanation for the evidence of this doubly-derived chromosome to exist only in the populations for which evidence of selection on the enhancer chromosomes is strongest.

Table 4 Distribution of individuals (by direct gene counting and by inference) in 105 populations for the 10 possible genotypes of the 2-SNP haplotype based on rs1800407, rs12913832.

Full size table

We can expect, given the functional variation at each site, that all but the homozygote for the doubly ancestral genotype will have some positive effect toward lighter pigmentation. However, evidence for the effect on eye color of just the variant (419Gln) as heterozygous with a fully ancestral chromosome is largely absent; its expected frequency is quite small. Even assuming the haplotypes affect phenotype additively, to estimate the three different fitness parameters associated with the three derived chromosomes seems beyond the power of the existing data. We leave such estimation to others.

Many studies have reported on use of the genotypes at these sites at OCA2 and SNPs at other genes to infer the iris, hair, and skin color of an individual from that individual’s DNA. Those efforts are most recently integrated into the HIrisplex-S web site (https://hirisplex.erasmusmc.nl/). Such phenotype inference from a DNA sample can be very useful as an investigative lead in criminal forensics. Our data summaries demonstrate that two of the SNPs, rs12913832 and rs1800414, have common variants with strikingly different geographic patterns that makes them relevant to inference of biogeographic ancestry in some parts of the world. Indeed, rs12913832, the enhancer SNP, was incorporated in the Kidd Lab panel of 55 ancestry informative SNPs²⁶ and rs1800414 is part of the 74 SNPs in a panel by Li et al.³³.

The population distribution of the chromosome with the derived enhancer variant (rs12913832) and the derived amino acid variant (419Gln for rs1800407) in cis is seen almost exclusively in northern Europe. Elsewhere, the rs1800407 variant (419Gln) occurs on a chromosome with the ancestral allele at the enhancer. The common occurrence of the doubly-derived (cis) chromosome, primarily in the populations with the strongest evidence of selection for the enhancer variant, strongly suggests selection on this chromosome in northern Europe. The north Eurasia distribution of the hypomorphic allele–481Thr at rs74653330–suggests parallel evolution for this variant as well.

Our understanding of the role of the known functional and enhancer variants in human pigmentation phenotypes has grown markedly in recent decades but, thus far, the relationships have only been studied simultaneously and in relatively large samples in a subset of populations of European and East Asian ancestry. The very strong geographical frequency patterns shown by the existing patchwork of genetic data in the OCA2-HERC2 gene region are more extensive and suggest that more empirical studies are needed from more world regions so that we can refine and improve our knowledge. The studies supporting strong selection effects done thus far also support the view that more studies are important. Other genetic loci are known to influence pigmentation phenotypes. Their relative roles and the magnitude of their effects during development as well as the evolutionary impact of non-genetic factors will be more clearly understood when we have more worldwide data on the OCA2-HERC2 gene region.

Informed consent

All subjects gave permission for collection of samples and use in population studies such as this. All samples are anonymous.

Data availability

Allele frequencies for each of the three functional SNPs and one enhancer SNP analyzed along with the 2-SNP haplotype frequencies are in supplementary Table S1. Almost all of the individual SNP frequencies and their literature citations are also available in the ALFRED database which is freely accessible online. Five literature citations are given in Table S1 for the allele frequencies of a small number of populations that were more recently published and are not present in the static version of ALFRED.

References

Kamaraj, B. & Purohit, R. Mutational analysis of oculocutaneous albinism: A compact review. BioMed Res. Int. 2014, 905472 (2014).
PubMed PubMed Central Google Scholar
Sviderskaya, E. V. et al. Complementation of hypopigmentation in p-mutant (Pink-Eyed Dilution) mouse melanocytes by normal human P cDNA, and defective complementation by OCA2 mutant sequences. J. Investig. Derm. 108, 30–34 (1997).
Article CAS Google Scholar
Rebbeck, T. R. et al. P gene as an inherited biomarker of human eye colour. Cancer Epidemiol. Biomark. Prev. 11, 782–784 (2002).
CAS Google Scholar
Sturm, R. A. & Larsson, M. Genetics of human iris colour and patterns. Pigment Cell Melanoma Res. 22, 544–562 (2009).
Article PubMed CAS Google Scholar
Liu, F., Wen, B. & Kayser, M. Colorful DNA polymorphisms in humans. Semin. Cell Dev. Biol. 24, 562–575 (2013).
Article PubMed CAS Google Scholar
Eiberg, H. & Mohr, J. Assignment of genes coding for brown eye colour (BEY2) and brown hair colour (HCL3) on chromosome 15q. Eur. J. Hum. Genet. 4, 237–241 (1996).
Article PubMed CAS Google Scholar
Sturm, R. A. et al. A single SNP in an evolutionary conserved region within intron 86 of the HERC2 gene determines human blue-brown eye color. Am. J. Hum. Genet. 82, 424–431 (2008).
Article PubMed PubMed Central CAS Google Scholar
Donnelly, M. P. et al. A global view of the OCA2-HERC2 region and pigmentation. Hum. Genet. 131, 683–696 (2012).
Article PubMed CAS Google Scholar
Visser, M., Kayser, M. & Palstra, R. J. HERC2 rs12913832 modulates human pigmentation by attenuating chromatin-loop formation between a long-range enhancer and the OCA2 promoter. Genome Res. 22, 446–455 (2012).
Article PubMed PubMed Central CAS Google Scholar
Andersen, J. D. et al. Importance of nonsynonymous OCA2 variants in human eyecolor prediction. Molec. Genet. Genom. Med. 4, 420–430 (2016).
Article CAS Google Scholar
Walsh, S. et al. IrisPlex: a sensitive DNA tool for accurate prediction of blue and brown eye colour in the absence of ancestry information. Forensic Sci. Int. Genet. 5, 170–180 (2011).
Article PubMed CAS Google Scholar
Walsh, S. A. et al. DNA-based eye colour prediction across Europe with the IrisPlex system. Forensic Sci. Int. Genet. 6, 330–340 (2012).
Article PubMed CAS Google Scholar
Walsh, S. et al. (2013) The HIrisPlex system for simultaneous prediction of hair and eye colour from DNA. Forensic Sci. Int. Genet. 7, 98–115 (2013).
Article PubMed CAS Google Scholar
Edwards, M. et al. Association of the OCA2 polymorphism His615Arg with melanin content in East Asian populations: further evidence of convergent evolution of skin pigmentation. PLoS Genet. 6, e1000867 (2010).
Article PubMed PubMed Central Google Scholar
Yuasa, I. et al. OCA2⁄481Thr, a hypofunctional allele in pigmentation, is characteristic of northeastern Asian populations. J. Hum. Genet. 52, 690–693 (2007).
Article PubMed CAS Google Scholar
Yuasa, I. et al. Distribution of two Asian-related coding SNPs in the MC1R and OCA2 genes. Biochem. Genet. 45, 535–542 (2007).
Article PubMed CAS Google Scholar
Yuasa, I., Harihara, S., Jin, F. & Saitou, N. Distribution of OCA2*481Thr and OCA2*615Arg, associated with hypopigmentation, in several additional populations. Legal Med. 13, 215–217 (2011).
Article PubMed CAS Google Scholar
Eaton, K. et al. Association study confirms the role of two OCA2 polymorphisms in normal skin pigmentation variation in East Asian populations. Am. J. Hum. Biol. 27, 520–525 (2015).
Article PubMed Google Scholar
Rawofi, L., Edwards, M., Norton, H. & Parra, E. Genome-wide association study of pigmentary traits (skin and iris color) in individuals of East Asian ancestry. PeerJ 5, e2951. https://doi.org/10.7717/peerj.3951 (2017).
Article CAS Google Scholar
Lee, S. T. et al. Mutations of the P gene in oculocutaneous albinism, ocular albinism, and Prader-Willi syndrome plus albinism. N. Engl. J. Med. 330, 529–534 (1994).
Article PubMed CAS Google Scholar
Suzuki, T., Miyamura, Y. & Tomita, Y. High frequency of the Ala481Thr mutation of the P gene in the Japanese population. Am. J. Med. Genet. 118A, 402–403 (2003).
Article PubMed Google Scholar
Pakstis, A. J. et al. Increasing the reference populations for the 55 AISNP panel: the need and benefits. Int. J. Legal Med. 131, 913–917 (2017).
Article PubMed PubMed Central Google Scholar
The 1000 Genomes Project Consortium, Auton, A. et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Duffy, D. L. et al. A three-single-nucleotide polymorphism haplotype in intron 1 of OCA2 explains most human eye-color variation. Am. J. Hum. Genet. 80, 241–252 (2007).
Article PubMed CAS Google Scholar
Siewierska-Górska, A., Sitek, A., Żądzińska, E., Bartosz, G. & Strapagiel, D. Association of five SNPs with human hair colour in the Polish population. Homo 68, 134–144 (2017).
Article PubMed Google Scholar
Kidd, K. K. et al. Progress toward an efficient panel of SNPs for ancestry inference. Forensic Sci. Int. Genet. 10, 23–32 (2014).
Article PubMed CAS Google Scholar
Stephens, M., Smith, N. J. & Donnelly, P. A new statistical method for haplotype reconstruction from population data. Am. J. Hum. Genet. 68, 978–989 (2001).
Article PubMed PubMed Central CAS Google Scholar
Stephens, M. & Scheet, P. Accounting for decay of linkage disequilibrium in haplotype inference and missing-data imputation. Am. J. Hum. Genet. 76, 449–462 (2005).
Article PubMed PubMed Central CAS Google Scholar
Duffy, D. Genetics of eye colour. In eLS. Chichester, UK: Wiley (2015). doi.org/10.1002/9780470015902.a0024646.
Pospeich, E. et al. Gene-gene interactions contribute to eye colour variation in humans. J. Hum. Genet. 56, 447–455 (2011).
Article Google Scholar
Pospiech, E. et al. The common occurrence of episasis in the determination of human pigmentation and its impact on DNA-based pigmentation phenotype prediction. Forensic Sci. Int. Genet. 11, 64–72 (2014).
Article PubMed CAS Google Scholar
Wollstein, A. et al. (2017) Novel quantitative pigmentation phenotyping enhances genetic association, epistasis, and prediction of human eye colour. Sci. Rep. 7, 43359 (2017).
Article ADS PubMed PubMed Central Google Scholar
Li, C. X. et al. A panel of 74 AISNPs: Improved ancestry inference within Eastern Asia. Forensic Sci. Int. Genet. 23, 101–110 (2016).
Article PubMed CAS Google Scholar

Download references

Acknowledgments

The assembly and data analyses were funded primarily by NIJ Grants 2018-75-CX-0041, 2015-DN-BX-K023, 2016-DN-BX-0162, and 2014–DN-BX-K030 to KKK awarded by the National Institute of Justice, Office of Justice Programs, U.S. Department of Justice. Points of view in this presentation are those of the authors and do not necessarily represent the official position or policies of the U.S. Department of Justice. The authors thank Dr. Francoise R. Friedlaender for her expert help in creating Figs. 1, 2, 3 and 4. We would like to acknowledge all of our collaborators who helped collect the samples used in this research as well as the National Laboratory for the Genetics of Israeli Populations at Tel Aviv University and the Coriell Cell Repositories. Special thanks are due to the many hundreds of individuals who volunteered to give blood or saliva samples for studies of gene frequency variation.

Author information

Authors and Affiliations

Professor Emeritus, Department of Genetics, Yale University School of Medicine, P.O. Box 208005, New Haven, CT, 06520-8005, USA
Kenneth K. Kidd, Andrew J. Pakstis, Michael P. Donnelly, Eva Haigh & William C. Speed
Biological and Environmental Sciences, Troy University, Dothan, AL, 36303, USA
Michael P. Donnelly
Institute of Forensic Science, Istanbul University–Cerrahpasa, Istanbul, 34500, Turkey
Ozlem Bulbul
Laboratory of Genetics, Immunology and Human Pathologies, Faculty of Sciences of Tunis, University of Tunis El Manar, 2092, Tunis, Tunisia
Lotfi Cherni
Higher Institute of Biotechnology of Monastir, Monastir University, 5000, Monastir, Tunisia
Lotfi Cherni
Turkish Cypriot DNA Laboratory, Committee on Missing Persons in Cyprus Turkish Cypriot Member Office, Nicosia, North Cyprus), Turkey
Cemal Gurkan
Dr. Fazıl Küçük Faculty of Medicine, Eastern Mediterranean University, Famagusta (North Cyprus), Turkey
Cemal Gurkan
Key Laboratory forMolecular GeneticMechanisms and Intervention Research On High Altitude Disease of Tibet Autonomous Region, School of Medicine, Xizang Minzu University, Xianyang, 712082, Shaanxi, China
Longli Kang
Key Laboratory of High Altitude Environment and Genes Related To Disease of Tibet Ministry of Education, School of Medicine, Xizang Minzu University, Xianyang, 712082, Shaanxi, China
Longli Kang
MOE State Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China
Hui Li
Institute of Forensic Medicine, West China College of Preclinical and Forensic Medicine, Sichuan University, No.16. Section 3. RenMin Nan Road, Chengdu, 610041, Sichuan, China
Libing Yun
Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
Peristera Paschou
Department of Population Health and Pathobiology, North Carolina State University, 1060 William Moore Drive, Raleigh, NC, 27607, USA
Kelly A. Meiklejohn

Authors

Kenneth K. Kidd
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Pakstis
View author publications
You can also search for this author in PubMed Google Scholar
Michael P. Donnelly
View author publications
You can also search for this author in PubMed Google Scholar
Ozlem Bulbul
View author publications
You can also search for this author in PubMed Google Scholar
Lotfi Cherni
View author publications
You can also search for this author in PubMed Google Scholar
Cemal Gurkan
View author publications
You can also search for this author in PubMed Google Scholar
Longli Kang
View author publications
You can also search for this author in PubMed Google Scholar
Hui Li
View author publications
You can also search for this author in PubMed Google Scholar
Libing Yun
View author publications
You can also search for this author in PubMed Google Scholar
Peristera Paschou
View author publications
You can also search for this author in PubMed Google Scholar
Kelly A. Meiklejohn
View author publications
You can also search for this author in PubMed Google Scholar
Eva Haigh
View author publications
You can also search for this author in PubMed Google Scholar
William C. Speed
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.K.K. and M.P.D. designed the study. A.J.P. and W.C.S. analyzed the data. Other authors helped collect the data and/or contributed new samples. A.J.P. and K.K.K. were primarily responsible for writing the manuscript. All authors read and commented on the manuscript.

Corresponding author

Correspondence to Kenneth K. Kidd.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Tables.

Supplementary Figures.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kidd, K.K., Pakstis, A.J., Donnelly, M.P. et al. The distinctive geographic patterns of common pigmentation variants at the OCA2 gene. Sci Rep 10, 15433 (2020). https://doi.org/10.1038/s41598-020-72262-6

Download citation

Received: 14 April 2020
Accepted: 17 August 2020
Published: 22 September 2020
DOI: https://doi.org/10.1038/s41598-020-72262-6

This article is cited by

Ungewohnte klinische Bilder
- Peter Schmid-Grendelmeier
Deutsche Dermatologie (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.