Abstract
Skin pigmentation is a complex trait that varies largely among populations. Most genome-wide association studies of this trait have been performed in Europeans and Asians. We aimed to uncover genes influencing skin colour in African-admixed individuals. We performed a genome-wide association study of melanin levels in 285 Hispanic/Latino individuals from Puerto Rico, analyzing 14 million genetic variants. A total of 82 variants with p-value ≤1 × 10−5 were followed up in 373 African Americans. Fourteen single nucleotide polymorphisms were replicated, of which nine were associated with skin colour at genome-wide significance in a meta-analysis across the two studies. These results validated the association of two previously known skin pigmentation genes, SLC24A5 (minimum p = 2.62 × 10−14, rs1426654) and SLC45A2 (minimum p = 9.71 × 10−10, rs16891982), and revealed the intergenic region of BEND7 and PRPF18 as a novel locus associated with this trait (minimum p = 4.58 × 10−9, rs6602666). The most significant variant within this region is common among African-descent populations but not among Europeans or Native Americans. Our findings support the advantages of analyzing African-admixed populations to discover new genes influencing skin pigmentation.
Similar content being viewed by others
Introduction
Skin pigmentation is essential in the protection against ultraviolet (UV) radiation1,2. A complex regulatory system controls the production of melanin3, the main pigment providing colour to the skin4. Melanin is produced by melanocytes in the epidermis and is deposited in melanosomes, which are transferred to adjacent keratinocytes5. Melanocytes are also implicated in several other important bioregulatory, metabolic and homeostatic processes, both in the skin and in other organs5.
Skin colour varies among different populations and is strongly correlated with latitude due to the variation in UV radiation intensity6. Moreover, several selective factors have been implicated in the evolution of human pigmentation towards darker pigmentation in equatorial and tropical regions2, including: protection against the harmful effects of UV radiation exposure7; protection against folate photolysis2,8; maintenance of adequate levels vitamin D9; and contributing to the skin’s barrier function by optimizing water conservation and improving cutaneous antimicrobial defense10.
The colour of unexposed skin (constitutive skin pigmentation) is a complex trait11. Indeed, evidence supports that many genes and other interacting factors are involved in determining normal skin pigmentation12,13. However, candidate-gene and genome-wide association studies (GWAS) have revealed only a few of the total estimated number of genes implicated in the variability of human skin colour2,14,15. Moreover, despite the large differences in skin pigmentation across populations, most genetic association studies of skin colour have been performed in European16,17 and Asian populations18,19,20. Only a handful of candidate-gene association studies has been performed in African ancestry populations14,21,22,23, and one lone GWAS has been carried out in African-European admixed individuals from Cape Verde24.
Hispanics/Latinos from Puerto Rico are the result of the admixture of European, African, and Native American ancestry. Specifically, the Native American component derives from the Taínos, the native population of Puerto Rico, which was highly reduced by slavery trade, warfare and diseases25, and the European component was introduced by the Spanish settlers26,27,28. Later on, the Spanish brought African slaves who replaced the indigenous population of the island. Therefore, the resulting population has nowadays a predominant European admixture, followed by African ancestry and lower Native American component. Given that performing GWAS in recently admixed African-ancestry populations provides an opportunity to identify novel genes implicated in skin colour variability14, we hypothesized that a GWAS in Hispanics/Latinos from Puerto Rico could reveal novel genes contributing to this trait. Herein, we identify genetic variants influencing skin pigmentation in African-admixed individuals analysing 14 million genetic variants across the genome.
Methods
Ethics statement
This study has been approved by the institutional review boards of University of California San Francisco and all participant centres. Written informed consent was obtained from all subjects or from their appropriate surrogates for participants under 18 years old. All methods were performed in accordance with the relevant guidelines and regulations for human subject research, in accordance with the Declaration of Helsinki.
Study populations
Samples from the Genes-environment & Admixture in Latino Americans (GALA II) Study and the Study of African Americans, Asthma, Genes & Environments (SAGE II) were used for the discovery of genetic variants associated with skin colour and replication of results, respectively. The GALA II and SAGE II studies are two independent case-control studies initially conceived for the study of genetic and environmental factors involved in asthma. Both studies used the same protocol and questionnaires to recruit unrelated children aged 8 to 21 years old, but focused on two different racial/ethnic groups: Hispanics/Latinos in GALA II and African Americans in SAGE II. All recruited subjects must have reported that all four grandparents self-identified as Hispanics/Latinos (GALA II) or African Americans (SAGE II). Participants from GALA II with skin colour measurements included in this study were recruited in Puerto Rico, while SAGE II participants were recruited from the San Francisco Bay Area29,30.
Skin colour characterization
We used the DSM II ColorMeter (Cortex Technology, Hadsund, Denmark) to measure skin pigmentation in triplicate for each participant along the inner side of each upper arm. Melanin was measured using the melanin index, defined as the inverse of the melanin reflectance measured at 650 nm24; lower values of the melanin index correspond to light skin colour, whereas larger values correspond to dark skin colour.
Genotyping and assessment of genetic ancestry
Genome-wide genotyping data from participants of both studies were obtained by using the Axiom LAT1 array (World Array 4, Affymetrix, Santa Clara, CA, United States), and quality control procedures were performed as described elsewhere29,30. Genotype data were obtained for the 285 Hispanics/Latinos from Puerto Rico from GALA II and the 373 African Americans from SAGE II who had available skin colour measurement data.
Genetic ancestry was initially assessed by performing a principal components analysis (PCA) using EIGENSOFT31. We assessed ancestry structure among Hispanics/Latinos from Puerto Rico and African Americans using the African (YRI) and European (CEU) reference populations from the 1000 Genomes Project (1KGP)32, and Native American (NAM) individuals genotyped with the Axiom LAT1 array29. Genetic ancestry proportions for each subject were also estimated with an unsupervised model from ADMIXTURE33, using the CEU and YRI as parental populations for African Americans, and CEU, YRI and NAM as parental populations for Hispanics/Latinos from Puerto Rico.
Imputation, association testing and meta-analysis
Genetic variants located in autosomal chromosomes were imputed by means of the Michigan Imputation Server34, using SHAPEIT35 for haplotype reconstruction, and Minimac3 software for the imputation step36. The first release of the Haplotype Reference Consortium (HRC) was used as the reference population37.
Association testing with skin colour was performed in Hispanics/Latinos from Puerto Rico by means of the linear Wald test implemented in the software EPACTS 3.2.638, adjusting by the proportions of African and Native American ancestries. The results were then filtered to retain those variants with a minor allele frequency (MAF) ≥1% and Rsq ≥0.3. Variants associated with skin pigmentation in the GALA II discovery sample at a suggestive significance level (p ≤ 1 × 10−5) were followed up for replication in SAGE II African Americans. Association testing was performed similarly in the replication sample, with the exception that only African ancestry was used to adjust for genetic ancestry.
Results from the discovery and replication samples were meta-analyzed using METASOFT. Random-effects models were applied for single nucleotide polymorphisms (SNPs) showing heterogeneity of effects between studies (Cochran’s Q test p-value ≤ 0.05) and fixed effects models for those SNPs without evidence of heterogeneity (Cochran’s Q test p-value Q > 0.05)39. Genome-wide significance was declared at p-value ≤ 5 × 10−8.
Chromosomal regions containing variants that were genome-wide significant were plotted for the discovery sample using Locus Zoom 1.1 (ref. 40) based on linkage disequilibrium (LD) data from the 1KGP (GRCh37/hg19 build)32. Independence of association signals with skin colour among SNPs located within the same genomic region was assessed by multivariate linear regression analyses conditioned on the most significant SNP of each region using R 3.2.2 (ref. 41).
Allele frequency distribution assessment of rs6602666
We assessed the distribution of the minor allele frequency of the novel associated variant rs6602666 across different populations. We first used the Geography of Genetic Variants Browser Beta v0.2 to plot allele distributions in African, admixed American, East Asian, European, and South Asian populations from 1KGP Phase III42. Given that Native American populations are not represented in the 1KGP dataset, we downloaded publicly available data for 108 Native American individuals described in Lazaridis et al.43 (7 Bolivian, 12 Karitiana, 18 Mayan, 10 Mixe, 10 Mixtec, 10 Nasoi, 4 Piapoco, 14 Pima, 5 Quechua, 8 Surui, and 10 Zapotec). Allele frequency in those groups was assessed using PLINK44.
Results
Ancestry composition and skin colour distribution
Our analysis of the ancestral composition using PCA revealed that no individuals were outliers regarding their ancestry composition (Supplementary Fig. S1). As expected, Hispanics/Latinos from Puerto Rico had a larger proportion of European and lower contribution of African and Native American admixture compared with African Americans (Table 1, Supplementary Fig. S2). The replication sample showed a predominant African component and, to a lesser extent, European ancestry (Table 1, Supplementary Fig. S2). Therefore, despite being two African-admixed populations, Hispanics/Latinos from Puerto Rico had significantly smaller proportions of African admixture (22.8% ± 9.5%) compared with African Americans (80.9% ± 10.0%, p < 0.001).
A summary of the descriptive data of the individuals from our study is shown in Table 1. Age average and proportion of males were similar across the discovery and replication samples, and neither of those characteristics was associated with skin pigmentation (p > 0.05) and therefore they were not included as covariates in the GWAS. Additionally, Hispanics/Latinos from Puerto Rico had lighter skin (45.8 ± 6.8) than African Americans (71.9 ± 13.5) (p < 0.001). Actual distributions of the melanin index for the two populations are shown in Fig. 1.
Discovery study in Hispanics/Latinos from Puerto Rico
Association analyses of the 14 million imputed variants with MAF ≥1% in the discovery sample revealed a total of 82 SNPs associated with skin colour at a suggestive significance level (p-value ≤ 1 × 10−5) (Supplementary Table S1). No major genomic inflation (λGC = 1.02) was observed in the Q-Q plot (Fig. 2A) and the most significant SNPs were located in chromosomes 5, 10, and 15 (Fig. 2B). The top hit was rs2675345, located within SLC24A5, which was near genome-wide significance (p = 5.83 × 10−8; β for G allele: 3.31, 95% CI: 2.14–4.47).
Replication of associated variants in African Americans and meta-analysis
Of the 82 SNPs that were significant at a suggestive level in Hispanics/Latinos from Puerto Rico, 77 were followed up for replication in the African American sample, since the remaining five were either monomorphic (three SNPs) or had a MAF <1% (two SNPs). Out of the 77 SNPs, 14 replicated in the African American sample (p-value < 0.05) and effect sizes were all in the same direction and of similar magnitude as the discovery sample (Table 2).
The meta-analysis showed evidence of association for nine of the 14 SNPs at a genome-wide significance level (p < 5 × 10−8) (Table 2). These SNPs are located within three genomic regions. Two regions are already known to contribute to skin colour, including SLC24A518,19,20,21,23,45 and surrounding genes (Supplementary Fig. S3) as well as the SLC45A218,24,45 gene (Supplementary Fig. S4). We also report one novel region described for the first time in the current study, located in the intergenic region of BEND7 and PRPF18. The remaining five SNPs not reaching genome-wide significance were located in three genes SST-RTP2, ATP8B4, and EIF2S2-ASIP; the two last genes have previously been associated with skin colour-related traits18,23,46,47.
Three SNPs within SLC24A5 showed the strongest meta-analysis association signals: rs1426654 (β for G allele: 4.36, p = 2.62 × 10−14), rs2675345 (β for G allele: 3.89, p = 2.98 × 10−14), and rs2470102 (β for G allele: 4.31, p = 3.70 × 10−14). We also detected two SNPs near SLC24A5 that were genome-wide significant, one located within DUT (rs11637235, β for T allele: −3.83, p = 3.34 × 10−10) and the other within MYEF2 (rs8028919, β for A allele: −3.31, p = 1.62 × 10−10). After including all five SNPs located within or near SLC24A5 in one common regression model, we determined that the association signal among Hispanics/Latinos from Puerto Rico was driven by the top SNP (rs2675345), as the regression coefficients for the other SNPs were not significant in the common model (Supplementary Table S2).
The second locus with genome-wide significant association with skin colour in the meta-analysis was in SLC45A2: rs16891982 (β for G allele: −2.85, p = 9.71 × 10−10) and rs35397 (β for T allele: −2.66, p = 2.05 × 10−8). These two SNPs had high LD (r2 ≥ 0.82) in both populations and their association with skin pigmentation was driven by rs16891982 (a SNP associated with skin colour by previous studies)18, given that rs35397 lost significance after performing regression analysis conditioned on rs16891982 (p = 0.945) (Supplementary Table S2).
Moreover, two SNPs located in the intergenic region of BEND7 and PRPF18 (Fig. 3) were associated with skin colour at genome-wide significance in the meta-analysis: rs6602665 (β for C allele: 4.01, p = 6.14 × 10−9) and rs6602666 (β for G allele: 4.03, p = 4.58 × 10−9), which showed strong LD in the discovery and replication samples (r2 = 0.99). These SNPs were more significantly associated with skin colour in Hispanics/Latinos from Puerto Rico (β = 4.72, p = 7.27 × 10−7 for both SNPs) than in African Americans (β = 3.20, p = 1.80 × 10−3 and β = 3.14, p = 2.34 × 10−3 for rs6602666 and rs6602665, respectively). As expected by the high LD between the two SNPs, they represented one association signal in regression analysis when both SNPs were incorporated into the same model (p = 0.788 for rs6602666).
The frequency distribution of the G allele of rs6602666 in 1KGP Phase III (Fig. 4) showed that this variant is more prevalent in populations with African ancestry (MAF = 30%) and in South Asians (MAF = 8%), and has a lower frequency in admixed American populations (MAF = 3%). Among admixed American populations, this variant was more prevalent in Puerto Ricans residing in Puerto Rico. In contrast, this variant is almost absent in Europeans and East Asians. An assessment of allele frequency in populations of Native American origin revealed that this variant is monomorphic in the 108 samples from the 11 Native American populations with available data43.
Discussion
In this study, we performed the first GWAS of skin colour in Hispanics/Latinos from Puerto Rico from the GALA II study. After performing genotype imputation and subsequent association testing, we detected 82 suggestive association signals in Hispanics/Latinos, 14 of which replicated at nominal significance in an independent African American sample from the SAGE II study. We identified novel, genome-wide significant associations between skin colour and variants from the BEND7/PRPF18 intergenic region. We also validated the association of five genes already known to contribute to skin colour identified primarily in European populations: two loci with genome-wide significance (SLC24A5 and SLC45A2), and three at a suggestive level (EIF2S2, ASIP, and ATP8B4). In addition to replicating previously described SNPs18,21, our results also revealed additional loci within the same region (e.g., rs2675345 from SLC24A5).
Among the three most significantly associated gene regions, variants near or within SLC24A5 showed the strongest association signals. This gene is located in the 15q21.1 chromosomal band and encodes the NCKX5 protein (solute carrier family 24 [sodium/potassium/calcium exchanger], member 5), an intracellular membrane protein whose function has been associated with skin colour and diseases related to skin pigmentation21,48. The top SNP in our meta-analysis (rs1426654) has been also associated with skin colour in African American and African Caribbean populations in a candidate-gene study21, and has broadly replicated across different populations18,19,20,23,24,45.
We also validated the association of SLC45A2 with skin colour in Hispanics/Latinos from Puerto Rico and African Americans. This gene encodes SLC45A2, which is a transporter highly expressed in the melanosomal membrane of melanocytic cell lines, where it is overexpressed in melanoma cells49. SLC45A2 has been associated with different pigmentary traits (e.g., eye, skin, and hair colour)18 and diseases50. We confirmed the association of a previously described SNP with normal skin pigmentation (rs16891982 [Phe374Leu]), which was first identified in South Asians18 and has since been validated in other populations, including African-admixed individuals24,45.
Notably, we detected two novel genome-wide significant associations (rs6602665 and rs6602666) in the intergenic region of BEND7 and PRPF18 with skin colour. Both variants are located closer to PRPF18 (approximately 23 kb) than to BEND7 (approximately 83 kb). The function of the intracellular protein encoded by BEND7 (BEN domain-containing protein 7) is not extensively known. Nevertheless, it contains the BEN domain, which is involved in transcription regulation throughout recruitment of chromatin remodelling factors and DNA-protein interactions51. The other gene located nearest the top SNP of this region, PRPF18 (pre-mRNA processing factor 18), encodes a splicing factor implicated in pre-mRNA splicing by means of protein-protein interactions52. While no skin pigmentation-specific functions have been attributed to any of these two flanking genes, RNA for both genes is expressed in skin regardless of exposure to UV light, with higher levels of expression for PRPF1853. However, at the protein level, only PRPF18 is expressed in melanocytes and other skin cells54.
Interestingly, based on 1KGP data, the rs6602666 G allele (which is associated with darker skin colour) is present in African, South Asian, and admixed American populations, rare in Europeans, and completely absent in Native Americans. Therefore, differences in the proportion of African genetic ancestry may provide a simple explanation as to why this locus has not been detected in previous GWAS, since they were predominantly focused on populations of European descent. This observation underscores the importance and scientific benefit of studying admixed populations, as the inclusion of genetically diverse groups improves statistical power, particularly when genetic variants are rare55.
Identification of genes implicated in human skin pigmentation has high anthropological, forensic and biomedical interest56,57. For example, genes associated with skin colour are also important in regulating vitamin D levels in Caucasian populations58. Given that vitamin D deficiency has been implicated with a variety of diseases59, and the fact that the majority of circulating vitamin D is derived from photochemical reactions in the skin, genes affecting skin pigmentation could play an indirect role in several diseases60. Furthermore, identification of genes involved in controlling melanin levels in the skin could provide new insights regarding the genetics of several types of skin cancer61. In fact, some of the skin pigmentation associated loci in our present study, such as SLC45A2, have also been associated with protection against basal cell carcinoma, squamous cell carcinoma62, and melanoma among Europeans50, who are at increased risk of developing skin cancer60. Furthermore, another gene associated with skin colour in our study, ASIP, has been previously implicated in basal cell carcinoma16. Therefore, the novel locus associated with skin pigmentation in the current GWAS might also be relevant for skin cancer susceptibility in African-descent populations. Indeed, data from the National Cancer Institute’s Surveillance, Epidemiology and End Results Program have shown that melanoma incidence is lowest among African Americans (1.0% in females and 1.1% in males), intermediate among Hispanics/Latinos (4.4% in females and 4.8% in males), and highest among non-Hispanic whites (19.4% in females and 32.2% in males)63. The lower incidence of melanoma in individuals from populations with darker skin may be attributed to the protective effects of higher melanin levels64. Furthermore, there are differences in the prevalence of other types of skin cancer among Hispanics and African Americans, such as basal cell carcinoma, which is the most prevalent skin cancer in Hispanics65,66 and the second most common skin malignancy in non-Hispanics with African ancestry.
In addition to skin cancer, other diseases of the skin (e.g., vitiligo, psoriasis, or alopecia areata) could be affected by the genetic variants identified in the current study. To date, association studies linking either of the genome-wide significant SNPs (rs6602665 and rs6602666) or their flanking genes (BEND7 and PRPF18) with any of these skin diseases are lacking; future studies should therefore investigate the association of the novel locus with these skin diseases.
Some of the SNPs identified as suggestively associated with skin pigmentation in the current study are located in gene regions previously associated with skin pigmentation, such as EIF2S246, ASIP47, and ATP8B418. Nevertheless, we found another suggestive hit near two genes that had not been previously associated with this trait (SST and RTP2) and deserve further attention in future studies.
Our study has several advantages that should be highlighted: a) skin colour was assessed using skin reflectance spectrometry obtaining a quantitative measure of skin pigmentation, as opposed to many previous studies based on self-reported skin colour; b) skin pigmentation measures were obtained using the same instrument in different recruitment centres participating in the GALA II and SAGE II to reduce possible biases; c) the analysed samples were genotyped with a specific array for African-admixed populations, providing good representation of their genomic variation67; d) for the first time in a GWAS of skin colour, the extensive catalogue of genetic variants provided by whole-genome sequencing data from the HRC reference panel was used37.
The current study also has some limitations that should be considered. Sample size was relatively limited, yet we had sufficient statistical power in the discovery sample (83%) to detect the association of genetic variants with allele frequencies ≥25% and effects sizes (β) ≥ 3.5. Statistical power was limited for variants with lower allele frequencies and modest effect sizes. Moreover, the differing proportions of African admixture and distinct distributions of skin colour between our study populations point to differences in the genetic architecture of skin colour between our two populations. In fact, a proportion of the SNPs associated with skin colour in Hispanic/Latinos was monomorphic or had a low frequency in African Americans, precluding replication attempts of those variants among African Americans. Given that Hispanics/Latinos from Puerto Rico have lower Native American proportions than other Hispanic/Latino subgroups29, our results may not generalize to other Hispanic/Latino groups. Therefore, additional replication should be performed in other populations with different ancestry proportions and skin phototypes.
We measured skin pigmentation using the melanin index. It is possible that our GWAS could have yielded different results had we used alternative methods for measuring melanin, such as pyrrole-2,3,5-trycarboxilic acid (PTCA), aminohydroxyphenylalanine (AHP), or electron paramagnetic resonance spectroscopy (EPR)68. Future studies using these alternative methods may provide convergent validity to our results if the associated loci are truly related to the melanogenesis. Finally, clear functional evidence relating the novel locus, BEND7-PRPF18, with skin pigmentation has not yet been described. Given the involvement of these genes in pre-mRNA processing and transcription regulation, these loci could be related to melanocytic proliferation. Unfortunately, skin biopsies from the patients in this study are unavailable for performing histologic studies or in vitro experiments using epidermal cell cultures. Therefore, the functional role of associated variants will need to be assessed by future studies.
In summary, this GWAS validated the role of SLC24A5 and SLC45A2 with skin melanin levels in Hispanics/Latinos from Puerto Rico and African Americans, and identified a novel association of variants in the intergenic region of BEND7 and PRPF18 with this trait. Therefore, this study reinforces the advantages and the necessity of analyzing African-admixed populations to identify new loci involved in complex traits.
Additional Information
How to cite this article: Hernandez-Pacheco, N. et al. Identification of a novel locus associated with skin colour in African-admixed populations. Sci. Rep. 7, 44548; doi: 10.1038/srep44548 (2017).
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
Jablonski, N. G. The evolution of human skin and skin color. Annu. Rev. Anthropol. 33, 585–623 (2004).
Parra, E. J. Human Pigmentation Variation: Evolution, Genetic Basis, and Implications for Public Health Am. J. Phys. Anthropol Suppl 45, 85–105, doi: 10.1002/ajpa.20727 (2007).
Slominski, A., Tobin, D. J., Shibahara, S. & Wortsman, J. Melanin pigmentation in mammalian skin and its hormonal regulation. Physiol. Rev. 84, 1155–1228, doi: 10.1152/physrev.00044.2003 (2004).
Quevedo, W. & Holstein, T. J. General Biology of Mammalian Pigmentation In The Pigmentary System: Physiology and Pathophysiology(eds Nordlund, J. J., Boissy, R. E., Hearing, V. J., King, R. A., Oetting, W. S. & Ortonne, J.-P. ) 61–90 (Blackwell Publishing Ltd, Oxford, UK, doi: 0.1002/9780470987100.ch3 (2006).
Plonka, P. M. et al. What are melanocytes really doing all day long…? Exp. Dermatol. 18, 799–819, doi: 10.1111/j.1600-0625.2009.00912.x (2009).
Relethford, J. H. Hemispheric difference in human skin color. Am. J. Phys. Anthropol. 104, 449–457, doi: 10.1002/(SICI)1096-8644(199712)104:4<449::AID-AJPA2>3.0.CO;2-N (1997).
Rees, J. L. The genetics of sun sensitivity in humans. Am. J. Hum. Genet. 75, 739–751, doi: 10.1086/425285 (2004).
Off, M. K. et al. Ultraviolet photodegradation of folic acid. J. Photochem. Photobiol. B. 80, 47–55, doi: 10.1016/j.jphotobiol.2005.03.001 (2005).
Holick, M. F. Vitamin D: A millenium perspective. J. Cell Biochem. 88, 296–307, doi: 10.1002/jcb.10338 (2003).
Elias, P. M. & Williams, M. L. Basis for the gain and subsequent dilution of epidermal pigmentation during human evolution: The barrier and metabolic conservation hypotheses revisited. Am. J. Phys. Anthropol. 161, 189–207, doi: 10.1002/ajpa.23030 (2016).
Sturm, R. A., Teasdale, R. D. & Box, N. F. Human pigmentation genes: identification, structure and consequences of polymorphic variation. Gene 277, 49–62, doi: 10.1016/j.tig.2006.06.010 (2001).
Branicki, W., Brudnik, U. & Wojas-Pelc, A. Interactions between HERC2, OCA2 and MC1R may influence human pigmentation phenotype. Ann. Hum. Genet. 73, 160–170, doi: 10.1111/j.1469-1809.2009.00504.x (2009).
Candille, S. I. et al. Genome-wide association studies of quantitatively measured skin, hair, and eye pigmentation in four European populations. PLoS One 7, e48294, doi: 10.1371/journal.pone.0048294 (2012).
Shriver, M. D. et al. Skin pigmentation, biogeographical ancestry and admixture mapping. Hum. Genet. 112, 387–399, doi: 10.1007/s00439-002-0896-y (2003).
Binstock, M., Hafeez, F., Metchnikoff, C. & Arron, S. T. Single-nucleotide polymorphisms in pigment genes and nonmelanoma skin cancer predisposition: a systematic review. Br. J. Dermatol. 171, 713–721, doi: 10.1111/bjd.13283 (2014).
Nan, H., Kraft, P., Hunter, D. J. & Han, J. Genetic variants in pigmentation genes, pigmentary phenotypes, and risk of skin cancer in Caucasians. Int. J. Cancer 125, 909–917, doi: 10.1002/ijc.24327 (2009).
Cook, A. L. et al. Analysis of cultured human melanocytes based on polymorphisms within the SLC45A2/MATP, SLC24A5/NCKX5, and OCA2/P loci. J. Invest. Dermatol. 129, 392–405, doi: 10.1038/jid.2008.211 (2009).
Stokowski, R. P. et al. A genomewide association study of skin pigmentation in a South Asian population. Am. J. Hum. Genet. 81, 1119–1132, doi: 10.1086/522235 (2007).
Basu Mallick, C. et al. The light skin allele of SLC24A5 in South Asians and Europeans shares identity by descent. PLoS Genet. 9, e1003912, doi: 10.1371/journal.pgen.1003912 (2013).
Jonnalagadda, M., Norton, H., Ozarkar, S., Kulkarni, S. & Ashma, R. Association of genetic variants with skin pigmentation phenotype among populations of west Maharashtra, India. Am. J. Hum. Biol. 28, 610–618, doi: 10.1002/ajhb.22836 (2016).
Lamason, R. L. et al. SLC24A5, a putative cation exchanger, affects pigmentation in zebrafish and humans. Science 310, 1782–1786, doi: 10.1126/science.1116238 (2005).
Neitzke-Montinelli, V. et al. Polymorphisms upstream of the melanocortin-1 receptor coding region are associated with human pigmentation variation in a Brazilian population. Am. J. Hum. Biol. 24, 853–855, doi: 10.1002/ajhb.22301 (2012).
Lima, F. A., Goncalves, F. T. & Fridman, C. SLC24A5 and ASIP as phenotypic predictors in Brazilian population for forensic purposes. Leg. Med. (Tokyo) 17, 261–266, doi: 10.1016/j.legalmed.2015.03.001 (2015).
Beleza, S. et al. Genetic architecture of skin and eye color in an African-European admixed population. PLoS Genet. 9, e1003372, doi: 10.1371/journal.pgen.1003372 (2013).
Rouse, I. The Tainos: Rise & Decline of the People Who Greeted Columbus 26–105 (Yale University Press, 1993).
Lisker, R., Ramirez, E. & Babinsky, V. Genetic structure of autochthonous populations of Meso-America: Mexico. Hum. Biol. 68, 395–404, doi: 10.1086/302801 (1996).
Hanis, C. L., Hewett-Emmett, D., Bertin, T. K. & Schull, W. J. Origins of U.S. Hispanics. Implications for diabetes. Diabetes Care 14, 618–627 (1991).
Long, J. C. et al. Genetic variation in Arizona Mexican Americans: estimation and interpretation of admixture proportions. Am. J. Phys. Anthropol. 84, 141–157 (1991).
Pino-Yanes, M. et al. Genetic ancestry influences asthma susceptibility and lung function among Latinos. J. Allergy Clin. Immunol. 135, 228–235, doi: 10.1016/j.jaci.2014.07.053 (2015).
White, M. J. et al. Novel genetic risk factors for asthma in African American children: Precision Medicine and the SAGE II Study. Immunogenetics 68, 391–400, doi: 10.1007/s00251-016-0914-1 (2016).
Price, A. L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909, doi: 10.1038/ng1847 (2006).
Abecasis, G. R. et al. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65, doi: 10.1038/nature11632 (2012).
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664, doi: 10.1101/gr.094052.109 (2009).
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284–1287, doi: 10.1038/ng.3656 (2016).
Delaneau, O., Coulonges, C. & Zagury, J. F. Shape-IT: new rapid and accurate algorithm for haplotype inference. BMC Bioinformatics 9, 540, doi: 10.1186/1471-2105-9-540 (2008).
Fuchsberger, C., Abecasis, G. R. & Hinds, D. A. minimac2: faster genotype imputation. Bioinformatics 31, 782–784, doi: 10.1093/bioinformatics/btu704 (2015).
McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 48, 1279–1283, doi: 10.1038/ng.3643 (2016).
Kang, H. M. EPACTS (Efficient and Parallelizable Association Container Toolbox), http://genome.sph.umich.edu/wiki/EPACTS (Date of access:27/01/2016) (2016).
Han, B. & Eskin, E. Random-effects model aimed at discovering associations in meta-analysis of genome-wide association studies. Am. J. Hum. Genet. 88, 586–598, doi: 10.1016/j.ajhg.2011.04.014 (2011).
Pruim, R. J. et al. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics 26, 2336–2337, doi: 10.1093/bioinformatics/btq419 (2010).
R. Core Team . R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL http://www.R-project.org/ (2013).
Marcus, J. H. & Novembre, J. (2016) Visualizing the Geography of Genetic Variants. Bioinformaticsin press, doi: 10.1093/bioinformatics/btw643 (2016).
Lazaridis, I. et al. Genomic insights into the origin of farming in the ancient Near East. Nature 536, 419–424, doi: 10.1038/nature19310 (2016).
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575, doi: 10.1086/519795 (2007).
Durso, D. F. et al. Association of genetic variants with self-assessed color categories in Brazilians. PLoS One 9, e83926, doi: 10.1371/journal.pone.0083926 (2014).
Liu, F. et al. Genetics of skin color variation in Europeans: genome-wide association studies with functional follow-up. Hum. Genet. 134, 823–835, doi: 10.1007/s00439-015-1559-0 (2015).
Bonilla, C. et al. The 8818G allele of the agouti signaling protein (ASIP) gene is ancestral and is associated with darker skin color in African Americans. Hum. Genet. 116, 402–406, doi: 10.1007/s00439-004-1251-2 (2005).
Salceda, R. & Sanchez-Chavez, G. Calcium uptake, release and ryanodine binding in melanosomes from retinal pigment epithelium. Cell Calcium 27, 223–229, doi: 10.1054/ceca.2000.0111 (2000).
Bin, B. H. et al. Membrane-Associated Transporter Protein (MATP) Regulates Melanosomal pH and Influences Tyrosinase Activity. PLoS One 10, e0129273, doi: 10.1371/journal.pone.0129273 (2015).
Guedj, M. et al. Variants of the MATP/SLC45A2 gene are protective for melanoma in the French population. Hum. Mutat. 29, 1154–1160, doi: 10.1002/humu.20823 (2008).
Abhiman, S., Iyer, L. M. & Aravind, L. BEN: a novel domain in chromatin factors and DNA viral proteins. Bioinformatics 24, 458–461, doi: 10.1093/bioinformatics/btn007 (2008).
Horowitz, D. S. & Krainer, A. R. A human protein required for the second step of pre-mRNA splicing is functionally related to a yeast splicing factor. Genes. Dev. 11, 139–151, doi: 10.1101/gad.11.1.139 (1997).
GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
Uhlen, M. et al. Proteomics. Tissue-based map of the human proteome. Science 347, 1260419, doi: 10.1126/science.1260419 (2015).
Mensah-Ablorh, A. et al. Meta-Analysis of Rare Variant Association Tests in Multiethnic Populations. Genet. Epidemiol. 40, 57–65, doi: 10.1002/gepi.21939 (2016).
Kayser, M. & Schneider, P. M. DNA-based prediction of human externally visible characteristics in forensics: motivations, scientific challenges, and ethical considerations. Forensic Sci. Int. Genet. 3, 154–161, doi: 10.1016/j.fsigen.2009.01.012 (2009).
Draus-Barini, J. et al. Bona fide colour: DNA prediction of human eye and hair colour from ancient and contemporary skeletal remains. Investig. Genet. 4, 3, doi: 10.1186/2041-2223-4-3 (2013).
Saternus, R. et al. A closer look at evolution: Variants (SNPs) of genes involved in skin pigmentation, including EXOC2, TYR, TYRP1, and DCT, are associated with 25(OH)D serum concentration. Endocrinology 156, 39–47, doi: 10.1210/en.2014-1238 (2015).
Holick, M. F. Vitamin D deficiency. N. Engl. J. Med. 357, 266–281, doi: 10.1056/NEJMra070553 (2007).
Slominski, A. & Postlethwaite, A. E. Skin under the sun: when melanin pigment meets vitamin D. Endocrinology 156, 1–4, doi: 10.1210/en.2014-1918 (2015).
Gerstenblith, M. R., Shi, J. & Landi, M. T. Genome-wide association studies of pigmentation and skin cancer: a review and meta-analysis. Pigment Cell Melanoma Res. 23, 587–606, doi: 10.1111/j.1755-148X.2010.00730.x (2010).
Stacey, S. N. et al. New common variants affecting susceptibility to basal cell carcinoma. Nat. Genet. 41, 909–914, doi: 10.1038/ng.412 (2009).
Howlader, N. et al. SEER Cancer Statistics Review, 1975–2013, National Cancer Institute. Bethesda, MD, http://seer.cancer.gov/csr/1975_2013/, based on November 2015 SEER data submission, posted to the SEER web site. (Date of access: 19/11/2016).
Wu, X. C. et al. Racial and ethnic variations in incidence and survival of cutaneous melanoma in the United States, 1999–2006. J. Am. Acad. Dermatol. 65, S26–37, doi: 10.1016/j.jaad.2011.05.034 (2011).
Halder, R. M. & Bridgeman-Shah, S. Skin cancer in African Americans. Cancer 75, 667–673, doi: 10.1002/1097-0142(19950115)75:2+<667::AID-CNCR2820751409>3.0.CO;2-I (1995).
Koh, D. et al. Basal cell carcinoma, squamous cell carcinoma and melanoma of the skin: analysis of the Singapore Cancer Registry data 1968-97. Br. J. Dermatol. 148, 1161–1166, doi: 10.1001/jamadermatol.2015.5731 (2003).
Hoffmann, T. J. et al. Design and coverage of high throughput genotyping arrays optimized for individuals of East Asian, African American, and Latino race/ethnicity using imputation and a novel hybrid SNP selection algorithm. Genomics 98, 422–430, doi: 10.1016/j.ygeno.2011.08.007 (2011).
Maronas, O. et al. Development of a forensic skin colour predictive test. Forensic Sci. Int. Genet 13, 34–44, doi: 10.1016/j.fsigen.2014.06.017 (2014).
Acknowledgements
The authors acknowledge the patients, families, recruiters, health care providers and community clinics for their participation. In particular, the authors thank Sandra Salazar for her support as the GALAII/SAGE II study coordinator. This work was supported in part by the Sandler Family Foundation, the American Asthma Foundation, the RWJF Amos Medical Faculty Development Program, Harry Wm. and Diana V. Hind Distinguished Professor in Pharmaceutical Sciences II, National Institutes of Health 1R01HL117004, R01Hl128439, National Institute of Health and Environmental Health Sciences R01 ES015794, R21ES24844, the National Institute on Minority Health and Health Disparities 1P60 MD006902, U54MD009523, 1R01MD010443 and the Tobacco-Related Disease Research Program under Award Number 24RT-0025. NH-P was funded by a fellowship from the Spanish Ministry of Education, Culture, and Sports (11608852). SSO was supported in part by the Flight Attendant Medical Research Institute.
Author information
Authors and Affiliations
Contributions
M.P.-Y., C.F., and E.G.B. were involved in the conception, hypotheses delineation, and design of the study. N.H.-P., C.F., S.A., C.E., A.C.Y.M., S.H., D.H., M.J.W., S.S.O., K.M., H.J.F., P.C.A., D.S., S.M.T., E.B.-B., W.R.-C., S.S., R.K., M.L., J.R.R.-S., and M.P.Y. participated in the acquisition of the data or the analysis and interpretation of such information. N.H.-P. and M.P.Y. wrote the manuscript and all authors provided revisions and approval of the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Hernandez-Pacheco, N., Flores, C., Alonso, S. et al. Identification of a novel locus associated with skin colour in African-admixed populations. Sci Rep 7, 44548 (2017). https://doi.org/10.1038/srep44548
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep44548
This article is cited by
-
MC1R and age heteroclassification of face phenotypes in the Rio Grande do Sul population
International Journal of Legal Medicine (2024)
-
Association of pigmentation related-genes polymorphisms and geographic environmental variables in the Chinese population
Hereditas (2021)
-
Prediction of eye and hair pigmentation phenotypes using the HIrisPlex system in a Brazilian admixed population sample
International Journal of Legal Medicine (2021)
-
Skin pigmentation polymorphisms associated with increased risk of melanoma in a case-control sample from southern Brazil
BMC Cancer (2020)
-
The Evolutionary History of Human Skin Pigmentation
Journal of Molecular Evolution (2020)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.