Novel quantitative pigmentation phenotyping enhances genetic association, epistasis, and prediction of human eye colour

Wollstein, Andreas; Walsh, Susan; Liu, Fan; Chakravarthy, Usha; Rahu, Mati; Seland, Johan H.; Soubrane, Gisèle; Tomazzoli, Laura; Topouzis, Fotis; Vingerling, Johannes R.; Vioque, Jesus; Böhringer, Stefan; Fletcher, Astrid E.; Kayser, Manfred

doi:10.1038/srep43359

Download PDF

Article
Open access
Published: 27 February 2017

Novel quantitative pigmentation phenotyping enhances genetic association, epistasis, and prediction of human eye colour

Andreas Wollstein^1,2,3,
Susan Walsh^1,4,
Fan Liu^1,5,
Usha Chakravarthy⁶,
Mati Rahu⁷,
Johan H. Seland⁸,
Gisèle Soubrane⁹,
Laura Tomazzoli¹⁰,
Fotis Topouzis¹¹,
Johannes R. Vingerling¹²,
Jesus Vioque¹³,
Stefan Böhringer²,
Astrid E. Fletcher¹⁴ &
…
Manfred Kayser¹

Scientific Reports volume 7, Article number: 43359 (2017) Cite this article

4536 Accesses
25 Citations
25 Altmetric
Metrics details

Subjects

Abstract

Success of genetic association and the prediction of phenotypic traits from DNA are known to depend on the accuracy of phenotype characterization, amongst other parameters. To overcome limitations in the characterization of human iris pigmentation, we introduce a fully automated approach that specifies the areal proportions proposed to represent differing pigmentation types, such as pheomelanin, eumelanin, and non-pigmented areas within the iris. We demonstrate the utility of this approach using high-resolution digital eye imagery and genotype data from 12 selected SNPs from over 3000 European samples of seven populations that are part of the EUREYE study. In comparison to previous quantification approaches, (1) we achieved an overall improvement in eye colour phenotyping, which provides a better separation of manually defined eye colour categories. (2) Single nucleotide polymorphisms (SNPs) known to be involved in human eye colour variation showed stronger associations with our approach. (3) We found new and confirmed previously noted SNP-SNP interactions. (4) We increased SNP-based prediction accuracy of quantitative eye colour. Our findings exemplify that precise quantification using the perceived biological basis of pigmentation leads to enhanced genetic association and prediction of eye colour. We expect our approach to deliver new pigmentation genes when applied to genome-wide association testing.

Further insight into the global variability of the OCA2-HERC2 locus for human pigmentation from multiallelic markers

Article Open access 18 November 2021

Mapping and annotating genomic loci to prioritize genes and implicate distinct polygenic adaptations for skin color

Article Open access 07 June 2024

A systematic review of skin ageing genes: gene pleiotropy and genes on the chromosomal band 16q24.3 may drive skin ageing

Article Open access 30 July 2022

Introduction

Human eye colour is determined by the type, amount, and distribution of two forms of pigment produced in the melanocytes of the iris, eumelanin and pheomelanin. Eumelanin is a highly compact pigment, packed in ovoid eumelanosomes¹, which absorbs nearly the full light spectrum and is perceived as dark-brown to black colour. Pheomelanin in contrast, is a more sparse pigment that reflects in contrast to eumelanosomes a much broader light spectrum and is perceived as yellow to red colour^1,2. With the complete absence of both pigments, the light is reflected by the stroma of the iris, and the eye colour is perceived as grey to blue through Tyndall scattering³. As with many traits, the nature of human eye colour variation is continuous, spanning from the lightest shades of grey or blue to the darkest shades of brown or black⁴. Dark eye colour reflects the ancestral state in humans linked to their commonly believed origin in Africa, while light eye colour is assumed to be derived; shaped by positive selection perhaps due to sexual selection during European history⁵.

Several gene-mapping studies on eye colour were previously conducted by using manually defined phenotype categories^{6,7,8,9,10,11,12,13,14}, inevitably oversimplifying the continuous nature of human eye colour variation. Although this incomplete use of the underlying basics of eye colour variation reduces the thoroughness of such studies, several eye colour genes were previously identified with this simplified phenotyping approach^{5,6,7,8,9,10,11,12,13}. Moreover, it provides an accurate prediction from DNA with reasonably high accuracies for at least the extreme categories of blue and brown demonstrated via the IrisPlex system¹⁵, a system consisting of only six single nucleotide polymorphisms (SNPs) from six genes. Consequently, eye colour was one of the first externally visible characteristic for which the concept of Forensic DNA Phenotyping (FDP)^16,17 was put into practice¹⁵, later followed by hair colour¹⁸ and most recently by skin colour¹⁹. However, as described elsewhere²⁰ there is a desire to move pigmentation colour prediction from the currently applied categorical level to the continuous level. As prerequisite, this requires an understanding of the genes that determine eye colour in its fully continuous spectrum as well as methodology that allows the capture of continuous eye colour as accurately and completely as possible.

The first quantitative approach to measure eye colour was proposed by Frudakis²¹, who introduced two quantities representing the iris colour properties, i.e. the iris colour score, and the melanin index as derived from average luminosity (L) and colour reflectance values (C) from selected boxes in digital eye-imagery. The melanin index can be directly related to the amount of melanin that is known to decrease to extremely low levels (even complete absence) in blue eyes^22,23. In the hue (H) and saturation (S) measurements introduced to eye colour quantification by Liu et al.²⁴, H defines the colour itself, which can be related to the type of melanin having more red or yellow components. The S value describes the richness of a certain colour (defined in H) that is supposed to correlate with the amount of eu-or pheo-melanin. The V value is usually discarded, as it is supposed to rather represent the brightness due to different lighting conditions^24,25. Digital quantification of eye images of thousands of Europeans using the H-S colour space and its use in a genome-wide association study allowed the identification of three new eye colour genes, not previously identified when using categorical eye colour²⁴. This study clearly demonstrated the increase of power to find new genes when moving pigmentation phenotyping from the classical categorical approach to a quantitative approach. Beleza et al.²⁶ averaged and normalized B-G values from the RGB-space (red, green, blue, value) and proposed a T-index quantity to describe the amount of melanin per iris. Recently, the CIE-L * a * b* values have been used in place of H, S values²⁷ to quantify iris colour, where the L value describes the lightness, the a* the red/green, and b* the yellow/blue component, of the colour respectively. However, taking a quantity that is averaged over the iris as previous methods^21,23,24,27 have done may obscure the different mixture proportions of pigments. An alternative has been proposed by Anderson et al.²⁸, which includes a clustering of the segmented iris into blue and brown pixels deriving a ratio score (PIE score). Different types of pigmentation (i.e. eumelanin or pheomelanin), however, are not distinguished with this approach.

Here, we introduce an improvement of quantitative pigmentation phenotyping based on an automated segmentation of the iris followed by a measurement of the digital equivalents of eumelanin, pheomelanin, and total absence of any pigment in the iris. This clustering is based on manually predefined (assumed) image segments that depict eumelanin, pheomelanin and nonpigmented areas. We exemplified the advantage of this approach by an empirical analysis of high-resolution eye images from over 3000 individuals sampled from seven European countries (EUREYE study). By using genotypes of 12 SNPs previously involved in human eye colour variation that we generated in the same individuals, we demonstrate the impact of this novel pigmentation phenotyping approach on genetic association, epistasis, and prediction.

Results and Discussion

Detection and segmentation of the iris in digital imagery

Prior to colour assessment, the iris needed to be segmented from the pupil and sclera. Several approaches have been proposed previously for the segmentation of the iris in digital eye imagery^{28,29,30,31,32}. The utility of a certain approach depends strongly on the properties of the given image data. In the majority of imagery available to us, (i) the pupil was always centred in the middle (Fig. 1a), (ii) the iris was fully visible (those images where it was not were excluded from the analyses), and iii) eye lashes rarely overlapped with the iris. Because of these features, we followed a previously proposed³⁰ two-step procedure, which we implemented in Matlab (R2007a). First, we used a Canny filter³³ to distinguish the rim confining the iris and then applied the Hough transformation³⁴ to detect the iris circle. To reduce the number of multiple solutions, we constrained the results of the Hough transformation on those circles only that were centred in the middle of the image. As a result, we were able to maintain a high ratio of correctly segmented irides (>90%). Falsely segmented irides, for example when the pupil was extremely dilated, or eyes were closed by chance, were curated manually. It may be that in other types of image data taken under less normalized conditions (i.e. DSR camera systems with macro lenses), other approaches²⁸ might be more sensitive to apply for iris segmentation.

**Figure 1: Example of fully automated iris segmentation and eye colour quantification using our new approach.**

Quantification of iris colour from digital imagery

The developed method assesses each pixel according to its digital classification of pheomelanin, eumelanin, and non-pigmentation, using a machine learning approach (Supplementary Figure S1). The colour information of each image pixel was available as a red-green-blue (RGB) triplet, which we first transformed into a hue-saturation-value (HSV) triplet³⁵. We then used a support vector machine classifier³⁶ with a quadratic kernel to assign each pixel within the HSV space to one of the manually defined classifications representing the total absence of pigment (non-pigmentation), pheomelanin, and eumelanin. To define the distribution of the respective classes for the training of the support vector machine (prior to the sample phenotyping procedure), we used a set of 10 randomly chosen images of different eye colours to manually label areas that most obviously contain the two types of pigments and their absence (Supplementary Figure S1a). The distributions of these three types were well separated in at least one dimension from the selected training images (Supplementary Figure S1b), which implies that the assignment of a pixel from the iris into one of the three types of pigment outcomes can be achieved with good precision. Note that the clustering outcomes were robust against the choice of the kernel and choice of colour model (RGB or HSV, data not shown).

We finally considered the proportion of the clustered pixels relative to the segmented iris as our quantitative eye colour phenotype, reflecting the equivalent amounts of non-pigmented, pheomelanin, and eumelanin areas per iris (see Fig. 1c), the sum of which equals to one.

Comparing the new eye colour quantification method with previous methods

One important motivation behind quantitative measurements is to capture the information about complex phenotypes with a small number of variables. We use a correlation analysis to reveal how well different eye colour quantifications maintain the information about our estimated abundance of different types of melanin. We calculated quantitative iris colour phenotypes using five previously proposed methods: (i) the mean H, S values from the HSV space²⁴ (ii) the mean luminosity value and colour score²¹, (iii) the components of the L * a * b* space²⁷, (iv) the PIE score²⁸ and (v) the T-index²⁶.

Table 1 provides information about the relationship between the considered eye colour quantifiers. The amount of non-pigmentation we find mostly positively correlated with b* (r = 0.93, P < 1e–300), followed by S (−0.74, P < 1e–300). The more melanin is distributed on the iris (fewer non-pigmentation) the lower will be the b* indicating the yellow components contained in the melanin and the higher is the a* value for the red components of the melanin. Therefore we find the amount of pheomelanin was most strongly correlated with a* (r = 0.76, P < 1e–300), followed by b* (r = −0.68, P < 1e–300). The amount of eumelanin estimated with our approach was most strongly positively correlated with L²¹, (r = 0.71, P < 1e–300), followed by S (r = 0.69, P < 1e–300). The colour score C, which summarizes the components a* and b*²¹, shows a weaker correlation with eumelanin (r = −0.52, P = 1.1e–286) and much weaker positive correlation with pheomelanin (r = 0.14, P = 3.6e–14), which indicates a reduced utility in predicting the respective phenotypes from colour components, which is discussed in the next paragraph.

Table 1 Pearson Correlation coefficients (r) between pairs of eye colour phenotype measures obtained with various methods in the European study population (N = 3,087), lower triangular matrix depicts corresponding P-values.

Full size table

Finally, the amount of pheomelanin estimated with our approach showed only weak correlation with the estimated amount of eumelanin (r = −0.06, P = 0.001), confirming our expectation that these two measurements represent different biological phenotypes.

Comparing quantitative eye phenotypes with categorical eye phenotypes

A good quantitative colour measure is expected to distinguish well among manually defined eye colour categories. To test for this, we plotted the manually derived eye colour categories (blue, intermediate, brown) in the different components of the described eye colour quantities (see Fig. 2). To demonstrate the quality in separating the distributions of two selected categories, we used the Hellinger distance (HD, Table 2). The HD describes the distance between two probability distributions, which is zero if both are perfectly overlapping and one in case they are entirely separated. Note that the PIE-score and T-index are only one-dimensional quantities. The manually graded eye colour categories blue and brown were best differentiated by the non-pigmentation/eumelanin-space generated by our new method (Fig. 2c, HD = 0.923), followed by the non-pigmentation/pheomelanin space from our method (Fig. 2b, HD = 0.911). The categories blue and brown can be nearly completely separated with our approach. The categories blue and intermediate were also best separated in the non-pigmentation/pheomelanin space from our method (Fig. 2b, HD = 0.743). However, brown and intermediate were best separated by the L-a* space (Fig. 2g, HD = 0.545), followed closely by the pheomelanin/eumelanin space from our approach (0.495). A large overlap of the intermediate category with both, blue and brown, was evident, likely caused by well-known difficulties in manual assignment of eye colours to the intermediate category.

Figure 2: Manually categorized irises from the entire study population (N = 3087) into 3 eye colour categories blue (depicted in blue colour), intermediate (depicted in green colour), and brown (depicted in red colour) as arranged in the colour space of different quantification approaches for eye colour.

Table 2 Hellinger distances between pairs of colour categories from two dimensional colour subspaces (see Fig. 2).

Full size table

Considering all pairs of categories, the colour score-luminance space performed least accurately in comparison to all other quantitative methods used. The summary of the a* and b* components most likely obscures important information about the different types of melanin. The one-dimensional PIE score and T-index measurements showed the weakest ability for the separation of intermediate eye colours from either brown (PIE score, HD = 0.228) or blue (T-index, HD = 0.323).

Impact of improved eye colour phenotyping on strength of genetic eye colour association

Next, we investigated the impact of our eye colour phenotyping approach, as well as other previously used methods, on the strength of genetic eye colour association. For this, we used 12 SNPs from 11 genes highlighted in previous genetic eye colour studies mostly using categorical phenotypes^{15,18,24,37,38,39,40}. We analysed these 12 SNPs in the same 3,087 individuals from whom we used the eye images for eye colour quantification. We employed partial correlation controlling for age, sex, and the sampling population (see Methods), and report the amount of genetically explained eye colour variance as measured by R² (Table 3). Note that stronger association or higher effect size⁴¹ (as measured by R² in our study) is beside the allele frequency - an important factor controlling for the power of discovering an unknown causal variant⁴². Due to the SNPs HERC2 rs12913832 and HERC2 rs1129038 being in very high linkage disequilibrium (LD) (R² = 98%, P < 1e–300), we only describe the results for HERC2 rs12913832 (for complete results, see Table 3). Among all quantitative measurements, the phenotypic eye colour variance explained by any of these 11 SNPs was highest for the amount of non-pigmentation as measured with our new approach (HERC2 rs12913832 R² = 49.7%, P < 1e–300) followed by S (HERC2 rs12913832 R² = 40.7%, P < e–300), and eumelanin measured with our approach (HERC2 rs12913832 R² = 32.1%, P = 1.3e–261), and H (HERC2 rs12913832 R² = 26.1%, P < 7e–205). The explained variance was considerably less of a* (HERC2 rs12913832 R² = 15.4%, P = 1.e–300) and pheomelanin as measured with our approach (HERC2 rs12913832 R² = 14.6%, P = 1.8e–107). HERC2 rs12913832 (and similarly the LD SNP HERC2 rs1129038, Table 3) individually explained considerably more variation in continuous eye colour than any other SNP tested, which is in agreement with our previous study based on HS colour space²⁴). This also is in line with previous categorical eye colour studies, where HERC2 rs1129038 contributed most^12,15,38.

Table 3 Single associations of 12 SNPs previously involved in human eye colour variation with eye colour phenotype measures in the European study population (N = 3,087) controlled for age and sex in R² (P-values).

Full size table

In contrast to Liu et al.²², we could not replicate a statistically significant association of NPLOC4 rs9894429, DSCR9 rs2835630, and LYST rs3768056 (border-line significant association with S, P = 0.06) with any of the quantitative eye colour phenotypes tested including H and S used by Liu et al.²⁴ (Table 3). The lack of association noted here could be explained by their small effect size together with the smaller sample size used here (N = 3,087) relative to the previous study (N = 5,951)²². Alternative factors could be image and individual age differences between EUREYE samples used here and the on average elder Rotterdam Study samples used previously (note that the effects are measured as single contributions to the phenotype, for combined analysis see Supplementary Table S2). Age has been previously shown to be a significant predictor for eye colour²⁴.

Epistatic effects detected with the detailed quantitative eye colour phenotypes

Next, we studied the impact of the detailed eye colour phenotyping as achieved with our new method, as well as previously used methods, on epistatic effects between pairs of SNPs that deviate from additivity while correcting for sex and age (see Methods and Table 4). For this, we ignored the interaction between HERC2 rs1129038 and HERC2 rs12913832 due to their strong LD (R² = 0.98, P < 1e–300, Supplementary Table S3). We found two novel interactions between SLC24A4 rs12896399 and SLC45A2 rs16891982 strongest in pheomelanin (P = 8.7e–04) as well as LYST rs3768056 and DSCR9 rs2835630 solely evident in eumelanin (P = 2.6e–2). Moreover, we confirmed several interaction pairs reported in previous studies using quantitative or categorical eye colour phenotypes, namely: HERC2 rs12913832 and SLC24A4 rs12896399 previously observed for S²⁴ as well as blue vs. non-blue⁴³, observed here in PIEscore (P = 2.3e–16), b* (P = 3.3e–4), S (P = 4.2e–3), non-pigmentation (P = 5.4e–13), and pheomelanin (P = 8e–9); HERC2 rs12913832 and SCL45A2 rs16891982 previously described in blue vs. non-blue⁴⁴, observed here in S (P = 1.3e–12), pheomelanin (P = 1.2e–16), non-pigmentation (P = 1.8e–20), S (P = 1.3e–12), Colour score (P = 3.5e–7), PIEscore (P = 3.6e–21), and b* (P = 3.8e–18); HERC2 rs12913832 and TYPR1 rs1325127 previously described in hazel vs. non-hazel⁴³ found only in pheomelanin (P = 7.8e–4); HERC2 rs12913832 and IRF4 rs12203592 previously observed in H and S²⁴, found here in H (P = 1.9e–3), a* (P = 2.6e–4), and pheomelanin (P = 3.8e–3).

Table 4 Statistically significant interaction between pairs of SNPs from different pigmentation genes in the European study population (N = 3,087).

Full size table

We exemplified the interaction between HERC2 rs12913832 and SLC45A2 rs16891982 in Supplementary Figure S2. From the marginal distribution of HERC2 rs12913832 (SNP2 in Supplementary Figure S2) the dominant effect for allele T can be observed as the presence of a T allele strongly decreases the amount of non-pigmentation. The interaction partner SLC45A2 rs16891982 shows a recessive behaviour for the G allele that increases the amount of non-pigmentation only for the homozygote genotype GG. From the combined distributions a masking effect of HERC2 rs12913832 can be observed. The effect of the homozygote GG in SLC45A2 rs16891982 becomes much more evident if HERC2 rs12913832 is homozygote for CC (R² = 0.037, P = 3.7e–12). The effect of SLC45A2 rs16891982 on the amount of non-pigmentation is almost neutralized with HERC2 rs12913832 heterozygote CT or homozygote TT (R² = 0.003 P = 1.5e–02). Thus it appears that the allelic states of SLC45A2 rs16891982 will lighten/darken only those irises that are already stated to be “blue” by HERC2 rs12913832 = CC. A similar masking effect can be observed for the interaction pair HERC2 rs12913832 and SLC24A4 rs12896399 (Supplementary Figure S3). We similarly investigated the interaction between HERC2 rs12913832 and TYRP1 rs1325137 (Supplementary Figure S4) and observe a slight reinforcing effect of TYRP1 rs1325137 on the amount of quantified pheomelanin. Having a C allele in HERC2 rs12913832 decreases the amount of pheomelanin by the state of TYRP1 rs1325137 (R² = 0.06, P = 7.6e–3). In contrast, the state of TYRP1 rs1325137 tends to increase the amount of pheomelanin if HERC2 rs12913832 has at least one T allele (R² = 0.02, P = 6.9e–2). Condensing all SNP-SNP interactions for certain quantitative eye colour estimates, the following SNP pairs were found to be most relevant: IRF4 rs12203592 × HERC2 rs12913832, SLC24A4 rs12896399 × SLC45A2 rs16891982, HERC2 rs1291383) × SLC24A4 rs12896399, HERC2 rs12913832 × TYRP1 rs1325127, HERC2 rs12913832 × SLC45A2 rs16891982. Consequently, we included them in the prediction analyses considered as a separate model.

SNP-based prediction of quantitative eye colour phenotypes

We conducted a formal prediction analysis of continuous eye colour expressed by the various quantitative measures with (model 1, Supplementary Table S4) or without considering the SNP-SNP interaction terms previously described (model 2, Supplementary Table S5). For this, we divided our samples into a model building set (N = 2,087) and a model-validation set (N = 1,000). As prediction accuracy measure we used the percentage of phenotypic variance predicted by the respective model (mean coefficient of determination R²). Model 1 (without considering interactions) explained 52% of the phenotypic variance of non-pigmented area, 16% of pheomelanin, and 33% of eumelanin (Table 5), while H and S were predicted as 24% and 42%, respectively. PIE-score can be predicted as 45%, T-index as 31%, a* and b*, as 42% and 30% respectively. Model 2 (with considering interactions) provided increased prediction of 55% (i.e., by 3%) and 20% (i.e., by 4%) for non-pigmentation and pheomelanin, respectively, with eumelanin remaining at 33%, while for S the prediction increases to 44% (i.e., by 1%) and remained the same for H at 24%. Hence, by including the interaction terms in the prediction model, we observed an overall increase in the predictability of our newly derived quantitative eye colour measures. The strongest increase was seen for pheomelanin (i.e., by 4%) that notably is involved in intermediate eye colours. The following interactions were mostly relevant for the prediction of pheomelanin: HERC2 rs12913832 and SLC45A2 rs16891982 (beta = 0.73, P = 6.12e–14), HERC2 rs12913832 and SLC24A4 rs12896399 (beta = 0.32, P = 5.72e–9), as well as IRF4 rs12203592 and HERC2 rs12913832 (beta = 0.39, P = 9.56e–7), (see Supplementary Table S5).

Table 5 Mean coefficient of determination (R² in %) of different quantitative eye colour measures using 12 SNPs* in the European study population (N = 3,087).

Full size table

Notably, when constraining the quantitative eye colour predictions on the 6 SNPs included in the IrisPlex system for categorical eye colour prediction¹⁵, we observed only a minor loss of information relative to the full 12-SNP model with interaction, and varying degrees of effect relative to the 12-SNP model without interaction (Supplementary Table S6). Based on the 6 IrisPlex SNPs with considering interaction (see Supplementary Table S7 for betas), we estimated 53% for the amount of non-pigmentation (2% less than the full 12-SNP model), 18% for the amount of pheomelanin (2% less), and 33% for the amount of eumelanin (same as the 12-SNP model). The observed slight increase of predictability of pheomelanin with the 12-SNPs relative to the 6 IrisPlex SNPs underlines the added value of the additional 6 SNPs in predicting non-blue and non-brown eye colours, which is notably lower than the gain of power by our novel quantification method.

To demonstrate the potential of the 12 SNPs for predicting quantitative eye colour, in Fig. 3 we plotted the observed eye colour phenotypes as measured with our new approach, and their DNA-predictions from 12-SNP genotypes. We already find a very good prediction of the proportions of melanin-types, which is more informative with respect to iris colour than a plain category. Hence, intermediate iris colours can be visually predicted from DNA by the quantitative amounts of pheomelanin, and eumelanin, which would provide a better overall representation of eye colour rather than one single intermediate category. Knowledge about the distribution of the types of melanin on the iris will allow more realistic prediction of iris colour and structure, which can be hypothetically achieved by applying this approach to sub-areas of the iris as applied in Edwards et al.²⁷.

Conclusions

In summary, we have developed an automated computational approach that separates the iris from digital eye images and quantifies iris pigmentation by estimating digital equivalents of eumelanin, pheomelanin, and non-pigmentation. By applying this new approach to high-resolution eye imagery of thousands of Europeans, we demonstrated that it mostly outperformed previous eye colour quantification methods. When only low-resolution imagery is available that does not allow for a distinct clustering of pixels into melanin types, average H and S values are recommended for usage.

The power to detect a causal variant in a genome wide association study is known to depend mainly on its effect and frequency in the population⁴². By using these detailed quantitative eye colour phenotypes, we noted that SNPs previously involved in human eye colour variation showed stronger associations, revealed new and confirmed previously noted SNP-SNP interactions, and increased DNA-based prediction compared to quantitative eye colour phenotypes established by other methods. Overall, our findings imply that using our approach for detailed quantitative pigmentation phenotyping in future genome-wide association studies will likely deliver new pigmentation genes and new pigmentation predictive DNA variants, which is relevant for medical, evolutionary, and forensic genetics.

Materials andMethods

Ethics statement

The study was approved by national ethical committees and met the criteria of the Helsinki declaration. The ethics committees of each of the institutional review boards of the following collaborating eye study centres gave approval: Department of Epidemiology and Biostatistics, National Institute for Health Development, Tallinn, Estonia; Department of Ophthalmology, University of Bergen, School of Medicine, Bergen, Norway; Clinique Ophthalmologique, Universitaire De Creteil, Paris, France; Clinica Oculistica, Universita degli studi di Verona, Italy; Department of Ophthalmology, Aristotle University of Thessaloniki, School of Medicine, Thessaloniki, Greece; Dpto. Salud Publica Universidad Miguel Hernandez, Alicante, El Centro de Investigacion Biomedica en Red de Epidemiologıa y Salud Publica (CIBERESP), Elche, Spain; Faculty of Epidemiology & Population Health, London School of Hygiene & Tropical Medicine, London, United Kingdom. Study participants gave informed written consent. Participants were advised that all information would be kept confidential and no identifying information would be kept.

Subjects, images, and genotyping

Eye images and DNA samples were collected as part of the EUREYE study. EUREYE is a population-based study of age related macular degeneration (AMD) in seven centers located across Europe. Participants were recruited from random sampling of the population aged over 65 years in Bergen (Norway), Tallinn (Estonia), Belfast (UK), Paris Creteil (France), Verona (Italy), Thessaloniki (Greece), and Alicante (Spain). Participants were interviewed by fieldworkers, underwent an eye examination including digital capture of the iris, and provided a blood sample for DNA analysis. A detailed description of the EUREYE study including image collection can be found elsewhere^38,45,46. In brief, iris photography involved illumination of the anterior segment of each eye to show the colour of the iris with a flash intensity of 25–36 mW using a Topcon TRC 50 EX camera (http://www.topconmedical.com/categories/imaging-retinalcameras.htm) under normalized conditions. We manually excluded samples where the image quality was limited due to partially closed eyelids or largely dilated pupils that prevented a full view of the iris.

DNA samples were genotyped using the SNaPShot technology (Life Technologies) via a single multiplex assay targeting 12-eye colour SNPs. We used 6 SNPs that were previously identified to predict categorical eye colour in genome wide studies^12,24,37) and which are included in the IrisPlex and HIrisPlex DNA test systems for categorical eye colour prediction^{15,18,37,38,39}: HERC2 rs12913832, OCA2 rs1800407, SLC24A4 rs12896399, SLC45A2 (MATP) rs16891982, TYR rs1393350, and IRF4 rs12203592. The 6 additional SNPs used were identified to be eye colour predictive in previous studies^24,40: rs2070959 (UGT1A6), rs9894429 (NPLOC4), rs1129038 (HERC2), rs3768056 (LYST), rs2835630 (DSCR9), and rs1325127 (TYRP1) (see Supplementary Table S1 for primer sequences). Our analyses in the present manuscript were based on 3087 samples with suitable iris pictures and complete genotype data available.

Statistical analysis

Deviations from Hardy Weinberg equilibrium (HWE) were calculated using methods described elsewhere⁴⁷. Among other causes, HWE can be violated because of genotyping error or population substructure. The latter is indicated by an increased informativeness of ancestry⁴⁸. The statistical properties of the 12 SNPs from the 3087 samples are summarized in Supplementary Table S1. Three SNPs showed significant deviations from the Hardy-Weinberg equilibrium, namely HERC2 rs1129038 (P_HWE = 8.25e–38), HERC2 rs12913832 (P_HWE = 2.38e–37), and SLC45A2 rs16891982 (P_HWE = 2e–16). In the absence of evidence for genotyping errors, we expect this being caused by the samples coming were from seven European populations as these three SNPs displayed elevated informativeness of ancestry (In) values (see Supplementary Table S1 for HWE and In values). Linkage disequilibrium (LD) between pairs of SNPs was estimated by means of R². Regression analysis, Hellinger distance analysis, and interaction analysis were performed in Matlab (R2007a). For association and prediction analysis, linear models were fitted for both, individual SNPs and a combined model including all SNPs in Matlab (R2007a) while controlling for age, sex and population id. Presence of interactions between pairs of SNPs was tested by comparing nested linear models with and without the interaction term by means of an F-test. DNA-based eye colour prediction was achieved using general linear models Y~b₀ + b₁X₁ + b₂X₂ + b₃X₃ + b₄ age + b₅ sex + b₆ pop, where X₁ and X₂ denote the genotypes of SNP₁ and SNP₂ respectively and X₃ denotes the interaction term on a multiplicative scale. Genotypes were coded additively as counts of minor alleles. P-value for the interaction term (b₃) were used and corrected for multiple testing using Bonferroni correction for 12 interaction pairs (396 tests applied). Linear models were fitted to predict the quantitative amounts of melanin from genotypic variation. To study the importance of genetic interaction, we added the most important interaction identified to the predictor using only main effects of SNPs. Quality of prediction was assessed by means of adjusted coefficient of determination (R²) of the models. We used each colour quantification values separately as response variable. To account for over-fitting, we provide the mean R² from cross validation as estimated from 100 randomized train/holdout experiments where 2/3 of the samples were used as a training set (N = 2087) and 1/3 as a validation set (N = 1000).

The computational method to perform statistical analyses and extract quantitative eye colour from digital images is available on request.

Additional Information

How to cite this article: Wollstein, A. et al. Novel quantitative pigmentation phenotyping enhances genetic association, epistasis, and prediction of human eye colour. Sci. Rep. 7, 43359; doi: 10.1038/srep43359 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Sturm, R. A., Teasdale, R. D. & Box, N. F. Human pigmentation genes: identification, structure and consequences of polymorphic variation. Gene 277, 49–62, doi: 10.1016/S0378-1119(01)00694-1 (2001).
Article CAS PubMed Google Scholar
Sturm, R. A. & Frudakis, T. N. Eye colour: portals into pigmentation genes and ancestry. Trends in genetics: TIG 20, 327–332, doi: 10.1016/j.tig.2004.06.010 (2004).
Article CAS PubMed Google Scholar
Robins, A. H. Biological Perspectives on Human Pigmentation (Cambridge University Press, 1991).
Brues, A. M. Rethinking human pigmentation. Am J Phys Anthropol 43, 387–391, doi: 10.1002/ajpa.1330430320 (1975).
Article CAS PubMed Google Scholar
Frost, P. European hair and eye color - A case of frequency-dependent sexual selection? Evol Hum Behav 27, 85–103, doi: 10.1016/j.evolhumbehav.2005.07.002 (2006).
Article Google Scholar
Eiberg, H. et al. Blue eye color in humans may be caused by a perfectly associated founder mutation in a regulatory element located within the HERC2 gene inhibiting OCA2 expression. Hum Genet 123, 177–187, doi: 10.1007/s00439-007-0460-x (2008).
Article CAS PubMed Google Scholar
Frudakis, T. et al. Sequences associated with human iris pigmentation. Genetics 165, 2071–2083 (2003).
CAS PubMed PubMed Central Google Scholar
Graf, J., Hodgson, R. & van Daal, A. Single nucleotide polymorphisms in the MATP gene are associated with normal human pigmentation variation. Hum Mutat 25, 278–284, doi: 10.1002/humu.20143 (2005).
Article CAS PubMed Google Scholar
Han, J. et al. A genome-wide association study identifies novel alleles associated with hair color and skin pigmentation. PLoS genetics 4, e1000074, doi: 10.1371/journal.pgen.1000074 (2008).
Article CAS PubMed PubMed Central Google Scholar
Kanetsky, P. A. et al. A polymorphism in the agouti signaling protein gene is associated with human pigmentation. Am J Hum Genet 70, 770–775, doi: 10.1086/339076 (2002).
Article CAS PubMed PubMed Central Google Scholar
Kayser, M. et al. Three genome-wide association studies and a linkage analysis identify HERC2 as a human iris color gene. Am J Hum Genet 82, 411–423, doi: 10.1016/j.ajhg.2007.10.003 (2008).
Article CAS PubMed PubMed Central Google Scholar
Liu, F. et al. Eye color and the prediction of complex phenotypes from genotypes. Current biology: CB 19, R192–193, doi: 10.1016/j.cub.2009.01.027 (2009).
Article CAS PubMed Google Scholar
Rebbeck, T. R. et al. P gene as an inherited biomarker of human eye color. Cancer Epidemiol Biomarkers Prev 11, 782–784 (2002).
CAS PubMed Google Scholar
Sulem, P. et al. Genetic determinants of hair, eye and skin pigmentation in Europeans. Nature genetics 39, 1443–1452, doi: 10.1038/ng.2007.13 (2007).
Article CAS PubMed Google Scholar
Walsh, S. et al. IrisPlex: a sensitive DNA tool for accurate prediction of blue and brown eye colour in the absence of ancestry information. Forensic Sci Int Genet 5, 170–180, doi: 10.1016/j.fsigen.2010.02.004 (2011).
Article CAS PubMed Google Scholar
Kayser, M. & de Knijff, P. Improving human forensics through advances in genetics, genomics and molecular biology. Nature reviews. Genetics 12, 179–192, doi: 10.1038/nrg2952 (2011).
Article CAS PubMed Google Scholar
Kayser, M. Forensic DNA Phenotyping: Predicting human appearance from crime scene material for investigative purposes. Forensic Sci Int Genet 18, 33–48, doi: 10.1016/j.fsigen.2015.02.003 (2015).
Article CAS PubMed Google Scholar
Walsh, S. et al. The HIrisPlex system for simultaneous prediction of hair and eye colour from DNA. Forensic Sci Int Genet 7, 98–115, doi: 10.1016/j.fsigen.2012.07.005 (2013).
Article CAS PubMed Google Scholar
Maronas, O. et al. Development of a forensic skin colour predictive test. Forensic Sci Int Genet 13, 34–44, doi: 10.1016/j.fsigen.2014.06.017 (2014).
Article PubMed Google Scholar
Kayser, M. Forensic DNA Phenotyping: Predicting human appearance from crime scene material for investigative purposes. Forensic Sci Int Genet, doi: 10.1016/j.fsigen.2015.02.003 (2015).
Frudakis, A. Molecular Photofitting 497–596 (Academic Press, 2008).
Wielgus, A. R. & Sarna, T. Melanin in human irides of different color and age of donors. Pigment Cell Res 18, 454–464, doi: 10.1111/j.1600-0749.2005.00268.x (2005).
Article CAS PubMed Google Scholar
Dembinski, G. M. & Picard, C. J. Evaluation of the IrisPlex DNA-based eye color prediction assay in a United States population. Forensic Sci Int Genet 9, 111–117, doi: 10.1016/j.fsigen.2013.12.003 (2014).
Article CAS PubMed Google Scholar
Liu, F. et al. Digital quantification of human eye color highlights genetic association of three new loci. PLoS genetics 6, e1000934, doi: 10.1371/journal.pgen.1000934 (2010).
Article CAS PubMed PubMed Central Google Scholar
Jacobs, L. C. et al. Comprehensive candidate gene study highlights UGT1A and BNC2 as new genes determining continuous skin color variation in Europeans. Hum Genet 132, 147–158, doi: 10.1007/s00439-012-1232-9 (2012).
Article CAS PubMed Google Scholar
Beleza, S. et al. Genetic architecture of skin and eye color in an African-European admixed population. PLoS genetics 9, e1003372, doi: 10.1371/journal.pgen.1003372 (2013).
Article CAS PubMed PubMed Central Google Scholar
Edwards, M. et al. Iris pigmentation as a quantitative trait: variation in populations of European, East Asian and South Asian ancestry and association with candidate gene polymorphisms. Pigment cell & melanoma research 29, 141–162, doi: 10.1111/pcmr.12435 (2016).
Article CAS Google Scholar
Andersen, J. D. et al. Genetic analyses of the human eye colours using a novel objective method for eye colour classification. Forensic Sci Int Genet 7, 508–515, doi: 10.1016/j.fsigen.2013.05.003 (2013).
Article CAS PubMed Google Scholar
Daugman, J. New methods in iris recognition. IEEE Trans Syst Man Cybern B Cybern 37, 1167–1175 (2007).
Article PubMed Google Scholar
Wildes, R. P. Iris Recognition: An Emerging Biometric Technogy. Proceedings of the IEEE 85, doi: 10.1109/5.628669 (1997).
Proenca, H. Iris recognition: on the segmentation of degraded images acquired in the visible wavelength. IEEE Trans Pattern Anal Mach Intell 32, 1502–1516, doi: 10.1109/TPAMI.2009.140 (2010).
Article PubMed Google Scholar
Hansen, D. W. & Ji, Q. In the eye of the beholder: a survey of models for eyes and gaze. IEEE Trans Pattern Anal Mach Intell 32, 478–500, doi: 10.1109/TPAMI.2009.30 (2010).
Article PubMed Google Scholar
Canny, J. A computational approach to edge detection. IEEE Trans Pattern Anal Mach Intell 8, 679–698, doi: 10.1109/TPAMI.1986.4767851 (1986).
Article CAS PubMed Google Scholar
Duda, R. Use of the Hough Transformation to Detect Lines and Curves in Pictures. Comm. ACM 15, 11–15, doi: 10.1145/361237.361242 (1971).
Article MATH Google Scholar
Smith, A. R. Color Gamut Transform Pairs. SIGGRAPH Comput. Graph. 123, 12–19, doi: 10.1145/965139.807361 (1978).
Article Google Scholar
Shawe-Taylor, N. C. a. J. Support Vector MAchines (Cambridge University Press, 2000).
Walsh, S. et al. Developmental validation of the IrisPlex system: determination of blue and brown iris colour for forensic intelligence. Forensic Sci Int Genet 5, 464–471, doi: 10.1016/j.fsigen.2010.09.008 (2011).
Article CAS PubMed Google Scholar
Walsh, S. et al. DNA-based eye colour prediction across Europe with the IrisPlex system. Forensic Sci Int Genet 6, 330–340, doi: 10.1016/j.fsigen.2011.07.009 (2012).
Article CAS PubMed Google Scholar
Walsh, S. et al. Developmental validation of the HIrisPlex system: DNA-based eye and hair colour prediction for forensic and anthropological usage. Forensic Sci Int Genet 9, 150–161, doi: 10.1016/j.fsigen.2013.12.006 (2014).
Article CAS PubMed Google Scholar
Sturm, R. A. et al. A single SNP in an evolutionary conserved region within intron 86 of the HERC2 gene determines human blue-brown eye color. Am J Hum Genet 82, 424–431, doi: 10.1016/j.ajhg.2007.11.005 (2008).
Article CAS PubMed PubMed Central Google Scholar
Park, J. H. et al. Distribution of allele frequencies and effect sizes and their interrelationships for common genetic susceptibility variants. Proceedings of the National Academy of Sciences of the United States of America 108, 18026–18031, doi: 10.1073/pnas.1114759108 (2011).
Article ADS PubMed PubMed Central Google Scholar
Sham, P. C. & Purcell, S. M. Statistical power and significance testing in large-scale genetic studies. Nature reviews. Genetics 15, 335–346, doi: 10.1038/nrg3706 (2014).
Article CAS PubMed Google Scholar
Pospiech, E., Draus-Barini, J., Kupiec, T., Wojas-Pelc, A. & Branicki, W. Gene-gene interactions contribute to eye colour variation in humans. J Hum Genet 56, 447–455, doi: 10.1038/jhg.2011.38 (2011).
Article CAS PubMed Google Scholar
Branicki, W., Brudnik, U. & Wojas-Pelc, A. Interactions between HERC2, OCA2 and MC1R may influence human pigmentation phenotype. Annals of human genetics 73, 160–170, doi: 10.1111/j.1469-1809.2009.00504.x (2009).
Article CAS PubMed Google Scholar
Augood, C. et al. Methods for a population-based study of the prevalence of and risk factors for age-related maculopathy and macular degeneration in elderly European populations: the EUREYE study. Ophthalmic epidemiology 11, 117–129, doi: 10.1076/opep.11.2.117.28160 (2004).
Article PubMed Google Scholar
Augood, C. A. et al. Prevalence of age-related maculopathy in older Europeans: the European Eye Study (EUREYE). Archives of ophthalmology 124, 529–535, doi: 10.1001/archopht.124.4.529 (2006).
Article PubMed Google Scholar
Wigginton, J. E., Cutler, D. J. & Abecasis, G. R. A note on exact tests of Hardy-Weinberg equilibrium. Am J Hum Genet 76, 887–893, doi: 10.1086/429864 (2005).
Article CAS PubMed PubMed Central Google Scholar
Rosenberg, N. A. et al. Genetic structure of human populations. Science 298, 2381–2385, doi: 10.1126/science.1078311 (2002).
Article ADS CAS PubMed Google Scholar

Download references

Acknowledgements

We thank the EUREYE study participants for providing samples for DNA analysis and eye images. This work was funded in part by the Erasmus MC University Medical Center Rotterdam, by a previous grant from the Netherlands Genomics Initiative (NGI)/Netherlands Organization for Scientific Research (NWO) within the framework of the Forensic Genomics Consortium Netherlands (FGCN) and Netherlands Forensic Institute (NFI). EUREYE was supported by the European Commission 5th Framework (QLK6-CT-1999-02094). Additional funding for cameras was provided by the Macular Disease Society UK and by the UK Medical Research Council for extraction of DNA. The work of MR was supported by the Estonian Ministry of Education and Science (target funding SF0940026s07) and the Estonian Research Council (IUT5-1). AW received additional funding by Volkswagen Foundation (ref. 86042). SW received additional funding from the National Institute of Justice (NIJ) in the US under grant number 2014-DN-BX-K031. Funders had no role in study design, data collection and analysis, manuscript preparation, and decision to publish.

Author information

Authors and Affiliations

Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, The Netherlands
Andreas Wollstein, Susan Walsh, Fan Liu & Manfred Kayser
Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, The Netherlands
Andreas Wollstein & Stefan Böhringer
Department of Biology II, Section of Evolutionary Biology, University of Munich LMU, Planegg-Martinsried, Germany
Andreas Wollstein
Department of Biology, Indiana University-Purdue University Indianapolis, Indianapolis, IN, USA
Susan Walsh
Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
Fan Liu
Centre for Vision and Vascular Science, The Queen’s University Belfast, Belfast, United Kingdom
Usha Chakravarthy
Department of Epidemiology and Biostatistics, National Institute for Health Development, Tallinn, Estonia
Mati Rahu
Department of Ophthalmology, University of Bergen, School of Medicine, Bergen, Norway
Johan H. Seland
Clinique Ophthalmologique, Universitaire De Creteil, Paris, France
Gisèle Soubrane
Clinica Oculistica, Universita degli studi di Verona, Italy
Laura Tomazzoli
Department of Ophthalmology, Aristotle University of Thessaloniki, School of Medicine, Thessaloniki, Greece
Fotis Topouzis
Department of Ophthalmology, Erasmus MC University Medical Centre Rotterdam, Rotterdam, The Netherlands
Johannes R. Vingerling
Dpto. Salud Publica Universidad Miguel Hernandez, Alicante, El Centro de Investigacion Biomedica en Red de Epidemiologıa y Salud Publica (CIBERESP), Elche, Spain
Jesus Vioque
Faculty of Epidemiology & Population Health, London School of Hygiene & Tropical Medicine, London, United Kingdom
Astrid E. Fletcher

Authors

Andreas Wollstein
View author publications
You can also search for this author in PubMed Google Scholar
Susan Walsh
View author publications
You can also search for this author in PubMed Google Scholar
Fan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Usha Chakravarthy
View author publications
You can also search for this author in PubMed Google Scholar
Mati Rahu
View author publications
You can also search for this author in PubMed Google Scholar
Johan H. Seland
View author publications
You can also search for this author in PubMed Google Scholar
Gisèle Soubrane
View author publications
You can also search for this author in PubMed Google Scholar
Laura Tomazzoli
View author publications
You can also search for this author in PubMed Google Scholar
Fotis Topouzis
View author publications
You can also search for this author in PubMed Google Scholar
Johannes R. Vingerling
View author publications
You can also search for this author in PubMed Google Scholar
Jesus Vioque
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Böhringer
View author publications
You can also search for this author in PubMed Google Scholar
Astrid E. Fletcher
View author publications
You can also search for this author in PubMed Google Scholar
Manfred Kayser
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceived and designed the experiments: A.W., S.W., F.L., M.K. Performed the experiments: A.W., S.W. Analysed the data: A.W., S.W. Contributed reagents, materials, and analysis tools: A.W., S.W., F.L., J.R.V., J.V., U.C., M.R., J.H.S., G.S., L.T., F.T., S.B., A.E.F., and M.K. Wrote most of the paper: A.W., S.W., F.L., M.K.

Corresponding authors

Correspondence to Andreas Wollstein or Manfred Kayser.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information (DOC 908 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Wollstein, A., Walsh, S., Liu, F. et al. Novel quantitative pigmentation phenotyping enhances genetic association, epistasis, and prediction of human eye colour. Sci Rep 7, 43359 (2017). https://doi.org/10.1038/srep43359

Download citation

Received: 12 October 2016
Accepted: 23 January 2017
Published: 27 February 2017
DOI: https://doi.org/10.1038/srep43359

This article is cited by

Predictive accuracy of genetic variants for eye color in a Kazakh population using the IrisPlex system
- Alizhan Bukayev
- Igor Gorin
- Maxat Zhabagin
BMC Research Notes (2024)
Mapping and annotating genomic loci to prioritize genes and implicate distinct polygenic adaptations for skin color
- Beomsu Kim
- Dan Say Kim
- Hong-Hee Won
Nature Communications (2024)
A new approach to broaden the range of eye colour identifiable by IrisPlex in DNA phenotyping
- Ersilia Paparazzo
- Anzor Gozalishvili
- Alberto Montesanto
Scientific Reports (2022)
The distinctive geographic patterns of common pigmentation variants at the OCA2 gene
- Kenneth K. Kidd
- Andrew J. Pakstis
- William C. Speed
Scientific Reports (2020)
Clinical and genetic variability in children with partial albinism
- Patrick Campbell
- Jamie M. Ellingford
- Panagiotis I. Sergouniotis
Scientific Reports (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results and Discussion

Detection and segmentation of the iris in digital imagery

Quantification of iris colour from digital imagery

Comparing the new eye colour quantification method with previous methods

Comparing quantitative eye phenotypes with categorical eye phenotypes

Impact of improved eye colour phenotyping on strength of genetic eye colour association

Epistatic effects detected with the detailed quantitative eye colour phenotypes

SNP-based prediction of quantitative eye colour phenotypes

Conclusions

Materials andMethods

Ethics statement

Subjects, images, and genotyping

Statistical analysis

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links