Malaria was a weak selective force in ancient Europeans

Gelabert, Pere; Olalde, Iñigo; de-Dios, Toni; Civit, Sergi; Lalueza-Fox, Carles

doi:10.1038/s41598-017-01534-5

Download PDF

Article
Open access
Published: 03 May 2017

Malaria was a weak selective force in ancient Europeans

Pere Gelabert¹,
Iñigo Olalde¹,
Toni de-Dios¹,
Sergi Civit² &
…
Carles Lalueza-Fox¹

Scientific Reports volume 7, Article number: 1377 (2017) Cite this article

4759 Accesses
28 Citations
19 Altmetric
Metrics details

Subjects

Abstract

Malaria, caused by Plasmodium parasites, is thought to be one of the strongest selective forces that has shaped the genome of modern humans and was endemic in Europe until recent times. Due to its eradication around mid-twentieth century, the potential selective history of malaria in European populations is largely unknown. Here, we screen 224 ancient European genomes from the Upper Palaeolithic to the post-Roman period for 22 malaria-resistant alleles in twelve genes described in the literature. None of the most specific mutations for malaria resistance, like those at G6PD, HBB or Duffy blood group, have been detected among the available samples, while many other malaria-resistant alleles existed well before the advent of agriculture. We detected statistically significant differences between ancient and modern populations for the ATP2B4, FCGR2B and ABO genes and we found evidence of selection at IL-10 and ATP2B4 genes. However it is unclear whether malaria is the causative agent, because these genes are also involved in other immunological challenges. These results suggest that the selective force represented by malaria was relatively weak in Europe, a fact that could be associated to a recent historical introduction of the severe malaria pathogen.

Genomics reveals heterogeneous Plasmodium falciparum transmission and selection signals in Zambia

Article Open access 06 April 2024

Abebe A. Fola, Qixin He, … Giovanna Carpi

In silico characterisation of putative Plasmodium falciparum vaccine candidates in African malaria populations

Article Open access 10 August 2021

O. Ajibola, M. F. Diop, … A. Amambua-Ngwa

Distinctive genetic structure and selection patterns in Plasmodium vivax from South Asia and East Africa

Article Open access 26 May 2021

Ernest Diez Benavente, Emilia Manko, … Taane G. Clark

Introduction

The malaria parasite Plasmodium falciparum is one of the main causes of child mortality worldwide, although other species from the same genus, P. vivax, P. malariae and P. ovale are also causative agents of the disease. Therefore, it is not surprising that malaria is one of the strongest known selective pressures that have recently shaped the human genome. Malaria is the evolutionary driving force behind sickle-cell disease (HbS), thalassemia, glucose-6-phosphatase deficiency (G6PD) and other erythrocyte defects that together comprise the most common Mendelian diseases of humankind. Remarkably, populations from different geographical areas have developed different genetic mechanisms adapted to malaria resistance; for instance, the HbS allele at the HBB gene is common in Africa but rare in Southeast Asia, whereas the HbE alelle shows a reversed pattern. Resistance to malaria includes primarily genes involved in immunological response, but also some involved in inflammation and cell adhesion and even genes related to metabolic pathways. The fact that different malaria-resistance alleles have arisen in different places suggests that this adaptation occurred relatively recent in human history, at least well after the Out of Africa migration¹.

P. falciparum may have rapidly spread from its African tropical origins to other tropical and subtropical regions of the world only within the last 6,000 years². The recent origin of the worldwide P. falciparum populations, which are the most malignant of human malarial parasites, may account for its virulence³. Thus, haplotype analysis at the G6PD locus suggests that the African resistance allele to P. falciparum originated within the last 10,000 years⁴ while a similar analysis at the HbE alleles (variants of the HBB gene) in Southeast Asia yielded an even more recent date, around 5,000 years ago⁵. These and other estimates associate the resistance to P. falciparum with the onset of agriculture and the emergence of favourable environments for mosquito proliferation after the clearance of woodlands for farming. Falciparum-like illness (I.e. miasmas) appears to be very recent in Europe. It seems to have spread from India to the West, appearing in ancient Greece only by the 4th century BCE, where it was implicated in the decline of many city-state populations². The retrieval of the eradicated European P. falciparum from 70-year-old slides from a malaria-endemic region in Spain confirms a recent, phylogenetic link with present-day Indian haplotypes⁶. Therefore, it seems unlikely that P. falciparum would have had a significant impact on prehistoric European genomes.

On the other hand, P. vivax (and probably P. ovale and P. malaria) is likely an ancient parasite that evolved in Africa and spread to the rest of the world with the Out of Africa migration, around 60,000 years ago². Phylogenetic analyses suggest that P. vivax could have an African origin but has largely disappeared from this continent after the spread of Duffy negative resistance⁷. The biology of P. vivax that makes it highly adapted to maintain itself in small mobile groups of hosts and its mild effect also supports the idea that it is an ancient human parasite. The limited data available does show that P. vivax and P. falciparum may have had at least partially different effects on the human genome¹.

Malaria was endemic in Europe until very recently. Historical data describe malaria as affecting people as north as England, Scandinavia or Russia². However, while there is information of some malaria-resistance alleles from Europe, notably from the Mediterranean area (such as the HBB and the G6PD variants), little is known about the magnitude of selection due to malaria in the continent because of the eradication of the parasite around 1950. The recent retrieval of more than 250 ancient genomes from different moments of the prehistory of Europe^8,9,10,11 and the publication of several Genome wide association studies (GWAS) on malaria^{12, 13} now allows for a genome-wide screening of malaria-resistance alleles along space and time. Using this approach, we aim to understand the role of this disease as a selective force in the shaping of the gene pool of modern Europeans.

Results

Sample size and allele imputation

It was possible to screen 224 ancient individuals (Fig. 1 and Table S1) for 20 single nucleotide polymorphisms (SNPs) and two single nucleotide-deletions described to play a role in malaria resistance and distributed in twelve different genes: G6PD, HBB, ACKRQ, FCGR2B, TIRAP, ATP2B4, GRK5, IL-10, MARVELD3, CD36, CD40LG and ABO blood group system (Table 1). Due to low coverage on many ancient samples, complete genotypes are typically not available for all loci; therefore, we needed to rely on imputation methods to infer genotypes, using the 1,000 Genomes phased samples as a reference panel and Beagle-4 software¹⁴.

Table 1 Summary of the genetic variants analysed in the study and the hypothetical functional explanation in relation to malaria resistance, when available.

Full size table

After imputation, the number of genotypes recovered for each nucleotide position was variable (Table 1 and Table S1). The proportion of recovered haplotypes in whole genome sequence data fluctuated from a maximum 96.74% in HBB SNP: rs33950507 and rs334 to a minimum 23.91% in ATP2B4 SNP rs5030868. The proportion of recovered alleles in targeted capture data samples fluctuated from a maximum of 40.91% in ACKRQ SNP: rs2814778 to zero observations for CD36 SNP: rs3092945, CD40LG SNP: rs201346212, and ABO SNP: rs817671. A dataset with genotypes from 92 genome-wide samples is presented in Table S2.

Even with such data limitations, we were able to detect a substantial number of carriers of the proposed malaria resistance alleles in our ancient genome dataset (Table 2). It is noteworthy that different derived genetic variants (at TIRAP, ATP2B4, MARVELD3, ABO, and IL-10) were already present prior to the arrival of farming, in Upper Palaeolithic and/or Mesolithic individuals. For other loci, the derived alleles are only seen in very recent periods. For instance, only one of the Iron Age individuals carries the T allele (rs2230345) at GRK5 gene, one individual carries the T allele (rs8176746) at ABO gene and only two individuals (one Bronze Age Russian and one Post-Roman British) carry the C allele (rs1050501) at the FCGR2B gene.

Table 2 Frequencies of malaria-resistant alleles by time period in nine selected genes and 224 available ancient European genomes.

Full size table

Ancient-modern population comparisons

For three genetic variants, we observed significant frequency differences between ancient and modern European populations from the 1,000 Genomes project, after adjusting for multiple tests: rs1050501 (FCGR2B) (OR = 7.86; CI = 2.07–66.66; p = 2.2e-16), rs4951074 (ATP2B4) (OR = 2.16; CI = 1.06–4.96; p = 0.02) and rs8176746 (ABO) (OR = 9.63; CI = 2.54–81.49; p = 1.19e-5). The three genes show lower frequencies than expected of the derived alleles in ancient populations as compared to present-day Europeans. Samples with data from these three statistically significant ancient vs modern variants are not geographically clustered, as can be seen in Fig. 2. Therefore, no clear climatic or latitudinal pattern that could be associated to the malaria distribution emerges.

Population structure

Although malaria was prevalent along the continent, from Russia and Scandinavia to Southern Europe, some resistance mutations such as G6PD B- are nowadays restricted to the Mediterranean, which suggests that the impact of the disease may have been different between European regions. We therefore tested for population structure at these loci.

The only SNP to show significant North-South differences in allele frequencies in modern Europeans was the rs8177374 SNP at TIRAP gene. The allelic frequency of the derived allele in South-Europe populations (IBS and TSI) was 34%, statistically significant higher than the 16% of the derived alleles in North-European populations (CEU, GBR and FIN) (codominant model: OR = 4.04; CI = 1.37–11.91; p = 1.428e-05 and log-additive model: OR = 2.20; CI = 1.57–3.08; p = 2.522e-06). This trend cannot be discerned in the ancient samples (see Fig. 2).

Although the problems of assessing population structure with a dataset of low-coverage, geographically and temporally disparate ancient genomes cannot be overlooked we sought to explore the genetic structure among ancient and modern European with an F_ST and G-test approach. We have grouped the ancient samples in eight periods (Upper Palaeolithic, Early Neolithic, Middle Neolithic, Late Neolithic, Bronze Age, Iron Age, Roman Age and Post-Roman Age) (Table S1). The F_ST pairwise comparisons did not reveal any structural differentiation in the established sub-groupings (Table S4). A G-test analysis detected a significant geographic differentiation of the variants associated with malaria resistance among the five present-day European populations (CEU, IBS, TSI, GBR, FIN) of the 1,000 Genomes database (G-statistic; 102.304, p-value: 0.01) (Table S5). This suggests that very fine differences in allele frequencies among these populations likely exist. We further explored this issue with Chi-squared test comparisons for each SNP between all pairs of populations; we found that all pairwise tests including Finns (FIN) are statistically significant, and also one North-South test (IBS vs CEU).

Evidence of selection

To further explore the possibility that changes in our time series data could be attributed to selection acting on the derived alleles, we used a Bayesian method that estimates the frequency trajectory of the alleles based on observed frequencies and sampling dates¹⁵. The “Selection” software estimates a posterior probability of selection coefficients for heterozygous haplotypes (alpha 1) and for homozygous derived haplotypes (alpha 2) for each SNP and also the allelic age. The mean values of the selection coefficients and the likelihood values are shown in Fig. 3, and the posterior distribution of selection coefficients (alpha1 and alpha2) are shown in Supplementary Figures S6–S13.

Two genetic variants showed statistically significant evidence of positive selection on the derived allele: rs1800890 SNP at IL-10 gene with a favoured derived-homozygous genotype (alpha 2: mean value = 57.7; p = 0.046) and rs10900585 SNP at ATP2B4 gene, with a favoured heterozygous genotype (alpha 1: mean value = 48.85; p = 0.015); p is the probability of the selection coefficient to be <0.

The allelic ages estimates showed that the oldest derived allele is rs8176719 at the ABO blood group gene. The age estimates of all alleles (Supplementary Table S6) predate our sampling period with the exception of rs1050501 at the FCGR2B gene that yielded an age of 9,500 years.

Discussion

Globally, the comparison of the allele frequencies between ancient and current European populations did not reveal strong variations in most of the analysed genes. In most cases, and despite limitations in sample size, the allele frequencies in the ancient samples were very similar to those of the modern populations (Table 2).

The fact that we can, in most cases, detect the derived allele in a dataset based on a handful of ancient genomes implies that these alleles had already reached relatively high frequencies in the ancient populations. Even with the uncertainties associated to the small sample size, we provide for first time information about the ages of the derived malaria resistance alleles. With the single exception of the FCGR2B gene, all of them predate the emergence of the Neolithic by thousands of years. However, the period when they became prevalent and potentially important from a phenotypical point of view, can be much more recent. It has been suggested that the spread of malaria was triggered by the deforestation associated with the farming expansion¹⁶. However, our results do not indicate any clear shift in the allelic frequencies of the malaria-resistant genes in response to the onset of farming.

P. vivax is thought to be a relatively ancient parasite that probably was common when Homo sapiens colonized Europe, about 45,000 years ago². It is therefore not surprising that this parasite could have had a significant impact on pre-Neolithic European as well as African genomes. The three traits that have been clearly shown by functional studies to protect against P. vivax are: Duffy negativity (ACKR1 gene), Ovalocytosis (in South East Asian groups) and G6PD deficiency^17,18,19. The first two have little if any effect on P. falciparum while G6PD seems to have more effect on P. vivax than on P. falciparum. So far, none of the ancient genomes available displays any mutation at the crucial G6PD or Duffy loci. In addition, we do not find a single carrier of the HBB mutation. We don’t know if this is attributable to the rather limited sample size or if their absence reflects a relatively recent origin of these mutations among European populations. The frequencies of these mutations are very low in current populations (for instance, there is not a single carrier among the European populations of the 1,000 Genomes project). However, the sample size for some of the SNPs, such as the rs372091 at the HBB locus, is not that small and currently includes 155 ancient genomes.

The current shortage of ancient genomic data from the Mediterranean area (where severe malaria was most prevalent in historic times) represents a limitation of our survey. This is partially attributable to the warm environmental conditions in the area, which are unfavourable to DNA preservation²⁰. At present, only one complete genome (a Cardial Early Neolithic specimen)²¹ and several Neolithic individuals from the oriental Mediterranean peninsulas (Greece and Anatolia)^{22, 23} derive from the thermo-Mediterranean zone²¹. However, a significant number of samples derive from regions that are close to the Mediterranean shores (Fig. 1).

While looking at the variation among modern Europeans with the G-statistic, the present-day frequencies of the SNPs related with malaria resistance denote the presence of substructure that should be further investigated with additional modern populations and neutral variation SNPs. The structure could be partly influenced by the distinctiveness of the Finnish sample but also by the SNP at TIRAP gene (rs8177374) (Table S5) that shows a clear latitudinal trend confirmed with the OR results. However, the same mutation has also been related to resistance to bacterial infections such as tuberculosis²⁴. Therefore, it is unclear if the clinal pattern of variation for this allele is in fact associated to malaria infection or to other selective forces acting on a latitudinal basis.

Three genes (FCGR2B, ATP2B4 and ABO) show differences in the allelic frequency between ancient and modern Europeans. This could in principle be the signal of recent events of positive selection acting upon the associated SNPs.

The mutation at the FCGR2B gene (an ILE > THR change in the 232 residue of the protein) has been proposed to have a strong protective effect against severe malaria (OR = 0.56, p < 0.001) in African populations due to the enhancement of phagocythosis of P. falciparum infected-human erythrocytes²⁵. This could explain the high frequency of this mutation in malaria endemic areas, although the same mutation has also been described as causative of Systemic Lupus Erythematosus^{25, 26}. The only two individuals that carry this mutation are from periods and regions where malaria would be endemic in historical times: one is a Bronze Age individual from southern Russia and the other one an Anglo-Saxon from southern England. However, some older specimens (Table S1) also seem to carry the mutation, although the imputation test in those cases did not meet the 0.95 threshold. Polymorphisms at ATP2B4 gene, encoding for membrane calcium-transporter protect against malaria because of its function in homoeostasis control of the erythrocyte membrane²⁷, although this association needs to be assessed with further functional evidence. The rs8176746 SNP at the ABO gene is in linkage disequilibrium with rs8176719 SNP that determines B antigens. The derived allele has been associated with increased risk of severe malaria²⁸.

To evaluate if these genes could bear the footprints of recent positive selection (as opposed to neutral processes such as drift or migration), we consulted a genome browser with information about such signatures²⁹, but none of them displayed any evidence of selective sweeps. An alternative to selection would be migration; it has been recently demonstrated that European populations were modelled by three genetic components that overlapped in the last thousands of years: the Mesolithic hunter-gatherers, the Early Neolithic farmers and the Late Neolithic steppe migrants^{11, 30}. Although the right approach to distinguish between selection and migration would be the one followed by Mathieson et al.⁹, the lack of data in the original source populations for most of our SNPs precludes this approach. However, we noticed that differences in allelic frequencies detected in FCGR2B and ATP2B4 are restricted to very recent periods -after the Bronze Age- when the general composition of the European populations was already established. Therefore, the trend observed towards the increase of the derived allele is unlikely to be attributed to these ancient migrations.

We have modelled all the SNPs with an approach that rejects neutrality across a time series and found evidence for selection in SNPs rs10900585 at ATP2B4 and rs1800890 at IL-10 genes. This suggests that selection acted on these genes, an evidence that should be further explored from a functional point of view. The fact that no previous evidence of selective sweeps on these genes has been reported could be explained if the timing of the selective event was very recent or if the effect was regionally restricted. Nevertheless, it is reasonable to discard large-scale and dramatic selective events such as those seen for the lactase gene⁹.

In conclusion, it seems that most of the screened alleles were equally prevalent in past than in present time; in those alleles were differences are statistically significant, we cannot discard the effect of drift or other selective forces acting upon these immunity genes. This could indicate a weak adaptation to malaria, due to either the benignity of symptoms caused by P. vivax or to an undetectable adaptive response related to a very recent arrival to the European continent (which seems to be the case of severe malaria caused by P. falciparum). We have shown however that, despite the large geographic distribution, malaria was likely not an important selective pressure in ancient European populations and not nearly as important as it is in Africa today. Additionally we demonstrate that the study of the selective effect of malaria on ancient genomes is a feasible approach that will likely provide more information in the future with increasing sample sizes, especially from the Mediterranean area. With additional genomic data, and also the retrieval of DNA from the pathogen directly from ancient bones³¹, it would be possible to check if local selective events took place and to understand the role of malaria in the shaping of modern European genomes.

Methods

Ancient genomes

We have analysed all ancient European -sensu lato- genomes with a mean sequence coverage >1x published so far^{11, 21,22,23, 32,33,34,35,36,37,38,39}. We have also included those genomes for which only targeted capture data is available^{9, 10, 30, 40,41,42}. Our final dataset comprises genome-wide-data from 224 individuals (Fig. 1,Table S1) from which we have at least information for one of the malaria-resistant markers.

Screened genetic variants

We have screened all malaria-resistant alleles from the literature, even if most of them have been described as being selective in current African populations and involved in P. falciparum resistance. Of course this cannot be directly extrapolated to European populations but the limited information on the selective effect of P. vivax or P. falciparum on Europeans -especially outside the Mediterranean area- makes alternative approaches unfeasible. We have included 20 SNPs and two deletion belonging to twelve different genes (Table 1, Table S3). Some of the associations have been investigated from a functional perspective⁴ while others have only been recently detected on GWAS studies and their potential role in the protection against the disease is not yet well understood.

G6PD, the key enzyme in the oxidative pentose phosphate pathway is probably one of the most prevalent signs of malaria-resistance in European populations, at least in the Mediterranean area. The G6PD A- is the most prevalent resistant allele in African populations. The B- mutation, known as Mediterranean G6PD has a more severe effect and is commonly found around the Mediterranean islands, in places where malaria was endemic, in frequencies of 2–20%. We have analysed the SNPs associated to the G6PD A- (rs1050828, rs1050829) and Mediterranean G6PD (rs5030868) forms. Additional G6PD mutations associated to malaria resistance have also been included (rs137852314, rs76723693, rs137852328)⁴³.

Haemoglobin beta locus (HBB) determines the sequence of two types of polypeptidic chains in adult haemoglobin, HbA. Variants in the HBB gene such as HbC and HbS change the structure of the erythrocytes and confer resistance to P. falciparum in Asian and African populations⁴⁴. The mutations that cause both HbS (rs334) and HbC (rs33930165) were screened^{1, 45}. Recent studies based on GWAS have found an additional mutation in the HBB gene related to malaria resistance (rs372091)¹². HbE caused by rs33950507 mutation) is a mainly Asiatic variant of HBB gene that also confers resistance to the infection of P. falciparum ⁴⁶.

Duffy blood group system, Fy (a-b-) (ACKR1 gene) confers resistance to P. vivax and P. knowlesi through the mutation rs2814778 that changes the structure of the protein, which is a receptor for Plasmodium in the erythrocytes⁴⁷. This phenotype is especially present in West African populations were P. vivax originated¹³.

Additional mutations have been detected by GWAS analysis at MARVELD3 (rs2334880), ATP2B4 (rs4951074 and rs10900585) and ABO (rs81767199)¹². The MARVELD3 gene encodes for a tight junction-associated transmembrane protein of endothelial cells, a key point in the pathology of malaria infection. This mutation is supposed to confer resistance to P. falciparum malaria although it is not confirmed¹². The ABO gene encodes for a glycosyltransferase enzyme. Mutations in this enzyme determine the ABO blood group. The individuals who are homozygous for a single nucleotide deletion (rs8176719) are classified as blood group O⁴⁸. O blood group is linked with a reduced risk of severe malaria^{28, 49}. However, we note that, besides malaria, the O blood group has been associated to susceptibility to other pathogens such as Helicobacter pylori ⁵⁰ and severe cholera⁵¹. We have also analyzed the rs8176746 SNP. This mutation, which is in LD with the rs8176719 variant, determines the production of B antigens⁴⁸. The presence of the derived allele in this locus is associated with an increased risk of severe malaria²⁸. Evidence of malaria protection has been also described at CD40LG gene in Gambian populations: homozygotes for the derived rs3092945 allele show a reduced risk of suffering severe malaria. However, a significant increase of severe malaria risk was found for the same allele in Kenyan populations²⁸. In the same study, a severe malaria protective effect of the SNP rs201346212 heterozygous genotype (CD36 gene) was described. Other possible evidences on P. falciparum malaria resistance include additional genes such as GRK5, a G protein receptor (mutation rs2230345)⁵², IL-10 (rs1800890), FCGR2B (rs105050) and TIRAP (rs8177374).

Imputation

Due to the low coverage in many ancient samples and gaps associated to the targeted capture data, we performed imputations of SNPs not covered by any DNA read using Beagle-4 software¹⁴ and the Tuscans (TSI) population of the 1,000 Genomes project⁵³ as a reference panel.

Some of the alleles screened involve C to T or G to A changes that could be affected by post-mortem DNA degradation⁵⁴. To control for this confounding factor we have screened the genetic background in which the SNP is located, following the same approach as in ref. 9. In brief, the problematic SNP was removed on each ancient genome, genotype likelihoods were estimated and the SNP was imputed using Beagle-4 and the phased information of the 1,000 Genomes. This approach provides us with a probability estimate that a specific allele in a given sample is likely to be there or not, even in the absence of reads covering that nucleotide position. We have considered genotypes that yielded a probability higher than 0.95.

The genotypes obtained from genome sequence data and validated with Beagle-4 software were used to generate genotype frequencies. The allelic frequencies were estimated using both validated genome sequence data and targeted capture data. Because of the impossibility to infer genotypes in the case of targeted capture data only one random allele per position was taken into account.

Statistical analysis

The allelic frequencies obtained with validated genotypes and targeted capture data were stratified by geographic position of the samples (North Europe and South Europe -the latter including Iberia, Italy, the Balkan peninsula and Anatolia-) and period (Upper Palaeolithic, Early Neolithic, Middle Neolithic, Late Neolithic, Bronze Age, Iron Age, Roman Age, Post-Roman Age).

To explore potential differences in the genotypic composition between modern South (Iberians and Tuscans) and North Europeans (Finnish, British and CEU) populations, we tested two different genetic models (codominant and logistic additive models) for each SNP, with a case-control approach. A likelihood ratio test was estimated for each SNP and both models using a Bonferroni correction. A Fisher’s exact test with a case-control approach was used for comparing allele frequencies in ancient versus modern European populations in those SNPs with more than one detected case of derived allele. Due to the assumption that ancient frequencies are directly comparable to the modern ones, the Fisher’s exact test is anti-conservative here; however we have included the p-value in the cases were we found an>1 OR, just as an additional, supporting evidence.

Population structure analysis

To determine the level of differentiation and substructure among subpopulations the statistic FST⁵⁵ was calculated. The fixation index was determined performing all the possible pairwise calculations comparing North-South present-day Europeans, the five European populations of 1000 genomes, North-South ancient Europeans and Period-stratified ancient Europeans (the populations were clustered in three supra populations: Upper Paleolithic; Pre-Neolithic, Middle-Neolithic, Past-Neolithic; Bronze Age, Iron Age, Roman, Post-Roman). Nine SNPs that exhibited variability (Table 2) were used to perform the comparisons. To test the divergence and the stratification within the population the statistic FSI (where I stands for individual and S for sub-population), was also used to estimate the departure from panmixia at the sub-population level.

Additionally the G-test was calculated as it is described in Goudet et al.⁵⁶. The authors showed that the likelihood ratio G-test, is the most powerful statistic to detect population differentiation, particularly when samples are unbalanced. These authors also showed that for diploids, the appropriate unit to randomize is the diploid genotype rather than the allele (of course, the contingency tables are based on alleles, not genotypes). The likelihood ratio G-statistic was performed on a contingency table of alleles at one locus x sampling unit; the individual genotypes were randomly permutated among samples and the G-statistic obtained from the original dataset is compared to the G-statistics obtained from the permuted datasets, yielding a P-value of the test.

Neutral selection testing

To test for the presence of selection driving the genotypic frequencies in past-populations and reject neutrality we used a specific software (https://github.com/Schraiber/selection) as described in ref. 15. This analysis based on Bayesian probabilities was performed for each allele that presented diversity among ancient individuals (Fig. 3). The 92 ancient samples with complete genotypic information were grouped in eight periods (Upper Palaeolithic, Early Neolithic, Middle Neolithic, Late Neolithic, Bronze Age, Iron Age, Roman and Post-Roman) to create a time series. The number of derived alleles was counted in each period, and selection coefficients were inferred from allele frequencies along time. The analysis was performed assuming a constant population size and 10,000 individuals of effective population size. The dates used in the simulations were: −0.04, −0.018, −0.015, −0.008, −0.006, −0.004, −0.0004 and 0 relatively to the most recent analysed age (post-Roman period). Two SNPs (rs951074 and rs2334880) presented high frequencies of the derived allele in all the sampling times. Because of the software models the allele arising from a new mutation, a new mutation is much more likely to rise to high frequency if it is under selection. In the analysis of rs951074 and rs2334880 SNPs the origin of the derived allele was not modelled because it was already extended in the initial periods. The value distributions of selection coefficients over time were plotted with R 3.2.2 (http://cran.r-project.org) software.

References

Kwiatkowski, D. P. How Malaria Has Affected the Human Genome and What Human Genetics Can Teach Us about Malaria. Am. J. Hum. Genet. 77, 171–192, doi:10.1086/432519 (2005).
Article CAS PubMed PubMed Central Google Scholar
Carter, R. Speculations on the origins of Plasmodium vivax malaria. Trends Parasitol. 19, 214–219, doi:10.1016/S1471-4922(03)00070-9 (2003).
Article PubMed Google Scholar
Rich, S. M., Licht, M. C., Hudson, R. R. & Ayala, F. J. Malaria’s Eve: Evidence of a rcent population bottleneck throughout the world populations of Plasmodium falciparum. Proc. NAtl. Acad. Sci 95, 4425–4430, doi:10.1073/pnas.95.8.4425 (1998).
Article ADS CAS PubMed PubMed Central Google Scholar
Tishkoff, S. A. et al. Haplotype diversity and linkage disequilibrium at human G6PD: recent origin of alleles that confer malarial resistance. Science 293, 455–462, doi:10.1126/science.1061573 (2001).
Article CAS PubMed Google Scholar
Ohashi, J. et al. Extended linkage disequilibrium surrounding the hemoglobin E variant due to malarial selection. Am. J. Hum. Genet. 74, 1198–1208, doi:10.1086/421330 (2004).
Article CAS PubMed PubMed Central Google Scholar
Gelabert, P. et al. Mitochondrial DNA from the eradicated European Plasmodium vivax and P. falciparum from 70-year-old slides from the Ebro Delta in Spain. Proc. Natl. Acad. Sci USA. 113, 11495–11500, doi:10.1073/pnas.1611017113 (2016).
Article Google Scholar
Liu, W. et al. African origin of the malaria parasite Plasmodium vivax. Nat. Commun. 5, 3346, doi:10.1038/ncomms4346 (2014).
ADS PubMed PubMed Central Google Scholar
Olalde, I. & Lalueza-Fox, C. Modern humans’ paleogenomics and the new evidences on the European prehistory. Sci. Technol. Archaeol. Res. 1, 1–9, doi:10.1179/2054892315Y.0000000002 (2015).
Article Google Scholar
Mathieson, I. et al. Genome-wide patterns of selection in 230 ancient Eurasians. Nature 528, 499–503, doi:10.1038/nature16152 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Haak, W. et al. Massive migration from the steppe was a source for Indo-European languages in Europe. Nature 522, 207–211, doi:10.1038/nature14317 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Allentoft, M. E. et al. Population genomics of Bronze Age Eurasia. Nature 522, 167–172, doi:10.1038/nature14507 (2015).
Article ADS CAS PubMed Google Scholar
Timmann, C. et al. Genome-wide association study indicates two novel resistance loci for severe malaria. Nature 489, 443–446, doi:10.1038/nature11334 (2012).
Article ADS CAS PubMed Google Scholar
Jallow, M. et al. Genome-wide and fine-resolution association analysis of malaria in West Africa. Nat. Genet. 41, 657–665, doi:10.1038/ng.388 (2009).
Article CAS PubMed PubMed Central Google Scholar
Browning, B. L. & Browning, S. R. Genotype imputation with millions of reference samples. Am. J. Hum. Genet. 98, 116–126, doi:10.1016/j.ajhg.2015.11.020 (2016).
Article CAS PubMed PubMed Central Google Scholar
Schraiber, J. G., Evans, S. N. & Slatkin, M. Bayesian Inference of Natural Selection from Allele Frequency Time Series. Genetics 203, 493–511, doi:10.1534/genetics.116.187278 (2016).
Article CAS PubMed Google Scholar
Rich, S. M. et al. The origin of malignant malaria. Proc. Natl. Acad. Sci. USA 106, 14902–14907, doi:10.1073/pnas.0907740106 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Nagao, E., Seydel, K. B. & Dvorak, J. A. Detergent-resistant erythrocyte membrane rafts are modified by a Plasmodium falciparum infection. Exp Parasitol 102, 57–59, doi:10.1016/S0014-4894(02)00143-1 (2002).
Article CAS PubMed Google Scholar
Miller, L. H., Mason, S. J., Clyde, D. F. & McGinniss, M. H. The Resistance Factor to Plasmodium vivax in Blacks. N. Engl. J. Med. 295, 302–304, doi:10.1056/NEJM197608052950602 (1976).
Article CAS PubMed Google Scholar
Luzzatto, L., Usanga, F. A. & Reddy, S. Glucose-6-phosphate dehydrogenase deficient red cells: resistance to infection by malarial parasites. Science 164, 839–842, doi:10.1126/science.164.3881.839 (1969).
Article ADS CAS PubMed Google Scholar
Garcia-Garcera, M. et al. Fragmentation of contaminant and endogenous DNA in ancient samples determined by shotgun sequencing; prospects for human palaeogenomics. PLoS One 6, e24161, doi:10.1371/journal.pone.0024161 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Olalde, I. et al. A Common Genetic Origin for Early Farmers from Mediterranean Cardial and Central European LBK Cultures. Mol. Biol. Evol. 32, 3132–3142, doi:10.1093/molbev/msv181 (2015).
CAS PubMed PubMed Central Google Scholar
Kılınç, G. M. et al. The Demographic Development of the First Farmers in Anatolia. Curr. Biol. 26, 1–8, doi:10.1016/j.cub.2016.07.057 (2016).
Article Google Scholar
Hofmanová, Z. et al. Early farmers from across Europe directly descended from Neolithic Aegeans. Proc. Natl. Acad. Sci. USA 113, 6886–6891, doi:10.1073/pnas.1523951113 (2016).
Article PubMed PubMed Central Google Scholar
Khor, C. C. et al. A Mal functional variant is associated with protection against invasive pneumococcal disease, bacteremia, malaria and tuberculosis. Nat. Genet. 39, 523–528, doi:10.1038/ng1976 (2007).
Article CAS PubMed PubMed Central Google Scholar
Willcocks, L. C. et al. A defunctioning polymorphism in FCGR2B is associated with protection against malaria but susceptibility to systemic lupus erythematosus. Proc. Natl. Acad. Sci. USA 107, 7881–7885, doi:10.1073/pnas.0915133107 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Clatworthy, M. R. et al. Systemic lupus erythematosus-associated defects in the inhibitory receptor FcgammaRIIb reduce susceptibility to malaria. Proc. Natl. Acad. Sci. USA 104, 7169–7174, doi:10.1073/pnas.0608889104 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Gazarini, M. L., Thomas, A. P., Pozzan, T. & Garcia, C. R. S. Calcium signaling in a low calcium environment: how the intracellular malaria parasite solves the problem. J. Cell Biol. 161, 103–110, doi:10.1083/jcb.200212130 (2003).
Article CAS PubMed PubMed Central Google Scholar
Rockett, K. A. et al. Reappraisal of known malaria resistance loci in a large multicenter study. Nat. Genet. 46, 1197–1204, doi:10.1038/ng.3107 (2014).
Article PubMed Central Google Scholar
Pybus, M. et al. 1000 Genomes Selection Browser 1.0: a genome browser dedicated to signatures of natural selection in modern humans. Nucleic Acids Res. 42, D903–D909, doi:10.1093/nar/gkt1188 (2014).
Article CAS PubMed Google Scholar
Lazaridis, I. et al. Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature 513, 409–413, doi:10.1038/nature13673 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Llamas, B. et al. Ancient mitochondrial DNA provides high-resolution time scale of the peopling of the Americas. Sci. Adv. 2, e1501385–e1501385, doi:10.1126/sciadv.1501385 (2016).
Article ADS PubMed PubMed Central Google Scholar
Olalde, I. et al. Derived immune and ancestral pigmentation alleles in a 7,000-year-old Mesolithic European. Nature 507, 225–228, doi:10.1038/nature12960 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Skoglund, P. et al. Genomic diversity and admixture differs for Stone-Age Scandinavian foragers and farmers. Science 344, 747–750, doi:10.1126/science.1253448 (2014).
Article ADS CAS PubMed Google Scholar
Gamba, C. et al. Genome flux and stasis in a five millennium transect of European prehistory. Nat. Commun. 5, 5257, doi:10.1038/ncomms6257 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Seguin-Orlando, A. et al. Paleogenomics. Genomic structure in Europeans dating back at least 36,200 years. Science 346, 1113–1118, doi:10.1126/science.aaa0114 (2014).
Article ADS CAS PubMed Google Scholar
Jones, E. R. et al. Upper Palaeolithic genomes reveal deep roots of modern Eurasians. Nat. Commun. 6, 8912, doi:10.1038/ncomms9912 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Cassidy, L. M. et al. Neolithic and Bronze Age migration to Ireland and establishment of the insular Atlantic genome. Proc. Natl. Acad. Sci. USA 113, 368–373, doi:10.1073/pnas.1518445113 (2016).
Article ADS CAS PubMed Google Scholar
Martiniano, R. et al. Genomic signals of migration and continuity in Britain before the Anglo-Saxons. Nat. Commun. 7, 10326, doi:10.1038/ncomms10326 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Schiffels, S. et al. Iron Age and Anglo-Saxon genomes from East England reveal British migration history. Nat. Commun. 7, 10408, doi:10.1038/ncomms10408 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Fu, Q. et al. Genome sequence of a 45,000-year-old modern human from western Siberia. Nature 514, 445–449, doi:10.1038/nature13810 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Fu, Q. et al. An early modern human from Romania with a recent Neanderthal ancestor. Nature 524, 216–219, doi:10.1038/nature14558 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Fu, Q. et al. The genetic history of Ice Age Europe. Nature 534, 200–205, doi:10.1038/nature17993 (2016).
ADS CAS PubMed PubMed Central Google Scholar
Hirono, A. & Beutler, E. Alternative splicing of human glucose-6-phosphate dehydrogenase messenger RNA in different tissues. J. Clin. Invest. 83, 343–346, doi:10.1172/JCI113881 (1989).
Article CAS PubMed PubMed Central Google Scholar
Gouagna, L. C. et al. Genetic variation in human HBB is associated with Plasmodium falciparum transmission. Nat. Genet. 42, 328–331, doi:10.1038/ng.554 (2010).
Article CAS PubMed Google Scholar
Agarwal, A. et al. Hemoglobin C associated with protection from severe malaria in the Dogon of Mali, a West African population with a low prevalence of hemoglobin S. Blood 96, 2358–2363 (2000).
CAS PubMed Google Scholar
Zimmerman, P. A. et al. Emergence of FY*A(null) in a Plasmodium vivax-endemic region of Papua New Guinea. Proc. Natl. Acad. Sci. USA 96, 13973–13977, doi:10.1073/pnas.96.24.13973 (1999).
Article ADS CAS PubMed PubMed Central Google Scholar
Tournamille, C., Colin, Y., Cartron, J. P. & Le Van Kim, C. Disruption of a GATA motif in the Duffy gene promoter abolishes erythroid gene expression in Duffy-negative individuals. Nat. Genet. 10, 224–228, doi:10.1038/ng0695-224 (1995).
Article CAS PubMed Google Scholar
Yamamoto, F., Clausen, H., White, T., Marken, J. & Hakomori, S. Molecular genetic basis of the histo-blood group ABO system. Nature 345, 229–233, doi:10.1038/345229a0 (1990).
Article ADS CAS PubMed Google Scholar
Blumenfeld, O. O. & Patnaik, S. K. Allelic genes of blood group antigens: a source of human mutations and cSNPs documented in the Blood Group Antigen Gene Mutation Database. Hum. Mutat. 23, 8–16, doi:10.1002/humu.10296 (2004).
Article CAS PubMed Google Scholar
Boren, T., Falk, P., Roth, K. A., Larson, G. & Normark, S. Attachment of Helicobacter pylori to human gastric epithelium mediated by blood group antigens. Science 262, 1892–1895, doi:10.1126/science.8018146 (1993).
Article ADS CAS PubMed Google Scholar
Swerdlow, D. L. et al. Severe life-threatening cholera associated with blood group O in Peru: implications for the Latin American epidemic. J. Infect. Dis. 170, 468–472, doi:10.1093/infdis/170.2.468 (1994).
Article CAS PubMed Google Scholar
Gupta, H. et al. Categorical complexities of Plasmodium falciparum malaria in individuals is associated with genetic variations in ADORA2A and GRK5 genes. Infect. Genet. Evol. 34, 188–199, doi:10.1016/j.meegid.2015.06.010 (2015).
Article CAS PubMed Google Scholar
Sudmant, P. H. et al. An integrated map of structural variation in 2,504 human genomes. Nature 526, 75–81, doi:10.1038/nature15394 (2015).
Article CAS PubMed PubMed Central Google Scholar
Gilbert, M. T. P. et al. Distribution patterns of postmortem damage in human mitochondrial DNA. Am. J. Hum. Genet. 72, 32–47, doi:10.1086/345378 (2003).
Article CAS PubMed Google Scholar
Weir, B. S. & Hill, W. G. Estimating F-statistics. Annu. Rev. Genet. 36, 721–750, doi:10.1146/annurev.genet.36.050802.093940 (2002).
Article CAS PubMed Google Scholar
Goudet, J. & Roussett, F. Testing Differentiation in Diploid Populations. Genetics 144, 1933–1940 (1996).
CAS PubMed PubMed Central Google Scholar
Zhao, X., Smith, D. L. & Tatem, A. J. Exploring the spatiotemporal drivers of malaria elimination in Europe. Malar. J. 15, 122, doi:10.1186/s12936-016-1175-z (2016).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This research was supported by a grant to C.L.-F. from FEDER and Ministry of Economy and Competitiveness (BFU2015–64699-P) of Spain and by a grant 2014 SGR 464 (GRBIO) from the Departament d’Economia i Coneixement de la Generalitat de Catalunya (Spain) to S.C. We are grateful to Joshua Schraiber and Agnar Helgason for helpful comments and suggestions.

Author information

Authors and Affiliations

Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
Pere Gelabert, Iñigo Olalde, Toni de-Dios & Carles Lalueza-Fox
Department of Statistics, Faculty of Biology, University of Barcelona, 08028, Barcelona, Spain
Sergi Civit

Authors

Pere Gelabert
View author publications
You can also search for this author in PubMed Google Scholar
Iñigo Olalde
View author publications
You can also search for this author in PubMed Google Scholar
Toni de-Dios
View author publications
You can also search for this author in PubMed Google Scholar
Sergi Civit
View author publications
You can also search for this author in PubMed Google Scholar
Carles Lalueza-Fox
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.L.-F. and P.G. conceived the study; P.G., I.O., T.d.-D. and S.C. analyzed data; C.L.-F. and P.G. wrote the paper.

Corresponding author

Correspondence to Carles Lalueza-Fox.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Dataset 1

Supplementary Figures

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gelabert, P., Olalde, I., de-Dios, T. et al. Malaria was a weak selective force in ancient Europeans. Sci Rep 7, 1377 (2017). https://doi.org/10.1038/s41598-017-01534-5

Download citation

Received: 19 October 2016
Accepted: 30 March 2017
Published: 03 May 2017
DOI: https://doi.org/10.1038/s41598-017-01534-5

This article is cited by

Ancient DNA suggests anaemia and low bone mineral density as the cause for porotic hyperostosis in ancient individuals
- Manuel Ferrando-Bernal
Scientific Reports (2023)
Pathogenic Interleukin-10 Receptor Alpha Variants in Humans — Balancing Natural Selection and Clinical Implications
- Dominik Aschenbrenner
- Ziqing Ye
- Holm H. Uhlig
Journal of Clinical Immunology (2023)
Machine learning model for malaria risk prediction based on mutation location of large-scale genetic variation data
- Kah Yee Tai
- Jasbir Dhaliwal
Journal of Big Data (2022)
Risk score prediction model based on single nucleotide polymorphism for predicting malaria: a machine learning approach
- Kah Yee Tai
- Jasbir Dhaliwal
- KokSheik Wong
BMC Bioinformatics (2022)
Leveraging Mann–Whitney U test on large-scale genetic variation data for analysing malaria genetic markers
- Kah Yee Tai
- Jasbir Dhaliwal
- Vinod Balasubramaniam
Malaria Journal (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.