Human mitochondrial DNA lineages in Iron-Age Fennoscandia suggest incipient admixture and eastern introduction of farming-related maternal ancestry

Översti, Sanni; Majander, Kerttu; Salmela, Elina; Salo, Kati; Arppe, Laura; Belskiy, Stanislav; Etu-Sihvola, Heli; Laakso, Ville; Mikkola, Esa; Pfrengle, Saskia; Putkonen, Mikko; Taavitsainen, Jussi-Pekka; Vuoristo, Katja; Wessman, Anna; Sajantila, Antti; Oinonen, Markku; Haak, Wolfgang; Schuenemann, Verena J.; Krause, Johannes; Palo, Jukka U.; Onkamo, Päivi

doi:10.1038/s41598-019-51045-8

Download PDF

Article
Open access
Published: 15 November 2019

Human mitochondrial DNA lineages in Iron-Age Fennoscandia suggest incipient admixture and eastern introduction of farming-related maternal ancestry

Sanni Översti¹^na1,
Kerttu Majander^1,2,3^na1,
Elina Salmela ORCID: orcid.org/0000-0003-1326-4462^1,2,
Kati Salo⁴,
Laura Arppe⁵,
Stanislav Belskiy⁶,
Heli Etu-Sihvola ORCID: orcid.org/0000-0001-6669-797X⁵,
Ville Laakso⁷,
Esa Mikkola⁸,
Saskia Pfrengle³,
Mikko Putkonen⁹,
Jussi-Pekka Taavitsainen⁷,
Katja Vuoristo⁸,
Anna Wessman ORCID: orcid.org/0000-0001-6886-5455^4,7,
Antti Sajantila⁹,
Markku Oinonen ORCID: orcid.org/0000-0002-0881-7643⁵,
Wolfgang Haak ORCID: orcid.org/0000-0003-2475-2007²,
Verena J. Schuenemann^3,10,
Johannes Krause ORCID: orcid.org/0000-0001-9144-3920²^na2,
Jukka U. Palo^9,11^na2 &
…
Päivi Onkamo^1,12^na2

Scientific Reports volume 9, Article number: 16883 (2019) Cite this article

35k Accesses
14 Citations
117 Altmetric
Metrics details

Subjects

Abstract

Human ancient DNA studies have revealed high mobility in Europe’s past, and have helped to decode the human history on the Eurasian continent. Northeastern Europe, especially north of the Baltic Sea, however, remains less well understood largely due to the lack of preserved human remains. Finland, with a divergent population history from most of Europe, offers a unique perspective to hunter-gatherer way of life, but thus far genetic information on prehistoric human groups in Finland is nearly absent. Here we report 103 complete ancient mitochondrial genomes from human remains dated to AD 300–1800, and explore mtDNA diversity associated with hunter-gatherers and Neolithic farmers. The results indicate largely unadmixed mtDNA pools of differing ancestries from Iron-Age on, suggesting a rather late genetic shift from hunter-gatherers towards farmers in North-East Europe. Furthermore, the data suggest eastern introduction of farmer-related haplogroups into Finland, contradicting contemporary genetic patterns in Finns.

The mitogenome portrait of Umbria in Central Italy as depicted by contemporary inhabitants and pre-Roman remains

Article Open access 01 July 2020

Alessandra Modi, Hovirag Lancioni, … Alessandro Achilli

Papuan mitochondrial genomes and the settlement of Sahul

Article Open access 01 June 2020

Nicole Pedro, Nicolas Brucato, … François-Xavier Ricaut

Ancient genomes from the last three millennia support multiple human dispersals into Wallacea

Article Open access 09 June 2022

Sandra Oliveira, Kathrin Nägele, … Mark Stoneking

Introduction

Genetic studies on anthropological remains have exceedingly helped to shed light on various human populations as well as past events and processes. These include for instance the initial colonization of Europe by modern humans c. 40 kya^1,2, the Holocene hunter-gatherer communities^1,3, the major population turnover associated with the Neolithic spread of agriculture from Anatolia^3,4,5, and the massive Bronze-Age influx of genes, culture and customs of the Yamnaya-related people into Europe from the Pontic-Caspian steppe^6,7.

In terms of mitochondrial DNA (mtDNA), these cultural turnovers and population migrations in Europe involved also changes in the haplogroup composition. In hunter-gatherer populations the dominating mitochondrial lineage has been U, especially its subgroups U4, U5a and U5b³. In the advent of the Neolithic revolution, these U subgroups were largely supplanted by farmer-associated haplogroups H, HV, J, K, N1a, T2 and W^4,5. The subsequent spread of Yamnaya-related people and Corded Ware Culture in the late Neolithic and Bronze Age were accompanied with the increase of haplogroups I, U2 and T1 in Europe (See⁸ and references therein).

Whereas the ancient DNA (aDNA) composition and its changes in mainland Europe are increasingly well understood, northeastern Europe has been far less studied. The oldest human DNA analyzed in this region derive from the Mesolithic burial sites in Huseby Klev, western Sweden (9800 calBP⁹), Hummervikholmen in Norway (9300 calBP¹⁰) and Yuznuy Olennij Ostrov on Lake Onega, Russia (~8400 calBP^7,11). In addition, Mesolithic to Bronze-Age DNA data from several sites in the Baltic countries have been published^6,12,13,14.

Despite the relatively close geographical proximity, little is known about the ancient DNA diversity in regions immediately north of the Baltics. This is largely due to the scarcity of preserved anthropological remains. In the hemiboreal forest zone the soil pH, together with annual freeze-thaw cycles has highly detrimental effects on bone material to the extent to which no unburnt remains older than ~2000 years exist¹⁵. However, archaeological evidence strongly suggests that the most notable colonization events in the region have taken place much earlier^16,17. Consequently, the lack of archaeological bone material gravely limits the capability of aDNA studies in resolving the human population history of the Taiga belt and the processes that have shaped present-day diversity. Despite these shortcomings, aDNA has recently been recovered from c. 1500 year-old bones from Levänluhta in western central Finland^18,19. Genomic data from these samples show a Siberian ancestry component still prominently present today, particularly in the indigenous Saami people, and to a lesser extent in modern Finns. Although these data suggest a widespread presence of genetically Saami-like people around eastern Fennoscandia during the Iron Age, more wide-spread sampling in space and time is necessary for understanding the past population dynamics, and emerging of the contemporary genetic diversity in Finland.

In terms of both genetics and culture, modern Finns show a unique combination of eastern and western European elements, which most likely reflects the settlement history. The first archaeological evidence of human presence in Finland dates relatively late to c. 11000–9000 years ago, soon after the continental ice sheet retreated. According to the archaeological record, the region has since supported a continuous human occupation until today¹⁷. Several influential waves of material culture have been shown to extend into Finland: the first one brought the Sperrings or Säräisniemi pottery to the area c. 7500 years ago²⁰, the second presented the typical Comb-Ceramic at 6000 years ago^16,17, one of the most influential prehistoric cultures in the wider region. Finally, around 4700 years ago, the Corded-Ware culture reached Finland^16,17,21. The most significant cultural changes, possibly driven by expansions, proceeded from east/south-east and extended into most of today’s territory of Finland, while the Corded-Ware culture influence spread from south occupying only the southwestern part of Finland. Later, Bronze Age brought an increase of human activity¹⁷ and coinciding advance of cereal cultivation²². Finland also saw a pronounced Scandinavian impact along the western coast, while the inland was dominated by eastern influences, seen e.g. in the arrival of ferrous metallurgy c. 500 BC and stylistic features of Bronze-Age and Iron-Age items²¹. Iron Age in Finland starts around 500 BC and continues until the end of Crusade period (c. 1200 AD in west and 1300 AD in east). Unlike most of the Europe, the Middle Ages starts as late as 1200 AD in western parts of country and 1300 AD in east and shifts into Early modern period in the beginning of 16^th century (for a review of archaeological and historical periods in Finland see²¹).

Linguistically, contemporary Finns and Saami differ from most other Europeans in speaking a Uralic language, unrelated to the majority of European languages, which belong to the Indo-European language family. Finns are also genetically distinct from their neighboring populations and form outliers in the genetic variation within Europe²³. This genetic uniqueness derives from both reduced genetic diversity^24,25 and an Asian influence to the gene pool²⁴. Within Finland, an unusually strong genetic border bisects the population along a northwest to southeast axis^24,26,27, and is interpreted to reflect an ancient boundary between hunter-gatherer and farmer populations²⁸. The expanse of agriculture north-east of this border was probably limited by environmental factors, especially the length of the growing season. Later, this border has most likely acted in demarcating the spread of western and eastern political and cultural impacts influencing the placement of first political border between Sweden and Novgorod through the middle of Finland (Treaty of Noteborg 1323 AD).

In order to gain better insight into the genetic history of Finns, we here describe 103 complete mitochondrial genomes reconstructed from bone samples from ten burial sites in southern Finland and the Republic of Karelia, Russia (former Finnish territory; Fig. 1). The main focus is on the 70 complete mitochondrial genomes from five archaeological burial sites in Finland spanning spatially from western coast to Lake Ladoga, and Late Roman Iron Age (300 AD) to the Middle Ages (1500 AD) (Tables 1, S1 and Supplementary Material S1). In addition, we include 33 mitochondrial genomes from later, mainly historical burials (1400–1800 AD) from five sites across southern Finland. While mtDNA genomes of Iron-Age and Early-Medieval Finland cannot be used to directly target questions about the colonization of Finland, they provide a spatio-temporal transect to the maternal ancestry of the early inhabitants in this region, and help to understand patterns observed in Finland’s modern mtDNA diversity.

Table 1 Sites presented in this study.

Full size table

Results

Authenticity of ancient-DNA results

Based on the shotgun sequencing, out of the total of 141 individuals sampled, 134 were included in mitochondrial capture. Mitochondrial genomes for 103 individuals passed the quality control thresholds, while 31 samples were excluded from further analyses due to insufficient data (less than fivefold mitochondrial coverage) or high contamination levels (Supplementary Table S1). Ancient-DNA yield for all 103 samples was studied with several criteria of authentication. All samples showed fragment sizes ranging between 40–250 bp, as expected for ancient DNA²⁹. Fragments under 30 bp were filtered out as a mapping quality control. All samples had an average fragment length of 47 to 95 bp. The authentic ancient DNA is often fragmented compared to the modern DNA, and fragments as short as 50–65 bp are common. The samples included in the downstream analyses yielded between 1426 and 395345 unique human mitochondrial fragments with an average coverage ranging from 5-fold to 1683-fold. The first-base damage on the fragments varied between 5–36% on the 3′-end and 4–34% on the 5′-end. Previous studies have proven that cytosine deamination is influenced by the age of the sample^30,31 and the mean temperature of the site³¹. Considering the climatic conditions in Finland, e.g., low mean temperature, and the relatively young age especially for the post-medieval samples, 3′ and 5′ damage values below 5% are plausible. No samples were therefore omitted from the study based on these criteria.

The contamination rates of the 103 samples were further evaluated by Schmutzi³². 36 samples had Schmutzi contamination estimates exceeding 5% and were excluded (Supplementary Table S1). The remaining samples were then analyzed with ContamMix³³: the resulting crude contamination estimates as well as the a posteriori estimates of contamination along with their 95% confidence intervals (CI) from the MCMC are reported in Supplementary Table S1. The CIs ranged from 0% to 17.2%; in ten cases they exceeded 10%, even though estimates by Schmutzi had remained below 5%. These cases were visually inspected with Geneious 11.0.3 (www.geneious.com). For each of them, the majority call supported the previously assigned haplogroup.

Radiocarbon datings

For this study, we report new ¹⁴C dates for 42 individuals (Supplementary Table S1). Radiocarbon dates for nine individuals were determined previously (see Supplementary Table S1). Based on radiocarbon dates and/or dating of the context, the studied burial sites cover the timespan from the Roman Iron Age (300 AD) to historical times (19th century). For sites Levänluhta, Luistari, Hollola, Hiitola, Tuukkala, Pälkäne and Porvoo the highest posterior densities (HPD) for site’s start and end boundaries were determined. The mean values for obtained phase boundaries are presented in Table 1, and 68% and 95% HPD regions are presented in Supplementary Table S2 and in Supplementary Fig. S1. Intervals for mean values of boundaries obtained based on radiocarbon dates were in accordance with dates determined based on the archaeological context (Table 1).

MtDNA data and haplotypic variation

A total of 95 unique complete-mitogenome haplotypes were observed among the 103 complete sequences retrieved: three haplotypes were shared between sampling sites and five within a site. In the latter cases, the placement of the skeletal samples suggests that the shared haplotypes have been carried by different individuals, who may have been maternally related: identical haplotypes (haplogroup U5a2a1e) were obtained from remains of a c. 5-year-old child (grave 18, TU666) and an older woman (grave 7, TU655) from Hollola. Identical haplotypes (haplogroup H85) were also observed in a middle-aged adult (grave 6, TU661) and a c. 18-month-old child (grave 15, TU668) from Hollola. At the Hiitola site, identical haplotypes (haplogroup W6) were shared between two individuals from distinct graves (individual TU566 from grave 80 and individual TU675 from grave 30). At the Tuukkala site, two individuals showed identical haplotypes (haplogroup H10e, individuals TU631 and TU645). At Turku, two adults shared the same haplotype belonging to the basal haplogroup H (samples TU582 and TU588). Haplotypes for 103 individuals are presented in Supplementary Table S3.

As the subsequent statistical methods assume that samples derive from unrelated individuals, five samples - one of each identical haplotype pairs within sites (TU666, TU668, TU675, TU645 and TU588) - were removed from the subsequent analyses due to their possible maternal relatedness.

The mean number of pairwise differences, calculated from complete mitochondrial genomes, was highest within Porvoo (MNPD = 33.7 ± 16.8) and lowest within Renko (MNPD = 21.8 ± 10.8) (Supplementary Table S4). Due to the small number of individuals per site and utilization of unique complete mtDNA sequences, haplotype diversities (H) were relatively high (with mean 1.0 and standard deviation ranking from 0.0202 to 0.1768).

MtDNA haplogroup composition at the ancient sites

Burial site-specific haplogroup frequencies of the 98 complete mitochondrial sequences showed considerable between-site variation (Fig. 2 and Supplementary Table S5). The observed frequencies of the main haplogroups in the whole dataset resembled the prevalence among contemporary Finns. As today, haplogroups U and H were the most common, yet with slightly higher overall frequencies than today (U 33.7% vs. 24.1%, and H 41.8% vs. 33.2%). However, when grouped temporally into Iron-Age and medieval sites (IAM) and early-modern and modern sites (EMM), differences were observed: the IAM sites (i.e., Levänluhta, Luistari, Hollola, Hiitola and Tuukkala) demonstrated significantly higher overall prevalence of haplogroup U (40.9%) than the EMM sites (i.e., Pälkäne, Porvoo, Renko, Turku and Hamina, 18.8%) but also high inter-site variability. Among the EMM samples haplogroup H dominated (U 18.8%, H 46.9%).

This inter-site variability of the haplogroup U/H ratio had a clear spatial pattern also among the IAM samples. The western cluster (IAM south-west: Levänluhta, Luistari and Hollola) had average U and H frequencies of 58.3% and 27.8%, respectively, whereas the corresponding values in the eastern cluster (IAM east: Hiitola, Tuukkala) were 20.0% and 53.3%. In IAM east the highest frequency for an individual subhaplogroup was 30.0% obtained for H1. Strikingly, this U/H ratio is the opposite compared to contemporary eastern and western Finns.

Differences in haplogroup composition between the sites

Among the 12 Levänluhta samples, five individuals carried haplogroup (hg) U5b, four of which belonged to the sub-hg U5b1b1a. Additionally, the Levänluhta site included three individuals with hg U5a, resulting in a total frequency of 66.7% for hg U5. In contrast, with only two haplotypes of sub-hg H1, the frequency of hg H was well below values observed in modern European populations. The high U5b1b1a frequency resembles that observed today in Saami populations of northern Europe. This actually corresponds well to a related recent study that is showing the close genetic affinity between Levänluhta individuals and modern Saami¹⁸. However, the Levänluhta individuals also carried mtDNA haplogroups that are absent or rare among the Saami population today, U5a, H1 (0.0–4.0%³⁴) and haplogroups K and T. The Levänluhta site clearly showed a unique composition, which resulted in significant genetic distances to all other ancient sites at sequence level, with Φ_ST values of >10% (see below).

Individuals from the Hollola site, ¹⁴C dated to 955–1390 calAD (Table 1), also showed a high overall frequency of hg U (64.3%), similar to Levänluhta. However, differences in subhaplogroup distribution between Hollola and Levänluhta suggest a possible non-modern-Saami-like hunter-gatherer ancestry in this region. Interestingly, subhaplogroup U5b1b1a, typical among contemporary Saami, was not observed in Hollola. In contrast, most of the Hollola U haplotypes belong to haplogroups U4 and U5a (frequencies in Hollola 28.6% and 14.3%, respectively), which are rare or absent in Saami today³⁴. Moreover, U4 is also rare in modern Finns while the frequency for U5a is around 6%^28,35. Haplotypes belonging to different subhaplogroups of hg H were more common in Hollola than in Levänluhta, occurring in altogether five samples. Haplogroups K and T were absent in the Hollola sample.

A rather different picture emerged from the Luistari samples, showing a substantial genetic distance to Levänluhta (Φ_ST = 0.134, p < 0.01). Haplogroup U5b1 was entirely absent, and the U haplotypes observed belong to subhaplogroup U4, U5b2 and U2. Lineage U2 is prevalent in some Uralic speaking groups today³⁶. The overall haplogroup distribution in Luistari was more similar to the modern European populations dominated by agriculture-associated Neolithic haplogroups H and occurrences of T2 and W1 (see Introduction), than in Levänluhta and Hollola sites.

The two easternmost sites, Hiitola and Tuukkala, proved genetically distant from the western Levänluhta and Hollola sites, despite being approximately contemporaneous with the Hollola individuals. The Neolithic signal in the mtDNA gene pool of ancient Finns in general was much stronger in the east. Both Hiitola and Tuukkala samples showed high frequencies of hg H (61.5% and 47.1%, respectively), together with other Neolithic haplogroups J, K, W and X. Notably, these eastern sites shared three haplogroups: H1a7, H1a8a and H10g. According to GenBank searches these three haplogroups are rare in modern populations: for H1a7 four modern sequences were found, two in Finnish (KY620272 and MF686118), one in Swedish (KJ487971) and one in British (GU797829) populations. For haplogroup H1a8a only two matches were found, one among Finnish (JX153203) and one of an unknown origin (JQ701944), whereas three modern sequences were found for haplogroup H10g: two Finnish (KR732275 and MF497508) and one from Russia (GU122976). Notably, H1* are known to be common in modern Karelia³⁷. The eastern sites also comprise rare subhaplogroups U1 (hg U1b2 in Hiitola) and U8 (hg U8b1a2b in Tuukkala), which are atypical for contemporary Finns.

Early modern and modern sites represents similar frequencies of U and H as the combined Iron Age and Medieval East (18.8% and 46.9%, respectively). Contrasting IAM sites and contemporary Finns, EMM sites harbors high prevalence of haplogroup T; frequency in EMM is as high as 21.9%, while in other Finnish populations the frequency is less than 8% (Supplementary Tables S5 and S8). Individual JK1954 from Hamina belonged to haplogroup C, which is lacking from contemporary Finns²⁸ (Supplementary Table S5) and suggests possible eastern origin. Nevertheless, additional autosomal data is needed to confirm the genetic background of the individual JK1954.

When contrasted with haplogroup frequencies observed in contemporary Finns, our simulations (Supplementary Fig. S2) showed that the ancient sites are significantly different, and that these differences cannot be explained by sampling effects. This applied especially to haplogroup U5 in total and to subhaplogroup U5b in Levänluhta, hg U4 in both Luistari and Hollola as well as hg H1 in the Hiitola dataset.

Genetic distances among sites and to contemporary Finns

When we calculated genetic distances between sites, we observed that Levänluhta differed significantly from all the other sites, except Hollola (Φ_ST = 0.05042, p = 0.02441) and Tuukkala (Φ_ST = 0.04387, p = 0.06055) (Fig. 3a and Supplementary Table S6). The largest distance from Levänluhta was to the eastern Hiitola site (Φ_ST = 0.15468). The distance between Levänluhta and contemporary Finns was smaller but still significant, with a distance to contemporary north-east (NE) Φ_ST = 0.04077 and to contemporary south-west (SW) slightly higher Φ_ST = 0.06473. While Luistari differed only from Levänluhta, the Hollola site differed both from Hiitola and the EMM (Φ_ST = 0.05205 and Φ_ST = 0.05135, respectively, p < 0.05 for both), but not from Levänluhta (Φ_ST = 0.06445, p > 0.05) (Fig. 3a and Supplementary Table S6). Hiitola differed, in addition to Levänluhta and Hollola, from EMM and from both groups of contemporary Finns (Φ_ST = 0.04111 for NE and Φ_ST = 0.03437 for SW). When considering the genetic distances between individual sites, it has to be noted that the relatively low sample sizes might affect the Φ_ST values and the results should be interpreted with caution. However, for pooled IAM and EMM sites (see Fig. 2b), for which the sample sizes are ≥30, the genetic distance calculations should not be that sensitive for bias caused by small sample sizes.

Clustering the IAM sites further roughly according to their geographical location to IAM south-west (hg U more prevalent) and IAM east (hg H more prevalent) further demonstrated the pattern opposite to modern mtDNA diversity distribution (Fig. 3b). IAM south-west differed statistically significantly from contemporary SW (Φ_ST = 0.01670, p = 0.00488) and EMM (Φ_ST = 0.05350, p = 0.00098) but not from contemporary NE (Φ_ST = 0.00036, p = 0.41895). In addition, EMM and contemporary SW differed from each other (Φ_ST = 0.01140, p = 0.04102). Conversely, IAM east differed from the contemporary NE (Φ_ST = 0.00849) more than from contemporary SW (Φ_ST = 0.00514).

Haplotype level median-joining network (Supplementary Fig. S3) demonstrates that ancient and contemporary Finns exhibit in principle same main haplogroups, whereas the most notable differences are within the haplogroup frequencies between the ancient populations. Individuals from IAM eastern sites are more prevalent in the haplogroup H cluster, while individuals from IAM southwestern sites are more concentrated on the haplogroup U cluster. Contemporary Finns are in both clusters, indicating possible mixture of IAM southwestern and IAM eastern populations.

Main haplogroup frequencies in space and time

To evaluate the possible impact of spatial and temporal factors on the distributions of haplogroup U, largely associated with European hunter-gatherers, and farmer-associated haplogroup H within the IAM sites, we performed multinomial logistic regression analyses. In a stepwise forward analysis, the only statistically significant independent variable explaining the differences in the haplogroup composition was the distance from eastern reference point Lahdenpohja (compared to ‘H’ and ‘Others’ significance for Lahdenpohja was 0.013 and 0.103, respectively) (Supplementary Table S7). Neither the ages of the samples nor distance from the southern and western reference points were requisite for the best-fit model. However, the addition of the eastern reference point significantly improved the fit between model and data (p = 0.027). Based on the odds ratios, it is less likely that an individual from southwest belongs to haplogroups ‘H’ or ‘Others’ than an individual from an eastern archaeological site. Similar results were obtained when using hunter-gatherer associated haplogroups (U and V), farmer associated haplogroups (H, J, K and T) and ‘Others’ as categorically distributed dependent variables. We chose to include the haplogroup V as ‘hunter-gatherer’ while there is no direct evidence for association of hg V with the hunter-gatherers. This is assumed here because of V’s northern distribution and its high prevalence (up to 58%³⁴) among the Saami, the archetypal nomadic population lacking many farmer-associated haplogroups^34,38. Distance from the eastern reference point was the only predictor included in the model (with significance of 0.031 for farmer associated haplogroups and 0.082 for other haplogroups). Assuming that haplogroups U and H can be associated to hunter-gatherers and farmers, respectively, the results suggest a spread of the more central European like, farmer-related haplogroups spreading from the east. However, as mentioned above, association of hg V is unclear. Omitting V from the hunter-gatherer group does not change results noteworthily (Supplementary Table S7).

Genetic affinities of ancient Finns to other ancient and contemporary populations

To further explore the affinity of ancient Finns to other ancient and contemporary populations, we carried out principal component analysis (PCA) based on haplogroup frequencies. We plotted the first two components of the PCA plot for ancient Finns, 31 other ancient populations, contemporary Finns and Saami, which account for 55% of the total variance (Figs 4, S4). Interestingly, southwestern Iron-Age sites Levänluhta and Hollola fall close to hunter-gatherer populations from Baltic, Central and Southern Europe. In addition Levänluhta is located in proximity to modern day Saami. This suggests the hunter-gatherer type of maternal ancestry in these two sites. In contrast, eastern IAM sites Hiitola and Tuukkala, EMM sites and contemporary SW Finns clustered with European Neolithic, Bronze-Age and Iron-Age populations. The southwestern site Luistari, as well as the contemporary NE Finns, were located roughly between these two clusters, indicating a possible mix of maternal ancestry from hunter-gatherers and Neolithic farmers. However, as with the genetic distances presented in Section 2.6., the small sample sizes of ancient populations might distort the haplogroup frequencies to deviate from the original source population, subsequently affecting PCA. To evaluate the possible bias, we performed random subsampling of contemporary SW and NE Finns (fifty iterations, for each N = 15) and carried out PCA with the same reference populations as for Fig. 4. Supplementary Fig. S5 demonstrates the amount of variation induced.

Discussion

Here we report 103 human mitochondrial DNA genomes from approx. 300 AD to 1800 AD, a transect both in time and space, which represents thus far the largest collection of individuals with ancient human DNA analyzed from Finland. Analysis of the prehistoric samples from Iron-Age and medieval sites from western and eastern Finland revealed a high overall prevalence of haplogroup U in southwestern sites, in stark contrast with a high frequency of haplogroup H in the east, which is opposite to what is observed in modern day Finland. Moreover, there is relatively high differentiation between the ancient sites.

Genetic layers of mitochondrial variation among the Iron Age and Medieval Finns

Unexpectedly high variation in maternal lineages could be observed between the southwestern Levänluhta, Luistari and Hollola Iron-Age sites. Especially the distribution of U subhaplogroups differed clearly between sites: The oldest site, Levänluhta, represented a high frequency of U5a and the modern Saami-related haplogroup U5b1b1a, which is present in contemporary Finns only in moderate frequency of around 3.0%³⁵. Indeed, recent studies considering Levänluhta, in which nuclear genomes have been retrieved, confirm the genetic continuation with the modern Saami population^18,19. The strong drift experienced by the Saami groups of present day, shown by their high levels of LD throughout genome and low diversity in uniparental markers (See³⁸ and references therein) could explain why some mtDNA lineages, such as U5a, would have vanished from present day Saami.

Both Luistari and Hollola lacked the U5b1b1a, but instead Hollola displayed a wider variety of other U5 subhaplogroups, such as U5a1, U5a2 and U5b2. In addition, Luistari and Hollola sites showed relatively high frequencies of different subhaplogroups of U4 (i.e., U4a, U4b and U4d), which are rare in contemporary Finns and absent from modern Saami. Instead, in contemporary populations, U4 exists in high frequencies in Volga-Ural region (up to 24% in Komi-Zyryans)³⁶ and with lower frequencies around the Baltic Sea, such as in Latvians and Tver Karelians (both around 8%)³⁷. Taking into account that U4 have been prevalent in neighboring areas among Scandinavian^{10,39,40,41,42,43} and Baltic hunter-gatherers^12,13,44, Baltic Comb Ceramics Culture^12,13,14 and in Siberia during the Early metal period¹¹, we might be observing ancestries belonging to an earlier layer of ancient inhabitants of the region.

Taking these different distributions of mtDNA haplogroups from the Iron-Age and medieval sites into consideration, our results suggest three different streams of mitochondrial ancestry: Saami-like haplogroups (U5b1b1a, possible also U5a), non-Saami-like hunter-gatherer related haplogroups (especially U4) and haplogroups associated with Neolithic farmers (H, J, K and T). In this context we use ‘Saami-like’ as a term that shows genetic continuity with modern-day Saami groups. Different proportions of these ancestries could be observed both in later EMM sites and also modern-day southwestern and northeastern Finns. This suggests a fluctuation of each of these mitochondrial ancestry proportions over space and time.

The ancient distribution of mtDNA lineages contradicts the contemporary east-west divergence

The Finnish population has been a subject of multitude medical genetic studies for many decades. Assessments of genetic diversity have revealed a number of idiosyncrasies in the modern Finnish gene pool. These include, for instance, the enrichment of c. 40 rare genetic diseases and the absence of some major ones in the rest of European metapopulation, as a clear distinction from the largely clinal differences observed in most of Europe⁴⁵. Furthermore, these studies have demonstrated the existence of notable genetic differentiation between southwestern and northeastern parts of Finland^24,26,27. This differentiation is especially pronounced in Y-chromosomes, showing opposite frequency trends of haplogroups N1c (25% SW, 75%NE) and I (56% SW, 24% NE)²⁶.

The modern mitochondrial DNA diversity in Finland resembles that observed in the Central Europe, but holds a relatively high overall frequency of haplogroup U, and also a notable proportion of subhaplogroups which have frequency peaks in or are exclusive to Finland³⁵. The genetic substructure within Finland is minimal when at the level of mtDNA haplotypes are considered, but pronounced in the frequencies of haplogroups assumed Paleolithic (here U and V) or Neolithic (H, J, K, T) in Europe: the palaeolithic haplogroups are more common in the north-east (“Contemporary NE” subpopulation), and Neolithic haplogroups in the south-west (“Contemporary SW”). This, together with the Y-chromosomal subdivision, has been interpreted to reflect an ancient border between populations relying on farming (south-west Finland) and foraging (north-east). The observed genetic border running diagonally from north-west to south-east coincides with differences in a number of linguistic and cultural differences all the way to folk traditions. It also coincides with the first medieval political border, the Treaty of Nöteborg, between the Swedish and Novgorodian spheres of influence agreed in 1323 AD (see²⁸).

The ancient mitochondrial genomes analyzed here show a notable pattern opposite to the modern variation: mtDNA types usually associated with the hunter-gatherer communities were significantly more common in the ancient western cluster (Levänluhta, Luistari and Hollola) than in the east (Hiitola, Tuukkala), with the haplogroup U frequency as high as 58.3%. In contrast, the farming-related lineages were observed in particular in the ancient eastern cluster. This pattern of division between the ancient sites, and the contradictions with their respective local modern population frequencies emerged also in formal testing of pairwise Φ_ST values: the western cluster was closer to the modern NE subpopulation than to the modern SW subpopulation whereas the eastern cluster showed closer affinity with the modern mtDNA variation in southwestern Finland.

Bidirectional expansion of agriculturally oriented populations into Finland?

Assuming that the haplogroup composition has correlated with the mode of subsistence, the observed pattern of east to west transect suggests a bidirectional spreading of agricultural human groups into Finland. Although there is evidence of sporadic small-scale cultivation in southeastern Finland already during the Neolithic Stone Age (c. 5300–4000 BC)^46,47, the start of agriculture in Finland has been traditionally associated with the Corded-Ware Culture (CWC) arriving across the Baltic Sea approximately 4700 years ago. Indeed, there are scattered findings of animal husbandry from southwestern parts of country starting from c. 2500 calBC⁴⁸, but in general archaeological evidence supporting transition to agriculture as a consequence of introduction of Corded-Ware culture, are sparse (for discussion see⁴⁹). Some independent observations of animal domestication⁵⁰ and cultivation (see²² and references there in) are identifiable during the Bronze Age, but documentation remains still very limited. This suggests that cultivation has probably been relatively uncommon and local for centuries, as little direct evidence for cereal cultivation in Finland prior Iron Age exists^22,49. Pollen records show notable increase of cereals starting only at 100 AD and reaching maximum as late as 1300 AD²² overlapping the time span of Iron Age and Medieval sites presented in this study.

As a support for the late introduction of farming populations in to Finland we do not see strong affinities of western IAM to for example the CWC maternal gene pools from Estonia and Lithuania^6,12,13,14, suggesting either that the mtDNA gene flow between these two regions has been low or that shared mtDNA variation had dissolved before the Iron-Age in Finland. Alternatively, the CWC expansion may have been largely male-driven as suggested by⁵¹. However, we observe a strong Neolithic signal in the Iron-Age mtDNA pool in Eastern Finland, thus rather suggesting a southeastern/eastern arrival route of an agro-pastoralist population into the country. Interestingly, their maternal genetic legacy also corresponded to the contemporary modern Finland, especially in SW. We therefore propose that either there has been east-to-west directed gene flow during the Middle Ages, after the introduction of agricultural haplogroups into the east, or that the late change in SW maternal gene pool may reflect recent immigration from more western/southern sources, such as the migration from Sweden during the Swedish reign in Finland (from 1200s–1809). Iron-Age has evidenced high mobility around the Baltic Sea, as evidenced by the genetic and isotope analyses of human remains from 10th to 12th century in Sigtuna, eastern Sweden⁵².

The multinomial logistic regression analysis lent support to the eastern introduction of agriculturally related maternal ancestry. The likely migration routes for the observed ancestral elements were investigated through different combinations of factor dependencies as the multinomial logistic regression. The test revealed the distance from Lahdenpohja on the eastern border of Finland as the only statistically significant variable explaining the differences in the haplogroup composition. Neither time scale, nor distance from the southwestern locations (i.e. Hanko and Uusikaupunki) were supported by the best-fit model. It thus seems likely that the major spread of haplogroup H can be explained by presuming its introduction via the eastern landroute. In accordance with the inference here, population genetic studies of many organisms in Finland as well as in all Fennoscandia have suggested bidirectional colonization of the current habitats. The reasons behind this are largely geographical: the Baltic Sea acts, for most species, as a migration barrier.

The Neolithic farmer-related signal in the mtDNA diversity in the Iron-Age samples is mainly found in the southeast, whereas in contemporary population it predominates in the southwest. The reasons for this discrepancy are likely diverse, and could be affected by such recent events as the evacuation of nearly 0.5 M inhabitants of Karelia during the World War II and their resettling into the area of current Finland. However, these evacuees were resettled rather evenly across southern Finland and should not create the observed pattern. It rather suggests that the division between SW and NE Finland had still been more substantial in the early 1900s. Another, more fundamental explanation for the genetic subdivision comes from the environmental demands of sedentary farming. In southwestern Finland the soil is more amenable to field-farming and due to the warming Atlantic effect that gradually shades into more arid continental climate, the growing season in Finland is the longest in the southwestern coast. These environmental differences follow the NW-SW border similarly to the genetic distances. As the country has been sparsely inhabited until modern times, it is plausible that farming oriented populations, in search of more favorable conditions, have over the centuries concentrated into the SW parts of the country.

The mitochondrial DNA genomes from Iron-Age Finland show variation that can be linked to either hunter-gatherer or agricultural human groups. These elements are still present in the mitochondrial gene pool of contemporary Finns but relatively evenly distributed throughout the country. In contrast, the Iron-Age mtDNA variation show significant differences between sampling sites, with hunter-gatherer and farmer-associated elements dominating in different regions than today. Rather surprisingly, the agricultural population signal has been stronger in eastern Finland in the past, which might reflect a bidirectional arrival of farming-associated populations into Finland.

Materials and Methods

Sample selection

The human skeletal remains used in this study were collected from five archaeological sites and five historical cemeteries (for more detailed information of the sites and references for original publications in Supplementary Material S1). Archaeological sites include Levänluhta, used as a burial place from Roman Iron Age until the end of Merovingian Period (archaeological dating 300–800 AD), Luistari, consisting of graves from Merovingian to Crusade Period (archaeological dating 600–1200 AD) and Hollola, Hiitola and Tuukkala, largely Christian-style cemeteries spanning from Crusade Period to Early Middle Ages (archaeological datings 1050–1400 AD, 1200–1500 AD and 1200–1400 AD, respectively) (Fig. 1, Table 1 and Supplementary Material S1). The five historical sites include the cemetery of the Church of St. Michael in Pälkäne, the cemetery of the Church of St. Jacob in Renko, the Cathedral site in Porvoo, the Julin’s site in Turku, and the Ryazan regimental church cemetery in Hamina. Samples were obtained from the archaeological collections of the Finnish Heritage Agency, Department of Archaeology in the University of Turku, Department of Anatomy in the University of Helsinki, and the Peter the Great Museum of Anthropology and Ethnography (Kunstkamera), Russian Academy of Sciences.

Contextual archaeological evidence, such as grave goods and burial customs, together with radiocarbon analyses were used to confirm the dating of each site and/or individual. Details of dating, sample sizes and number of genomes obtained are presented in Table 1. Detailed information on individual burial sites and samples are given in Supplementary Material S1 and Supplementary Table S1.

Reference populations used in comparative analyses

To evaluate possible changes in the Finnish mitochondrial gene pool during the past thousand years, samples were compared to HVR1 + HVR2 (16024–16385, 72–340) data from 832 modern Finns²⁷ for which the county-level geographical origin is known. In order to compare the mitochondrial profile of ancient Finns to other ancient nearby populations, haplogroup frequencies were collected from 31 ancient populations (Supplementary Table S8).

Sampling

All samples were processed in dedicated aDNA facilities with regularly UV:d and bleach-treated laminar hoods, inside a clean room space. For Levänluhta, Porvoo, Renko, Pälkäne and Hamina sites, sampling was conducted at the University of Tübingen, Germany from the start. For the Luistari, Hiitola, Hollola and Tuukkala sites, the bone powder was produced in a clean-room space for small-scale ancient DNA work at the Helsinki University Department of Forensic Medicine, and then transferred to Tübingen University facilities, stored in plastic tubes. All subsequent laboratory work with the bone powder was conducted at the facilities in Tübingen. A protective overall, facemask, hair net and two layers of disposable gloves were used at all times when handling the samples. Decontamination was carried out by ultraviolet light exposure of the plastic ware, reagents and samples, and by removal of the immediate surface at the point of sampling before drilling into the bone. Teeth were sawed in two, with the exception of the Levänluhta samples, where dentine was already exposed due to heavy fragmentation. Dental pulp and the surrounding dentine were used for DNA sampling, drilling into the crown or the root. For the petrous part of the temporal bone, a wedge was cut off to reveal the inner ear channels and sample was taken from the inside of the channel, as previously described⁵³. A dentist’s drill together with cooled-down drill heads for minimal heat exposure were used at the University of Tübingen facility and a “field kit” with Dremel or dentist’s drill were used at the Helsinki University Department of Forensic Medicine.

Extraction of ancient DNA

The extraction was performed according to a modified version of the original protocol²⁹. For each sample, ~50 mg of bone powder was used for the extraction by eluting it in 100 μl of TET (10 mM Tris-HCL, 1 mM EDTA pH 8.0, 0.1% Tween20). The extracts were used to prepare DNA libraries of 20 μl, without additional treatments. To enable multiplex sequencing, the double-stranded library preparation and the subsequent indexing procedure were performed according to standard recommended protocols for ancient DNA^54,55. The original molecular copy number in the DNA library, as well as the subsequent indexing efficiency, were measured by qPCR, using AccuPrime Pfx polymerase. The molecular copy numbers in pre-indexed libraries varied, ranging from ~1–100 × 108 copies/μl, and the indexed libraries from ~1–100 × 1011 copies/μl, indicating a successful library composition and admissible indexing efficiency. The indexed libraries were amplified using PCR, with heating cycles chosen individually per library according to the copy number after indexing. Amplified libraries were purified using MinElute spin columns with the standard protocol provided by the manufacturer (Qiagen). A qPCR together with Agilent Bioanalyzer 2100 device, and a DNA1000LabChip were subsequently used on the amplified libraries to measure the concentration of DNA, as well as fragment size distribution. A positive control extracted from a cave bear bone, to confirm the success of extraction and library preparation, as well as two negative laboratory controls to measure the levels of contamination were carried along for every batch of 10–16 samples.

Mitochondrial capture and sequencing

Mitochondrial genomes were achieved using a mitochondrial in-solution capture as described in Maricic et al.⁵⁶. Complete human mitochondrial DNA sequence was used to produce in-house made baits, which were ligated to adapters. The bait DNA was then purified and denatured to the single stranded form, and attached to streptavidin-coated magnetic beads. Pools of 4–6 samples, combined in equal mass ratios for altogether 2 μg of DNA, were captured with the above mentioned beads, and sequenced on the Illumina platforms: for samples from sites Turku, Hiitola, Tuukkala and Hollola, along with four samples from the Luistari site (TU619, TU621, TU622, TU623), single-end sequencing data was produced on HiSeq4000 run for 75 + 8 + 8 cycles, whereas samples from all the other sites, including the rest of the samples from Luistari, paired-end data was produced with NextSeq500 for 2 × 150 + 8 + 8 cycles, at the Max Planck Institute for the Science of Human History, Jena.

Processing of the sequence data

Raw-read sequencing data were processed using the EAGER-pipeline for aDNA sequencing data⁵⁷. Reads were demultiplexed using both indices, and the adapters were clipped off with the AdapterRemoval program integrated in EAGER, with minimum overlap set to 1 bp. Short reads reaching a minimum of 10 bp overlap were merged, validated by the paired-end fragment compatibility. For the single-end-sequenced samples this step was omitted. Minimum read length of 30 bp and base quality of 20 was required for all final reads. The reads were aligned to the complete human reference genome Hg19 with the BWA mapping algorithm for the shotgun sequence data, whereas CircularMapper, a custom-made tool for circular genomes included in EAGER, was used to map the enriched mitochondrial reads to the human mitochondrial reference genome (revised Cambridge reference sequence, rCRS⁵⁸).

Authentication of ancient DNA and haplogroup assignments

In order to evaluate the authenticity of the reconstructed mitochondrial genome sequences, contamination rates, read-length distributions and deamination patterns at the 5′ and 3′ ends of DNA fragments were inspected for each sample with the program MapDamage⁵⁹ integrated in EAGER. Further contamination estimation was carried out with Schmutzi³² and ContamMix³³ programs.

The mitochondrial consensus-calling in Schmutzi was used to produce the consensus sequences of both the endogenous source and the most likely single contaminating source. Complete consensus sequences were called against the rCRS with a filter value of q20. For the Schmutzi-based consensus sequences a manual correction was performed for position 3107 to correspond to the “N” embedded in rCRS. The mtDNA haplogroups were determined using HaploGrep2⁶⁰ with respect to PhyloTree version 17⁶¹.

For contamination estimation in ContamMix, consensus sequences created by Schmutzi were combined with a reference dataset of 311 mitochondrial genomes from worldwide populations (provided by ContamMix) and aligned with mafft version 7.305^62,63. The untrimmed mitochondrial reads from the Eager pipeline were then extracted from BAM files into fastQ files and mapped back to the assembly. ContamMix then evaluates whether the reads assign more probably to their respective consensus or one of the worldwide mitochondrial genomes, i.e possible contaminant source. The ContamMix was run with trimming of seven bases of each side of the read to remove the accumulated damage typical for ancient DNA.

The mitochondrial genome sequences with highest ContamMix estimates were further visually inspected in Geneious 11.0.3 (www.geneious.com). In this inspection, the majority call support for relevant diagnostic mutations against the rCRS reference genome was compared to the PhyloTree version 17. We applied the automated variant caller in Geneious to the alignments with minimum support of 3x coverage and variant frequency of 66.6% for diagnostic SNPs to confirm the authenticity of the haplogroup assignments.

Radiocarbon dating

Radiocarbon dates were produced by the Laboratory of Chronology, Finnish Museum of Natural History Luomus in Helsinki, Finland (Hela) and Klaus-Tschira C14-laboratory in Mannheim, Germany (MAMS). Bone collagen was extracted with the modified Longin method^64,65, the collagen samples combusted, graphitized and measured by using Accelerator Mass Spectrometry (AMS). The results are provided as conventional radiocarbon dates without potential reservoir effect corrections. The radiocarbon dates were calibrated using the OxCal program version 4.3⁶⁶, IntCal 13 as the calibration curve⁶⁷. For sites with four or more individuals with ¹⁴C dates also the boundaries for phase’s start and end were determined with OxCal 4.3. Example of the OxCal code is given in Supplementary Material S2. Timescale discussed throughout the text is defined as calendar years.

Statistical analyses

A summary of statistical analyses performed in this study for different sets of populations and dataset (i.e., complete sequence, HVR1 + HVR2, haplogroup frequencies) is presented in Table 2.

Table 2 Overview of statistical analyses and datasets used in this study.

Full size table

For the statistical analysis, sequences were aligned with Muscle v3.8.31⁶⁸. To explore the genetic diversity within ancient populations, basic diversity indices such as haplotype diversity, mean pairwise distance and nucleotide diversity were calculated for the complete mitochondrial genomes with Arlequin 3.5.2.2⁶⁹. To determine genetic distances between ancient and modern Finns, pairwise Φ_ST values based on the sequence data were calculated with Arlequin 3.5.2.2. Genetic distances, were calculated by utilizing HVR1 + HVR2, as the contemporary reference data with detailed geographical origin were restricted to HVR regions only^27,28. To estimate the significance for the Φ_ST values, permutation tests with 10000 permutations were used. The visualization of the Φ_ST values was done by R heatmap.2 function with hierarchical clustering based on the Euclidean distance. The best-fit models for different datasets were estimated with jModelTest⁷⁰. The substitution model used was Tamura & Nei⁷¹ with gamma correction (shape parameter α = 0.67) and Tamura & Nei with gamma correction (α = 0.44) for the complete sequence data and HVR1 + HVR2 data, respectively. Poly-C region (positions 309–315), AC indels (positions 515–522) and mutational hotspot at position 16519 were masked for the population level analysis. Further to evaluate the relation of ancient individuals to contemporary Finns on a haplotype level, median-joining network analysis⁷² was performed with PopArt⁷³. Positions bearing more than 5% of missing data were masked for the network analysis.

To statistically test the impact of possible factors affecting the spatial and temporal distribution of haplogroups U and H in our ancient samples, we conducted a multinomial logistic regression analysis. Each individual was considered as separate occurrence and haplogroups ‘U’, ‘H’ and ‘Others’ were set as categorical dependent variables. Because the main interest was to estimate the impact of time and geography on the occurrence of the haplogroups, the following independent variables were chosen: (1) Median age of sample (or the mean age of the site if radiocarbon dates were not available) (2) site’s distance (in km) from southern reference point Hanko, (3) site’s distance (in km) from western reference point Uusikaupunki and (4) site’s distance (in km) from eastern reference point Lahdenpohja (see Fig. 1). These geographical points were chosen to represent the most plausible entry points of different migration routes to the study area, south (Hanko on the southern coast), west (Uusikaupunki on the western coast) and east/south-east (Lahdenpohja). The last mentioned represents the eastern migration route both along the Karelian Isthmus and north of Lake Ladoga, strongly supported by archaeological evidence²¹. For multinomial logistic regression a stepwise forward method was used with entry probability 0.05, and probability was tested with likelihood ratios. To evaluate the impact of grouping the haplogroups into hunter-gatherer and farmer related hgs’ also categories ‘U + V’, ‘H + J + K + T’ and ‘Others’ were tested. In addition, categories ‘U’, ‘H + J + K + T’ and ‘Others’ were tested, due to controversial definition of haplogroup V (See Section 2.7). Analyses were performed with IBM SPSS version 25 (IBM Corp. Released 2017, IBM SPSS Statistics for Windows, Version 25.0 Armonk, NY: IBM Corp.).

To trace genetic affinities between ancient Finns and other ancient populations, we visualized haplogroup composition of each site (based on haplogroup frequencies) using principal component analysis (PCA). PCA was computed using MATLAB and Statistical Toolbox Release 2015b (The MathWorks, Inc., Natick, Massachusetts, United States). Populations and haplogroup frequencies used are presented in Supplementary Table S8.

Data Availability

Complete mitochondrial sequences will be deposited in GenBank under accession numbers MN540463-MN540565.

References

Posth, C. et al. Pleistocene mitochondrial genomes suggest a single major dispersal of non-Africans and a Late Glacial population turnover in Europe. Current Biology 26, 827–833 (2016).
Article CAS PubMed Google Scholar
Fu, Q. et al. An early modern human from Romania with a recent Neanderthal ancestor. Nature 524, 216 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Bramanti, B. et al. Genetic discontinuity between local hunter-gatherers and central Europe’s first farmers. Science 326, 137–140 (2009).
Article ADS CAS PubMed Google Scholar
Haak, W. et al. Ancient DNA from the first European farmers in 7500-year-old Neolithic sites. Science 310, 1016–1018 (2005).
ADS CAS PubMed Google Scholar
Haak, W. et al. Ancient DNA from European early neolithic farmers reveals their near eastern affinities. PLoS Biology 8, e1000536 (2010).
Article PubMed PubMed Central CAS Google Scholar
Allentoft, M. E. et al. Population genomics of Bronze Age Eurasia. Nature 522, 167–172 (2015).
Article ADS CAS PubMed Google Scholar
Haak, W. et al. Massive migration from the steppe was a source for Indo-European languages in Europe. Nature 522, 207–211 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Stolarek, I. et al. A mosaic genetic structure of the human population living in the South Baltic region during the Iron Age. Scientific Reports 8, 2455 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Kashuba, N. et al. Ancient DNA from mastics solidifies connection between material culture and genetics of mesolithic hunter–gatherers in Scandinavia. Communications Biology 2, 185 (2019).
Article PubMed PubMed Central Google Scholar
Günther, T. et al. Population genomics of Mesolithic Scandinavia: Investigating early postglacial migration routes and high-latitude adaptation. PLoS Biology 16, e2003703 (2018).
Article PubMed PubMed Central CAS Google Scholar
Der Sarkissian, C. et al. Ancient DNA reveals prehistoric gene-flow from Siberia in the complex human population history of North East Europe. PLoS Genetics 9, e1003296 (2013).
Article CAS Google Scholar
Jones, E. R. et al. The Neolithic transition in the Baltic was not driven by admixture with early European farmers. Current Biology 27, 576–582 (2017).
Article CAS PubMed PubMed Central Google Scholar
Saag, L. et al. Extensive farming in Estonia started through a sex-biased migration from the Steppe. Current Biology 27, 2185–2193. e6 (2017).
Article CAS PubMed Google Scholar
Mittnik, A. et al. The genetic prehistory of the Baltic Sea region. Nature communications 9, 442 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Ahola, M., Salo, K. & Mannermaa, K. Almost Gone: Human Skeletal Material from Finnish Stone Age Earth Graves. Fennoscandia Archaeologica 33, 95–122 (2016).
Google Scholar
Carpelan, C. Käännekohtia Suomen esihistoriassa aikavälillä 5100–1000 eKr. Pohjan poluilla.Suomalaisten juuret nykytutkimuksen mukaan.Bidrag till kännedom av Finlands natur och folk 153, 249–280 (1999).
Google Scholar
Tallavaara, M., Pesonen, P. & Oinonen, M. Prehistoric population history in eastern Fennoscandia. Journal of Archaeological Science 37, 251–260 (2010).
Article Google Scholar
Lamnidis, T. C. et al. Ancient Fennoscandian genomes reveal origin and spread of Siberian ancestry in Europe. Nature Communications 9, 5018 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Sikora, M. et al. The population history of northeastern Siberia since the Pleistocene. Nature 570, 182–188 (2018).
Article ADS CAS Google Scholar
Pesonen, P., Oinonen, M., Carpelan, C. & Onkamo, P. Early Subneolithic ceramic sequences in eastern Fennoscandia—a Bayesian approach. Radiocarbon 54, 661–676 (2012).
Article Google Scholar
Haggren, G., Halinen, P., Lavento, M., Raninen, S. & Wessman, A. In Muinaisuutemme jäljet: Suomen esi-ja varhaishistoria kivikaudelta keskiajalle (Gaudeamus, 2015).
Lahtinen, M., Oinonen, M., Tallavaara, M., Walker, J. W. & Rowley-Conwy, P. The advance of cultivation at its northern European limit: Process or event? The Holocene 27, 427–438 (2017).
Article ADS Google Scholar
Lao, O. et al. Correlation between genetic and geographic structure in Europe. Current Biology 18, 1241–1248 (2008).
Article CAS PubMed Google Scholar
Salmela, E. et al. Genome-wide analysis of single nucleotide polymorphisms uncovers population structure in Northern Europe. PloS One 3, e3519 (2008).
Article ADS PubMed PubMed Central CAS Google Scholar
Sajantila, A. et al. Paternal and maternal DNA lineages reveal a bottleneck in the founding of the Finnish population. Proc. Natl. Acad. Sci. USA 93, 12035–12039 (1996).
Article ADS CAS PubMed PubMed Central Google Scholar
Lappalainen, T. et al. Regional differences among the finns: A Y-chromosomal perspective. Gene 376, 207–215 (2006).
Article CAS PubMed Google Scholar
Palo, J. U., Ulmanen, I., Lukka, M., Ellonen, P. & Sajantila, A. Genetic markers and population history: Finland revisited. Eur. J. Hum. Genet. 17, 1336–1346 (2009).
Article PubMed PubMed Central Google Scholar
Neuvonen, A. M. et al. Vestiges of an Ancient Border in the Contemporary Genetic Diversity of North-Eastern Europe. PLoS One 10, 1–19 (2015).
Article CAS Google Scholar
Dabney, J. et al. Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments. Proc. Natl. Acad. Sci. USA 110, 15758–15763 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Sawyer, S., Krause, J., Guschanski, K., Savolainen, V. & Pääbo, S. Temporal patterns of nucleotide misincorporations and DNA fragmentation in ancient DNA. PloS One 7, e34131 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Kistler, L., Ware, R., Smith, O., Collins, M. & Allaby, R. G. A new model for ancient DNA decay based on paleogenomic meta-analysis. Nucleic Acids Res. 45, 6310–6320 (2017).
Article CAS PubMed PubMed Central Google Scholar
Renaud, G., Slon, V., Duggan, A. T. & Kelso, J. Schmutzi: estimation of contamination and endogenous mitochondrial consensus calling for ancient DNA. Genome Biol. 16, 224 (2015).
Article PubMed PubMed Central Google Scholar
Fu, Q. et al. A revised timescale for human evolution based on ancient mitochondrial genomes. Current Biology 23(7), 553–559 (2013).
Article CAS PubMed Google Scholar
Tambets, K. et al. The western and eastern roots of the Saami—the story of genetic “outliers” told by mitochondrial DNA and Y chromosomes. The American Journal of Human Genetics 74, 661–682 (2004).
Article CAS PubMed Google Scholar
Översti, S. et al. Identification and analysis of mtDNA genomes attributed to Finns reveal long-stagnant demographic trends obscured in the total diversity. Scientific Reports 7, 6193 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Bermisheva, M., Tambets, K., Villems, R. & Khusnutdinova, E. Diversity of mitochondrial DNA haplotypes in ethnic populations of the Volga-Ural region of Russia. Molecular Biology (Mosk) 36, 990–1001 (2002).
Article CAS Google Scholar
Lappalainen, T. et al. Migration waves to the Baltic Sea region. Ann. Hum. Genet. 72, 337–348 (2008).
Article CAS PubMed Google Scholar
Ingman, M. & Gyllensten, U. A recent genetic link between Sami and the Volga-Ural region of Russia. European Journal of Human Genetics 15, 115–120 (2007).
Article CAS PubMed Google Scholar
Malmström, H. et al. Ancient DNA reveals lack of continuity between neolithic hunter-gatherers and contemporary Scandinavians. Current Biology 19, 1758–1762 (2009).
Article PubMed CAS Google Scholar
Skoglund, P. et al. Origins and genetic legacy of Neolithic farmers and hunter-gatherers in Europe. Science 336, 466–469 (2012).
Article ADS CAS PubMed Google Scholar
Lazaridis, I. et al. Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature 513, 409–413 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Malmstrom, H. et al. Ancient mitochondrial DNA from the northern fringe of the Neolithic farming expansion in Europe sheds light on the dispersion process. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 370, 20130373 (2015).
Article PubMed PubMed Central CAS Google Scholar
Skoglund, P. et al. Genomic diversity and admixture differs for Stone-Age Scandinavian foragers and farmers. Science 344, 747–750 (2014).
Article ADS CAS PubMed Google Scholar
Mathieson, I. et al. The genomic history of southeastern Europe. Nature 555, 197 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Norio, R. Finnish disease heritage I. Hum. Genet. 112, 441–456 (2003).
PubMed Google Scholar
Alenius, T., Mökkönen, T. & Lahelma, A. Early Farming in the Northern Boreal Zone: Reassessing the History of Land Use in Southeastern Finland through High‐Resolution Pollen Analysis. Geoarchaeology 28, 1–24 (2013).
Article Google Scholar
Herva, V., Mökkönen, T. & Nordqvist, K. A northern Neolithic? Clay work, cultivation and cultural transformations in the boreal zone of north‐eastern Europe, c. 5300–3000 bc. Oxford Journal of Archaeology 36, 25–41 (2017).
Article Google Scholar
Cramp, L. J. et al. Neolithic dairy farming at the extreme of agriculture in northern Europe. Proceedings of the Royal Society B: Biological Sciences 281, 20140819 (2014).
Article PubMed PubMed Central Google Scholar
Lahtinen, M. & Rowley-Conwy, P. Early farming in Finland: was there cultivation before the Iron Age (500 BC)? European Journal of Archaeology 16, 660–684 (2013).
Article Google Scholar
Bläuer, A. & Kantanen, J. Transition from hunting to animal husbandry in Southern, Western and Eastern Finland: new dated osteological evidence. Journal of Archaeological Science 40, 1646–1666 (2013).
Article Google Scholar
Goldberg, A., Günther, T., Rosenberg, N. A. & Jakobsson, M. Ancient X chromosomes reveal contrasting sex bias in Neolithic and Bronze Age Eurasian migrations. Proceedings of the National Academy of Sciences, 201616392 (2017).
Krzewińska, M. et al. Genomic and Strontium Isotope Variation Reveal Immigration Patterns in a Viking Age Town. Current Biology 28, 2730–2738. e10 (2018).
Article PubMed CAS Google Scholar
Pinhasi, R. et al. Optimal ancient DNA yields from the inner ear part of the human petrous bone. PloS One 10, e0129102 (2015).
Article PubMed PubMed Central CAS Google Scholar
Meyer, M. & Kircher, M. Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harb Protoc. 2010, pdb.prot5448 (2010).
Article PubMed Google Scholar
Kircher, M. In Ancient DNA 197–228 (Springer, 2012).
Maricic, T., Whitten, M. & Pääbo, S. Multiplexed DNA sequence capture of mitochondrial genomes using PCR products. PloS One 5, e14004 (2010).
Article ADS PubMed PubMed Central CAS Google Scholar
Peltzer, A. et al. EAGER: efficient ancient genome reconstruction. Genome Biology 17, 60 (2016).
Article PubMed PubMed Central CAS Google Scholar
Andrews, R. M. et al. Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nature Genetics 23, 147–147 (1999).
Article CAS PubMed Google Scholar
Ginolhac, A., Rasmussen, M., Gilbert, M. T. P., Willerslev, E. & Orlando, L. mapDamage: testing for damage patterns in ancient DNA sequences. Bioinformatics 27, 2153–2155 (2011).
Article CAS PubMed Google Scholar
Weissensteiner, H. et al. HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing. Nucleic Acids Res. (2016).
Van Oven, M. & Kayser, M. Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Human Mutation 30, E386–E394 (2009).
Article PubMed Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular Biology and Evolution 30, 772–780 (2013).
Article CAS PubMed PubMed Central Google Scholar
Katoh, K. et al. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Research 30, 3059–3066 (2002).
Article CAS PubMed PubMed Central Google Scholar
Longin, R. New method of collagen extraction for radiocarbon dating. Nature 230, 241 (1971).
Article ADS CAS PubMed Google Scholar
Bocherens, H. et al. Paleobiological implications of the isotopic sigNatures (13 C, 15 N) of fossil mammal collagen in Scladina Cave (Sclayn, Belgium). Quatern. Res. 48, 370–380 (1997).
Article ADS Google Scholar
Ramsey, C. B. Bayesian analysis of radiocarbon dates. Radiocarbon 51, 337–360 (2009).
Article CAS Google Scholar
Reimer, P. J. et al. Selection and treatment of data for radiocarbon calibration: an update to the International Calibration (IntCal) criteria. Radiocarbon 55, 1923–1945 (2013).
Article CAS Google Scholar
Edgar, R. C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research 32, 1792–1797 (2004).
Article CAS PubMed PubMed Central Google Scholar
Excoffier, L. & Lischer, H. E. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Molecular ecology resources 10, 564–567 (2010).
Article PubMed Google Scholar
Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. jModelTest 2: more models, new heuristics and parallel computing. Nature methods 9, 772 (2012).
Article CAS PubMed PubMed Central Google Scholar
Tamura, K. & Nei, M. Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Molecular Biology Evolution 10, 512–526 (1993).
CAS PubMed Google Scholar
Bandelt, H., Forster, P. & Röhl, A. Median-joining networks for inferring intraspecific phylogenies. Molecular Biology Evolution 16(1), 37–48 (1999).
Article CAS PubMed Google Scholar
Leigh, J. W. & Bryant, D. PopART: Full-feature software for haplotype network construction. Methods Ecological Evolution 6(9), 1110–1116 (2015).
Article Google Scholar
Talve, I. Finnish Folk Culture. Finnish Literature Society; English edition. pp. 349 (2000)

Download references

Acknowledgements

We are grateful for the Levänluhta project and Elämän historiat hampaissa project for providing us samples from Levänluhta and Luistari, respectively. For the Julin’s site samples we would like to thank Benito Casagrande, Liisa Seppänen, Sirkku Pihlman, Heikki Vuorinen and Juha Varrela. For the samples from Renko and Porvoo, professor Helena Ranta and University of Helsinki are acknowledged. We would also like to thank Markku Niskanen for providing us the ¹⁴C dating for one Renko individual (Renko H29, JK1927). Ella Reiter and Shweta Venkatakrishnan are thanked for technical assistance. This work was funded by Finnish Cultural Foundation (S.Ö.), the Academy of Finland (grant no. 133056, S.Ö., P.O., M.P.), the Kone Foundation (S.Ö., K.M., E.S., L.A., H. E.-S.), Emil Aaltonen Foundation (K.M., A.W., H.E.-S.), Jane and Aatos Erkko Foundation (K.M., E.S.), Ella and Georg Ehrnrooth Foundation (E.S.), Jenny and Antti Wihuri Foundation (E.S.), Finnish Foundations’ Professor Pool Grant (Paulo Foundation, A.S.), the University of Zurich’s University Research Priority Program “Evolution in Action: From Genomes to Ecosystems” (V.J.S.) and the Mäxi Foundation Zurich (V.J.S.). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.

Author information

Sanni Översti and Kerttu Majander contributed equally.
Johannes Krause, Jukka U. Palo and Päivi Onkamo jointly supervised this work.

Authors and Affiliations

Department of Biosciences, University of Helsinki, Helsinki, Finland
Sanni Översti, Kerttu Majander, Elina Salmela & Päivi Onkamo
Department of Archaeogenetics, Max Planck Institute for the Science of Human History, Jena, Germany
Kerttu Majander, Elina Salmela, Wolfgang Haak & Johannes Krause
Institute for Archaeological Sciences, Archaeo- and Palaeogenetics, University of Tübingen, Tübingen, Germany
Kerttu Majander, Saskia Pfrengle & Verena J. Schuenemann
Department of Cultures, University of Helsinki, Helsinki, Finland
Kati Salo & Anna Wessman
Laboratory of Chronology, Finnish Museum of Natural History, University of Helsinki, Helsinki, Finland
Laura Arppe, Heli Etu-Sihvola & Markku Oinonen
Peter the Great Museum of World Anthropology and Ethnography (Kunstkamera), Russian Academy of Science, St. Petersburg, Russia
Stanislav Belskiy
Department of Archaeology, University of Turku, Turku, Finland
Ville Laakso, Jussi-Pekka Taavitsainen & Anna Wessman
Finnish Heritage Agency, Helsinki, Finland
Esa Mikkola & Katja Vuoristo
Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
Mikko Putkonen, Antti Sajantila & Jukka U. Palo
Institute of Evolutionary Medicine, University of Zürich, Zürich, Switzerland
Verena J. Schuenemann
Forensic Genetics Unit, National Institute for Health and Welfare, Helsinki, Finland
Jukka U. Palo
Department of Biology, University of Turku, Turku, Finland
Päivi Onkamo

Authors

Sanni Översti
View author publications
You can also search for this author in PubMed Google Scholar
Kerttu Majander
View author publications
You can also search for this author in PubMed Google Scholar
Elina Salmela
View author publications
You can also search for this author in PubMed Google Scholar
Kati Salo
View author publications
You can also search for this author in PubMed Google Scholar
Laura Arppe
View author publications
You can also search for this author in PubMed Google Scholar
Stanislav Belskiy
View author publications
You can also search for this author in PubMed Google Scholar
Heli Etu-Sihvola
View author publications
You can also search for this author in PubMed Google Scholar
Ville Laakso
View author publications
You can also search for this author in PubMed Google Scholar
Esa Mikkola
View author publications
You can also search for this author in PubMed Google Scholar
Saskia Pfrengle
View author publications
You can also search for this author in PubMed Google Scholar
Mikko Putkonen
View author publications
You can also search for this author in PubMed Google Scholar
Jussi-Pekka Taavitsainen
View author publications
You can also search for this author in PubMed Google Scholar
Katja Vuoristo
View author publications
You can also search for this author in PubMed Google Scholar
Anna Wessman
View author publications
You can also search for this author in PubMed Google Scholar
Antti Sajantila
View author publications
You can also search for this author in PubMed Google Scholar
Markku Oinonen
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Haak
View author publications
You can also search for this author in PubMed Google Scholar
Verena J. Schuenemann
View author publications
You can also search for this author in PubMed Google Scholar
Johannes Krause
View author publications
You can also search for this author in PubMed Google Scholar
Jukka U. Palo
View author publications
You can also search for this author in PubMed Google Scholar
Päivi Onkamo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.S., L.A., S.B., H.E.-S., V.L., E.M., J.-P.T., K.V., A.W. provided archaeological material and related information. K.M., K.S., S.Ö., M.P. performed the sampling. K.M., S.Ö., S.P. performed the laboratory work. K.M., S.Ö. processed the sequence reads and generated the mtDNA genotypes. S.Ö., J.U.P., E.S. performed the statistical analyses. M.O., L.A., H.E.-S. performed the radiocarbon dating analysis. S.Ö., M.O. post-processed the ¹⁴C datings (i.e. phase start and end boundaries). S.Ö., J.U.P., K.M., P.O., E.S., K.S. wrote the manuscript with input from all co-authors. P.O., J.K., J.U.P., V.J.S., W.H., A.S. supervised the study.

Corresponding author

Correspondence to Sanni Översti.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Materials and Figures

Supplementary Tables

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Översti, S., Majander, K., Salmela, E. et al. Human mitochondrial DNA lineages in Iron-Age Fennoscandia suggest incipient admixture and eastern introduction of farming-related maternal ancestry. Sci Rep 9, 16883 (2019). https://doi.org/10.1038/s41598-019-51045-8

Download citation

Received: 03 April 2019
Accepted: 19 September 2019
Published: 15 November 2019
DOI: https://doi.org/10.1038/s41598-019-51045-8

This article is cited by

Comparison of the optimal and suboptimal quantity of mitotype libraries using next-generation sequencing
- Marcel Obal
- Tomaž Zupanc
- Irena Zupanič Pajnič
International Journal of Legal Medicine (2024)
Measure quantity of mitochondrial DNA in aged bones or calculate it from nuclear DNA quantitative PCR results?
- Marcel Obal
- Tomaž Zupanc
- Irena Zupanič Pajnič
International Journal of Legal Medicine (2023)
Evolution and dispersal of mitochondrial DNA haplogroup U5 in Northern Europe: insights from an unsupervised learning approach to phylogeography
- Dana Kristjansson
- Jon Bohlin
- Theodore G. Schurr
BMC Genomics (2022)
Application of natural sciences methodology in archaeological study of Iron Age burials in Latvia: pilot study
- Janis Kimsis
- Elina Petersone-Gordina
- Renate Ranka
Forensic Science, Medicine and Pathology (2022)
Isotopic insights into the early Medieval (600–1100 CE) diet in the Luistari cemetery at Eura, Finland
- H. Etu-Sihvola
- K. Salo
- L. Arppe
Archaeological and Anthropological Sciences (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.