We set out to identify the origins of the Árpád Dynasty based on genome sequencing of DNA derived from the skeletal remains of Hungarian King Béla III (1172–1196) and eight additional individuals (six males, two females) originally interred at the Royal Basilica of Székesfehérvár. Y-chromosome analysis established that two individuals, Béla III and HU52 assign to haplogroups R-Z2125 whose distribution centres near South Central Asia with subsidiary expansions in the regions of modern Iran, the Volga Ural region and the Caucasus. Out of a cohort of 4340 individuals from these geographic areas, we acquired whole-genome data from 208 individuals derived for the R-Z2123 haplogroup. From these data we have established that the closest living kin of the Árpád Dynasty are R-SUR51 derived modern day Bashkirs predominantly from the Burzyansky and Abzelilovsky districts of Bashkortostan in the Russian Federation. Our analysis also reveals the existence of SNPs defining a novel Árpád Dynasty specific haplogroup R-ARP. Framed within the context of a high resolution R-Z2123 phylogeny, the ancestry of the first Hungarian royal dynasty traces to the region centering near Northern Afghanistan about 4500 years ago and identifies the Bashkirs as their closest kin, with a separation date between the two populations at the beginning of the first millennium CE.
The Árpád Dynasty (ca. 850–1301 CE) established the Hungarian state in the Carpathian Basin and played a formative role in Eastern European history. The Dynasty was founded by Prince Álmos (ca. 820 CE–ca. 894 CE), but in the modern historiography got its name from his son, Prince Árpád (ca. 845 CE–ca. 907 CE) who ruled between ca. 894 CE and ca. 907 CE and lead the Hungarians into the Carpathian basin between 862 CE and 896 CE . Although partial mitochondrial and Y-chromosome analyses of remains from cemeteries of conquering Hungarian nobility have been performed, the ethnic origins of the Árpád Dynasty are subject to scientific debate in the absence of genetic evidence [2,3,4,5,6]. Historical sources indicate that ten Árpáds, eight kings, and two princes, were laid to rest in the provostry church of the Virgin Mary, commonly known as the Royal Basilica of Székesfehérvár, before the Turkish occupation of that city in 1543 [7,8,9]. During the following centuries of war and neglect, the Basilica was destroyed, and the only royal graves left undisturbed were those of King Béla III (ca. 1148–1196) and his first spouse, Anna of Antioch (ca. 1150–1184/85). The royal remains were discovered in 1848 by the leading archaeologist and historian of the time, János Érdy, a member of the Hungarian Academy and curator of the National Museum in Budapest . The royal regalia unearthed with the remains of Béla III and Anna of Antioch are in the possession of the National Museum of Hungary in Budapest. The remains were reinterred first in the undercroft of Matthias Church of Buda in 1862 and later in the Holy Trinity Chapel of the church in 1898. Additional remains from the site of the Royal Basilica of Székesfehérvár were excavated in 1862 and 1874 and eight of these were placed in the Matthias Church in 1900 .
Y-chromosomal Short Tandem Repeat (STR) analysis of King Béla III and six other unidentified male skeletons indicated that in addition to Béla III, only one of the remains, marked HU52, belongs to the Árpád Dynasty .
Developments in next-generation sequencing (NGS) during the past decade have made it possible to economically sequence the mitochondrial and Y chromosomes in large number of individuals . Two pivotal studies [13, 14] sequenced 456 and 1244 globally diverse samples, respectively, leading to the identification of over 65,000 Y-chromosome SNPs. Since these papers were published, the number of known Y-chromosomal SNPs has tripled allowing high-resolution analysis of patrilineal relationships . We undertook NGS of the DNA derived from the remains studied by Olasz et al.  and determined the phylogenetic origins and closest kinship of the Árpád Dynasty based on shared Y chromosome haplogroup derivation in the context of 40 Eurasian populations.
Materials and methods
The ancient remains examined in this study are under the legal guardianship of the Hungarian Catholic Church. Péter Erdő, Archbishop of Esztergom-Budapest, provided written consent to perform genetic analysis on the remains that was sanctioned by the Hungarian Ministry of Human resources (EMMI) under project number 26090/2019/MINKABINET.
Sample collection was performed at the National Institute of Oncology in Budapest, Hungary and DNA extraction of the nine ancient skeletal remains was performed in the laboratory of Susanna Hummel at the Department of Historical Anthropology and Human Ecology of the Johann-Friedrich-Blumenbach Institute for Zoology and Anthropology . Next-generation sequencing (NGS) and primary data analysis were performed at Praxis Genomics LLC (Atlanta, GA) and the results were verified by reanalysis of all data at the Department of Archaeogenetics, Institute of Hungarian Research, Budapest, Hungary.
Further technical detail about the NGS process and data analysis is provided in the “Library preparation and NGS” and “Data analysis” sections. All handling of DNA samples were under strict observance of CLIA and CAP guidelines.
4340 modern samples from 40 different populations were genotyped for the Z2125 and Z2123 SNPs using Sanger sequencing (Fig. 1, Table S1). 206 samples belonging to the R-Z2123 haplogroup from 20 populations were paired-end sequenced (2 × 151 bp) using Novaseq 6000 (Illumina) (Fig. 1, Table S1). Use of these samples for this study was approved by the Ethics Committee of the Ufa Federal research Centre of the Russian Academy of Sciences; document number: MKI-F/321-1/2019. Further technical detail is provided in the “Library preparation and NGS” section. All subjects were voluntary participants and gave written consent to use their samples for genetic analysis. In addition, two published Y-chromosome sequencing datasets corresponding to an Iraqi and an “Iraqi Jew”, were downloaded from the European Nucleotide Archive (http://www.ebi.ac.uk/ena) under the accession number PRJEB21310 and were used in the analysis .
Library preparation and next-generation sequencing
Modern DNA samples were sonicated using Covaris S220 Ultrasonicator (Woburn, MA) to yield fragments with a median fragment length of 300 bp according to the manufacturer’s recommendations. For ancient DNA samples, no sonication was necessary. Low-molecular weight DNA (<300 bp) enrichment from all samples was performed using AMPure XP beads (Beckman Coulter, Indianapolis, IN). Library preparation was performed using the TruSeq Nano DNA LT kit (Illumina, San Diego, CA) according to the manufacturer’s recommendations. Library size and quality was confirmed with Fragment Analyzer (Advanced Analytical, Santa Clara, CA) and quantitation was done using qPCR (S1000/CFX96 Real Time System; Biorad, Hercules, CA) Paired-end sequencing (2 × 125 bp; and 2 × 151 bp) was performed on HiSeq 2500 and NovaSeq 6000 Systems (Illumina) following the manufacturer’s recommendations.
Initial analysis of both ancient and modern datasets was performed at Praxis Genomics LLC (Atlanta, GA). Demultiplexing of NGS runs was performed in BaseSpace (Illumina, San Diego) . Adapter and quality trimming was performed on the ancient samples by Trim Galore (Babraham Bioinformatics) . Reads with length under 40 bp after quality and adapter trimming were removed and only properly paired read data were retained and aligned to the GRCh38 reference by Dragen v.3.2.5 (Illumina) using default settings. Variant calling was also performed using Dragen. The Integrative Genome Viewer [19, 20] was used for visual confirmation of calls made.
The conclusions of the Dragen alignment and variant calling were corroborated by reanalysis of the ancient datasets at the Department of Archaeogenetics, Institute of Hungarian Research, Budapest, Hungary. The ancient datasets were aligned to both GRCh37 (hs37d5) and GRCh38 using Burrow-Wheels-Aligner (v 0.7.17) using the MEM command with reseeding disabled. PICARD tools were used to mark duplicates and only properly paired primary alignments with ≥90% identity to reference were considered in all downstream analyses.
Ancient DNA damage patterns were assessed using MapDamage 2.0  and read quality scores were modified with the Rescale option to account for post-mortem damage. Sex determination was performed according to the method described in Skoglund et al. . Mitochondrial genome contamination was estimated using Schmutzi algorithm  (Table 1). Contamination for the male samples was assessed by the ANGSD X chromosome contamination method .
Mitochondrial Haplogroup determination was performed using HaploGrep  (Tables 2, and S2). Y chromosome haplogroup determination was initially performed using Yleaf . SNPs described in FamilyTreeDNA Y-DNA Haplotree and ISOGG SNP databases [27, 28] were used to further interrogate genomic datasets using bam-readcount program . Bam readcount provides depth of coverage, nucleotide composition, and read quality at the specified chromosomal positions. This approach allowed us to further derive the Y-leaf generated haplogroups, and to identify a novel haplogroup defining the Árpád Dynasty (R-ARP). We used Clustal Omega Version 1.2.4  to reconstruct the R-Z2123 haplogroup phylogeny and calculate coalescence times between derived haplogroups. Only SNPs that fall within the ~10 Mb region of the Y-chromosome described by Poznik et al.  were used in the calculation of coalescence times. Drawing of the phylogenetic tree was performed using FigTree v1.4.4 .
The BAM files of the mitochondrial and Y chromosome sequences for all samples presented in this paper are available at https://www.ncbi.nlm.nih.gov/bioproject/PRJNA490697: Determination of the phylogenetic origins of the Árpád Dynasty based on Y-chromosome sequencing of Béla the Third.
Next-generation sequencing data
We have performed NGS on nine ancient DNA samples. Unique reads per sample, endogenous DNA content, X and Y chromosome, as well as autosomal coverage varied widely between samples (Tables 1 and S3). X and Y chromosome coverage was calculated for the entire length of the chromsomes. Y and X chromosome read ratios allowed unequivocal assignment of chromosomal sex in all samples (Table S3b) . Mitochondrial genomes for all individuals had >100-fold average coverage except sample HUAA (Anna of Antioch), for whom we only obtained 23-fold average coverage. The estimated X contamination based on the ANGSD X chromosome contamination assessment tool varied from 0.3 to 1.34% , except for one sample, HU53, in which it was considerably higher (11.94%). The estimated mitochondrial contamination determined by Schmutzi varied from 1 to 2% for most samples, but was at 5% in HU53 (Table 1). The overall amount and quality of the data allowed precise derivation of mitochondrial and Y chromosomal haplogroups in all samples studied (Table 2, Figures S1, S2 and Tables S2, S4, 5).
Y chromosome haplogroup derivation for ancient samples
The Yleaf software outputs Y-chromosome haplogroups for male samples in the format of ISOGG (Tables 2 and S4). In all cases, haplogroups output by Yleaf could be further derived using manual asessment of data based on Family Tree haplogroup and SNP information. For this reason, for each individual haplogroup derivation, the Family Tree based haplogroup is provided along with the ISOGG based haplogroup assignment (Fig. 2, Tables 2 and S5).
The HU53 sample was derived for the E-BY4992 (E1b1b1a1b1a) haplogroup based on brachpoint SNPs BY4991 and BY4999 that could be ascertained. This haplogroup is frequent in the southern Balkans, especially in Greece  and was also detected Hungarian conquerors and Avars .
The HU3G sample was derived for haplogroup J-ZS7626 (J1a2b1b2c1a2b) based on the presence of 6 out of 25 haplogroup defining SNPs. This haplogroup radiates across eastern Africa and the southern tip of the Arabian Peninsula, but is also seen among inhabitants of the western shores of the Caspian Sea among Tabasaran, Kumyk, Avar, and Lezgin populations [14, 33].
Sample HU4H is derived for R-BY3642 past R-PF6658 (R1b1a1b1a1a2b). Consistent with this result the R-BY3636; R-BY3630; R-BY3851; R-FGC30121; R-BY42688; R-Y16335 haplogroup defining SNPs downstream of R-PF6658 are not derived and thus these branches can be excluded. The geographic distribution of R1b-U152, that is upstream of R-BY3642 is mainly restricted to the Alpine area of Italy and Corsica .
Samples HU3B, HU52, and HU54 belong to haplogroup R1a. HU54 is derived for R-YP1626 (R1a1a1b1a2b3a4a2c) which is most common in southwest Russia and Ukraine, Belarus and eastern Poland .
The remains belonging to Béla III (HU3B) and HU52 are derived for R-Z2125 (R1a1a1b2a2a). The R-Z2125 haplogroup is common in northeastern Afghanistan, Tajikistan, Kyrgyzstan, and southern Kazakhstan and to a lesser extent in the Volga Ural region, the Caucasus and Iran (Fig. 1) . We could further derive the Árpád Dynasty lineage based on SNPs defining R-Y2632, R-Y20746, R-Y2633, and 16 SNPs associated with haplogroup R-SUR51 (Figs. 2 and 3, Table S5) [13, 14, 16, 36, 37].
High resolution characterization of the R-Z2123 haplogroup
4340 samples from over 40 populations from the Volga-Ural region, the Caucasus, Central and South Central Asia were genotyped for R-Z2125 and/or R-Z2123 (Fig. 1, and Table S1). 400 plus samples were shown to be derived for R-Z2125 and 320 samples were derived for the R-Z2123 haplogroup. 206 representative R-Z2123 samples were chosen for whole-genome sequencing to provide high-resolution characterization of this haplogroup (Figs. 2 and 3). Two previously published high resolution Y chromosome sequencing datasets of an Iraqi and an Iraqi Jew were also included in the analysis along with the two Árpád Dynasty members, Béla III (HU3B) and HU52  (Table S6). From these 210 individuals, five Afghans and one Chechen could not be derived beyond R-Z2123* (R1a1a1b2a2a1). Twelve individuals, all from Afghanistan, are derived for R-Y47 (R1a1a1b2a2a1b). 58 individuals, including 48 Bashkirs predominantly from the Burzyansky and Abzelilovsky districts of Bashkortostan, 4 Afghans , 2 Árpád Dynasty members, an individual from modern day Serbia [7, 39], a Punjabi from Lahore (HG03636) , an Iraqi Jew (GRC14414377 a.k.a 16198) [14, 16, 36] and an Iraqi (GRC15570738)  were derived for R-Y2632 (R1a1a1b2a2a1c3). While the Iraqi individual (GRC15570738) could not be derived past R-Y2632* (R1a1a1b2a2a1c3), the other 57 samples are all derived for R-Y2633 (R1a1a1b2a2a1c3a). Post Y2633, these individuals form three different haplogroups defined by SNPs B139 (present only in the Iraqi Jew), Y2634 and Y16006 (present in the Punjabi from Lahore (HG03636) and 4 Afghans, and SUR51 present in the 48 Bashkirs, the individual from modern day Serbia and the two Árpád Dynasty members, Béla III (HU3B) and HU52. The R-SUR51 haplogroup bifurcates further after 17 shared SNPs (Table S7). The Bashkirs have the additional six SNPs associated with R-SUR51, while the individual from modern day Serbia and the two Árpád Dynasty members lack these, but are derived for a novel haplogroup, R-ARP, defined by nine shared SNPs. The individual from modern day Serbia has nine additional private SNPs, absent from the ancient samples, that define a novel haplogroup R-UVD (Ujvidék; the Hungarian name of modern day Novi Sad). 38 Bashkirs are further derived for R-SUR2 and its subhaplogroups R-SUR72 and R-SUR3/SUR95 while 7 have not been derived past R-SUR51*.
The other 134 individuals, including all 89 samples from the Caucasus, 30 Bashkirs predominantly from the Arhangelsky region and 15 Afghans are derived for Y934 and its subhaplogroups (Figs. 2 and 3, Table S6).
We generated a phylogenetic tree of the 208 modern samples and the two Árpád Dynasty members described above using the Clustal Omega software (v.1.2.4)  to assess evolutionary relationships and time to most recent common ancestor. Only SNPs that fall within the ~10 Mb region of the Y-chromosome described by Poznik et al.  were used in the calculation of coalescence times (Table S8). Using the appearance of R-Y2633 haplogroup 4100 years ago as an anchor based on prior work , the data indicates that the individual from modern day Serbia and the two Árpád Dynasty members, Béla III (HU3B) and HU52 separated from the Bashkirs about 2000 years ago, while the four main branches of the R-2123 haplogroup represented in this cohort, R-Z2123*, R-Y47, R-Y2632, R-Y934 arose ~4500 years ago.
Our objective was to identify and characterize SNPs defining the Árpád Dynasty and establish their phylogenetic origin.
Y chromosome haplogroup analysis of the seven ancient male skeletons previously Y-STR genotyped by Olasz et al.  confirms that only one, HU52, shares the Y chromosome haplogroup R-ARP with King Béla III confirming the hypothesis that this individual belongs to the House of Árpád. The other individuals are of diverse origins (J-ZS7626, E-BY4992, R-BY41605, R-BY3642, and R-Y2608) and are not related to the royals or to each other along paternal lineages. Mitochondrial haplogroup analysis revealed different haplogroups for all samples indicating no shared maternal lineage (Tables 2 and S2). The fact that neither Anna of Antioch nor Béla III share mitochondrial haplogroup with HU52 excludes the possibility that HU52 is their son, or Béla III’s brother. He could be Béla III’s father King Géza II (lived 1130–1162), or grandfather Béla II (1110–1141) or a more distant ancestor.
Identification of two members of the Árpád Dynasty provided us with the tools to address the phylogenetic origins of this dynasty. Based on 17 shared R-SUR51 SNPs, we established that Bashkirs, predominantly from the Burzyansky and Abzelilovsky districts, are the closest kin of the Árpád Dynasty from among the populations examined (Table S7). We further derived the Árpád Dynasty Y-chromosome haplogroup to R-ARP (Árpád) (proposed to be equivalent to R1a1a1b2a2a1c3a3b following ISOGG naming conventions) based on nine shared SNPs between them and an individual from modern day Serbia.
It was proposed by multiple authors that one new Y chromosome SNP gets introduced into the germline every 100–150 years . Based on this calibration, Behar et al. (2017) estimated that the Iraqi and the Iraqi Jew lineages diverged ~4100 years ago  coinciding with the appearance of the Y2633 SNP in the latter individual. Using this date as an anchor, the coalescence estimation using the Clustal Omega software suggests that the Z2123* starburst (appearance of R-YP3920, R-YP4907, R-Y47, R-Y934, and R-Y2632) occurred >4500 years ago (Fig. 3). This likely occurred in the region centered on northern Afghanistan since all Z2123* individuals of our cohort, as well as individuals belonging to the early branches of the R-Y47 (R-46), R-Y934 (R-Y874*, R-Y15121*, R-YP520*) and the R-Y2632 haplogroups (R-Y2634 and R-Y16006; R-YP6321*; R-YP6547*) are seen predominantly in Afghan individuals . The more derived haplogroups of R-Y2632, (R-SUR51) and R-Y934 (R-YP451 and R-Y5977) are to be found in the Volga Ural and the Caucasus regions, respectively, suggesting a founder effect and a relatively recent population expansion at their current locations.
Our analysis indicates that the ancestral lineage of modern Bashkirs separated from the lineage of the Árpáds about 2000 years ago marked by the appearance of the R-ARP haplogroup. Appearance of the R-UVD haplogroup present in the individual from modern day Serbia is estimated to have occurred about 900 years ago consistent with the burial date (1196) of King Béla III. Of course these numbers are estimates and hinge significantly on the SNPs that are accepted currently as legitimate to be used in such calculations (Table S8). Out of a total of nine ARP SNPs we identified, only eight fell into the region recommended, while of the nine UVD SNPs only 6 could be included in our analysis. In our opinion the region of interest from which SNPs could and should be taken into account for creating phylogenetic trees should be revised based on the better reference sequences available for the Y-chromosome.
The phylogenetic origins of the Hungarians who occupied the Carpathian basin has been much contested . Based on linguistic arguments it was proposed that they represented a predominantly Finno-Ugric speaking population while the oral and written tradition of the Árpád dynasty suggests a relationship with the Huns. Based on the genetic analysis of two members of the Árpád Dynasty, it appears that they derived from a lineage (R-Z2125) that is currently predominantly present among ethnic groups (Pashtun, Tadjik, Turkmen, Uzbek, and Bashkir) speaking Iranian or Turkic languages. However, their closest kin, the Bashkirs live in close proximity with Finno-Ugric speaking populations with the N-B539 haplogroup. A recent study shows that this haplogroup is also found in modern Hungarians . Intriguingly, the most recent separation of the N-B539 derived lineages found in Hungarians and Bashkirs is estimated to have occurred ~2000 years before present . This would suggest that a group of people consisting of a Turkic (R-SUR51) component and a Finno-Ugric (N-B539) component left the Volga Ural region about 2000 years ago and started a migration that eventually culminated in settlement in the Carpathian Basin. Higher resolution studies of the prevalence of the N-B539, R-SUR51, and R-ARP haplogroups in the Carpathian basin are needed to test this hypothesis, both in the current populations, as well as in the remains from cemeteries from the period of the Hungarian invasion. Targeted sampling of the regions of Levédia and Etelköz, proposed earlier living areas of Hungarians north of the Black Sea between the river Don (or Dnieper) and the Eastern Carpathians, could provide further data to determine the precise timing of the Hungarian migrations prior to their entry into the Carpathian Basin. The rarity of the Y-chromosome lineage of the Árpád Dynasty will allow a very detailed and accurate mapping of Hungarian prehistory and identification of additional descendents of the dynasty, which has been a goal of scholars interested in the subject for centuries.
The BAM files of the mitochondrial and Y-chromosome sequences for all samples presented in this paper are available at https://www.ncbi.nlm.nih.gov/bioproject/PRJNA490697: Determination of the phylogenetic origins of the Árpád Dynasty based on Y-chromosome sequencing of Béla the Third.
Hóman B. Geschichte des Ungarischen Mittelalters. Berlin: Walter de Gruyter; 1940–43.
Neparaczki E, Juhasz Z, Pamjav H, Feher T, Csanyi B, Zink A, et al. Genetic structure of the early Hungarian conquerors inferred from mtDNA haplotypes and Y-chromosome haplogroups in a small cemetery. Mol Genet Genom. 2017;292:201–14.
Neparaczki E, Kocsy K, Toth GE, Maroti Z, Kalmar T, Bihari P, et al. Revising mtDNA haplotypes of the ancient Hungarian conquerors with next generation sequencing. PLoS ONE 2017;12:e0174886.
Tomory G, Csanyi B, Bogacsi-Szabo E, Kalmar T, Czibula A, Csosz A, et al. Comparison of maternal lineage and biogeographic analyses of ancient and modern Hungarian populations. Am J Phys Anthropol. 2007;134:354–68.
Csosz A, Szecsenyi-Nagy A, Csakyova V, Lango P, Bodis V, Kohler K, et al. Maternal genetic ancestry and legacy of 10(th) century AD Hungarians. Sci Rep. 2016;6:33446.
Neparaczki E, Maroti Z, Kalmar T, Maar K, Nagy I, Latinovics D, et al. Y-chromosome haplogroups from Hun, Avar and conquering Hungarian period nomadic people of the Carpathian Basin. Sci Rep. 2019;9:16569.
Olasz J, Seidenberg V, Hummel S, Szentirmay Z, Szabados G, Melegh B, et al. DNA profiling of Hungarian King Béla III and other skeletal remains originating from the Royal Basilica of Székesfehérvár. Archaeol Anthropol Sci. 2019;11:1345–57.
Szentpétery I. Scriptores Rerum Hungaricarum. Budapest: Academia Litteraria Hungarica; 1937–38.
Engel P. Temetkezések a középkori székesfehérvári bazilikában [Burials in the medieval Basilica of Székesfehérvár]. Századok. 1987;121:613–37.
Érdy J III. Béla király és nejének Székes-Fehérvárott talált síremlékei [The tombs of king Béla III and his spouse found in Székes-Fehérvár]. Pest: Emich G; 1853.
Éry K (ed). A székesfehérvári királyi bazilika embertani leletei 1848-2002. Budapest: Balassi Kiadó; 2008.
Poznik GD, Henn BM, Yee MC, Sliwerska E, Euskirchen GM, Lin AA, et al. Sequencing Y chromosomes resolves discrepancy in time to common ancestor of males versus females. Science 2013;341:562–5.
Poznik GD, Xue Y, Mendez FL, Willems TF, Massaia A, Wilson Sayres MA, et al. Punctuated bursts in human male demography inferred from 1,244 worldwide Y-chromosome sequences. Nat Genet. 2016;48:593–9.
Karmin M, Saag L, Vicente M, Wilson Sayres MA, Jarve M, Talas UG, et al. A recent bottleneck of Y chromosome diversity coincides with a global change in culture. Genome Res. 2015;25:459–66.
International Society of Genetic Genealogy. Y-DNA Haplogroup Tree 2019, Version:14.22, Date: 25 January 2019, http://www.isogg.org/tree/
Behar DM, Saag L, Karmin M, Gover MG, Wexler JD, Sanchez LF, et al. The genetic variation in the R1a clade among the Ashkenazi Levites’ Y chromosome. Sci Rep. 2017;7:14969.
Inc. I. Basespace application. www.illumina.com/BaseSpaceApps.
Bioinformatics B. Trim Galore. http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/
Miller NA, Farrow EG, Gibson M, Willig LK, Twist G, Yoo B, et al. A 26-hour system of highly sensitive whole genome sequencing for emergency management of genetic diseases. Genome Med. 2015;7:100.
Robinson JT, Thorvaldsdottir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nat Biotechnol. 2011;29:24–6.
Jonsson H, Ginolhac A, Schubert M, Johnson PL, Orlando L. mapDamage2.0: fast approximate Bayesian estimates of ancient DNA damage parameters. Bioinformatics 2013;29:1682–4.
Skoglund PSJ, Götherström A, Jakobssonac M. Accurate sex identification of ancient human remains using DNA shotgun sequencing. J Archaeological Sci. 2013;40:4477–82.
Renaud G, Slon V, Duggan AT, Kelso J. Schmutzi: estimation of contamination and endogenous mitochondrial consensus calling for ancient DNA. Genome Biol. 2015;16:224.
Rasmussen M, Guo X, Wang Y, Lohmueller KE, Rasmussen S, Albrechtsen A, et al. An Aboriginal Australian genome reveals separate human dispersals into Asia. Science 2011;334:94–8.
Weissensteiner H, Pacher D, Kloss-Brandstatter A, Forer L, Specht G, Bandelt HJ, et al. HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing. Nucleic Acids Res. 2016;44(W1):W58–63.
Ralf A, Gonzalez DM, Zhong K, Kayser M. Yleaf: software for human Y-Chromosomal Haplogroup inference from next-generation sequencing data. Mol Biol Evol. 2018;35:1820.
Y-DNA Haplogroup Tree 2019: International Society of Genetic Genealogy; 2020. http://www.isogg.org/tree.
Y-chromosome DNA haplotree: Family Tree DNA Ltd. https://www.familytreedna.com/public/y-dna-haplotree.
Au CH, Ho DN, Kwong A, Chan TL, Ma ESK. BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing. Sci Rep. 2017;7:1567.
Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol. 2011;7:539.
FigTree v1.4.4. http://tree.bio.ed.ac.uk/software/figtree/.
Cruciani F, La Fratta R, Trombetta B, Santolamazza P, Sellitto D, Colomb EB, et al. Tracing past human male movements in northern/eastern Africa and western Eurasia: new clues from Y-chromosomal haplogroups E-M78 and J-M12. Mol Biol Evol. 2007;24:1300–11.
Balanovsky O, Dibirova K, Dybo A, Mudrak O, Frolova S, Pocheshkhova E, et al. Parallel evolution of genes and languages in the Caucasus region. Mol Biol Evol. 2011;28:2905–20.
Myres NM, Rootsi S, Lin AA, Jarve M, King RJ, Kutuev I, et al. A major Y-chromosome haplogroup R1b Holocene era founder effect in Central and Western Europe. Eur J Hum Genet. 2011;19:95–101.
Underhill PA, Poznik GD, Rootsi S, Jarve M, Lin AA, Wang J, et al. The phylogenetic and geographic structure of Y-chromosome haplogroup R1a. Eur J Hum Genet. 2015;23:124–31.
Rootsi S, Behar DM, Jarve M, Lin AA, Myres NM, Passarelli B, et al. Phylogenetic applications of whole Y-chromosome sequences and the Near Eastern origin of Ashkenazi Levites. Nat Commun. 2013;4:2928.
Muratov. The genus of Shagali Shakman, the clan of Olobure and the descendants of Inas (Kipchak Khan) according to Big-Y. Bulletin of the EI Project ‘Suyun’. 2014;1:7–29.
Di Cristofaro J, Pennarun E, Mazieres S, Myres NM, Lin AA, Temori SA, et al. Afghan Hindu Kush: where Eurasian sub-continent gene flows converge. PLoS ONE 2013;8:e76748.
Zgonjanin D, Alghafri R, Antov M, Stojiljkovic G, Petkovic S, Vukovic R, et al. Genetic characterization of 27 Y-STR loci with the Yfiler((R)) Plus kit in the population of Serbia. Forensic Sci Int Genet. 2017;31:e48–9.
Neparaczki E, Maroti Z, Kalmar T, Kocsy K, Maar K, Bihari P, et al. Mitogenomic data indicate admixture components of Central-Inner Asian and Srubnaya origin in the conquering Hungarians. PLoS ONE 2018;13:e0205920.
Post H, Nemeth E, Klima L, Flores R, Feher T, Turk A, et al. Y-chromosomal connection between Hungarians and geographically distant populations of the Ural Mountain region and West Siberia. Sci Rep. 2019;9:7786.
Amorim CEG, Vai S, Posth C, Modi A, Koncz I, Hakenbeck S, et al. Understanding 6th-century barbarian social organization and migration through paleogenomics. Nat Commun. 2018;9:3547.
Team QD. QGIS Geographic Information System; Open Source Geospatial Foundation Project. 2015. http://www.qgis.org.
The authors express their gratitude to Péter Erdő Cardinal, Archbishop of Esztergom-Budapest for the permission to exhume the human remains for the purpose of genetic analysis and Tibor Török for critical reading of the paper.
Funding for the NGS sequencing and data analysis was provided by the Hungarian Ministry of Human resources (EMMI) under project number 26090/2019/MINKABINET. Endre Neparáczki is supported by grants: No. 62722/2018 certificate and Excellence Program of 2019 (TUDFO/5157-1/2019-ITM). Doron Behar is partially supported by the Estonian Research Council grant IUT-24. Elza Khusnutdinova and Natalia Ekomasova are supported by grants: 17-44-020748 and 19-04-01195 from the Russian Foundation for Basic Research. Murat Dzhaubermezov is supported by grant FZWU-2020-0027 from the Ministry of Science and Higher Education of the Russian Federation. JDC and JC are in part supported by the Agence National de la Recherche (Grant #BLAN07-3_222301, CSD 9 - Sciences humaines et sociales). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the paper.
Conflict of interest
The authors declare that they have no conflict of interest.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Nagy, P.L., Olasz, J., Neparáczki, E. et al. Determination of the phylogenetic origins of the Árpád Dynasty based on Y chromosome sequencing of Béla the Third. Eur J Hum Genet 29, 164–172 (2021). https://doi.org/10.1038/s41431-020-0683-z
This article is cited by
Dual origins of the Northwest Chinese Kyrgyz: the admixture of Bronze age Siberian and Medieval Niru’un Mongolian Y chromosomes
Journal of Human Genetics (2022)
Genome-wide autosomal, mtDNA, and Y chromosome analysis of King Bela III of the Hungarian Arpad dynasty
Scientific Reports (2021)