Ancient genomes from the last three millennia support multiple human dispersals into Wallacea

Oliveira, Sandra; Nägele, Kathrin; Carlhoff, Selina; Pugach, Irina; Koesbardiati, Toetik; Hübner, Alexander; Meyer, Matthias; Oktaviana, Adhi Agus; Takenaka, Masami; Katagiri, Chiaki; Murti, Delta Bayu; Putri, Rizky Sugianto; Mahirta; Petchey, Fiona; Higham, Thomas; Higham, Charles F. W.; O’Connor, Sue; Hawkins, Stuart; Kinaston, Rebecca; Bellwood, Peter; Ono, Rintaro; Powell, Adam; Krause, Johannes; Posth, Cosimo; Stoneking, Mark

doi:10.1038/s41559-022-01775-2

Download PDF

Article
Open access
Published: 09 June 2022

Ancient genomes from the last three millennia support multiple human dispersals into Wallacea

Nature Ecology & Evolution volume 6, pages 1024–1034 (2022)Cite this article

17k Accesses
11 Citations
225 Altmetric
Metrics details

Subjects

Abstract

Previous research indicates that human genetic diversity in Wallacea—islands in present-day Eastern Indonesia and Timor-Leste that were never part of the Sunda or Sahul continental shelves—has been shaped by complex interactions between migrating Austronesian farmers and indigenous hunter–gatherer communities. Yet, inferences based on present-day groups proved insufficient to disentangle this region’s demographic movements and admixture timings. Here, we investigate the spatio-temporal patterns of variation in Wallacea based on genome-wide data from 16 ancient individuals (2600–250 years BP) from the North Moluccas, Sulawesi and East Nusa Tenggara. While ancestry in the northern islands primarily reflects contact between Austronesian- and Papuan-related groups, ancestry in the southern islands reveals additional contributions from Mainland Southeast Asia that seem to predate the arrival of Austronesians. Admixture time estimates further support multiple and/or continuous admixture involving Papuan- and Asian-related groups throughout Wallacea. Our results clarify previously debated times of admixture and suggest that the Neolithic dispersals into Island Southeast Asia are associated with the spread of multiple genetic ancestries.

Widespread Denisovan ancestry in Island Southeast Asia but no evidence of substantial super-archaic hominin admixture

Article 22 March 2021

Genetic history from the Middle Neolithic to present on the Mediterranean island of Sardinia

Article Open access 24 February 2020

The genomic landscape of Mexican Indigenous populations brings insights into the peopling of the Americas

Article Open access 12 October 2021

Main

Wallacea (Fig. 1), a region of deep-sea islands located between the Sunda and Sahul continental shelves¹, has been both a bridge and a barrier for humans migrating from Asia to Oceania. Anatomically modern humans (AMHs) presumably first crossed Wallacea before reaching Sahul, for which the earliest unequivocal dates are approximately 47 ka^2,3,4,5 (but see Clarkson et al.⁶). In Wallacea itself, the archaeological record indicates occupation by AMHs starting around 46 ka in the southern islands^7,8,9, 45.5 ka in Sulawesi¹⁰ and 36 ka in the northern islands (North Moluccas)¹¹. After a long period of isolation, the region was impacted by the Austronesian expansion. Equipped with new sailing and farming technologies, Austronesian-speaking groups likely expanded out of Taiwan 4,000–4,500 ya^12,13,14 and eventually settled in Island Southeast Asia (ISEA), Oceania and Madagascar. Their arrival is generally linked to the earliest appearance of pottery, which dates to approximately 3,500 ya in Wallacea^{11,15,16,17,18}. During the Late Neolithic and early Metal Age (2,300–2,000 ya), the maritime trade network intensified, with a movement of spices, bronze drums and glass beads connecting Wallacea to India and mainland SEA (MSEA)^{11,17,19,20,21,22,23,24}.

**Fig. 1: Sample provenance and the results of principal component and DyStruct analyses.**

The contact between Austronesian-speaking farmers and hunter–gatherer communities is still reflected in the linguistic and biological diversity of Wallacea today. Austronesian languages of the Malayo-Polynesian subgroup are widespread throughout the region²⁵ but a few dozen non-Austronesian (that is, Papuan) languages are also spoken in the North Moluccas, Timor, Alor and Pantar²⁶; some Austronesian languages show features acquired from Papuan languages²⁷.

The genomic composition of present-day Wallaceans shows signals of admixture between Papuan- and Asian-related ancestry most similar to that of present-day Austronesians^28,29,30. This dual ancestry is geographically distributed as a gradient of increasing Papuan-related ancestry from west to east^28,30. Previous studies have estimated admixture times based on present-day groups^28,29,30, providing the first inferences on the direction and rate of spread of genetic ancestry²⁸. However, the time estimates from different studies show discrepancies of more than 3,000 years (Supplementary Table 1) that cannot be solely attributed to ascertainment bias but also reflect limitations in admixture dating methods^28,29, which are differentially affected by scenarios involving continuous or multiple pulses of gene flow from closely related sources³¹. Resolving the uncertainty in admixture dates has important implications for understanding the interactions between Austronesians and pre-Austronesian populations. Admixture dates close to the archaeological dates proposed for the Austronesian arrival would indicate that admixture occurred soon after contact, while more recent dates would imply that communities coexisted for some time before genetically mixing or were mixing for a prolonged period. Moreover, admixture dates predating the Austronesian arrival would suggest alternative explanations, such as genetic influences from other Asian-related groups³².

In this study, we leveraged the power of ancient DNA to investigate spatio-temporal patterns of variation within Wallacea during the last 2,500 years. We provide insights into the time of arrival of the Austronesian-related ancestry, the temporal span of admixture and the relationship between the ancestry of incomers and that of other groups from Asia and Oceania. Additionally, we explore the impact and timing of an additional migration from MSEA to Wallacea.

Results

We extracted DNA from skeletal remains from 16 individuals dated to approximately 2,600–250 BP from 8 archaeological sites spanning the North Moluccas, Sulawesi and East Nusa Tenggara (for our purposes, East Nusa Tenggara has been abbreviated to NTT for Nusa Tenggara Timur) (Fig. 1a and Table 1). Sequencing libraries were then constructed and capture-enriched for approximately 1.2 million genome-wide single-nucleotide polymorphisms (SNPs)³³ and the complete mitochondrial DNA (mtDNA). The authenticity of ancient DNA was confirmed based on the elevated amounts of deaminated positions at the ends of reads and the short average fragment size (Supplementary Table 2). Contamination estimates were low (Supplementary Table 2).

Table 1 Ancient samples from Wallacea included in this study

Full size table

The mtDNA and Y-chromosome haplogroups show that both Asian- and Papuan-related ancestries were already present in the North Moluccas approximately 2,150 BP (Table 1). Furthermore, 2 North Moluccas individuals dating to approximately 2,150–2,100 BP carried mtDNA and Y-chromosome haplogroups associated with different ancestries, indicating that admixture started before then. In comparison to the individuals from NTT and Sulawesi, those from the North Moluccas showed a higher proportion of mtDNA lineages connecting them to Near Oceania, as attested by the Q haplogroups characteristic of Northern Sahul³⁴ and by the so-called ‘Polynesian pre-motif’ (B4a1a/B4a1a1) (ref. ³⁵). None of the individuals from Sulawesi or NTT carry Papuan-related mtDNA haplogroups, even though they are found there today^36,37.

To explore the genome-wide patterns of variation in ancient Wallaceans, we performed principal component analysis (PCA) based on different sets of present-day populations from Asia and Oceania and two combinations of SNP arrays (Methods). Ancient Wallaceans cluster between Papua New Guinea and Asia, together with present-day Wallaceans (Fig. 1b and Extended Data Fig. 1). However, the trajectory outlined by individuals from the northern (North Moluccas) versus southern (NTT) islands is slightly different, suggesting they may have distinct genetic histories. Ancient individuals from NTT cluster on a cline towards mainland Asians and some Western Indonesian groups, while ancient individuals from the North Moluccas align on a trajectory towards present-day Taiwanese/Philippine populations or even towards ancient individuals from Guam 2,200 BP, Vanuatu 2,900 BP and Tonga 2,500 BP (previously shown to have almost exclusively Austronesian-related ancestry)^38,39. The differences between the two Wallacean regions are more pronounced when projecting the ancient individuals into principal components that feature Asian-related variation (PC2 versus PC3; Extended Data Fig. 2).

We next used a model-based clustering method (DyStruct) to infer shared ancestry⁴⁰. The results for the best supported number of clusters in each of the tested datasets (Supplementary Fig. 1a,b) show that ancient Wallaceans shared ancestry with Papuan-speaking groups from New Guinea (dark blue component) and multiple Asian groups whose ancestry can be partitioned into three main components (Fig. 1c; see full results in Supplementary Fig. 1e,f). One component (yellow) is present at high frequencies in Austronesian-speaking groups from Taiwan, the Philippines and Indonesia, and ancient individuals from Taiwan; a second component (mango) is maximized in Polynesian-speaking groups from the Pacific and ancient individuals from the same region; and a third component (dark red) is widespread in present-day and ancient individuals from SEA. The most striking difference among ancient Wallaceans is the presence of the SEA component in ancient NTT and Sulawesi individuals but not in North Moluccan individuals. A more subtle difference occurs in the relative proportion of the two Austronesian-related components (Extended Data Fig. 3): ancient individuals from Sulawesi and NTT have a higher relative proportion of the Austronesian-related (yellow) component that predominates in Taiwan, compared to ancient individuals from the North Moluccas, who are more similar to groups from the Pacific.

To directly compare allele-sharing between ancient Wallaceans and different Asian-related groups, we used f-statistics⁴¹. First, we computed an f₄-statistic of the form F₄(Mbuti, ancient Wallacean; Amis, test), where the test group includes ancient and present-day groups from mainland Asia, ISEA and the Pacific who have no discernible Papuan-related ancestry (Supplementary Fig. 2 and Supplementary Table 3). Our results show that ancient individuals from the North Moluccas share more drift with ancient individuals from Vanuatu (2,900 BP) and Tonga (2,500 BP) than with Amis (z > 2). In contrast, ancient individuals from Sulawesi and NTT do not share additional drift with any tested groups. Nonetheless, the higher number of f₄-statistics consistent with zero in tests involving ancient individuals from NTT (Komodo and Liang Bua) indicates that they share as much drift with Amis as with several other groups, not only from Taiwan/Philippines but also SEA or Western Indonesia. This result, together with the identification of an ancestry component related to SEA (Supplementary Fig. 1e,f) in ancient NTT and Sulawesi individuals, supports a more complex admixture history in these parts of Wallacea.

We next analysed pairs of f₄-statistics designed to capture any differences in Asian-related ancestry between individuals from the North Moluccas and NTT (Supplementary Figs. 3 and 4 and Supplementary Tables 4 and 5). All f₄-statistics had the form F₄(Mbuti, test; New Guinea Highlanders, ancient Wallacean) and each pair compared the results for a fixed test group on the x axis (Amis or Vanuatu 2,900 BP for comparisons between modern or ancient test pairs, respectively) and various Asian-related test groups on the y axis. Since individuals from the North Moluccas lacked the SEA component (Fig. 1c), the best proxy for this component in individuals from NTT should maximize the differences in f₄-statistics between regions. To estimate these differences, we used a Bayesian approach that accounts for measurement error in the f₄-statistics (Supplementary Figs. 5 and 6). We concluded that the groups maximizing differences between ancient individuals from Wallacea are the present-day Mlabri or Nicobarese and ancient individuals from Vietnam (Mán Bạc 3,800 BP), Laos (Tam Pa Ping 3,000 BP) and Thailand (Ban Chiang 2,600 BP) (Fig. 2), but other related MSEA proxies could not be excluded (Supplementary Figs. 7 and 8). For several MSEA test groups, the 95% credible interval of the differences did not overlap zero within the range of F₄ values covering the Aru Manara 2,150 BP, Tanjung Pinang 2,100 BP, Uattamdi 1,900 BP, Liang Bua 2,600 BP and Komodo 750 BP, indicating strong support for the differences between NTT and the North Moluccas. Below this range (that is, for lower F₄ values), there is no support for regional differences, probably due to a decrease in power for differentiating Asian ancestries when the total Asian ancestry is low and the Papuan-related ancestry is high.

**Fig. 2: Biplots showing the results of two pairs of f₄-statistics of the form F₄(Mbuti, test; New Guinea Highlanders, ancient Wallacea).**

We also investigated the relationships between ancient Wallaceans and groups associated with the first colonization of Sahul or Wallacea using an f-statistic of the form F₄(Mbuti, new ancient Wallacean; New Guinea Highlanders, test). Most ancient Wallacean individuals showed a significantly closer affinity to New Guinea Highlanders than to Australians, the Bismarck group or the recently published pre-Neolithic individual from Sulawesi (Leang Panninge)⁴² (Supplementary Fig. 9 and Supplementary Table 6). The non-significant results are probably due to low amounts of data available for Leang Panninge and/or low amount of Papuan ancestry in Liang Bua and Topogaro (Supplementary Fig. 10). Nonetheless, tests involving Leang Panninge consistently exhibited the lowest F₄ values. Therefore, despite being from Wallacea, this ancient individual was not a good proxy for the Papuan-related ancestry of the newly reported ancient Wallaceans.

We further investigated potential differences in ancestry sources and proportions among ancient Wallaceans using the qpAdm software⁴¹. Our results indicate that whereas ancient individuals from the North Moluccas can be modelled as having both Papuan- and Austronesian-related ancestry, ancient individuals from NTT and Sulawesi were either consistent with or required a three-wave model, with additional SEA-related ancestry (Fig. 3 and Supplementary Table 7). Despite cases for which we identified more than 1 fitting model (P > 0.01), the estimated proportions under the model with the highest P value correlated with the proportions of Austronesian, Papuan and SEA ancestry inferred by DyStruct (Mantel statistic r = 0.97, P < 0.001). Ancient NTT individuals displayed more inter-island variance in their Papuan- and SEA-related ancestries (s² = 0.026 and 0.046, respectively) compared to their Austronesian-related ancestry (s² = 0.003).

**Fig. 3: Ancestry proportions estimated with qpAdm for the model with the highest P value in each group.**

A comparison between the ancestry composition of ancient and present-day individuals from the same region (Fig. 3 and Supplementary Table 7) suggests that a small part (8%) of the Austronesian-related ancestry of ancient individuals from the North Moluccas was replaced by SEA ancestry in present-day groups, masking former differences between regions of Wallacea. The present-day groups from Sulawesi and NTT can be modelled by the same three ancestry components found in ancient individuals from those regions. However, the ancestry proportions of ancient and present-day groups showed some differences, which could indicate ancestry shifts over time or reflect the small sample sizes.

To gain insights into the relative order of admixture events between different ancestries in Wallacea, we used the admixture history graph (AHG) approach⁴³, which relies on differences in covariance between the components inferred by DyStruct (Supplementary Tables 8–10). The AHG, applied to both ancient and present-day data from NTT, suggests that the admixture of SEA- and Papuan-related ancestries occurred before the arrival of the Austronesian-related ancestry (Supplementary Table 10). An analogous test based on the three main ancestry components observed in the North Moluccas (Papuan, Taiwan-Austronesian and Pacific-Austronesian) does not provide compelling evidence for backflow from the Pacific since the AHG inferred that Papuan ancestry was introduced into a population that already had both Austronesian-related components (Supplementary Table 10). This result suggests that drift had a more important role in the occurrence and distribution of the two Austronesian-related components.

Finally, we investigated the timing of admixture using the software DATES (Supplementary Table 11), applicable to ancient DNA from single individuals⁴⁴. With time series ancient data, we expected to reconcile previous admixture time estimates, despite gene flow complexity. Using Papuans and a pool of Asian groups as sources, we found that estimates for the oldest individuals from the Northern Moluccas (2,150 BP) and NTT (2,600 BP) are very similar (approximately 3,000 BP, adjusting for the archaeological age of each sample; Fig. 4), approaching archaeological dates for the arrival of the Austronesians in Wallacea. However, younger individuals displayed more recent estimates. This trend of decreasing admixture times extends to the present-day groups from the North Moluccas who show even younger admixture dates (approximately 1,400 BP) than ancient individuals from the same region. Admixture times for present-day and ancient samples from NTT overlapped. Our results indicate that both regions probably experienced multiple admixture pulses (and/or continuous gene flow), as suggested by the changes in ancestry proportions or composition over time (Fig. 3); however, the overall duration of admixture differed between regions.

Since we inferred that the Asian-related ancestry of ancient individuals from NTT was introduced by two Asian groups in separate events, we might expect admixture time estimates to differ using different proxies for this Asian ancestry. Therefore, we looked for differences in estimates using as sources Papuans and Austronesians (test a) versus Papuans and MSEA (test b). While the point estimates for the two NTT samples with the most MSEA ancestry were older in test b than test a, the confidence intervals overlapped (Extended Data Fig. 4). Either the different Asian ancestry sources were too similar or the admixture times were too close in time to reliably distinguish.

Discussion

This study greatly increases the amount of ancient genomic data from ISEA, a tropical region unsuitable for DNA preservation but particularly important for understanding human population interactions. The new data clarify Wallacea’s admixture history and expose genetic relationships that were masked by recent demographic processes in present-day populations. Our results reveal striking regional variation in Wallacea, some of which is found among the Austronesian-related ancestry of ancient individuals. The most remarkable differences are associated with ancestry contributions from MSEA that were probably already part of the NTT genomic landscape when Austronesians arrived but were absent from the North Moluccas until recently.

Papuan ancestry in Wallacea

All newly presented ancient Wallaceans are genetically closer to present-day Papuans than to the pre-Neolithic Leang Panninge individual from Sulawesi⁴², suggesting little direct continuity between pre-Neolithic and post-Austronesian Wallaceans. Additionally, the ancestry of the newly presented Wallaceans is closer to the ancestry of Papuans than indigenous Australians. This suggests that either the group that gave rise to Australians split first or there was contact between Wallacea and New Guinea after their initial settlement. The second scenario is supported by an mtDNA study that inferred major influxes of Papuan ancestry into Wallacea: after the Last Glacial Maximum (around 15 ka); and Austronesian contact (around 3 ka)⁴⁵.

Previous studies also reported elevated amounts of Denisovan ancestry in present-day Wallaceans, which correlate with the amount of Papuan-related ancestry⁴⁶. We confirmed that the same relationship holds for ancient Wallaceans (Extended Data Fig. 5); therefore, their Denisovan-related ancestry was probably contributed via Papuan-related admixture.

Early SEA ancestry in NTT

Wallacea is generally assumed to have been mostly isolated and shaped by two main streams of ancestry: one related to the first settlement of Sahul and another associated with the Austronesian expansion^28,29,30. The results presented in this study show that the genetic variation of ancient individuals from NTT also requires ancestry contributions from MSEA. The inferred order of events makes it unlikely that the SEA and Austronesian-related ancestries were introduced together from Western Indonesia, where both ancestries are found. Instead, it seems that human groups from MSEA crossed into southern Wallacea before the Austronesian-related groups spread into the region. Further support for this scenario comes from genetic analyses of the commensal black rat—often a good indicator of human migration—which suggests that this species was also first introduced in NTT from MSEA⁴⁷.

The broad geographical distribution of groups best matching the MSEA ancestry of southern Wallaceans raises questions about the actual origin(s) of peoples who reached those islands. The best present-day proxies, the Mlabri from Thailand/Laos and the Nicobarese from the Nicobar islands, speak Austroasiatic languages and have been relatively isolated compared to other MSEA groups that recently experienced extensive admixture^48,49,50,51. Their isolation might explain why they appear as best proxies without being necessarily connected to the inferred migration event. Moreover, there is no clear link between any specific ancient group from MSEA and the actual source that contributed ancestry to NTT since we identified several equivalent ancient proxies for this ancestry.

Other evidence for this migration is equivocal. NTT languages are either Austronesian or Papuan-related and no influences from MSEA language families have been reported. Similarly, there is no archaeological evidence for pre-Austronesian contact between MSEA and southern Wallacea; the earliest evidence is the appearance of the Đông Sơn bronze drums, which spread to southern but not northern Wallacea around the early centuries AD, following maritime trade routes⁵². These drums probably originated in northern Vietnam or adjacent provinces of southern China⁵². Although we cannot rule out some MSEA ancestry contributions from the Đông Sơn period (or even later) for the younger NTT individuals, the high amount of MSEA ancestry in the Liang Bua individual (2,600 BP) and our AHG inferences support an earlier presence in southern Wallacea. This has important implications for our understanding of the Neolithic expansion into ISEA since it brings to light a previously undescribed human dispersal from MSEA. Future archaeological studies in SEA might help link this human dispersal to any contemporaneous material culture. Additionally, ancient DNA from older periods will help clarify the time of arrival of this ancestry.

Austronesian expansion into the North Moluccas and Pacific

The fine-scale structure observed among Austronesian-related groups from ISEA and the Pacific, and the higher genetic proximity of the ancient North Moluccans to the latter, are pertinent for previous considerations of the role of the North Moluccas in dispersals to the Pacific²⁰. When analysed through seafaring and climatic models, the North Moluccas is one of the most likely starting points for settlers that ventured into the Palau or Mariana Islands (western Micronesia)^53,54. Their geographical setting also led archaeologists to search the region for pottery that might be ancestral to the Lapita cultural complex (distributed from the Bismarck Archipelago to Samoa), as well as the Marianas Redware culture^11,18. However, current evidence does not connect the North Moluccas red-slipped pottery to either of these material cultures but instead to pottery from the Talaud Islands, northern and western Sulawesi, North Luzon, Batanes and south-eastern Taiwan¹¹.

The genetic affinity between the ancient individuals from the North Moluccas and the Mariana Islands suggested by our results has some parallels in mtDNA studies based on present-day groups⁵⁵. However, ancient DNA from Guam supports an origin for the settlement of the Mariana Islands from the Philippines³⁹. Under a simple expansion scenario, without back migration, the increasing amounts of Austronesian ancestry characteristic of the Pacific (and decrease of ancestry characteristic of Taiwan/Philippines) from the ancient North Moluccas to Guam (2,200 BP), Vanuatu (2,900 BP) and Tonga (2,500 BP) could reflect their relative position along the peopling wave that eventually reached the eastern parts of the Pacific (Extended Data Fig. 3). Yet, the position in this study refers to the split order of groups without any necessary attachment to their geographical location. Therefore, it is possible that the higher proximity between the North Moluccas and groups from the Pacific, compared to NTT or Sulawesi, simply reflects their more recent ancestry tracing back to a common Austronesian source, regardless of its location. This scenario also implies that the Austronesian-related ancestry found in NTT or Sulawesi is somewhat differentiated from that found in the North Moluccas. Nonetheless, we cannot exclude the possibility of more complex migration scenarios (for example, involving back migrations).

It is also important to consider that the dates of the oldest individuals from the North Moluccas (2,150 BP in Morotai and 1,900 BP in Kayoa Island) overlap with the start of the Early Metal Age 2,300–2,000 ya in this region¹¹. This period is characterized by the appearance of copper, bronze and iron artefacts and glass beads in the region, as well as the spread of pottery into Morotai. Thus, these individuals might not be good representatives of the first Austronesians, thought to have reached Kayoa island 3,500 ya^11,18, but instead might reflect additional genetic influences brought by later contacts.

However, linguistic evidence parallels the genetic evidence for a closer relationship of North Moluccans with Oceanians, compared to peoples from NTT. The Austronesian (Malayo-Polynesian major subgroup) languages of the Northern Moluccas are part of the South Halmahera–West New Guinea (SHWNG) regional subgroup, which are closer to Oceanic languages than to any other Western Malayo-Polynesian major subgroup^13,56,57, whereas the languages spoken in NTT are an outgroup to both SHWNG and Oceanic languages¹³.

The timing of admixture

Besides providing direct evidence for Austronesian-Papuan contact before 2,150 BP in the North Moluccas and 2,600 BP in NTT, the oldest individuals gave admixture date estimates close to 3,000 BP. This period is slightly younger than the earliest archaeological traces of the Neolithic (Austronesian) arrival in the North Moluccas (approximately 3,500 BP for Kayoa Island^11,18) but predates the adoption of pottery on Morotai Island (2,300–2,000 BP), where the oldest North Moluccan individuals in this study were found^11,58. However, it is similar to some of the earliest secure dates from NTT (3,000 BP for eastern Flores)¹⁷. Previous studies conducted on present-day eastern Indonesian populations suggested that this admixture lagged about a millennium behind the arrival of Austronesian populations³⁰. Our admixture analysis for ancient individuals, and the comparison with present-day data, provides an alternative explanation and helps to clarify previous debates concerning admixture times^{28,29,30,32,59}. The decreasing trend in admixture time estimates from the oldest individuals until present-day populations is a strong indicator of multiple pulses or continuous admixture. Therefore, even our oldest estimates might not correspond to the actual start of admixture but to a more recent time due to additional gene flow.

Gene flow events might have been facilitated by emergent maritime networks and spice trade interactions in the Metal Age¹¹. In the North Moluccas, this period not only corresponds to a more rapid spread of material culture between regions^11,21,22,58 but also to the period of language levelling or radiation described for both the Austronesian (SHWNG) and Papuan (West Papuan phylum, Northern Halmahera stock) languages¹¹. The historical socio-economic systems of the North Moluccas and western Papua also brought together Papuan-speaking resident populations and a Malay-speaking elite¹¹, thus mixing could have occurred until very recently. In contrast to the North Moluccas, NTT and Sulawesi individuals do not show genetic traces of very recent contact. Still, their demographic history was nonetheless characterized by a long-term process of admixture involving at least two Asian-related ancestries.

The evidence for ongoing contact in Wallacea has important implications for efforts that use present-day genomic data to discern the direction and number of human migrations to Sahul (for example, Brucato et al.⁶⁰). Failure to consider such contact may result in wrongly considering the genetic affinity between Papuans and northern versus southern Wallaceans to reflect ancestral relationships of these groups rather than differences in the degree of contact. Overall, our findings suggest different histories for northern versus southern Wallaceans that reflect differences in contact with MSEA, in the duration of contact with Papuans and perhaps even with different Austronesian-related groups. Future ancient DNA studies involving individuals from earlier periods will help to improve our understanding of the demographic changes occurring before and after the arrival of Austronesians in Wallacea.

Methods

Sampling

All samples were processed in dedicated ancient DNA laboratories at the Max Planck Institute for the Science of Human History and the Max Planck Institute for Evolutionary Anthropology. At the Max Planck Institute for the Science of Human History, the petrous bone of samples AMA001, AMA004 and AMA009 was first drilled from the outside, identifying the position of the densest part by orienting on the internal acoustic metre and drilling parallel to it into the target area to avoid damaging the semicircular ducts⁶¹ (protocol: https://doi.org/10.17504/protocols.io.bqd8ms9w). After that, the petrous bone was cut along the margo superior partis petrosae (crista pyramidis) and 50–150 mg of bone powder were drilled from the densest part around the cochlea⁶². All other elements processed at the Max Planck Institute for the Science of Human History (AMA003, AMA005, AMA008, JAB001, KMO001, LIA001, LIA002, LIT001, TOP002, TOP004) were sampled by cutting and drilling the densest part. At the Max Planck Institute for Evolutionary Anthropology, the Tanjung Pinang and Uattamdi specimens were sampled by targeting the cochlea from the outside. For this, a thin layer of surface (approximately 1 mm) was removed with a sterile dentistry drill. Small holes were then drilled into the cleaned areas, yielding between 42 and 63 mg of bone powder. Detailed information on the analysed samples, radiocarbon dating and archaeological context are provided in the supplementary information and in Supplementary Tables 2 and 12.

DNA extraction

DNA extraction in both laboratories was carried out using a silica-based method optimized for the recovery of highly degraded DNA^63,64. To release DNA from 50–100 mg of bone powder, a solution of 900 μl EDTA, 75 μl H₂O and 25 μl proteinase K was added. In a rotator, samples were digested for at least 16 h at 37 °C, followed by an additional hour at 56 °C. The suspension was then centrifuged and transferred into a binding buffer. To bind DNA, large-volume silica spin columns (High Pure Viral Nucleic Acid Large Volume Kit; Roche Molecular Systems) were used. After two washing steps using the manufacturer’s wash buffer, DNA was eluted in TET (10 mM Tris, 1 mM EDTA and 0.05% Tween 20). At the Max Planck Institute for the Science of Human History, the second elution of DNA from the spin column was carried out using a fresh aliquot of elution buffer for a total of 100 µl DNA extract, whereas at the Max Planck Institute for Evolutionary Anthropology the same aliquot of elution buffer was loaded twice for a total of 50 µl DNA extract (protocol: https://doi.org/10.17504/protocols.io.baksicwe).

Library preparation

At the Max Planck Institute for the Science of Human History, double-stranded DNA libraries were built from 25 μl of DNA extract in the presence of uracil DNA glycosylase (UDG) (half libraries) according to a protocol that uses the UDG enzyme to reduce, but not eliminate, the amount of deamination-induced damage towards the ends of ancient DNA fragments⁶⁵. Negative and positive controls were carried alongside each experiment (extraction and library preparation) (protocol: https://doi.org/10.17504/protocols.io.bmh6k39e). Libraries were quantified with the IS7 and IS8 primers⁶⁶ in a quantification assay using a DyNAmo SYBR Green qPCR Kit (Thermo Fisher Scientific) on the LightCycler 480 (Roche). Each ancient DNA library was double-indexed⁶⁷ in parallel 100 μl reactions using PfuTurbo DNA Polymerase (Agilent Technologies) (protocol: https://doi.org/10.17504/protocols.io.bakticwn). The indexed products for each library were pooled, purified over MinElute columns (QIAGEN), eluted in 50 μl TET and again quantified with the IS5 and IS6 primers⁶⁶ using the quantification method described above; 4 μl of the purified product were amplified in multiple 100 μl reactions using Herculase II Fusion DNA Polymerase (Agilent Technologies) according to the manufacturer’s specifications with 0.3 μM of the IS5/IS6 primers. After another MinElute purification, the product was quantified with the Agilent 2100 Bioanalyzer DNA 1000 chip. An equimolar pool of all libraries was then prepared for shotgun sequencing on the Illumina HiSeq 4000 platform using an SR75 sequencing kit. Libraries were further amplified with IS5/IS6 primers to reach a concentration of 200–400 ng μl⁻¹ as measured on a NanoDrop spectrophotometer (Thermo Fisher Scientific). At the Max Planck Institute for Evolutionary Anthropology, single-stranded DNA libraries were prepared without UDG treatment using the Bravo NGS Workstation B (Agilent Technologies), exactly as described in Gansauge et al.⁶⁸. Briefly, after an initial denaturation step, adaptor oligonucleotides were ligated to the 3′ ends of the single-stranded ancient DNA fragments using T4 DNA ligase. Using streptavidin-covered magnetic beads, the ligation products and excess adaptors were immobilized, a primer hybridized to the adaptor and a copy of the ancient DNA molecule generated using the Klenow fragment of Escherichia coli DNA polymerase I. Excess primer was then removed in a washing step at increased temperature, which prevented the formation of adaptor dimers. Blunt-end ligation with T4 DNA ligase was used to ligate a second, double-stranded adaptor. Finally, the library strand was released from the beads by heat denaturation. Libraries were quantified through two probe-based quantitative PCR assays and amplified and indexed via PCR⁶⁸.

Targeted enrichment and high-throughput sequencing

MtDNA capture⁶⁹ was performed on screened libraries which, after shotgun sequencing, showed the presence of ancient DNA, highlighted by the typical C to T and G to A substitution pattern towards the 5′ and 3′ molecule ends, respectively. Furthermore, samples with a percentage of human DNA in shotgun data around 0.1% or greater were enriched for a set of 1,237,207 targeted SNPs across the human genome (1,240 K capture)³³. The enriched DNA product was sequenced on an Illumina HiSeq 4000 instrument with 75 cycles single reads or 50 cycles paired-end reads according to the manufacturer’s protocol (at the Max Planck Institute for the Science of Human History) or on a HiSeq 2500 with 75 paired-end reads (at the Max Planck Institute for Evolutionary Anthropology). The output was demultiplexed using in-house scripts requiring either a perfect match of the expected and observed index sequences (Max Planck Institute for Evolutionary Anthropology samples) or allowing a single mismatch between the expected and observed index sequences (Max Planck Institute for the Science of Human History samples).

Genomic data processing

Preprocessing of the sequenced reads was performed using EAGER v.1.92.55 (ref. ⁷⁰). The resulting reads were clipped to remove residual adaptor sequences using Clip&Merge v.1.7.6⁷¹ and AdapterRemoval v.2 (ref. ⁷²). Clipped sequences were then mapped against the human reference genome hg19 using the Burrows–Wheeler Aligner v.0.7.12 (ref. ⁷³), disabling seeding (-l 16,500) and allowing for 2 mismatches (--n 0.01). Duplicates were removed with DeDup v.0.12.2 (ref. ⁷⁰). Additionally, a mapping quality filter of 30 was applied using SAMtools v.1.3 (ref. ⁷⁴). Different sequencing runs and libraries from the same individuals were merged and duplicates were removed and sorted again using SAMtools v.1.3 (ref. ⁷⁴). Genotype calling was performed separately for trimmed and untrimmed reads using pileupCaller v.8.6.5 (https://github.com/stschiff/sequenceTools), a tool that randomly draws one allele at each of the targeted SNPs covered at least once. For the UDG-treated libraries produced at the Max Planck Institute for the Science of Human History, two bases were trimmed on both ends of the reads. For libraries produced at the Max Planck Institute for Evolutionary Anthropology (without UDG treatment), the damage plots were inspected to determine the number of bases to trim from each read. For all libraries, the residual damage extended 8 base pairs into the read, after which it was below 0.05%, and trimmed accordingly. We combined the genotypes keeping all transversions from the untrimmed genotypes and transitions only from the trimmed genotypes to eliminate problematic, damage-related transitions overrepresented at the ends of reads. The generated pseudo-haploid calls were merged with previously published ancient data^{38,39,42,48,75,76,77,78,79}, present-day genomes from the Simons Genome Diversity Project⁸⁰, and worldwide populations genotyped on the Affymetrix Human Origins array^{38,41,50,75,76,81,82,83,84,85,86}. For the PCA and DyStruct analyses, we additionally merged the data with populations from ISEA genotyped on the Affymetrix 6.0 (refs. ^46,87) (dataset 1) or Affymetrix Axiom Genome-Wide Human Array³⁰ (dataset 2), filtering out SNPs with a missing rate higher than 10%. Related individuals were excluded if they exhibited a proportion of identity by descent (IBD) higher than 0.3, computed in PLINK v.1.9 (ref. ⁸⁸) as P(IBD = 2) + 0.5 × P(IBD = 1). We additionally pruned datasets 1 and 2 for linkage disequilibrium with PLINK v.1.9, removing SNPs with r² > 0.4 in 200 kilobase windows, shifted at 25-SNP intervals. After pruning, a total of 89,597 and 65,880 SNPs remained in datasets 1 and 2, respectively.

Y-chromosome haplogroups were identified by calling the SNPs covered on the Y chromosome of all male individuals using the pileup from the Rsamtools v1.3.⁸⁹ package and by recording the number and form of derived and ancestral SNPs overlapping with the International Society of Genetic Genealogy SNP index v.14.07 (https://github.com/Integrative-Transcriptomics/DamageProfiler)⁹⁰.

Authentication of ancient DNA

The typical features of ancient DNA were inspected with DamageProfiler v.0.3.1 (http://bintray.com/apeltzer/EAGER/DamageProfiler)⁷⁰. Sex determination was performed by comparing the coverage on the targeted X-chromosome SNPs to the coverage on the Y-chromosome SNPs, both normalized by the coverage on the autosomal SNPs⁷¹ (Supplementary Table 2). For male individuals, ANGSD v.0.919 was run to measure the rate of heterozygosity of polymorphic sites on the X chromosome after accounting for sequencing errors in the flanking regions⁹¹. This provides an estimate of nuclear DNA contamination in males since they are expected to have only one allele at each site. For both male and female individuals, mtDNA-captured data were used to jointly reconstruct the mtDNA consensus sequence and estimate contamination levels with contamMix v.1.0-10⁶⁹ (Supplementary Table 2) using an in-house pipeline (https://github.com/alexhbnr-mitoBench-ancientMT³⁹).

Statistical analyses

PCAs were carried out using smartpca v.10210 (ref. ⁹²) based on present-day Asian and Oceanian populations from datasets 1 and 2. Ancient individuals were projected onto the calculated components using the options lsqproject: YES and numoutlieriter: 0. We used DyStruct v.1.1.0 (ref. ⁴⁰) to infer shared genetic ancestry taking into account archaeological age. The uncalibrated radiocarbon dates of each ancient sample were converted to generations, assuming a generation time of 29 years⁹³. For each dataset (1 and 2), we performed 25 independent runs, using 2–15 ancestral populations (K). To compare runs for different values of K, a subset of loci (5%) was held out during training and the conditional log-likelihood was subsequently evaluated (Supplementary Fig. 1a,b). Within the best K, the run with the highest objective function was selected (Supplementary Fig. 1c,f).

To formally test population relationships we used the f₄-statistics implemented in the ADMIXTOOLS software v.4.1⁴¹. This analysis was carried out using the admixr v.0.9.1 R package⁹⁴. To evaluate differences in f₄-statistics for individuals from NTT and the North Moluccas, we built a Bayesian linear regression model:

$$B_{{{{\mathrm{OBS}}}},i} \sim {{{\mathrm{Normal}}}}\left( {B_{{{{\mathrm{TRUE}}}},i},B_{{{{\mathrm{SE}}}},i}} \right)$$

$$B_{{{{\mathrm{TRUE}}}},i} \sim {{{\mathrm{Normal}}}}\space (\mu _i,\sigma )$$

$$\mu _i = \alpha _{{{{\mathrm{REGION}}}}[i]} + \beta _{{{{\mathrm{REGION}}}}[i]}A_{{{{\mathrm{TRUE}}}},i}$$

$$A_{{{{\mathrm{OBS}}}},i} \sim {{{\mathrm{Normal}}}}\left( {A_{{{{\mathrm{TRUE}}}},i},A_{{{{\mathrm{SE}}}},i}} \right)$$

$$A_{{{{\mathrm{TRUE}}}},i} \sim {{{\mathrm{Normal}}}}\left( {0,1} \right)$$

$$\alpha _{{{{\mathrm{REGION}}}}[i]} \sim {{{\mathrm{Normal}}}}\left( {0,1} \right)$$

$$\beta _{{{{\mathrm{REGION}}}}\left[ i \right]} \sim {{{\mathrm{Normal}}}}\left( {0,10} \right)$$

$$\sigma \sim {{{\mathrm{Exponential}}}}\left( 1 \right)$$

The model was stratified by region (NTT versus North Moluccas) and takes into account measurement error in both A_OBS and B_OBS variables (corresponding to the f₄-statistics displayed on the x and y axes of the biplots, respectively)⁹⁵. The parameters μ and σ represent the mean and s.d. A_OBS and B_TRUE correspond to the unobserved true values of A and B. Both variables were standardized. The posterior distribution was obtained via Hamiltonian Monte Carlo approximation as implemented in the R package rethinking v2.21 (https://github.com/rmcelreath/rethinking), using 6 chains of 4,000 samples. We used a non-centred parameterization of the error model to aid in posterior exploration. Convergence of the chains was assessed by inspection of the trace plots, Rhat and the effective number of samples. All of these criteria indicate reliable sampling. All Rhat values were equal to 1.00 and all effective number of samples values were above 500. This procedure was applied to each pair of f₄-statistics separately. The code is available at https://github.com/sroliveiraa/ancient_Wallacea_f4_differences.

We used qpWave v.410 (ref. ⁹⁶) and qpAdm v.650 (ref. ⁴¹) to test two- and three-wave admixture models, using a ‘rotating’ strategy⁹⁷. A reference set of populations was chosen to represent diverse human groups and include potential source populations for the ancient Wallacean individuals: Mbuti, English, Brahui, Onge, Yakut, Oroqen, Lahu, Miao, Dai, Khomu, Denisova, Papuan, Kankanaey and Mlabri. We rejected models if their P values were lower than 0.01, if there were negative admixture proportions or if the s.e. was larger than the corresponding admixture proportion. When more than one model was accepted (Supplementary Table 7), the estimated admixture proportions under the model with the highest P value was preferred and used in subsequent analyses (Supplementary Fig. 10 and Extended Data Fig. 5) because the results better matched the DyStruct ancestry proportions and the ability to reject models might be affected by several factors (for example, the ancestry proportion, the quality of the target sample, the combination of ancient and present-day samples in the same analysis). The correlation between ancestry proportions inferred with qpAdm and DyStruct was assessed with a Mantel test with 10,000 permutations of the distance matrix to determine significance.

The relative order of the mixing of different ancestries was inferred using the AHG approach³⁹. Assuming that an admixed population with two ancestry components (A and B) later receives a third component (C) via admixture, then ancestry components A and B from the first admixture event will covary with the component that comes later (C) but the ratio of A and B throughout the population will be independent from C. Thus, the covariance of the recent ancestry C with the ratio of the two older ancestries A and B should be zero. The AHG approach thus involves estimating the covariance of the frequencies of A/B with C, A/C with B and B/C with A across all individuals in the population; the covariance closest to 0 then indicates the order of admixture events. We used the DyStruct ancestry proportions for each ancient and present-day Wallacean individual included in dataset 1 and 2 (Supplementary Tables 8 and 9) to calculate the covariances between ancestry components as indicated in Supplementary Table 10. The sequence of admixture events was then determined by the configuration that produced the smallest absolute value of the covariance estimate. The time since admixture was estimated based on the decay of ancestry covariance using the software DATES v.753 (ref. ⁴⁴) with the following parameters: binsize = 0.001; maxdis = 1.0; jackknife, YES; qbin = 10; runfit, YES; afffit, YES; lovalfit = 0.45; mincount = 1. In our main analysis, to maximize the number of SNPs included in the analysis and have equal sample sizes, we used as sources 16 Papuan individuals and 16 Asian-related individuals (2 Amis, 1 Atayal, 2 Kankanaey, 5 Dai, 2 Dusun, 2 She, 2 Kinh) with data covering the approximate 1,240,000 SNPs captured in the ancient samples. Two additional admixture tests were conducted using the same Papuan source and either Austronesians (2 Amis, 1 Atayal, 2 Kankanaey) or MSEA (2 Cambodian, 2 Kinh, 2 Thai) as the Asian-related source.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All newly reported ancient DNA data, including nuclear DNA and mtDNA alignment sequences, are archived in the European Nucleotide Archive (accession no. PRJEB48109).

References

Dickerson, R. E. Distribution of life in the Philippines. Monogr. Bur. Sci. (Manila) 21, 1–322 (1928).
Google Scholar
O’Connell, J. F. et al. When did Homo sapiens first reach Southeast Asia and Sahul? Proc. Natl Acad. Sci. USA 115, 8482–8490 (2018).
Article PubMed PubMed Central Google Scholar
Allen, J. & O’Connell, J. Both half right: updating the evidence for dating first human arrivals in Sahul. Aust. Archaeol. 79, 86–108 (2014).
Article Google Scholar
Veth, P. Breaking through the radiocarbon barrier: Madjedbebe and the new chronology for Aboriginal occupation of Australia. Aust. Archaeol. 83, 165–167 (2017).
Article Google Scholar
Allen, J. & O’Connell, J. F. A different paradigm for the initial colonisation of Sahul. Archaeol. Ocean. 55, 1–14 (2020).
Article Google Scholar
Clarkson, C. et al. Human occupation of northern Australia by 65,000 years ago. Nature 547, 306–310 (2017).
Article CAS PubMed Google Scholar
Hawkins, S. et al. Oldest human occupation of Wallacea at Laili Cave, Timor-Leste, shows broad-spectrum foraging responses to late Pleistocene environments. Quat. Sci. Rev. 171, 58–72 (2017).
Article Google Scholar
Shipton, C., O’Connor, S., Reepmeyer, C., Kealy, S. & Jankowski, N. Shell Adzes, exotic obsidian, and inter-island voyaging in the early and middle Holocene of Wallacea. J. Isl. Coast. Archaeol. 15, 525–546 (2020).
Article Google Scholar
Sutikna, T. et al. The spatio-temporal distribution of archaeological and faunal finds at Liang Bua (Flores, Indonesia) in light of the revised chronology for Homo floresiensis. J. Hum. Evol. 124, 52–74 (2018).
Article PubMed Google Scholar
Brumm, A. et al. Oldest cave art found in Sulawesi. Sci. Adv. 7, eabd4648 (2021).
Article PubMed PubMed Central Google Scholar
Bellwood, P. The Spice Islands in Prehistory: Archaeology in the Northern Moluccas, Indonesia (ANU Press, 2019).
Bellwood, P. & Dizon, E. The Batanes archaeological project and the “Out of Taiwan” hypothesis for Austronesian dispersal. J. Austronesian Stud. 1, 1–31 (2005).
Google Scholar
Gray, R. D., Drummond, A. J. & Greenhill, S. J. Language phylogenies reveal expansion pulses and pauses in Pacific settlement. Science 323, 479–483 (2009).
Article CAS PubMed Google Scholar
Ko, A. M.-S. et al. Early Austronesians: into and out of Taiwan. Am. J. Hum. Genet. 94, 426–436 (2014).
Article CAS PubMed PubMed Central Google Scholar
Anggraeni, S. T., Bellwood, P. & Piper, P. Neolithic foundations in the Karama valley, West Sulawesi, Indonesia. Antiquity 88, 740–756 (2014).
Article Google Scholar
O’Connor, S. Rethinking the Neolithic in island Southeast Asia, with particular reference to the archaeology of Timor-Leste and Sulawesi. Archipel 90, 15–47 (2015).
Article Google Scholar
Galipaud, J.-C. et al. The Pain Haka burial ground on Flores: Indonesian evidence for a shared Neolithic belief system in Southeast Asia. Antiquity 90, 1505–1521 (2016).
Article Google Scholar
Ono, R., Oktaviana A. A. & Sriwigati, A. N. in The Archaeology of Island Colonization: Global Approaches to Initial Human Settlement (eds Napolitano, M. F. et al.) 293–326 (Univ. Press of Florida, 2021).
Bellwood, P., Waluyo, A., Nitihaminoto, G. & Irwin, G. Archaeological research in the Northern Moluccas; interim results, 1991 field season. Bull. Indo-Pacific Prehistory Assoc. 13, 20–33 (1993).
Google Scholar
Bellwood, P. S. First Islanders: Prehistory and Human Migration in Island Southeast Asia (Wiley Blackwell, 2017).
Ono, R. et al. Development of regional maritime networks during the Early Metal Age in Northern Maluku Islands: a view from excavated glass ornaments and pottery variation. J. Isl. Coast. Archaeol. 13, 90–108 (2018).
Article Google Scholar
Ono, R. et al. The development of pottery making traditions and maritime networks during the Early Metal Age in Northern Maluku Islands. AMERTA 35, 109–122 (2017).
Article Google Scholar
Lape, P. V. Political dynamics and religious change in the late pre-colonial Banda Islands, Eastern Indonesia. World Archaeol. 32, 138–155 (2000).
Article Google Scholar
Solheim, W. G. The University of Hawaiʻi archaeological programme in Eastern Indonesia. Southeast Asian Archaeol. 1996, 61–73 (1998).
Google Scholar
Blust, R. The Austronesian homeland and dispersal. Annu. Rev. Linguist 5, 417–434 (2019).
Article Google Scholar
Holton, G. & Klamer, M. in The Languages and Linguistics of the New Guinea Area (ed. Palmer, B.) 569–640 (De Gruyter Mouton, 2017).
Klamer, M. The dispersal of Austronesian languages in Island South East Asia: current findings and debates. Lang. Linguist. Compass 13, e12325 (2019).
Article Google Scholar
Xu, S. et al. Genetic dating indicates that the Asian–Papuan admixture through Eastern Indonesia corresponds to the Austronesian expansion. Proc. Natl Acad. Sci. USA 109, 4574–4579 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lipson, M. et al. Reconstructing Austronesian population history in Island Southeast Asia. Nat. Commun. 5, 4689 (2014).
Hudjashov, G. et al. Complex patterns of admixture across the Indonesian archipelago. Mol. Biol. Evol. 34, 2439–2452 (2017).
Article CAS PubMed PubMed Central Google Scholar
Pugach, I. et al. The gateway from near into remote Oceania: new insights from genome-wide data. Mol. Biol. Evol. 35, 871–886 (2018).
Article CAS PubMed PubMed Central Google Scholar
Denham, T. & Donohue, M. Lack of correspondence between Asian-Papuan genetic admixture and Austronesian language dispersal in eastern Indonesia. Proc. Natl Acad. Sci. USA 109, E2577 (2012).
Article PubMed PubMed Central Google Scholar
Fu, Q. et al. An early modern human from Romania with a recent Neanderthal ancestor. Nature 524, 216–219 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hudjashov, G. et al. Revealing the prehistoric settlement of Australia by Y chromosome and mtDNA analysis. Proc. Natl Acad. Sci. USA 104, 8726–8730 (2007).
Article CAS PubMed PubMed Central Google Scholar
Soares, P. et al. Ancient voyaging and Polynesian origins. Am. J. Hum. Genet. 88, 239–247 (2011).
Article CAS PubMed PubMed Central Google Scholar
Mona, S. et al. Genetic admixture history of Eastern Indonesia as revealed by Y-chromosome and mitochondrial DNA analysis. Mol. Biol. Evol. 26, 1865–1877 (2009).
Article CAS PubMed Google Scholar
Tumonggor, M. K. et al. The Indonesian archipelago: an ancient genetic highway linking Asia and the Pacific. J. Hum. Genet. 58, 165–173 (2013).
Article CAS PubMed Google Scholar
Skoglund, P. et al. Genomic insights into the peopling of the Southwest Pacific. Nature 538, 510–513 (2016).
Article PubMed PubMed Central Google Scholar
Pugach, I. Ancient DNA from Guam and the peopling of the Pacific. Proc. Natl Acad. Sci. USA 118, e2022112118 (2021).
Article CAS PubMed Google Scholar
Joseph, T. A. & Pe’er, I. Inference of population structure from time-series genotype data. Am. J. Hum. Genet. 105, 317–333 (2019).
Article CAS PubMed PubMed Central Google Scholar
Patterson, N. et al. Ancient admixture in human history. Genetics 192, 1065–1093 (2012).
Article PubMed PubMed Central Google Scholar
Carlhoff, S. et al. Genome of a middle Holocene hunter-gatherer from Wallacea. Nature 596, 543–547 (2021).
Article CAS PubMed PubMed Central Google Scholar
Pugach, I. et al. The complex admixture history and recent southern origins of Siberian populations. Mol. Biol. Evol. 33, 1777–1795 (2016).
Article CAS PubMed PubMed Central Google Scholar
Narasimhan, V. M. et al. The formation of human populations in South and Central Asia. Science 365, eaat7487 (2019).
Article CAS PubMed PubMed Central Google Scholar
Purnomo, G. A. et al. Mitogenomes reveal two major influxes of Papuan ancestry across Wallacea following the Last Glacial Maximum and Austronesian contact. Genes 12, 965 (2021).
Article CAS PubMed PubMed Central Google Scholar
Reich, D. et al. Denisova admixture and the first modern human dispersals into Southeast Asia and Oceania. Am. J. Hum. Genet. 89, 516–528 (2011).
Article CAS PubMed PubMed Central Google Scholar
Louys, J. et al. Expanding population edge craniometrics and genetics provide insights into dispersal of commensal rats through Nusa Tenggara, Indonesia. Rec. Aust. Mus. 72, 287–303 (2020).
Article Google Scholar
Lipson, M. et al. Ancient genomes document multiple waves of migration in Southeast Asian prehistory. Science 361, 92–95 (2018).
Article CAS PubMed PubMed Central Google Scholar
Oota, H. et al. Recent origin and cultural reversion of a hunter–gatherer group. PLoS Biol. 3, e71 (2005).
Article PubMed PubMed Central Google Scholar
Liu, D. et al. Extensive ethnolinguistic diversity in Vietnam reflects multiple sources of genetic diversity. Mol. Biol. Evol. 37, 2503–2519 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kutanan, W. et al. Reconstructing the human genetic history of mainland Southeast Asia: insights from genome-wide data from Thailand and Laos. Mol. Biol. Evol. 38, 3459–3477 (2021).
Article CAS PubMed PubMed Central Google Scholar
Calò, A. Trails of Bronze Drums Across Early Southeast Asia: Exchange Routes and Connected Cultural Spheres (Institute of Southeast Asian Studies, 2013).
Montenegro, Á., Callaghan, R. T. & Fitzpatrick, S. M. Using seafaring simulations and shortest-hop trajectories to model the prehistoric colonization of Remote Oceania. Proc. Natl Acad. Sci. USA 113, 12685–12690 (2016).
Article CAS PubMed PubMed Central Google Scholar
Fitzpatrick, S. M. & Callaghan, R. T. Estimating trajectories of colonisation to the Mariana Islands, western Pacific. Antiquity 87, 840–853 (2013).
Article Google Scholar
Vilar, M. G. et al. The origins and genetic distinctiveness of the Chamorros of the Marianas Islands: an mtDNA perspective. Am. J. Hum. Biol. 25, 116–122 (2013).
Article PubMed Google Scholar
Blust, R. Eastern Malayo-Polynesian: a subgrouping argument. In Proc. Second International Conference on Austronesian Linguistics 1 (eds Wurm, S. A. & Carrington, L.) 181–234 (Department of Linguistics, Research School of Pacific Studies, Australian National University, 1978).
Kamholz, D. C. Austronesians in Papua: Diversification and Change in South Halmahera–West New Guinea. PhD thesis, Univ. of California, Berkeley (2014).
Ono, R. et al. Early Metal Age interactions in Island Southeast Asia and Oceania: jar burials from Aru Manara, northern Moluccas. Antiquity 92, 1023–1039 (2018).
Article Google Scholar
Xu, S. & Stoneking, M. Reply to Denham and Donohue: Asian-Papuan genetic admixture is in excellent agreement with Austronesian dispersal in eastern Indonesia. Proc. Natl Acad. Sci. USA 109, E2578 (2012).
PubMed Central Google Scholar
Brucato, N. et al. Papua New Guinean genomes reveal the complex settlement of north Sahul. Mol. Biol. Evol. 38, 5107–5121 (2021).
Article PubMed PubMed Central Google Scholar
Himmel, M. Etablierung und Evaluierung einer CT-Scan-informierten minimalinvasiven Methode zur Maximierung der DNA-Gewinnung aus prähistorischem Skelettmaterial (Establishment and Evaluation of a CT Scan-informed Minimally Invasive Method to Maximize DNA Recovery from Prehistoric Skeletal Material). BSc thesis, Ernst-Abbe-Hochschule Jena (2017).
Pinhasi, R. et al. Optimal ancient DNA yields from the inner ear part of the human petrous bone. PLoS ONE 10, e0129102 (2015).
Article PubMed PubMed Central Google Scholar
Dabney, J. et al. Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments. Proc. Natl Acad. Sci. USA 110, 15758–15763 (2013).
Article CAS PubMed PubMed Central Google Scholar
Rohland, N., Glocke, I., Aximu-Petri, A. & Meyer, M. Extraction of highly degraded DNA from ancient bones, teeth and sediments for high-throughput sequencing. Nat. Protoc. 13, 2447–2461 (2018).
Article CAS PubMed Google Scholar
Rohland, N., Harney, E., Mallick, S., Nordenfelt, S. & Reich, D. Partial uracil-DNA-glycosylase treatment for screening of ancient DNA. Philos. Trans. R. Soc. Lond. B Biol. Sci. 370, 20130624 (2015).
Article PubMed PubMed Central Google Scholar
Meyer, M. & Kircher, M. Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harb. Protoc. 2010, pdb.prot5448 (2010).
Article PubMed Google Scholar
Kircher, M., Sawyer, S. & Meyer, M. Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform. Nucleic Acids Res. 40, e3 (2012).
Article CAS PubMed Google Scholar
Gansauge, M.-T., Aximu-Petri, A., Nagel, S. & Meyer, M. Manual and automated preparation of single-stranded DNA libraries for the sequencing of DNA from ancient biological remains and other sources of highly degraded DNA. Nat. Protoc. 15, 2279–2300 (2020).
Article CAS PubMed Google Scholar
Fu, Q. et al. A revised timescale for human evolution based on ancient mitochondrial genomes. Curr. Biol. 23, 553–559 (2013).
Article CAS PubMed PubMed Central Google Scholar
Peltzer, A. et al. EAGER: efficient ancient genome reconstruction. Genome Biol. 17, 60 (2016).
Article PubMed PubMed Central Google Scholar
Fu, Q. et al. The genetic history of Ice Age Europe. Nature 534, 200–205 (2016).
Article CAS PubMed PubMed Central Google Scholar
Schubert, M., Lindgreen, S. & Orlando, L. AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC Res. Notes 9, 88 (2016).
Article PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
Lipson, M. et al. Population turnover in Remote Oceania shortly after initial settlement. Curr. Biol. 28, 1157–1165.e7 (2018).
Article CAS PubMed PubMed Central Google Scholar
McColl, H. et al. The prehistoric peopling of Southeast Asia. Science 361, 88–92 (2018).
Article CAS PubMed Google Scholar
Posth, C. et al. Language continuity despite population replacement in Remote Oceania. Nat. Ecol. Evol. 2, 731–740 (2018).
Article PubMed PubMed Central Google Scholar
Yang, M. A. et al. Ancient DNA indicates human population shifts and admixture in northern and southern China. Science 369, 282–288 (2020).
Article CAS PubMed Google Scholar
Yang, M. A. et al. 40,000-year-old individual from Asia provides insight into early population structure in Eurasia. Curr. Biol. 27, 3202–3208.e9 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mallick, S. et al. The Simons Genome Diversity Project: 300 genomes from 142 diverse populations. Nature 538, 201–206 (2016).
Article CAS PubMed PubMed Central Google Scholar
Lazaridis, I. et al. Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature 513, 409–413 (2014).
Article CAS PubMed PubMed Central Google Scholar
Skoglund, P. et al. Genetic evidence for two founding populations of the Americas. Nature 525, 104–108 (2015).
Article CAS PubMed PubMed Central Google Scholar
Meyer, M. et al. A high-coverage genome sequence from an archaic Denisovan individual. Science 338, 222–226 (2012).
Article CAS PubMed PubMed Central Google Scholar
Nakatsuka, N. et al. The promise of discovering population-specific disease-associated genes in South Asia. Nat. Genet. 49, 1403–1407 (2017).
Article CAS PubMed PubMed Central Google Scholar
Prüfer, K. et al. The complete genome sequence of a Neanderthal from the Altai Mountains. Nature 505, 43–49 (2014).
Article PubMed Google Scholar
Qin, P. & Stoneking, M. Denisovan ancestry in East Eurasian and Native American populations. Mol. Biol. Evol. 32, 2665–2674 (2015).
Article CAS PubMed Google Scholar
Jinam, T. A. et al. Discerning the origins of the Negritos, First Sundaland People: deep divergence and archaic admixture. Genome Biol. Evol. 9, 2013–2022 (2017).
Article CAS PubMed PubMed Central Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Morgan, M., Pagès, H., Obenchain, V. & Hayden, N. Rsamtools: Binary alignment (BAM), FASTA, variant call (BCF), and tabix file import. R package version 1.34.1 https://bioconductor.org/packages/release/bioc/html/Rsamtools.html (2019).
Rohrlach, A. B. et al. Using Y-chromosome capture enrichment to resolve haplogroup H2 shows new evidence for a two-Path Neolithic expansion to Western Europe. Sci. Rep. 11, 15005 (2021).
Article CAS PubMed PubMed Central Google Scholar
Korneliussen, T. S., Albrechtsen, A., & Nielsen, R.ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinformatics 15, 356 (2014).
Article PubMed PubMed Central Google Scholar
Patterson, N., Price, A. L. & Reich, D. Population structure and eigenanalysis. PLoS Genet. 2, e190 (2006).
Article PubMed PubMed Central Google Scholar
Fenner, J. N. Cross‐cultural estimation of the human generation interval for use in genetics‐based population divergence studies. Am. J. Phys. Anthropol. 128, 415–423 (2005).
Article PubMed Google Scholar
Petr, M., Vernot, B. & Kelso, J. admixr—R package for reproducible analyses using ADMIXTOOLS. Bioinformatics 35, 3194–3195 (2019).
Article CAS PubMed PubMed Central Google Scholar
McElreath, R. Statistical Rethinking: a Bayesian Course with Examples in R and Stan 495–497 (CRC Press, Francis & Taylor, 2020).
Reich, D. et al. Reconstructing Native American population history. Nature 488, 370–374 (2012).
Article CAS PubMed PubMed Central Google Scholar
Harney, É., Patterson, N., Reich, D. & Wakeley, J. Assessing the performance of qpAdm: a statistical tool for studying population admixture. Genetics 217, iyaa045 (2021).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank R. Radzeviciute, A. Wissgott, the Max Planck Institute for Evolutionary Anthropology lab technicians and the Max Planck Institute for Evolutionary Anthropology Sequencing and Bioinformatics Groups for their excellent support, C. Jeong for valuable comments, L. Iasi for helpful discussions on admixture dating and R. McElreath for help with statistics. This research was supported by the Max Planck Society. K.N., S.C. and A.P. were supported by the European Research Council Starting Grant ‘Waves’ (no. ERC758967). The research conducted on the samples from Liang Toge, Liang Bua and Komodo was part of a New Zealand Fast-Start Marsden Grant (no. 18-UOO-135). The research conducted on the Jareng Bori site was part of a joint project between the Australian National University Universitas Gadjah Maja funded by an ARC Laureate Project no. FL120100156.

Funding

Open access funding provided by Max Planck Society

Author information

These authors contributed equally: Sandra Oliveira, Kathrin Nägele, Selina Carlhoff.
These authors jointly supervised this work: Johannes Krause, Cosimo Posth, Mark Stoneking.

Authors and Affiliations

Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
Sandra Oliveira, Irina Pugach, Alexander Hübner, Matthias Meyer & Mark Stoneking
Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
Kathrin Nägele, Selina Carlhoff, Alexander Hübner, Johannes Krause & Cosimo Posth
Department of Anthropology, Faculty of Social Sciences and Political Sciences, Universitay Airlangga, Surabaya, Indonesia
Toetik Koesbardiati, Delta Bayu Murti & Rizky Sugianto Putri
The National Research Center for Archaeology, Jakarta, Indonesia
Adhi Agus Oktaviana
Kagoshima Women’s College, Kagoshima, Japan
Masami Takenaka
Okinawa Prefectural Archaeological Center, Nishihara, Japan
Chiaki Katagiri
Jurusan Arkeologi, Fakultas Ilmu Budaya, Universitas Gadjah Mada, Yogyakarta, Indonesia
Mahirta
Radiocarbon Dating Laboratory, University of Waikato, Hamilton, New Zealand
Fiona Petchey
ARC Centre of Excellence for Australian Biodiversity and Heritage, College of Arts, Society and Education, James Cook University, Cairns, Queensland, Australia
Fiona Petchey
Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria
Thomas Higham
Oxford Radiocarbon Accelerator Unit, Research Laboratory for Archaeology and the History of Art, University of Oxford, Oxford, UK
Thomas Higham
Department of Anthropology, University of Otago, Dunedin, New Zealand
Charles F. W. Higham
School of Culture, History and Language, College of Asia and the Pacific, Australian National University, Acton, Australian Capital Territory, Australia
Sue O’Connor & Stuart Hawkins
Australian Research Council Centre of Excellence for Australian Biodiversity and Heritage, Australian National University, Canberra, Australian Capital Territory, Australia
Sue O’Connor & Stuart Hawkins
Department of Anatomy, School of Medical Sciences, University of Otago, Dunedin, New Zealand
Rebecca Kinaston
Griffith Centre for Social and Cultural Research, Griffith University, Southport, Queensland, Australia
Rebecca Kinaston
BioArch South, Waitati, New Zealand
Rebecca Kinaston
School of Archaeology and Anthropology, College of Arts and Social Sciences, Australian National University, Canberra, Australian Capital Territory, Australia
Peter Bellwood
Center for Cultural Resource Studies, National Museum of Ethnology, Osaka, Japan
Rintaro Ono
Department of Human Behavior, Ecology and Culture, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
Adam Powell
Institute for Archaeological Sciences, Archaeo- and Palaeogenetics, University of Tübingen, Tübingen, Germany
Cosimo Posth
Senckenberg Centre for Human Evolution and Palaeoenvironment, University of Tübingen, Tübingen, Germany
Cosimo Posth
Université Lyon 1, Centre National de la Recherche Scientifique, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, France
Mark Stoneking

Authors

Sandra Oliveira
View author publications
You can also search for this author in PubMed Google Scholar
Kathrin Nägele
View author publications
You can also search for this author in PubMed Google Scholar
Selina Carlhoff
View author publications
You can also search for this author in PubMed Google Scholar
Irina Pugach
View author publications
You can also search for this author in PubMed Google Scholar
Toetik Koesbardiati
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Hübner
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Meyer
View author publications
You can also search for this author in PubMed Google Scholar
Adhi Agus Oktaviana
View author publications
You can also search for this author in PubMed Google Scholar
Masami Takenaka
View author publications
You can also search for this author in PubMed Google Scholar
Chiaki Katagiri
View author publications
You can also search for this author in PubMed Google Scholar
Delta Bayu Murti
View author publications
You can also search for this author in PubMed Google Scholar
Rizky Sugianto Putri
View author publications
You can also search for this author in PubMed Google Scholar
Mahirta
View author publications
You can also search for this author in PubMed Google Scholar
Fiona Petchey
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Higham
View author publications
You can also search for this author in PubMed Google Scholar
Charles F. W. Higham
View author publications
You can also search for this author in PubMed Google Scholar
Sue O’Connor
View author publications
You can also search for this author in PubMed Google Scholar
Stuart Hawkins
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca Kinaston
View author publications
You can also search for this author in PubMed Google Scholar
Peter Bellwood
View author publications
You can also search for this author in PubMed Google Scholar
Rintaro Ono
View author publications
You can also search for this author in PubMed Google Scholar
Adam Powell
View author publications
You can also search for this author in PubMed Google Scholar
Johannes Krause
View author publications
You can also search for this author in PubMed Google Scholar
Cosimo Posth
View author publications
You can also search for this author in PubMed Google Scholar
Mark Stoneking
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.O., P.B., R.K., T.K. and S.H. contributed archaeological material, collected with the critical support of A.A.O., M.T., C.K., D.B.M., R.S.P., M., A.P. and S.O.C. T.H., C.F.W.H., R.K., F.P. and P.B. contributed the radiocarbon data. K.N., S.C. and M.M. conducted the ancient DNA laboratory work. S.O., K.N., S.C., I.P. and A.H. performed the genetic analysis. S.O. and K.N. wrote the manuscript with input from all authors. M.S., C.P. and J.K. conceived and coordinated the study.

Corresponding authors

Correspondence to Sandra Oliveira, Johannes Krause, Cosimo Posth or Mark Stoneking.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Ecology & Evolution thanks Lluis Quintana-Murci, Guy Jacobs and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 PCA of publicly available whole genome data merged with Human Origins genotype data and Affymetrix Axiom Genome-Wide Human genotype data (dataset 2).

Ancient individuals (shown with a black contour) are projected and their fill color matches the color of present-day samples from the same geographic area. The location of the ancient individuals newly presented in this study is shown in the map on the right panel. For our purposes East Nusa Tenggara is abbreviated to NTT.

Extended Data Fig. 2 PCA generated with a subset of present-day populations from dataset 1 (A) and dataset 2 (B) that emphasize differences between closely related Asian ancestries.

Ancient individuals (shown with a black/grey contour) are projected and their fill color matches the color of present-day samples from the same geographic area. For our purposes East Nusa Tenggara is abbreviated to NTT.

Extended Data Fig. 3 Representation of two Austronesian-related components.

The frequencies of the “yellow” and “mango” components identified in Supplementary Figure 1E were normalized to sum to 1.

Extended Data Fig. 4 Admixture dates estimated with different source groups.

The admixture dates estimated with a pool of Asian groups are shown in black, while the admixture dates estimated with a more specific set of Asian-related groups (SEA or Austronesians) are shown in dark red and yellow. Data are presented as point estimates ± 2 SE. The individuals age is shown by filled symbols with a black contour. The number of individuals included in each group is shown in parenthesis, next to the group label.

Extended Data Fig. 5 Papuan vs Denisova ancestry.

The x-axis presents the Papuan-related ancestry proportion ± 2 SE calculated with block jackknife in the qpAdm software. The Denisova ancestry is represented in the y-axis by an f₄-statistic of the form f₄(Mbuti, Denisova; French, test) ± 2 SE.

Supplementary information

Supplementary Information

Supplementary Figs. 1–11.

Reporting Summary

Peer Review File

Supplementary Tables

Supplementary Tables 1–12.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Oliveira, S., Nägele, K., Carlhoff, S. et al. Ancient genomes from the last three millennia support multiple human dispersals into Wallacea. Nat Ecol Evol 6, 1024–1034 (2022). https://doi.org/10.1038/s41559-022-01775-2

Download citation

Received: 26 October 2021
Accepted: 13 April 2022
Published: 09 June 2022
Issue Date: July 2022
DOI: https://doi.org/10.1038/s41559-022-01775-2

This article is cited by

The Allen Ancient DNA Resource (AADR) a curated compendium of ancient human genomes
- Swapan Mallick
- Adam Micco
- David Reich
Scientific Data (2024)
Reanalyzing the genetic history of Kra-Dai speakers from Thailand and new insights into their genetic interactions beyond Mainland Southeast Asia
- Piya Changmai
- Yutthaphong Phongbunchoo
- Pavel Flegontov
Scientific Reports (2023)