Introduction

It is widely accepted that anatomically modern humans (Homo sapiens) from Asia colonized the New World through Beringia during the ice age.1 Many researchers in archaeology, linguistics and evolutionary genetics have investigated the route, timing and size of the first migration event to America.2, 3 The single migration hypothesis was proposed based mainly on the analysis of uniparental genetic markers of current Native Americans.1, 4 Genome-wide single nucleotide polymorphism data for Native Americans suggest three streams of Asian gene flow, one major plus two additional minor gene flow events.5 These data also suggest a coastal route for the initial peopling of America, in agreement with the results of mitochondrial DNA (mtDNA) analyses.4, 5

The Mesoamerican region is adjacent to the Isthmus of Panama, a narrow strip of land that links North and South America. It is an important region for understanding the demography of Native Americans because the ancestors of South Americans presumably passed through this region during their migration from North America. In a previous study, we analyzed mtDNA haplotypes to estimate genetic diversity within the Mesoamerican region.6 Here, we report a more detailed demographic history of the Zapotec inferred from Bayesian Skyline Plots (BSP) to gain better insight into their genetic history. Of particular interest is whether a recent population decline corresponded with European contact, as some studies have proposed, or other factors such as climatic events.7, 8, 9, 10, 11

Materials and methods

We used 88 complete Zapotec mtDNA genome sequences reported in the study by Mizuno et al.6 The poly(C) tracts beginning at position 16 182 were not considered in this study. Information regarding data collection and data availability can be found in the study by Mizuno et al.

To obtain BSP, we used BEAST v1.7 suite.12 We applied the Tamura and Nei TN93 mutation model using gamma-distributed rates and a proportion of invariant sites. A clock model was used assuming a log-normal distribution. In the Skyline model, we assumed a piecewise-constant model with 10 groups. We set the Markov Chain Monte Carlo chain length to 1 × 108 with 1 × 107 burn-in steps to collect sufficient samples for parameter estimation. We performed the Markov Chain Monte Carlo simulation twice independently to confirm that the simulation converged to the same state. We also estimated the times to the most recent common ancestor (tMRCAs) by this Markov Chain Monte Carlo method for haplogroups A2, B2, C1 and D1 using Beauti.12 The time was scaled by the number of mutations for both BSP construction and the estimation of tMRCAs for each haplogroup. To estimate divergence time based on the number of mutations, we assumed a molecular clock of 1.665 × 10−8 and applied the correction method of Soares et al.13 To scale the female effective population size (Nef), we assumed a generation time of 25 years.14 The simulation results were analyzed using Tracer v1.5 (http://tree.bio.ed.ac.uk/software/tracer/). For estimating tMRCAs, we also used another common method, ρ statistics, to estimate the tMRCAs of the four haplogroups using median joining calculation implemented in Network (http://www.fluxus-engineering.com).15

Results

Zapotec mtDNA was classified as haplogroup A2, B2, C1 or D1, and these groups included 59, 16, 10 and 3 sequences, respectively. In total, the data included 183 segregating sites (S) and were characterized by a nucleotide diversity per site (π) of 0.002850. Tajima’s D was −1.464, and was significantly different from zero (P=0.036).16 This excess of low frequency polymorphisms indicates that the Zapotec experienced a past population expansion. However, these summary statistics enable only a rough estimation of the timing and magnitude of demographic events.

To obtain a more detailed demographic history of the Zapotec, we constructed a BSP using BEAST v1.7 suite.12 BSP showed that the ancestors of the Zapotec experienced a population expansion around 17 000–19 000 years ago (Figure 1). The Nef was approximately 1500 and reached approximately 20 000 after the expansion. This observation is highly concordant with previous result;4 if we assume that this time of population expansion corresponds to the peopling of America, we can infer that the ancestors of the Zapotec migrated from Beringia to the American continent at least 17 000–19 000 years ago. This time range predates the opening of the ice-free orridor between the Laurentide and Cordilleran ice sheets, estimated to have occurred roughly 12 000 years ago.4, 17 This suggests that the Zapotec ancestors migrated via a coastal route rather than the ice-free corridor.4, 5 The BSP estimated from the mtDNA coding regions and the full-length sequences showed the same results (data not shown).

Figure 1
figure 1

Bayesian Skyline Plot (BSP) of the Zapotec. Nef (y axis, log-scale) was plotted against time (x axis, years ago). A mutation rate of 1.664 × 10−8 per site per year and a generation time of 25 years were assumed.14, 15 Solid line shows the mean Nef from the posterior distribution. Dashed lines show the 95% posterior density intervals.

Table 1 shows the estimated tMRCA for the four haplogroups determined using BEAST.12 Haplogroup A2 showed the oldest tMRCA, while Haplogroup D1 showed the youngest tMRCA. The tMRCAs of Haplogroups B2 and C1 were intermediate. These results are not concordant with a previous study.4 Because the Zapotec people are considered to be the descendants of the first major migrants to the New World,5 overlap between the tMRCAs of the four haplogroups was expected.4 To verify our results, we estimated tMRCA for the four haplogroups using ρ statistics.15 Table 1 shows the estimates of ρ and the associated dates inferred from the statistic, using the molecular clock of Soares et al.13 The estimates based on the ρ statistics were consistent with the results obtained using BEAST.

Table 1 Estimated tMRCAs in years for Haplogroups A2, B2, C1 and D1 from the Zapotec using BEAST and ρ statistics

We carefully examined very recent demographic events estimated from BSP to gain insights into the population decline of Native Americans. The BSP of the Zapotec shows a population decline after an approximately 10-fold population expansion (Figure 1). Nef decreased to around 5000 from a peak of around 20 000. If we assume that this population decline was caused by the first contact with Europeans, the time of this decline should correspond to the time of the contact event that occurred 500 years ago.8 However, our BSP showed that the population decline started approximately 4000 years ago, which is 3500 years earlier than the arrival of the first Europeans to the New World.

Discussion

The tMRCAs estimated for the four haplogroups showed different result from previous study4 (Table 1). The sample size of Haplogroup D1 was only three; therefore, it is quite possible that the tMRCA of D1 was underestimated. The sample sizes of Haplogroups B2 and C1 were also small (16 and 10, respectively), which might also result in an underestimate of the mutations and therefore can lead to more recent coalescent time estimates.

Our BSP showed a population decline started approximately 4000 years ago (Figure 1). On the other hand, O’Fallon and Fehren-Schmitz8 detected a population decline of Native Americans that occurred 500 years ago, which coincides with the first European contact. Unlike our analysis, O’Fallon and Fehren-Schmitz used extended BSP. We performed extended BSP with our Zapotec data, applying the same parameters used in the study by O’Fallon and Fehren-Schmitz, and detected a population decline starting 5000–6000 years ago. When we applied the molecular clock of Soares et al.13 in this extended BSP analysis, the time of population decline was estimated at around 4000 years ago, which matches our BSP results (data not shown). Therefore, the difference between our results and those of O’Fallon and Fehren-Schmitz was not due to methodological differences. We assume that the difference may reflect the different samples used in the studies.

In the Mesoamerican area, agriculture began more than 5000 years ago18 and the growth of civilizations occurred afterward.19 Steady population expansion is generally expected after the introduction of agriculture, but such steady growth was not observed in Europe.20 Our study is the first to uncover unexpected demographic patterns in a population of Mesoamericans after the introduction of agriculture. During the Holocene, millennial or multi-century periodical cycles of climate change have been suggested.9, 10 The timing of these cycles is related to Bond event no. 3 of the Holocene Bond cycles.10 An abrupt climate change occurred in Mexico around 4000 years ago.11 We concluded that the Nef of the ancestors of the Zapotec population decreased around 4000 years ago, long before European contact, possibly owing to severe climate change.9, 10, 11