Demographic history and biologically relevant genetic variation of Native Mexicans inferred from whole-genome sequencing

Romero-Hidalgo, Sandra; Ochoa-Leyva, Adrián; Garcíarrubio, Alejandro; Acuña-Alonzo, Victor; Antúnez-Argüelles, Erika; Balcazar-Quintero, Martha; Barquera-Lozano, Rodrigo; Carnevale, Alessandra; Cornejo-Granados, Fernanda; Fernández-López, Juan Carlos; García-Herrera, Rodrigo; García-Ortíz, Humberto; Granados-Silvestre, Ángeles; Granados, Julio; Guerrero-Romero, Fernando; Hernández-Lemus, Enrique; León-Mimila, Paola; Macín-Pérez, Gastón; Martínez-Hernández, Angélica; Menjivar, Marta; Morett, Enrique; Orozco, Lorena; Ortíz-López, Guadalupe; Pérez-Villatoro, Fernando; Rivera-Morales, Javier; Riveros-McKay, Fernando; Villalobos-Comparán, Marisela; Villamil-Ramírez, Hugo; Villarreal-Molina, Teresa; Canizales-Quinteros, Samuel; Soberón, Xavier

doi:10.1038/s41467-017-01194-z

Download PDF

Article
Open access
Published: 18 October 2017

Demographic history and biologically relevant genetic variation of Native Mexicans inferred from whole-genome sequencing

Sandra Romero-Hidalgo ORCID: orcid.org/0000-0003-3077-7825¹^na1,
Adrián Ochoa-Leyva^1,2^na1,
Alejandro Garcíarrubio²^na1,
Victor Acuña-Alonzo³,
Erika Antúnez-Argüelles¹,
Martha Balcazar-Quintero⁴,
Rodrigo Barquera-Lozano ORCID: orcid.org/0000-0003-0518-4518³,
Alessandra Carnevale¹,
Fernanda Cornejo-Granados²,
Juan Carlos Fernández-López¹,
Rodrigo García-Herrera ORCID: orcid.org/0000-0002-7972-5746¹,
Humberto García-Ortíz¹,
Ángeles Granados-Silvestre⁵,
Julio Granados⁶,
Fernando Guerrero-Romero⁷,
Enrique Hernández-Lemus ORCID: orcid.org/0000-0002-1872-1397¹,
Paola León-Mimila^1,5,
Gastón Macín-Pérez³,
Angélica Martínez-Hernández¹,
Marta Menjivar^1,5,8,
Enrique Morett¹,
Lorena Orozco¹,
Guadalupe Ortíz-López⁸,
Fernando Pérez-Villatoro^1,9,
Javier Rivera-Morales³,
Fernando Riveros-McKay^1,9,
Marisela Villalobos-Comparán¹,
Hugo Villamil-Ramírez^1,5,
Teresa Villarreal-Molina¹,
Samuel Canizales-Quinteros^1,5 &
…
Xavier Soberón^1,2

Nature Communications volume 8, Article number: 1005 (2017) Cite this article

25k Accesses
38 Citations
115 Altmetric
Metrics details

Subjects

Abstract

Understanding the genetic structure of Native American populations is important to clarify their diversity, demographic history, and to identify genetic factors relevant for biomedical traits. Here, we show a demographic history reconstruction from 12 Native American whole genomes belonging to six distinct ethnic groups representing the three main described genetic clusters of Mexico (Northern, Southern, and Maya). Effective population size estimates of all Native American groups remained below 2,000 individuals for up to 10,000 years ago. The proportion of missense variants predicted as damaging is higher for undescribed (~ 30%) than for previously reported variants (~ 15%). Several variants previously associated with biological traits are highly frequent in the Native American genomes. These findings suggest that the demographic and adaptive processes that occurred in these groups shaped their genetic architecture and could have implications in biological processes of the Native Americans and Mestizos of today.

Differences in local population history at the finest level: the case of the Estonian population

Article Open access 25 July 2020

The genetic legacy of the expansion of Bantu-speaking peoples in Africa

Article Open access 29 November 2023

Human genetic structure in Northwest France provides new insights into West European historical demography

Article Open access 07 August 2024

Introduction

The peopling of the Americas was the most recent continental occupation to occur in human history. Although still a matter of controversy, according to the most widely accepted model based on archaeological and genetic evidence, American Natives originated in Eastern Asia no earlier than 23 thousand years ago (kya), reached America through the Bering Strait and expanded across the Americas along the North–South direction^{1, 2}. The inhabitants of the Americas of today are the result of several ongoing migration, admixture, and adaptive processes that have varied throughout the continent over several centuries. In the fifteenth century, prior to the Conquest of Mexico, Mexican territory was inhabited by many different Native groups located mainly in Mesoamerica, but also in Northern Mexico, inhabited by nomadic or semi-nomadic peoples³. The Mexican population of today is the result of complex and ongoing admixture processes that began with the Conquest of Mexican territory by the Spaniards in 1521⁴. The admixture process occurred mainly between Native Americans and Europeans, although there was a smaller contribution of the African population introduced by slave traders during the colonial period^{5, 6}. Currently, there are 68 acknowledged Native American languages in Mexico. Native populations not only contributed very importantly to the admixture process of the Mexican population, but currently up to 21% of Mexicans identify themselves as members of one of the acknowledged Indigenous groups of Mexico. Moreover, according to a recent survey, approximately 7% of the Mexican population, that is over 7 million Mexicans, speak a Native language⁷.

The genetic structure of Native Americans (NAs) of Mexico has been previously analyzed using microarray technology, shedding light on peopling and migration processes, and more recently on the high diversity of the NA populations^{1, 8}. Much of what we know comes from analyzing patterns of common, and, therefore, ancient genetic polymorphisms via genotyping across diverse human populations^{9, 10}. Recent studies have used sequencing approaches to reveal a more complete and genome-wide picture of variation, including low frequency variants of a more recent origin^{11, 12}. However, only a few whole genome sequences from NA individuals have been reported, and the genetic variation of NA remains largely unexplored^{2, 13, 14}.

Understanding human genomic variation is a central focus of medical and population genomics^{8, 15}. Thus, to better understand the genetic variability of NA and the genetic factors that may contribute to biological traits in this population, high coverage whole-genome sequences of 12 NA individuals from six distinct ethnic groups and a trio of Mexican Mestizos were analyzed, together with microarray data from 312 NA individuals. These data shed light on recent demographic history, better define the diversity of the populations of Mexico, and discover genetic variation shared among these populations, which may have an impact on health and other phenotypic traits.

Results

Sample selection

Whole genomes of a total of 15 Mexican individuals were sequenced, including 12 NAs from six distinct ethnic groups (Tarahumara and Tepehuano from the North; Nahua, Totonaca, Zapoteca from the South; and Maya from the South-East) and a trio of Mexican Mestizos (mother, father, and offspring) (Fig. 1). NA participants were selected considering ancestry, linguistic group, geographic location, and representation of three of the main genetic clusters previously described in the NA Map of Mexico: Northern, Southern, and Mayan, as described by Moreno-Estrada et al.⁸ Estimated NA ancestry using genome-wide data was over 98% in all indigenous participants, except for both Tepehuanos who were selected for having the highest NA ancestry (91%) among those available for whole-genome sequencing (WGS).

Analysis of population admixture

Reference NA and continental populations were used to generate multidimensional scaling (MDS) plots. Figure 2a shows that components 1 and 2 distinguish Africans and Europeans from NAs of Mexico, and the 12 NA individuals clustered within the latter group. Moreover, component 3 separates NAs, showing the 12 NA individuals clustered within 3 of the main NA components described by Moreno-Estrada et al.⁸ (Fig. 2b). Three-hundred and twelve additional samples from Nahua, Totonaca, Zapoteca, and Maya individuals were genotyped and included in the admixture analysis, revealing the presence of two additional and distinct Southern components: Nahua and Totonaca (Fig. 2c). Thus, the previously described Southern component includes at least three distinctive ancestral components: Nahua, Totonaca, and Zapoteca. Figure 2d shows ancestry proportions of the 12 NAs. The Tarahumara and Tepehuano individuals showed the highest average proportions of the Northern native component, while Mayas and Zapotecas showed higher proportions of their respective reference components. Interestingly, admixture analyses revealed that the four Nahua and Totonaca individuals show a distinct Native substructure. The high demographic and genetic diversity of Nahua-speaking populations is most likely a consequence of the expansion of the Aztec Empire mainly during the Postclassic period (1427–1520 A.D.)^{16, 17}. Because the inclusion of a greater number of Nahuas and Totonacas in the analysis revealed the presence of previously unidentified components, and there are 1.7 million Nahuatl-speaking individuals (the largest indigenous population in Mexico)⁷, it is necessary to obtain genomic data from different Nahua populations to identify whether Nahuatl-speaking individuals share the Nahua component here identified.

Sequence variation of Native Mexican and Mestizo individuals

The mean coverage for all genomes was 40× (Supplementary Table 1). On average, 96.8% of the genome was called for both alleles, and 99.4% of the bases in the reference genome (build 37.2) were covered by at least one read. The mean number of single-nucleotide variants (SNVs) was 3.2 million for the 12 NAs and 3.4 million for the three Mexican Mestizos, while the mean percentage of novel SNVs was 1.76% (Supplementary Table 2). Overall, 0.62% SNVs were located in coding regions, 0.28% were missense, and 0.002% were nonsense. Interestingly, the proportion of novel missense variants predicted to be damaging was higher for novel (30%) than for all missense variants (15%) (Supplementary Table 3). This is consistent with previous observations of an increased proportion of deleterious vs. neutral variation in Mexican-American individuals, compatible with population bottlenecks possibly experienced by NAs^{18, 19}. The distribution of insertions/deletions (indels), copy number variations (CNVs) and mobile element insertions (MEIs) are described in Supplementary Tables 4 and 5.

The mean percentage of SNVs in heterozygosis was similar among NAs, but on average was higher in the Mestizo genomes (59.9%) as compared to the NA genomes (51.8%) (Supplementary Table 6). Considering the three previously described clusters, mean SNV heterozygosis was similar in Northern groups (Tarahumaras and Tepehuanos, 51.4%) Southern groups (Nahuas, Zapotecas, and Totonacas, 51.7%) and Mayas (52.5%). Relatedness between pairs of individuals as assessed by kinship coefficient was highest between both Totonacas (0.0639), suggesting they are third degree relatives. Estimated kinship coefficient between both Tepehuanos was 0.0005, and was zero for Mayas, Nahuas, Zapotecos, and Tarahumaras.

We then compared the local density of SNVs across the genome between the 12 NA and 1000 Genomes (1KG) populations excluding the Americas (AMR) populations (Supplementary Fig. 1). Overall, SNV density was similar in both populations. Notably, chromosome 6 showed three high-density peaks shared by both populations, and one peak found only in the 12 NA genomes. The latter peak includes the PRIM2 gene, which encodes the 58 KDa subunit of DNA primase that plays a key role in DNA replication. This finding is consistent with that observed by Zhang et al.²⁰, who sequenced 35 Korean genomes finding that the PRIM2 gene had an extremely high number of SNVs not found in the 1KG project.

Inference of population history from sequence data

As shown in Table 1, all five male indigenous participants shared the same Y-chromosome haplogroup (Q1a2a) distinctive of NA populations²¹, and the male Mestizo had the R1b1 Y chromosome haplogroup, common in the Iberian Peninsula²². As expected, all individuals, NA and Mestizos, showed common NA mitochondrial DNA (mtDNA) haplogroups^{23, 24}. Thus, these uniparental marker analyses confirm the demographic history of the Mexican Mestizo and Native populations, where the contribution of the European gene pool to the admixture process was mainly of male origin²⁵.

Table 1 Characteristics of Native American and Mestizo individuals

Full size table

A maximum likelihood tree was generated to ascertain the population history of present day NA in relation to worldwide populations by Treemix (Supplementary Fig. 2) using sequencing data from the 12 NA genomes (Tarahumara, Tepehuano, Nahua, Totonaca, Zapoteca, and Maya), 11 genomes from worldwide populations²⁶, and 4 ancient individuals (Neanderthal, Denisovan, Anzick-1, and the Mal’ta child)^{27, 28}. The inferred tree recapitulates the North to South differentiation gradients for the NA groups observed from the MDS analysis. Southern NA populations of Mexico are grouped in a major clade (Totonacas, Zapotecas, Nahuas, and Mayas), while Northern NA populations of Mexico (Tarahumaras and Tepehuanos) branch from the same initial split. In addition, this analysis confirms the gene flow from the Mal’ta lineage, (known to have genetic affinities of both European and NA populations), to the ancestors of all present day NAs^{2, 29}.

Demographic history reconstruction using a pairwise sequentially Markovian coalescent (PSMC) model showed evidence of a strong bottleneck in the European, Asian and NA populations, reaching the lowest effective population size (N _e) between 50–60 thousand years ago (kya) (Fig. 3a). Although European and Asian N _e recovered afterwards, an extended period of low population size is observed in NAs with N _e around 2000 individuals for up to 20 kya, in consistency with previous reports^{30, 31}. This extended bottleneck is also consistent with estimates of the time that the NA ancestors crossed the Bering Strait and moved into America^{2, 32}.

Because PSMC can only infer population size estimates beyond 20 kya^{31, 33}, multiple sequentially Markovian coalescent (MSMC) analyses were performed combining four and eight phased haplotypes. Figure 3b depicts MSMC analysis combining two genomes per NA group, showing that the N _e remained close to 2000 individuals in all groups for another 10 thousand years (10–20 kya). We then performed MSMC analysis combining four individuals from Northern NA groups (Tarahumaras and Tepehuanos), and four individuals from Southern NA groups (Nahuas and Zapotecas). Figure 3c shows the analysis of these eight haplotypes, allowing the estimation of N _e up to 2 kya. N _e of Southern NA groups increased constantly between 9 and 2 kya, in contrast with Northern groups who continued to decline from 9 to 4 kya. This is consistent with the previously reported higher effective population sizes for Southern and Mayas compared to Northern NAs of Mexico³⁴. In addition, this population growth concurs with the time estimated for domestication of maize in Southwestern Mexico, approximately 6000–10,000 kya^35,36,37.

Identification alleles of biomedical relevance

We analyzed all NA genomes for the presence of potentially pathogenic alleles as defined by the publication of the American College of Medical Genetics and Genomics (ACMG)³⁸. Seeking for incidental findings within the 56 genes recommended by the ACMG, we found a total of 90 non-synonymous variants in the 12 NA genomes. Sixty-eight of these variants were polymorphisms (MAF > 1% in at least one of the 1KG populations). The 22 remainder variants were absent from the 1KG project (phase 3), three of which are annotated in the ClinVar database. One individual carried a mutation in the KCNH2 gene (rs199472885, R312C) previously found in a patient with congenital long QT syndrome;³⁹ another carried a BRCA2 variant annotated as of uncertain clinical significance in ClinVar (rs80358606), and a third mutation (T187S) was identified in the SCN5A gene, in the same position as a loss of function mutation (T187I) found in a patient with Brugada syndrome⁴⁰. Because the phenotypes of these individuals and the penetrance of the variants annotated as pathogenic are unknown, it is not possible to determine whether the 19 remainder variants predicted as deleterious or probably damaging are in fact pathogenic.

In the last decade, up to 2400 genome-wide association studies (GWAS) have been published reporting over 20,000 loci associated to many different phenotypes. The vast majority of the studies have been performed in populations of European descent and few loci have been replicated in the Mexican population using designs that simultaneously analyze a broad number of single-nucleotide polymorphisms (SNPs)^41,42,43. Based on the NHGRI GWAS catalog, which contains all the SNP-trait associations with a significance threshold of 10⁻⁵, a total of 10,347 variants were present in at least one of the 12 genomes and 1,888 variants were shared among the 12 individuals. The latter shared variants were grouped based on their experimental factor ontology (EFO) and their frequencies were compared with other continental populations, particularly Europeans (CEU) and Asians (CHB and JPT) from 1KG project. Notably, variants whose frequency in the 12 genomes most differed from European or Asian populations have been previously associated with human morphology and clinical traits (Supplementary Fig. 3 and Supplementary Table 7). Some of these variants have been reported to be involved in natural selection. For instance, the frequency of a skin pigmentation variant (rs1834640 near the SLC24A5 gene) most differed from Europeans, but was similar to that in Asians. Earlier studies have highlighted SLC24A5 as one of the top candidate genes demonstrating evidence for positive selection in Europeans, suggesting that the fixation of the “A” allele is related with adaptive processes^{44, 45}. Moreover, the allele frequency of the rs765132 “T” allele, previously associated with response to anti-tumor necrosis factor alpha therapy in inflammatory bowel disease⁴⁶, was 87.5% in the 12 NA whole genomes, 59% in the 312 additional NA samples with microarray genotyping data, but only 5% in Europeans (5%) and not found in Asians. Thirteen additional SNVs with high population allele frequency differences were also available in the 312 NA samples. Overall, allele frequencies of these SNVs were similar in the 12 NA whole genomes and the 312 NA samples, although differences ranged from 0.01 to 0.19 (Supplementary Table 7). Because of the potential clinical implications of these variants, it is important to design studies aiming to identify their biological implications in NAs and Mestizos of today.

Pathway and gene ontology enrichment analysis

Another approach to identify biologically relevant genes is to look not at the variations of individual genes, but rather at gene sets or pathways⁴⁷. We clustered the genes with novel non-synonymous and promoter region SNVs (Supplementary Table 2) in order to search for pathway and gene ontology (GO) enrichment. Supplementary Fig. 4 shows gene sets and/or pathways enriched with novel missense and promoter region mutations in the 12 NA individuals here analyzed. This analysis identified a wide range of enriched pathways in all groups (Supplementary Fig. 4a–g). Interestingly, one of the most significant enrichment values was for genes related to collagen, muscular and musculoskeletal diseases for both promoter and missense variants in Tarahumaras (Fig. 4). The combined analysis of these SNVs also showed highly significant enrichment for these pathways. This enrichment was also observed in other NA groups, although with less significant values. Moreover, to explore whether the genes with novel missense mutations share specific functional features, we performed GO enrichment analysis by population. Interestingly, the only significantly enriched GO terms in Tarahumaras were related to molecular and cellular processes involved in musculoskeletal function (Supplementary Fig. 5), previously suggested to be involved in athletic performance^{48, 49}.

In order to seek further evidence of the enrichment in muscle and collagen-related pathways found in Tarahumaras, we sequenced the exome of three more individuals from this group, known to have participated in high resistance physical activity events. These individuals showed significant gene enrichment in muscle-related pathways, in consistency with the two initially sequenced Tarahumaras (Fig. 4). While these findings could be related with the well-known high physical resistance of this Native Mexican group^{50, 51}, they should be interpreted with caution. Further research including a larger sample size and phenotypic data should be performed to establish any possible relationship between gene enrichment and physical resistance in the Tarahumara population.

Discussion

In conclusion, the study of NA genomes provides an enriching perspective on the demographic history of the region and is relevant in the genetic characterization of phenotypes in these populations. This study shows that the diversity of NA populations, particularly the Southern populations, is greater than what has been previously described⁸. Moreover, the demographic and adaptive processes suffered by these groups shaped their genetic architecture, having implications in biological and health-related processes of NAs and Mestizos of today. This is the first effort to characterize whole genomes of Native individuals from different regions of Mexico leading to the identification of variants associated with biological and clinical traits.

Methods

Ethics statement

Institutional review board of the National Institute of Genomic Medicine (INMEGEN) approved this project. Written informed consent was obtained from all participants. For participants not fluent in Spanish, a translator was used. Blood samples were drawn with the permission of local authorities. The ethical duty to return individual research results or incidental findings is currently a matter of debate, depending on characteristics of the finding such as analytical and clinical validity, clinical actionability and significance as well as magnitude of potential damage. Because the samples included were donated anonymously, it was not possible to inform the participants about the incidental findings found here.

Analysis of population admixture

Genotype data from 112 CEU and 106 YRI from 1KG project; 401 NA samples from Moreno-Estrada et al.⁸ (described in Supplementary Table 8) and 312 additional NA samples (103 Nahuas, 62 Totonacas, 49 Zapotecas, and 98 Mayas) genotyped with SNP 6.0 microarray (Affymetrix), were used for population admixture analysis. A total of 322,098 autosome-wide SNPs shared among these populations were used to generate MDS plots based on genome-wide identity by state pairwise distances as implemented in PLINK⁵². Supplementary Figs. 6 and 7 show the estimated individual ancestry proportions for K = 3 to K = 10 in 312 NA samples, continental reference populations, and the 12 NA and 3 Mexican Mestizo whole genomes using ADMIXTURE⁵³. The fit of different values of K was assessed using cross-validation (CV) procedures, where K = 10 showed the lowest CV error (Supplementary Fig. 8).

Samples and sequencing procedure

Sampling locations are described in Fig. 1. A total of 15 genomes (12 NAs from Tarahumara, Tepehuano, Nahua, Totonaca, Zapoteca, and Maya groups, and a trio of Mexican Mestizos) were submitted to WGS by Complete Genomics (Mountain View, California, USA). Individuals whose parents and grandparents recognized themselves as indigenous, had been born and lived in their home communities and spoke their native language were considered as NA. Individuals were selected according to their estimated NA ancestry, based on genome-wide data using the block relaxation algorithm implemented in ADMIXTURE⁵³, assuming k = 3 and including genotype data from CEU and YRI populations (1KG project) as well as additional NA samples included with SNP 6.0 microarray (Affymetrix) data.

All sequencing data were generated with Complete Genomics local pipeline⁵⁴. Sequence reads were mapped to the reference human genome (GRCh37) and variants were called using version 2.4 of the Complete Genomics software. All variant information (SNVs, indels, CNVs, MEIs) and structural variation] was obtained from the master variation files reported by Complete Genomics. Only variants classified as “PASS” were considered for analyses. Variants not reported in dbSNP v137 were considered novel. SNVs within the 7.5 kb region upstream of the 5′ transcription initiation site of genes were considered as promoter variants. The functional consequences of nonsynonymous SNVs were predicted using PolyPhen-2 (Polymorphism Phenotyping v2)⁵⁵. The mean concordance rate between SNP 6.0 microarray and WGS data was 99.2% (Supplementary Table 9). Six selected variants found in COL4A2, COL5A2, and COL18A1 genes in Tarahumaras were confirmed by Sanger sequencing.

Relatedness between among the 12 NA individuals was inferred by the kinship coefficient using KING v2.0⁵⁶ from 4.8 million autosomal SNVs fully called in the 15 whole genomes (12 NA and the Mestizo trio), and biallelic for at least one genome. Kinship coefficients between parents and offspring of the trio were 0.251 and 0.253.

Whole-exome sequencing

Samples of three additional Tarahumaras known to have participated in high resistance physical activity events were prepared using Agilent SureSelect V4 All Exome kit (Agilent, Santa Clara, CA) and were sequenced on an Illumina NextSeq 500 system (Illumina, San Diego, CA) at the Sequencing Unit of INMEGEN. Reads were mapped to the reference human genome (GRCh37) with BWA⁵⁷, GATK was used for variant calling⁵⁸, and snpEff was used for variant annotation⁵⁹.

Haplogroup assignment

mtDNA sequences were compared to the revised Cambridge Reference Sequence⁶⁰ using Phylotree⁶¹ and the nomenclature adopted by the International Society for Forensic Genetics⁶². Haplogrep software was used to assign haplogroups⁶³. Y chromosome sequences were merged with a set of 19 worldwide Y chromosomes made publicly available by Complete Genomics, in order to determine their broad haplogroup affiliation.

Genome-wide SNVs density

The distribution of SNVs along genomes was analyzed dividing the genome into 20 Kb windows, and reporting the mean number of SNVs per window in the 12 NA genomes and in all non-AMR individuals from the 1KG project.

Maximum likelihood analysis

History of population splits and mixtures was inferred with TreeMix⁶⁴. A total of 21 individuals were used to infer admixture graphs, 17 present-day individuals including the 12 NA genomes sequenced in this study, and 4 ancient individuals (Neanderthal, Denisovan, Anzick-1, and the Mal’ta child)^26,27,28. Transitions were excluded from the data set to prevent biases introduced by the four ancient genomes. Only SNVs called in all genomes were included in the analysis. A maximum likelihood tree was inferred including 336,272 autosomal SNVs, using the default parameters and fitted allowing two migration edges.

Inference of population history from sequence data

Firstly, a pairwise sequentially Markovian caolescent (PSMC) analysis was used to reconstruct the demographic history of the NA populations here analyzed. Only data from highly covered regions were used. Reference population genomes were obtained from Complete Genomics (http://www.completegenomics.com/public-data/69-genomes/.) Bootstrapping analysis was performed to validate the inference as described by Li et al.³¹ (Supplementary Fig. 9).

Secondly, MSMC models were used to analyze multiple genomes³³, grouped as follows: both individuals from each of the six NA groups of Mexico here analyzed; the four genomes from Northern NA populations of Mexico (Tepehuanos and Tarahumaras) and the four genomes from Southern Mexico (Zapotecas and Nahuas). All genomes were previously phased with SHAPEIT V2.12⁶⁵, using all individuals from 1KG project as reference panel. Demographic history of world populations was inferred using phased variants from the last release of the 1KG project.

Potentially pathogenic alleles causing Mendelian disease

The list of 56 genes recommended for reporting incidental findings by the ACMG was used as reference to identify potentially pathogenic variants³⁸. Non-synonymous variants in the 12 NA genomes absent from the 1KG project (phase 3) were annotated using ClinVar database to identify potential pathogenicity.

GWAS variants in NA individuals

Based on the NHGRI GWAS Catalog containing all SNP-trait associations with a significance level of 10⁻⁵, we identified GWAS variants present in each genome⁶⁶. Variants shared by the 12 NA individuals in heterozygous or homozygous form were grouped based on their EFO. The allele frequency of these variants was compared with that of other continental populations, particularly Europeans (CEU) and Asians (CHB and JPT) from the 1KG project (phase 3) (Supplementary Table 7). Thirteen variants were found in SNP 6.0 microarray and their allele frequencies were evaluated in 312 additional NA samples included in the present study (Supplementary Table 7).

Pathway enrichment and gene ontology analysis

Functional enrichment (Disease association and GO enrichment analysis) was performed using the WebGestalt online tool⁶⁷. For disease association analysis, WebGestalt uses disease terms downloaded from PharmGKB and genes associated with individual disease terms were inferred using GLAD4U (Gene List Automatically Derived For You)⁶⁸. Human genome data (hsapiens_genome) was used as reference set, the Benjamini-Hochberg multiple test was used to adjust for multiple testing⁶⁹. Statistical significance was considered at P < 0.01. Three annotation matrix for enriched disease-associated genes were tested: (A) all genes with novel non-synonymous variants; (B) protein-coding genes with highest number of SNVs in the promoter region (above 95th percentile); and (C) genes from groups A and B combined. All analyses were performed seperately in each ethnic group.

Data availability

In consistency with the respective Institutional Review Board approval and individual informed consents, genome and exome variation files will be available at www.12g-data.inmegen.gob.mx, upon request to the corresponding authors.

References

Reich, D. et al. Reconstructing Native American population history. Nature 488, 370–374 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Raghavan, M. et al. Genomic evidence for the pleistocene and recent population history of Native Americans. Science 349, 1280–1285 (2015).
Article Google Scholar
Kroeber, A. L. et al. Configurations for Culture Growth. (University of California Press, Berkeley, 1944).
Sánchez-Albornoz, N. The Population of Latin America: A History (University of California Press, Berkeley, 1974).
Silva-Zolezzi, I. et al. Analysis of genomic diversity in Mexican Mestizo populations to develop genomic medicine in mexico. Proc. Natl Acad. Sci. USA 106, 8611–8616 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Bryc, K. et al. Colloquium paper: genome-wide patterns of population structure and admixture among hispanic/latino populations. Proc. Natl Acad. Sci. USA 107, 8954–8961 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Instituto Nacional de Estadística y Geografía (INEGI). http://www.inegi.org.mx/.
Moreno-Estrada, A. et al. Human genetics. the genetics of mexico recapitulates Native American substructure and affects biomedical traits. Science 344, 1280–1285 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, J. Z. et al. Worldwide human relationships inferred from genome-wide patterns of variation. Science 319, 1100–1104 (2008).
Article ADS CAS PubMed Google Scholar
Auton, A. et al. Global distribution of genomic diversity underscores rich complex history of continental human populations. Genome Res. 19, 795–803 (2009).
Article CAS PubMed PubMed Central Google Scholar
Abecasis, G. R. et al. A map of human genome variation from population-scale sequencing. Nature 467, 1061–1073 (2010).
Article ADS PubMed Google Scholar
Sudmant, P. H. et al. An integrated map of structural variation in 2,504 human genomes. Nature 526, 75–81 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ribeiro-dos-Santos, A. M. et al. High-throughput sequencing of a south american amerindian. PLoS ONE 8, e83340 (2013).
Article ADS PubMed PubMed Central Google Scholar
Mallick, S. et al. The simons genome diversity project: 300 genomes from 142 diverse populations. Nature 538, 201–206 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Bustamante, C. D., De La Vega, F. M. & Burchard, E. G. Genomics for the world. Nature 475, 163–165 (2011).
Article CAS PubMed PubMed Central Google Scholar
Brumfiel, E. M. Aztec state making: ecology, structure, and the origin of the state. Am. Anthropol. 85, 1852–1861 (1983).
Article Google Scholar
Bernan, F. F. & Smith, M. E. in The Postclassic Mesoamerican World. (eds Smith, M. E. & Berdan, F. F.). 67–72 (The University of Utah Press, Salt Lake City, 2003).
Kidd, J. M. et al. Population genetic inference from personal genome data: impact of ancestry and admixture on human genomic variation. Am. J. Hum. Genet. 91, 660–671 (2012).
Article CAS PubMed PubMed Central Google Scholar
Balick, D. J., Do, R., Cassa, C. A., Reich, D. & Sunyaev, S. R. Dominance of deleterious alleles controls the response to a population bottleneck. PLoS Genet. 11, e1005436 (2015).
Article PubMed PubMed Central Google Scholar
Zhang, W. et al. Whole genome sequencing of 35 individuals provides insights into the genetic architecture of korean population. BMC Bioinformatics. 15, S6 (2014).
Article Google Scholar
Zegura, S. L., Karafet, T. M., Zhivotovsky, L. A. & Hammer, M. F. High-resolution SNPs and microsatellite haplotypes point to a single, recent entry of Native American Y chromosomes into the Americas. Mol. Biol. Evol. 21, 164–175 (2004).
Article CAS PubMed Google Scholar
Brión, M. et al. Introduction of a single nucleotide polymorphism-based ‘major y-chromosome haplogroup typing kit’ suitable for predicting the geographical origin of male lineages. Electrophoresis 26, 4411–4420 (2005).
Article PubMed Google Scholar
Shields, G. F. et al. mtDNA sequences suggest a recent evolutionary divergence for Beringian and northern North American populations. Am. J. Hum. Genet. 53, 549–562 (1993).
CAS PubMed PubMed Central Google Scholar
Malhi, R. S. et al. The structure of diversity within new world mitochondrial DNA haplogroups: implications for the prehistory of north america. Am. J. Hum. Genet. 70, 905–919 (2002).
Article CAS PubMed PubMed Central Google Scholar
Mörner, M. Race Mixture in the History of Latin America (Little, Brown and Co., Boston, 1967).
Meyer, M. et al. A high-coverage genome sequence from an archaic denisovan individual. Science 338, 222–226 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Reich, D. et al. Genetic history of an archaic hominin group from denisova cave in Siberia. Nature 468, 1053–1060 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Rasmussen, M. et al. The genome of a late pleistocene human from a clovis burial site in western montana. Nature 506, 225–229 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Raghavan, M. et al. Upper palaeolithic siberian genome reveals dual ancestry of Native Americans. Nature 505, 87–91 (2014).
Article ADS PubMed Google Scholar
1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article Google Scholar
Li, H. & Durbin, R. Inference of human population history from individual whole-genome sequences. Nature 475, 493–496 (2011).
Article CAS PubMed PubMed Central Google Scholar
Goebel, T. et al. The late Pleistocene dispersal of modern humans in the Americas. Science 319, 1497–1502 (2008).
Article ADS CAS PubMed Google Scholar
Schiffels, S. & Durbin, D. Inferring human population size and separation history from multiple genome sequences. Nat. Genet. 46, 919–925 (2014).
Article CAS PubMed PubMed Central Google Scholar
Rubi-Castellanos, R. et al. Pre-hispanic mesoamerican demography approximates the present-day ancestry of Mestizos throughout the territory of mexico. Am. J. Phys. Anthropol. 139, 284–294 (2009).
Article PubMed Google Scholar
Matsuoka, Y. et al. A single domestication for maize shown by multilocus microsatellite genotyping. Proc. Natl. Acad. Sci. USA 99, 6080–6084 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Ranere, A. J. et al. The cultural and chronological context of early Holocene maize and squash domestication in the Central Balsas River Valley, Mexico. Proc. Natl Acad. Sci. USA 106, 5014–5018 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
van Heerwaarden, J. et al. Genetic signals of origin, spread, and introgression in a large sample of maize landraces. Proc. Natl Acad. Sci. USA 108, 1088–1092 (2011).
Article ADS PubMed Google Scholar
Green, R. C. et al. ACMG recommendations for reporting of incidental findings in clinical exome and genome sequencing. Genet. Med. 15, 565–574 (2013).
Article CAS PubMed PubMed Central Google Scholar
Splawski, I. et al. Spectrum of mutations in long-QT syndrome genes. KVLQT1, HERG, SCN5A, KCNE1, and KCNE2. Circulation 102, 1178–1185 (2000).
Article CAS PubMed Google Scholar
Makiyama, T. et al. High risk for bradyarrhythmic complications in patients with brugada syndrome caused by SCN5A gene mutations. J. Am. Coll. Cardiol. 46, 2100–2106 (2005).
Article CAS PubMed Google Scholar
Gamboa-Meléndez, M. A. et al. Contribution of common genetic variation to the risk of type 2 diabetes in the Mexican Mestizo population. Diabetes 61, 3314–3321 (2012).
Article PubMed PubMed Central Google Scholar
León-Mimila, P. et al. Contribution of common genetic variants to obesity and obesity-related traits in mexican children and adults. PLoS ONE 8, e70640 (2013).
Article ADS PubMed PubMed Central Google Scholar
Williams, A. L. et al. Sequence variants in SLC16A11 are a common risk factor for type 2 diabetes in mexico. Nature 506, 97–101 (2014).
Article ADS CAS PubMed Google Scholar
Voight, B. F., Kudaravalli, S., Wen, X. & Pritchard, J. K. A map of recent positive selection in the human genome. PLoS Biol. 4, e72 (2006).
Article PubMed PubMed Central Google Scholar
Sabeti, P. C. et al. Genome-wide detection and characterization of positive selection in human populations. Nature 449, 913–918 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Dubinsky, M. C. et al. Genome wide association (GWA) predictors of anti-tnfα therapeutic responsiveness in pediatric inflammatory bowel disease. Inflamm. Bowel Dis. 16, 1357–1366 (2010).
Article PubMed PubMed Central Google Scholar
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl Acad. Sci. USA 102, 15545–15550 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Kjær, M. Role of extracellular matrix in adaptation of tendon and skeletal muscle to mechanical loading. Physiol. Rev. 84, 649–698 (2004).
Article PubMed Google Scholar
Kjær, M., Jørgensen, N. R., Heinemeier, K. & Magnusson, S. P. Exercise and regulation of bone and collagen tissue biology. Prog. Mol. Biol. Transl. Sci. 135, 259–291 (2015).
Article PubMed Google Scholar
Balke, B. & Snow, C. Anthropological and physiological observations on Tarahumara endurance runners. Am. J. Phys. Anthropol. 23, 293–301 (1965).
Article CAS PubMed Google Scholar
Groom, D. Cardiovascular observations on Tarahumara Indian runners—the modern Spartans. Am. Heart J. 81, 204–314 (1971).
Article Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009).
Article CAS PubMed PubMed Central Google Scholar
Drmanac, R. et al. Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays. Science 327, 78–81 (2010).
Article ADS CAS PubMed Google Scholar
Adzhubei, I. A. et al. A method and server for predicting damaging missense mutations. Nat. Methods 7, 248–249 (2010).
Article CAS PubMed PubMed Central Google Scholar
Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867–2873 (2010).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheele Transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
McKenna, A. et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20, 1297–1303 (2010).
Article CAS PubMed PubMed Central Google Scholar
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6, 80–92 (2012).
Article CAS PubMed PubMed Central Google Scholar
Andrews, R. M. et al. Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nat. Genet. 23, 147 (1999).
Article CAS PubMed Google Scholar
van Oven, M. & Kayser, M. Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum. Mutat. 30, E386–E394 (2009).
Article PubMed Google Scholar
Carracedo, A. et al. DNA commission of the international society for forensic genetics: guidelines for mitochondrial DNA typing. Forensic Sci. Int. 110, 79–85 (2000).
Article CAS PubMed Google Scholar
Kloss-Brandstätter, A. et al. HaploGrep: a fast and reliable algorithm for automatic classification of mitochondrial DNA haplogroups. Hum. Mutat. 32, 25–32 (2011).
Article PubMed Google Scholar
Pickrell, J. K. & Pritchard, J. K. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet. 8, e1002967 (2012).
Article CAS PubMed PubMed Central Google Scholar
Delaneau, O., Marchini, J. & Zagury, J. F. A linear complexity phasing method for thousands of genomes. Nat. Methods 9, 179–181 (2011).
Article PubMed Google Scholar
Welter, D. et al. The NHGRI GWAS catalog, a curated resource of SNP-trait associations. Nucleic Acids Res. 42, 1001–1006 (2014).
Article Google Scholar
Wang, J., Duncan, D., Shi, Z. & Zhang, B. WEB-Based GEne SeT analysis toolkit (webgestalt): update 2013. Nucleic Acids Res. 41, 77–83 (2013).
Article CAS Google Scholar
Jourquin, J., Duncan, D., Shi, Z. & Zhang, B. GLAD4U: deriving and prioritizing gene lists from PubMed literature. BMC Genomics 13, S20 (2012).
Article PubMed PubMed Central Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Series B (Methodol.) 57, 289–300 (1995).
MathSciNet MATH Google Scholar

Download references

Acknowledgements

We thank the volunteers for their support for this research. We also thank Miguel Angel Contreras Siecks, Manuela Irina Ortega-Sánchez, Martha Lucía Granados-Riveros, and María Teresa Flores-Dorantes for assistance with sample processing and data analyses; Stephan Schiffels and Rolando González-José for comments on the manuscript; and Alejandro Rodríguez-Torres for his help in Fig. 1. This work was supported by funds from the Federal Government of Mexico to the Instituto Nacional de Medicina Genómica.

Author information

Sandra Romero-Hidalgo, Adrián Ochoa-Leyva and Alejandro Garcíarrubio contributed equally to this work.

Authors and Affiliations

Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, 14610, Mexico
Sandra Romero-Hidalgo, Adrián Ochoa-Leyva, Erika Antúnez-Argüelles, Alessandra Carnevale, Juan Carlos Fernández-López, Rodrigo García-Herrera, Humberto García-Ortíz, Enrique Hernández-Lemus, Paola León-Mimila, Angélica Martínez-Hernández, Marta Menjivar, Enrique Morett, Lorena Orozco, Fernando Pérez-Villatoro, Fernando Riveros-McKay, Marisela Villalobos-Comparán, Hugo Villamil-Ramírez, Teresa Villarreal-Molina, Samuel Canizales-Quinteros & Xavier Soberón
Instituto de Biotecnología, Universidad Nacional Autónoma de México (UNAM), Cuernavaca, Morelos, 62210, Mexico
Adrián Ochoa-Leyva, Alejandro Garcíarrubio, Fernanda Cornejo-Granados & Xavier Soberón
Escuela Nacional de Antropología e Historia, Mexico City, 14030, Mexico
Victor Acuña-Alonzo, Rodrigo Barquera-Lozano, Gastón Macín-Pérez & Javier Rivera-Morales
Dirección de Formación y Acción Social, Universidad Iberoamericana, Mexico City, 01298, Mexico
Martha Balcazar-Quintero
Facultad de Química, UNAM, Mexico City, 04510, Mexico
Ángeles Granados-Silvestre, Paola León-Mimila, Marta Menjivar, Hugo Villamil-Ramírez & Samuel Canizales-Quinteros
Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, 14080, Mexico
Julio Granados
Unidad de Investigación Biomédica, Instituto Mexicano del Seguro Social, Durango, 34067, Mexico
Fernando Guerrero-Romero
Hospital Juárez de México, Mexico City, 07760, Mexico
Marta Menjivar & Guadalupe Ortíz-López
Winter Genomics, Mexico City, 07300, Mexico
Fernando Pérez-Villatoro & Fernando Riveros-McKay

Authors

Sandra Romero-Hidalgo
View author publications
You can also search for this author in PubMed Google Scholar
Adrián Ochoa-Leyva
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro Garcíarrubio
View author publications
You can also search for this author in PubMed Google Scholar
Victor Acuña-Alonzo
View author publications
You can also search for this author in PubMed Google Scholar
Erika Antúnez-Argüelles
View author publications
You can also search for this author in PubMed Google Scholar
Martha Balcazar-Quintero
View author publications
You can also search for this author in PubMed Google Scholar
Rodrigo Barquera-Lozano
View author publications
You can also search for this author in PubMed Google Scholar
Alessandra Carnevale
View author publications
You can also search for this author in PubMed Google Scholar
Fernanda Cornejo-Granados
View author publications
You can also search for this author in PubMed Google Scholar
Juan Carlos Fernández-López
View author publications
You can also search for this author in PubMed Google Scholar
Rodrigo García-Herrera
View author publications
You can also search for this author in PubMed Google Scholar
Humberto García-Ortíz
View author publications
You can also search for this author in PubMed Google Scholar
Ángeles Granados-Silvestre
View author publications
You can also search for this author in PubMed Google Scholar
Julio Granados
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Guerrero-Romero
View author publications
You can also search for this author in PubMed Google Scholar
Enrique Hernández-Lemus
View author publications
You can also search for this author in PubMed Google Scholar
Paola León-Mimila
View author publications
You can also search for this author in PubMed Google Scholar
Gastón Macín-Pérez
View author publications
You can also search for this author in PubMed Google Scholar
Angélica Martínez-Hernández
View author publications
You can also search for this author in PubMed Google Scholar
Marta Menjivar
View author publications
You can also search for this author in PubMed Google Scholar
Enrique Morett
View author publications
You can also search for this author in PubMed Google Scholar
Lorena Orozco
View author publications
You can also search for this author in PubMed Google Scholar
Guadalupe Ortíz-López
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Pérez-Villatoro
View author publications
You can also search for this author in PubMed Google Scholar
Javier Rivera-Morales
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Riveros-McKay
View author publications
You can also search for this author in PubMed Google Scholar
Marisela Villalobos-Comparán
View author publications
You can also search for this author in PubMed Google Scholar
Hugo Villamil-Ramírez
View author publications
You can also search for this author in PubMed Google Scholar
Teresa Villarreal-Molina
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Canizales-Quinteros
View author publications
You can also search for this author in PubMed Google Scholar
Xavier Soberón
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceived and designed study: S.R.-H., A.O.-L., A.G., V.A.-A., S.C.-Q., X.S. Contributed reagents/materials: V.A.-A., E.A.-A., M.B.-Q., R.B.-L., H.G.-O., A.G.-S., J.G., F.G.-R., P.L.-M., G.M.-P., A.M.-H., M.M., L.O., G.O.-L., J.R.-M., H.V.-R., T.V.-M. Performed the experiments: S.R.-H., A.O.-L., A.G. Analyzed data: S.R.-H., A.O.-L., A.G., A.C., F.C.-G., J.C.F.-L., H.G.-O., E.H.-L., E.M., F.P.-V., F.R.-M, M.V.-C. Wrote the paper, incorporating input from all authors: S.R.-H., A.O.-L., A.G., T.V.-M., S.C.-Q., X.S.

Corresponding authors

Correspondence to Samuel Canizales-Quinteros or Xavier Soberón.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Romero-Hidalgo, S., Ochoa-Leyva, A., Garcíarrubio, A. et al. Demographic history and biologically relevant genetic variation of Native Mexicans inferred from whole-genome sequencing. Nat Commun 8, 1005 (2017). https://doi.org/10.1038/s41467-017-01194-z

Download citation

Received: 23 December 2016
Accepted: 25 August 2017
Published: 18 October 2017
DOI: https://doi.org/10.1038/s41467-017-01194-z

This article is cited by

The role of single nucleotide variant rs3819817 of the Histidine Ammonia-Lyase gene and 25-Hydroxyvitamin D on bone mineral density, adiposity markers, and skin pigmentation, in Mexican population
- B. Rivera-Paredez
- A. Hidalgo-Bravo
- R. Velázquez-Cruz
Journal of Endocrinological Investigation (2023)
Mexican Biobank advances population and medical genomics of diverse ancestries
- Mashaal Sohail
- María J. Palma-Martínez
- Andrés Moreno-Estrada
Nature (2023)
Challenges in selecting admixture models and marker sets to infer genetic ancestry in a Brazilian admixed population
- Luciana Maia Escher
- Michel S. Naslavsky
- Silviene F. Oliveira
Scientific Reports (2022)
Differences in MTHFR and LRRK2 variant’s association with sporadic Parkinson’s disease in Mexican Mestizos correlated to Native American ancestry
- Elizabeth Romero-Gutiérrez
- Paola Vázquez-Cárdenas
- Oscar Arias-Carrión
npj Parkinson's Disease (2021)
The genomic landscape of Mexican Indigenous populations brings insights into the peopling of the Americas
- Humberto García-Ortiz
- Francisco Barajas-Olmos
- Lorena Orozco
Nature Communications (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Sample selection

Analysis of population admixture

Sequence variation of Native Mexican and Mestizo individuals

Inference of population history from sequence data

Identification alleles of biomedical relevance

Pathway and gene ontology enrichment analysis

Discussion

Methods

Ethics statement

Analysis of population admixture

Samples and sequencing procedure

Whole-exome sequencing

Haplogroup assignment

Genome-wide SNVs density

Maximum likelihood analysis

Inference of population history from sequence data

Potentially pathogenic alleles causing Mendelian disease

GWAS variants in NA individuals

Pathway enrichment and gene ontology analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links