Origin and evolution of qingke barley in Tibet

Zeng, Xingquan; Guo, Yu; Xu, Qijun; Mascher, Martin; Guo, Ganggang; Li, Shuaicheng; Mao, Likai; Liu, Qingfeng; Xia, Zhanfeng; Zhou, Juhong; Yuan, Hongjun; Tai, Shuaishuai; Wang, Yulin; Wei, Zexiu; Song, Li; Zha, Sang; Li, Shiming; Tang, Yawei; Bai, Lijun; Zhuang, Zhenhua; He, Weiming; Zhao, Shancen; Fang, Xiaodong; Gao, Qiang; Yin, Ye; Wang, Jian; Yang, Huanming; Zhang, Jing; Henry, Robert J.; Stein, Nils; Tashi, Nyima

doi:10.1038/s41467-018-07920-5

Download PDF

Article
Open access
Published: 21 December 2018

Origin and evolution of qingke barley in Tibet

Xingquan Zeng^1,2^na1,
Yu Guo³^na1,
Qijun Xu^1,2,
Martin Mascher ORCID: orcid.org/0000-0001-6373-6013⁴,
Ganggang Guo ORCID: orcid.org/0000-0001-9970-5382⁵,
Shuaicheng Li⁶,
Likai Mao³,
Qingfeng Liu³,
Zhanfeng Xia³,
Juhong Zhou³,
Hongjun Yuan^1,2,
Shuaishuai Tai ORCID: orcid.org/0000-0003-4174-085X³,
Yulin Wang^1,2,
Zexiu Wei^1,7,
Li Song³,
Sang Zha^1,2,
Shiming Li³,
Yawei Tang^1,2,
Lijun Bai⁸,
Zhenhua Zhuang⁸,
Weiming He³,
Shancen Zhao³,
Xiaodong Fang ORCID: orcid.org/0000-0001-7061-3337³,
Qiang Gao³,
Ye Yin³,
Jian Wang^9,10,
Huanming Yang^9,10,
Jing Zhang⁵,
Robert J. Henry ORCID: orcid.org/0000-0002-4060-0292¹¹,
Nils Stein ORCID: orcid.org/0000-0003-3011-8731⁴ &
…
Nyima Tashi^1,7

Nature Communications volume 9, Article number: 5433 (2018) Cite this article

13k Accesses
106 Citations
45 Altmetric
Metrics details

Subjects

Abstract

Tibetan barley (Hordeum vulgare L., qingke) is the principal cereal cultivated on the Tibetan Plateau for at least 3,500 years, but its origin and domestication remain unclear. Here, based on deep-coverage whole-genome and published exome-capture resequencing data for a total of 437 accessions, we show that contemporary qingke is derived from eastern domesticated barley and it is introduced to southern Tibet most likely via north Pakistan, India, and Nepal between 4,500 and 3,500 years ago. The low genetic diversity of qingke suggests Tibet can be excluded as a center of origin or domestication for barley. The rapid decrease in genetic diversity from eastern domesticated barley to qingke can be explained by a founder effect from 4,500 to 2,000 years ago. The haplotypes of the five key domestication genes of barley support a feral or hybridization origin for Tibetan weedy barley and reject the hypothesis of native Tibetan wild barley.

The genome and population genomics of allopolyploid Coffea arabica reveal the diversification history of modern coffee cultivars

Article Open access 15 April 2024

Jarkko Salojärvi, Aditi Rambani, … Patrick Descombes

Genetic gains underpinning a little-known strawberry Green Revolution

Article Open access 19 March 2024

Mitchell J. Feldmann, Dominique D. A. Pincot, … Steven J. Knapp

A pan-genome of 69 Arabidopsis thaliana accessions reveals a conserved genome structure throughout the global species range

Article Open access 11 April 2024

Qichao Lian, Bruno Huettel, … Raphael Mercier

Introduction

Barley (Hordeum vulgare L.) is one of the founder crops of Old World agriculture and probably the first crop cultivated by humans¹. At present, most barley is used as animal feed, malt, or as a component of various health foods². Called “qingke” in Chinese or “nas” in Tibetan, six-rowed hulless (or naked) barley has been used as a major staple food of Tibetans for generations^3,4,5. Some studies have suggested that in addition to a center of domestication in the Near East, Tibet was one of the centers of barley domestication^3,4,5,6,7. This hypothesis was based on: (1) the discovery of six-rowed wild barleys (Hordeum agriocrithon Åberg) in Tibet and surrounding areas^3,4,5,6,7,8 (Ganzi, Sichuan province, China, Fig. 1); (2) the discovery of barleys with an intermediate phenotype between wild barley (Hordeum vulgare ssp. spontaneum) and qingke, such as two-rowed hulless barley and six-rowed hulled barley in Tibet^3,4,5. H. agriocrithon and intermediate barley do not exist as wild populations in Tibet but occur as weeds only at the edges of fields in the region, and have been known as Tibetan weedy barley by Tibetans for generations and described as Tibetan semiwild barley by some barley researchers^3,4,5. It should be noted that while Tibetan weedy barley or Tibetan semiwild barley is not a name used in standard barley taxonomy, it has been a popular name used in Tibet by Tibetans or some qingke researchers to distinguish qingke from other Tibetan barleys. Tibetan weedy barley may have made more genetic contribution than Near East barleys to Chinese barleys^6,7. However, many studies suggested that H. agriocrithon may have originated from natural hybridization between H. spontaneum and six-rowed domesticated barley^9,10,11. Thus the existence of Tibetan wild barley¹² provided only weak support to the hypothesis of Tibet representing one of the centers of barley origin or domestication.

Several routes have been proposed to explain how western Eurasian domesticates such as wheat and barley may have entered East Asia. One of the proposed routes is that wheat and barley entered East Asia from areas to the north of the Tibetan plateau via the Inner Asian Mountain corridor that skirts the Taklimakan desert to the south and the Inner Asian Mountains^{13,14,15,16,17,18,19} (route I, Fig. 1). Both crops arrive on the northeastern and southeastern Tibetan plateau by 4000 calendar years before the present (cal y B.P.)¹⁹ (route II, Fig. 1). This route of transmission has been favored in archaeological research, but it is also the area in which the most archaeobotanical research has been carried out. Another scenario proposes that these domesticates could have moved east along the southern rim of the Tibetan Plateau: an area where unfortunately little archaeological research has been carried out^20,21,22. In sites in the northeastern Tibetan plateau, barley occurs alongside wheat, but also with other crops native to China such as broomcorn and foxtail millet¹⁹. When barley does appear in central Tibet, and to some extent the southeastern Tibetan plateau, it appears alongside other agricultural products. In addition to Chinese millets, a variety of other southwest Asian domesticates including pea and rye appear at Changguogou in the Yarlung Tsangpo river basin of southern Tibet²²; and flax at Ashaonao on the southeastern Tibetan plateau^23,24 by roughly 3500 cal y B.P., indicated that the introduction of Tibetan barley from South Asia was also possible. Genetic analysis of barley populations thus might help reveal which routes barley has taken on its spread to the plateau.

Single nucleotide polymorphisms (SNPs) derived from RNA-seq have been used to study the genetic relationship between qingke and other barleys^25,26. However, most of the samples in these studies were cultivars lacking unambiguous geographic origin information rather than geo-referenced landraces; furthermore, the sample size of qingke in these studies was insufficient (less than 20) to represent the qingke populations. According to an official record (http://www.stats.gov.cn/tjsj/tjgb/rkpcgb/dfrkpcgb/201202/t20120228_30406.html), by 2010, there are nearly three million people living in Tibet (Supplementary Table 1). Most of them live in southern and eastern Tibetan area. Sixty-nine qingke landraces and 35 qingke cultivars (produced by cross-breeding with different qingke landraces), and ten Tibetan weedy barleys, were collected from the major inhabited areas in seven Tibetan regions and adjacent areas (Qinghai and Yunnan province), to represent the diversity present in Tibetan barley (Fig. 1b; Supplementary Table 1).

Here, we investigate the origin and domestication history of qingke by whole-genome sequencing (WGS) of Tibetan barley including (i) qingke landraces and cultivars from most Tibetan inhabited areas, (ii) Tibetan weedy barleys (including two brittle rachis samples), as well as (iii) eastern and western barley landraces and cultivars (Supplementary Data 1). Population genomic analyses are performed in the context of previously published diversity datasets^27,28,29 (Supplementary Datas 1 and 2), that comprise barley originating from Africa, Europe, Central, and East Asia including the Tibetan plateau (Fig. 1a). Our analyses strongly suggest that contemporary qingke are derived from eastern domesticated barley providing genomic evidence that the earliest barley was introduced to southern Tibet most likely via north Pakistan, India, and Nepal between 4500 and 3500 cal y B.P.

Results

Origin of qingke

Resequencing of 177 barley genomes, predominantly sampled from Tibet, generated a total of 8.5 terabase (Tb) of high-quality cleaned sequences, with an average of 48.1 gigabase (Gb) per accession (~9.6-fold haploid barley genome coverage, Supplementary Data 1) and revealed 56.3 million (M) SNPs and 3.9 M small insertions and deletions (INDELs) (Supplementary Table 2). A total of 0.54% of the identified SNP and 0.35% of the INDEL polymorphisms resided in coding sequences (CDS) of high-confidence genes³⁰. The ratio of nonsynonymous/synonymous SNPs was ~1.11, while 0.23% of the total INDELs led to frameshifts (Supplementary Table 3). An overlapping total of 1.55 M SNPs (Supplementary Table 2) was found in a set of 260 published exome sequences²⁹ (ES) from a barley world collection.

Using this overlapping set of SNPs in a population structure analyses (principal component analysis (PCA), phylogenomic tree, Fig. 2a, b) separated wild and domesticated barleys into two clades/clusters, which confirmed previous findings^29,31,32. The domesticated barley clade/cluster was further divided into two subclades/subclusters explained by the geographic origin of the genotypes as reported by Morrell et al.³². One clade (clade I, Fig. 2c) included most of the cultivars and landraces of western Asia, central Asia, Africa, and Europe; the other (clade II, Fig. 2c), included landraces of central Asia, eastern Asia, as well as Tibetan barley (qingke and Tibetan weedy barley). For convenience, except the cultivars and Tibetan barley, we defined the domesticated barley landraces of clade I as western barley, and of clade II as eastern barley. Clade II showed that qingke was closer to all eastern than to wild and western barleys.

The evolutionary history of qingke was further inferred by individual ancestry coefficients (Fig. 2d). With K from 4 to 9, new subpopulations arose from each barley clade (wild, western, and eastern). PCA confirmed the existence of such subpopulations (Supplementary Figure 1a, c, e). We studied the relationship of these subpopulations with respect to their geographic origin (Supplementary Figure 1b, d, f). For western and eastern barley, accessions originating from close geographic proximity showed a closer relatedness, emphasizing that geographic origin was the main differentiating factor^31,32.

By filtering admixed samples, we divided the western and eastern barleys, as well as wild and qingke accessions into four groups based on the PCA: wild, western, eastern, and qingke group (Supplementary Figure 2a). In addition, the majority of the 177 WGS samples, which were qingke and cultivars clustered with western barley (Supplementary Figure 3), were divided into two groups: western cultivars and the qingke group (Supplementary Figure 2b). These defined groups represented the pure barley populations of wild barley, western landraces, eastern landraces, qingke, and western cultivars. Population genetic analysis, including nucleotide diversity (π), Watterson’s estimator (θ_W), gene diversity/heterozygosity (H_E), Tajima’s D, recombination rate (ρ), minor allele frequency (MAF) distributions, and linkage disequilibrium (LD) (r²), of the defined barley groups was carried out to test whether Tibet could be recognized as a center of barley diversity and origin as previously suggested^3,4,5,6,7. This could also help us understand whether it was rather a center of adaptive and diversifying selection. Across the genome, we observed an average reduction in genetic diversity described by π, θ_W, H_E of 50% in western and eastern landraces relative to wild barley, and ~50% in qingke relative to western and eastern landraces (Table 1; Supplementary Tables 4–6). Interestingly, chromosome 4H exhibited the lowest genetic diversity, which, furthermore, significantly decreased from wild barley to all domesticated barley groups. Similar observations have been made previously for barley³⁰ and were also revealed for the syntenic wheat chromosome 4D³³, possibly indicating a universal phenomenon for the group 4 chromosomes of the Triticeae tribe. Overall genetic diversity is higher toward the terminal regions of all chromosomes in barley and low in the proximal nonrecombining regions as shown before³⁰ (Supplementary Figures 4–7). The wild group had the highest proportion of low-frequency alleles (also supported by the negative Tajima’s D, Table 1), while the qingke group had the lowest (Supplementary Figure 8). The highest level of LD was present in the qingke group (Supplementary Figure 9), with the genome-wide population recombination rate ρ in qingke estimated to be ~16% of the rate in the western group, or to be ~50% of the rate in the eastern group (Table 1; Supplementary Table 7). Based on the even distribution of the geographic origin of the qingke samples used in this study, representing most of the inhabited area of Tibet, we conclude that the set used is most likely representative for the diversity present in Tibetan barley. The qingke group possessed the lowest genetic diversity, lowest proportion of low-frequency alleles and the highest LD compared to all other wild and domesticated barley groups. All of these factors favor that Tibet was not a center of origin or domestication of barley. A very recent study on H. agriocrithon came to a similar conclusion¹¹.

Table 1 Genetic diversity and Tajima’s D in barley groups

Full size table

We applied D statistics for a better understanding of the relationship between qingke and the other barley groups and to infer the most likely origin of qingke. First, we studied the relationship of the qingke group with the wild, western, and eastern groups (Fig. 3a, b; Supplementary Table 8). The wild group was geographically and genetically divided into two subpopulations of western Asia (wild-WA) and central Asia (wild-CA). With the qingke group fixed in P₃, the highest D value was generated when P₁ was the eastern group, and the second highest appeared when P₁ was the western group. Although western domesticated barley (western group) had a greater distance to Tibet geographically than wild-CA barleys originating from central Asia to Tibet, they had a more positive D value. This indicated a domesticated barley origin for qingke, e.g., from eastern domesticated barley.

We also studied qingke in relation to two subpopulations of the eastern group representing Central and South Asian origins like North Pakistan, India, and Nepal/Western Tibetan Plateau (eastern-CA) and East-Asian origins like East China and the eastern Tibetan Plateau (eastern-EA) (Fig. 3c, d; Supplementary Table 9). When P₃ was fixed by qingke, D value of P₁ = eastern-CA was more positive than P₁ = eastern-EA, revealing a higher probability for a South Asian introduction of barley into Tibet, which is in contrast to traditional narratives for the introduction of wheat and barley to the Tibetan plateau through Northwest China^{13,14,15,16,17,18,19}. Our analysis did not provide evidence for the East-Asian introduction of barley into Tibet but favored an introduction via North Pakistan, India, and Nepal to the southern Tibetan plateau (route III, Fig. 1c). This hypothesis was supported by a recent archaeological study²¹, which reported some new barley archaeological sites in northeastern India. The newly discovered carbonized barley is earlier (~4500 cal y B.P.) than the previously reported archaeological sites in the northeastern Tibetan Plateau¹⁹ (~4000–3500 cal y B.P).

We observed that genome-wide genetic diversity, presented by per bp value (Table 1; Supplementary Tables 4–7) or by 10 kb windows’ value across barley genome (Fig. 4a; Supplementary Figures 4–7), to be lower in qingke compared with other barley groups, indicating a founder effect event (bottleneck) in the history of qingke. Demographic analyses based on our WGS SNPs revealed the effective population size was low in qingke compared to eastern Asian landraces and western cultivars from ~2000 to ~4500 years ago, suggesting a continuous 2500 years’ founder effect event (Fig. 4b). We surmised three possibilities to explain the founder effect as follows. (i) A small subpopulation of eastern domesticated barley with an initial small effective population was introduced to Tibet and evolved to qingke. (ii) Tibet has complex geographic patterns, including plateau in the west, river valleys in the south, and canyons in the east. The barley which was adaptable to the Tibetan local environment was possibly selected by Tibetan settlers as main local landraces. (iii) A six-rowed spike has more grains than a two-rowed spike, and a hulless caryopsis was more convenient for human food than a hulled caryopsis, resulting in Tibetans preferring the six-rowed hulless barley. Over the same period (~2000–~4500 cal y B.P.), the effective population size value of the eastern Asian landraces, which entered central China around Tibet (route I, Fig. 1c), remained constant between ~4374 and ~4500 (Fig. 4b), indicating the Tibetan Plateau environment provided the main factor resulting in a founder effect. However, the small sample size of eastern barley in central Asia in this study limited the further examination of the founder effect. For instance, we do not know the distribution and proportion of six-rowed hulless barley in eastern barley in general, or whether the six-rowed hulless barley was selected in India and Nepal before introduction to Tibet. Resolving this will require the comprehensive investigation and collection of barley landraces in central and southern Asia.

The postulated beginning of the founder effect (~4500 cal y B.P.) is very similar to the age of ancient barley (>4500 cal y B.P.) in northeast India²¹. Considering barley had arrived in southern Tibet by ~3500 cal y B.P.²² (Changguogou site), we inferred the approximate earliest introduction time of qingke to southwestern Tibet was between 4500 and 3500 cal y B.P.

The differentiation of two subpopulations for eastern barley was revealed by their individual ancestry coefficients (Fig. 2d; Supplementary Figure 1e, f) indicating that ancient barley in central Asia split into two clades. One spread to India, Nepal, and Tibet evolving into qingke (route III, Fig. 1c); the other entered North and East China possibly from areas in the northwestern China (route I, Fig. 1c). The demographic analyses revealed the separation time of the two clades was ~8000 cal y B.P (Fig. 4c).

We used the fixation index (F_ST) approach to investigate the selection signals of local population adaptation for qingke in the exome capture target region of the barley genome (Fig. 5a; Supplementary Data 3). By comparing qingke with eastern landraces, eight regions were identified as candidate selective regions, including the region of Naked caryopsis (nud) residing on chromosome 7H. Strong selective sweep signals were revealed in several F_ST-based candidate selected regions (Fig. 5b). The analysis relying on exome capture target regions only did not provide sufficient resolution for determining the exact physical boundaries of the selective sweeps and for identification of individual major selected genes. This step of analysis will depend on the accumulation of additional whole genome resequencing data of eastern barley in future studies.

Qingke was derived from barley in the Fertile Crescent

In addition to global barley diversity analysis the haplotype of five genes involved in three key domestic traits of barley was determined. These were the genes nonbrittle rachis (btr1 and btr2), six-rowed spike (vrs1 and int-c), and the naked caryopsis (nud)^34,35,36,37. Sanger sequencing confirmed the accuracy of the automated genotype calls for these genes.

Qingke shared the same haplotypes with other domesticated barleys at the btr1, btr2, and int-c loci (Supplementary Datas 4–6; Supplementary Figures 10 and 11). All of the hulless barleys, including qingke and two Ethiopian hulless landraces (WGS-Ld1 and WGS-Ld2), showed the same large 1.6 kb deletion involving the nud gene (Supplementary Data 7; Supplementary Figure 12). Except three two-rowed qingke cultivar accessions (WGS-Qk67, WGS-Qk68, and WGS-Qk69), 30% of qingke accessions carried the allele vrs1.a1, while other 70% carried the allele vrs1.a4 reported by Cuesta-Marcos et al.³⁸ (Fig. 6b; Supplementary Data 8). A recent study¹¹ suggested vrs1.a4 arose in a wild barley population of Uzbekistan in central Asia. This underpinned a central Asian origin of qingke and supported our inference that qingke was derived from eastern domesticated barley (Fig. 6d). Mutations in the coding region of vrs1 converted two-rowed into six-rowed barley³⁵. However, six-rowed vrs1.a4 carriers have not been found to show any lesions within the Vrs1 ORF³⁸. The markedly reduced abundance of the Vrs1 transcript in vrs1.a4 carriers has been proposed as the determinant of the six-rowed phenotype, but the causative mutation/s have not been identified^38,39. The vrs1.a4 of six-rowed qingke showed no mutation in any exon of the gene as reported³⁸ with the only unique SNP being at bp-position 387 of the vrs1.b complete gene sequence (gi|119943316), therefore, residing in an intron. The SNP has been reported in four vrs1.a4 accessions³⁸, and was found uniquely fixed in 87 vrs1.a4 carriers in this study, while other reported vrs1.a4 variants³⁸ were not uniquely fixed (Fig. 6a). Thus the SNP might be a key potential mutation causing the six-rowed phenotype and should be studied in the future.

Altogether, qingke shared the known domestication gene alleles with worldwide cultivated barley thus supporting that barley domestication has only occurred in the Fertile Crescent¹¹ and rejecting a Chinese (or Tibetan) domestication of qingke.

Origin of H. agriocrithon and Tibetan weedy barleys

The origin of Tibetan wild or weedy barley was still unclear despite the previous studies^8,9,10,11,12. Six H. agriocrithon (Tibetan six-rowed wild barley) accessions and ten Tibetan weedy barley accessions were included in this study for resequencing. In all analyses of population structure (phylogenetic tree, PCA, and individual ancestry coefficients) only two H. agriocrithon accessions clustered close to the true wild barley (H. spontaneum) cluster (Fig. 2a, b); all other accessions of H. agriocrithon and Tibetan weedy barleys clustered within the eastern barley as previously reported^11,34,40. Although two of the six H. agriocrithon accessions in the present study clustered with H. spontaneum, at the six-rowed trait locus vrs1 all H. agriocrithon accessions showed the six-row conferring haplotypes vrs1.a1 or vrs1.a4 indicating their feral or hybridization origin from domesticated or partially domesticated barley^8,9,10,11 (Fig. 6). Ten Tibetan weedy barleys showed slightly higher genetic diversity (π = 1.43 × 10⁻³, θ_W = 1.37 × 10⁻³) than qingke (Table 1), but lower than other domesticated barleys (Table 1), and also showed domestication gene haplotypes that were shared with domesticated barley. At the btr1 locus, we found two Tibetan weedy barleys (WGS-Tw4 and WGS-Tw7) with brittle rachis. They exhibited a new haplotype which could have occurred from hybridization between the western (btr1Btr2) and the eastern domesticated type (Btr1btr2) followed by recombination between the btr1 and btr2 loci (Supplementary Datas 4 and 5; Supplementary Figure 10). Such a recombination could have restored the Btr1Btr2 genotype conferring brittleness of the rachis thus mimicking the phenotype of true wild barley. In a recent report by Pourkheirandish et al.¹¹, the same scenario was proposed for Tibetan H. agriocrithon having originated from hybridization between six-rowed landraces carrying btr1Btr2 and Btr1btr2 genotypes.

Qingke was the dominant barley in Tibet and other Tibetan barley were considered as weeds by Tibetans in their agricultural activity for generations^3,4,5. These weeds might have occurred through fertilization on field borders and spread of seeds through animal dung. In addition, a similar example was reported for weedy rice explained by de-domestication^41,42. Although H. agriocrithon and other Tibetan weedy barley accessions were only present as a small sample in this study, our results, without exception, supported clearly feral or hybridization origin of Tibetan wild barley¹¹.

Discussion

We want to conclude with a cautionary note on methodology. Our conclusions regarding the origin of qinqke are supported by multiple lines of evidence from analysis carried out at the whole-genome level. However, there are multiple of reasons why caution needs to be applied at understanding patterns of diversity at finer scales: (i) incompleteness and inaccuracy of the current barley reference genome sequence assembly⁴³; (ii) uncertainties of aligning short reads in a plant species with a large and complex reference genome, which may result in reduced mapping rates for haplotypes divergent from the Morex reference; and related to this (iii) presence–absence variation, which results in ignoring sequence variation in genes absent from the Morex reference. In the future, the construction of multiple high-quality reference sequences including representatives of qingke germplasm may contribute to obtaining a full picture of haplotype diversity in barley.

Our population genomics study showed that qingke originated from the eastern domesticated barleys. Qingke landraces brought to the Tibetan plateau were derived primarily from south Asian origin between ~4500 to ~3500 cal y B.P., supporting the hypothesis that a southern route of crop introduction into Tibet was an important factor. A founder effect event occurred in the qingke population between ~4500 to ~2000 cal y B.P. The accumulated sequence data as well as the SNP and INDEL information will be of great use for further barley population genomic studies.

Methods

Sample preparation and sequencing

The 172 WGS barley accessions used in this study, including wild barleys (including a semiwild accession: Hordeum vulgare var. gymnospermum Korn), cultivars (produced by cross-breeding), western and eastern landraces, qingke landraces, qingke cultivars (produced by cross-breeding with different qingke landraces; three two-rowed qingke accessions, including WGS-Qk67, WGS-Qk68, and WGS-Qk69, were produced by the cross-breeding between qingke landrace and two-rowed domesticated barley), Tibetan weedy barley, were provided by (i) National Crop Genebank of China (NCGC), and (ii) Tibetan Academy of Agricultural and Animal Husbandry Sciences (TAAAS) (Supplementary Data 1). NCGC is an official germplasm resource bank of Chinese Academy of Agricultural Sciences and the detailed information of the barley accessions was available on the website http://www.cgris.net/cgris_english.html. The DNA was extracted from 4 week old seedling’s leaves. The sequencing was performed on an Illumina Hiseq2000 or Hiseq4000 platform.

Previously published WGS samples, including Hordeum pubiflorum²⁸, 5 barley cultivars²⁷; and 260 exome sequencing samples²⁹, including 97 wild barleys accessions (91 H. spontaneum and 6 H. agriocrithon accessions) and 163 landraces, were downloaded (Supplementary Data 2).

Alignment and variant calling

We cleaned the Illumina NGS raw data to remove adaptors, trim low-quality bases and also to remove “N” with Trimmomatic⁴⁴ (V0.36). The clean reads were mapped to the barley genome reference³⁰ with BWA⁴⁵ (V0.7.10-r789, mapping method: MEM). The high-quality mapped reads (mapped, nonduplicated reads with mapping quality ≥ 20), were selected with Samtools⁴⁶ (V1.3.1) commands “−view −F 4 −q 20” and “−rmdup”. For 178 WGS samples (including Hordeum pubiflorum), the mapping statistics based on high-quality mapped reads of each accession included: (1) the coverage depth of each chromosomal position (Samtools command “−depth”); (2) proportion of barley genome covered by different read depths (Supplementary Figure 13a). Considering there were average ~30% uncovered region of barley genome for WGS accessions (Supplementary Figure 13b), the region covered by at least two reads in ≥80% of the WGS accessions were defined as the WGS effective covered region of barley genome. The overlap between WGS effective covered region and exome target region²⁹ (https://doi.org/10.5447/IPK/2016/27) were called as overlapped effective covered region (Supplementary Table 10).

Only high-quality mapped reads were used for variants calling. BAM files were sorted and marked PCR duplication by Picard (V1.117, http://broadinstitute.github.io/picard/), then variants calling was performed using the Genome Analysis Toolkit⁴⁷ (GATK, V3.3-0-g37228af). The 178 WGS and 260 ES samples were combined for variant calling. For barley no whole genome SNP and INDEL datasets was available to carry on Base Quality Score Recalibrator (BQSR) and INDEL Realigner, so we used the following approach as GATK website recommend for non-human data (https://gatkforums.broadinstitute.org/gatk/discussion/1706/best-recommendation-for-base-recalibration-on-non-human-data). First, we did an initial round variants calling for our original data by command HaplotypeCaller. Without available truth/training variants, we used GATK tool harder filter to filter the variants instead of Variant Quality Score Recalibration as GATK recommend (https://software.broadinstitute.org/gatk/documentation/article.php?id = 3225). The parameters of hard filter was set by default (for SNPs: QD < 2.0, FS > 60.0, MQ < 40.0, MQRankSum < −12.5; for short INDELs: QD < 2.0, FS > 200.0, ReadPosRankSum < −20.0). Applying the hard filter provided an initial confidence in the SNPs and INDELs sets. Second, the original BAM files were treated by BQSR and INDEL Realigner using the initial confidence SNPs and INDELs. Using of HaplotypeCaller and hard filter again, further improved confidence in the SNPs and INDELs sets. This dataset including 438 accessions (including 1 Hordeum pubiflorum, 177 WGS and 260 ES barley accessions) was considered as the raw confidence variants sets.

To obtain high-quality variants sets, the raw confidence variants were filtered on the basis of the steps Russell et al.²⁹ used for barley exome resequencing data. For WGS data, the variants of 177 WGS barley accessions were extracted from raw confidence variants sets and performed the following filtering steps: (1) only variants in the WGS effective covered region were kept; (2) only bi-allelic and polymorphic variants were kept; (3) genotype calls were considered successful if read depth was ≥2 and ≤50, otherwise were regarded as missing; (4) variants positions with more than 80% heterozygous calls or more than 20% missing genotype calls were discarded; (5) both alleles of a variant were required to occur in at least one individual in the homozygous state. For the overlapped variants between WGS samples and ES samples, the variants of 437 barley samples (without Hordeum pubiflorum) were extracted from raw confidence variants sets and performed the following the similar filtering steps showed above. The differences were: (1) only variants in the overlapped effective covered region were kept and (2) genotype calls were considered successful if read depth and the genotype quality score were both ≥10 for deeply sequenced ES samples as Russell et al.²⁹ used.

After filtering, two kinds of high-quality variants were obtained. One was the genome-wide variants (the SNPs data were called WGS SNPs data in the following analyses) of 177 WGS barley accessions; the other was the overlapped SNPs between the 177 WGS accessions and the 260 ES accessions (the SNPs data were called overlapped SNPs data in the following analyses). For estimating the quality of our genome-wide SNPs, ten primers were designed by Primer-BLAST (https://www.ncbi.nlm.nih.gov/tools/primer-blast/index.cgi?LINK_LOC = BlastHome). Thirteen accessions, including two wild barleys, six cultivars, and five qingke accessions were used for performing Sanger sequencing (Supplementary Datas 9 and 10). The annotation of genome-wide SNPs and INDELs were based on barley’s high-confidence (39,734 genes) and low-confidence (41,545 genes) gene sets³⁰ with an in-house Perl script and Reseqtools⁴⁸ (V0.25, https://github.com/BGI-shenzhen/Reseqtools).

Population structure analyses

Phylogenomic tree was constructed on the basis of the distance matrix calculated by the software PHYLIP 3.68 (http://evolution.genetics.washington.edu/phylip.html), and presented by iTol⁴⁹ (V3, http://itol.embl.de/). PCA was performed with EIGENSOFT⁵⁰ (V6.0.1).

The individual ancestry coefficients of overlapped SNPs data was performed with sNMF⁵¹ (V1.2), which was more appropriate to deal with inbred species⁵¹. Moreover, we choose sNMF because it had shown good performance in barley²⁹. Before running the sNMF, we treated the overlapped SNPs sets as haploid, coding all heterozygous sites as missing data. The command sNMF was called with parameter −m 1 (assuming haploid data for the predominantly inbreeding barley) for K values between 1 and 15. For each K value, 100 replications runs were performed with random, varied seed. The Q proportions were averaged across the 10 replications with the lowest cross-entropy by CLUMPP⁵² (V1.1.2) and plotted by Distruct⁵³. The K = 9 was chosen because the major subpopulations reported by Russell et al.²⁹ occurred from K = 2 to K = 9 and these stable subpopulations results were enough for the following analyses. For wild and western barley, if the components for subpopulations within the major group was ≥0.65, the samples were classified as wild or western, respectively, and the remaining samples were deemed admixed. For eastern barley, the critical components value was set 0.5. For proved the existence of theses subpopulations, we performed the PCA for wild, western and eastern barleys with EIGENSOFT⁵⁰ (V6.0.1), respectively.

Population genetic statistics

Only the non-admixed samples in both PC1–PC2 and PC1–PC3 of PCA results were defined as groups (Supplementary Figure 2). Admixed samples can be the result of recent outcrossing between traditional qingke and Chinese elite varieties growing side-by-side in the fields of Tibetan farmers. Such admixed samples would not be informative about the origin and history of qingke. The groups with sample size ≥20 were used in the following population genetic analyses.

The heterozygous SNPs proportion in the total SNPs was <2% in most of the accessions, meanwhile the accessions with high heterozygous SNPs proportion (>4%) comprise a large proportion of cultivars, indicating barley is a tightly inbred species (Supplementary Figure 14). Inbred samples would seem to be closer to haploid than diploid⁵⁴, for all of the following population genetics analyses, we treated SNPs datasets as haploid, coding all heterozygous sites as missing data. LD was calculated using PopLDdecay⁵⁵ (V3.31) with command “−MAF 0.01 −Het 0.8 −Miss 0.8 −MaxDist 1000” in barley groups. Regarding the LD for overall genome, the pairwise r² value was calculated for individual chromosomes using SNPs from the corresponding chromosome and then the pairwise r² values were averaged across the whole genome. The nucleotide diversity (π), Watterson’s estimator (θ_W), gene diversity/heterozygosity (H_E) per bp and Tajima’s D for haploid data were estimated with our in-house Perl scripts based on their definitions^56,57,58,59. The unbiased value of a window for π, θ_W, and H_E was equal to the sum of the value per bp divided by the corresponding effective covered region size of the window. The recombination rate (ρ = 4N_er) was estimated using a composite likelihood approach⁶⁰ with Maxhap (http://home.uchicago.edu/rhudson1/source/maxhap.html) on a per-contig level^29,61. We considered only SNPs with minor allele counts ≥3 located in contigs that contained at least 20 SNPs. Values of ρ per bp were estimated across a grid of values from 1 × 10⁻⁴ to 0.2, assuming no homologous gene conversion. The fixation index (F_ST) between pairwise groups was calculated using Hudson’s estimator with the explicit formula given as Eq. (10) in Bhatia et al.⁶² (Supplementary Table 11), which were independent of sample sizes. The average F_ST of a window was considered when the windows comprised at least 5 SNPs per kb. For windows calculation of π, θ_W, H_E and F_ST, the windows size was set as 10 kb with 2 kb step. Only the windows which comprised ≥2 kb effective covered region were considered. The distribution of π, θ_W, H_E, and ρ across the barley genome chromosomes was plotted using Gnuplot (V5.2, http://www.gnuplot.info/) with “smooth bezier” treatment based on the value of per window or per contig.

In addition, we considering ten Tibetan weedy barleys as a group, calculating the nucleotide diversity (π) and Watterson’s estimator (θ_W) based on the overlapped SNPs data using the same methods above.

D statistics

The D statistics⁶³ of four-population were calculated using ADMIXtools⁶⁴ (V4.1). The SNPs matrix converted to EIGENSOFT format using fcGENE⁶⁵ (V1.0.7) and CONVERTF⁶⁴. The barley relative H. pubiflorum was set as the out-group. The genotype of H. pubiflorum was directly extracted from the raw confidence variants sets of 438 samples.

Demographic history

The sequential Markov coalescent implemented in SMC++⁶⁶ (V1.13) was used to estimate the demographic history for qingke. The SMC++ was more suit for genome-wide SNPs than exon-wide SNPs⁶⁶, thus the three groups (western cultivar group, qingke group, and eastern Asian group) based on WGS SNPs data were used. The population size for SMC++ run was recommend as 2–10 (https://github.com/popgenmethods/smcpp). The eastern Asian group only included seven samples. For normalizing the population size, we randomly selected seven different samples, which evenly distributed in the PCA of western cultivar and qingke groups (Supplementary Figure 15). For each group, two replicated selections were performed. The non-WGS effective covered region was masked with parameter “vcf2smc −m”. For each group, every sample was set as the pair of distinguished lineages once (parameter “vcf2smc −d”) for generating varied independently evolving sequence. All of the sequences were used as input for each group when running “SMC++ estimate”. The split time between qingke and eastern Asian group was estimated by command “SMC++ split”. A mutation rate of 6.5 × 10⁻⁹ per site per generation, which were used for the demographic estimation for rice⁶⁷, and a constant generation time of 1 year was assumed to translate coalescence generations into times.

Candidate genomic region for plateau adaption of qingke

To examine local population adaptation of qingke, the F_ST we calculated above between qingke group and eastern group was used. The windows (10 kb with 2 kb step) in which F_ST ≥ 0.6 were regarded as the candidate selective regions. The genes overlapped with these regions (up and down stream ± 2kb) were considered as candidate genes. These genes were aligned with Swiss-Prot Protein Sequence Bank (Uniprot/release-2015_04) by BLAST⁶⁸ (V2.2.26). The E-value was set as 1 × 10⁻⁵. The symbols of Swiss-Prot were used in searching for the gene’s function. In addition, SweepFinder⁶⁹ was used to examine if these candidate regions were overlapped with selective sweep.

Haplotype of key domestication genes

The sequences of the five genes (btr1, btr2, vrs1, int-c, and nud, Supplementary Data 11) were downloaded from NCBI and aligned with the barley genome using IPK’s blast server (http://webblast.ipk-gatersleben.de/barley_ibsc/). The genotypes of the five genes were identified as the following approaches. (1) If the gene was annotated in barley genome high-quality gene set (int-c: HORVU4Hr1G007040.1; nud: HORVU7Hr1G089930.5), we directly extracted its genotype from our high-quality SNPs and INDELs data. (2) If the gene was not annotated in barley genome or not uniquely mapped to barley genome (btr1 and vrs1), the best mapped region of ±20 kb in barley genome was cut to be used as a reference for variants calling. The variants calling steps were the same as we used in our confidence variants’ calling. (3) An assembly error in barley genome resulted in part of brt2 sequence mapping to one position, and another part mapping to another position (Supplementary Data 11). We directly used the downloaded wild Btr2 sequences (gi|914342917) as a reference for variants calling. The coding region of btr1, btr2, int-c, and nud were not covered by exome sequences, so the haplotype of these four genes were only identified by the 177 WGS samples. The vrs1 locus was covered by exome sequences, providing the haplotype in all of the barley samples. Only variants with a MAF ≥ 5% were considered. For estimating the genotype quality of these genes, four primers were designed by Primer-BLAST (https://www.ncbi.nlm.nih.gov/tools/primer-blast/index.cgi?LINK_LOC = BlastHome) for btr1, btr2, vrs1, and int-c (Supplementary Data 11). Seventy-third successful PCR-based Sanger sequences confirmed the genotypes examined. The median-joining haplotype networks based on the SNPs of these genes were constructed with PopART⁷⁰ (V1.7). The deletion around nud locus of each sample were revealed by the reads depth. In addition, phased SNPs around each gene locus were prepared using SHAPEIT⁷¹ (v2.r790).The genotypes which were the same as the barley genome, were converted to 0; while the altered ones were converted to 1. The phased two haplotypes of each accessions were plot with Gnuplot (V5.2, http://www.gnuplot.info/).

Reporting summary

Further information on experimental design is available in the Nature Research Reporting Summary linked to this article.

Code availability

The in-house Perl and Shell scrips for reads mapping, variants calling, filtering and annotation, and population genetic analyses (π, θ_W, H_E, Tajima’s D, and F_ST) are available in https://sourceforge.net/projects/origin-of-qingke-barley/files/?source=navbar

Data availability

All data and genetic material used for this paper are available from the authors on request. The sequences data that support the findings of this study have been deposited in NCBI under the BioProject PRJNA417220 with the Sequence Read Archive (SRA) number SRP131710. The accessions code of the 200 Sanger sequences deposited in NCBI are MG879031 to MG879230. Genotype matrices for SNPs and INDELs are available from https://doi.org/10.5447/IPK/2018/15. Plant materials used in this study can be requested from the corresponding author Nyima Tashi or from the Chinese Crop Germplasm Resources Information System (http://www.cgris.net/). A reporting summary for this article is available as a Supplementary Information file. The source data underlying Figs. 1–6, Table 1, Supplementary Figures 1–9, 10a–c, 11–15, and Supplementary Tables 4–7 and 11 are provided as a Source Data file.

References

Badr, A. et al. On the origin and domestication history of barley (Hordeum vulgare). Mol. Biol. Evol. 17, 499–510 (2000).
Article CAS PubMed Google Scholar
Newman, C. W. & Newman, R. K. A brief history of barley foods. Cereal Foods World 51, 4–7 (2006).
Google Scholar
Hsu, T. Origin and phylogeny of cultivated barley with reference to the discovery of Ganze wild two-rowed barley Hordeum spontaneum C. koch. Acta Genet. Sin. 2, 006 (1975).
ADS Google Scholar
Ma, D. et al. The classification and distribution of wild barley in the Tibet Autonomous Region. Sci. Agric. Sin. 20, 1–6 (1987).
Ma, D. The research on classification and origin of cultivated barley in Tibet Autonomous Region. Sci. Agric. Sin. 21, 7–14 (1988).
Google Scholar
Dai, F. et al. Tibet is one of the centers of domestication of cultivated barley. Proc. Natl Acad. Sci. 109, 16969–16973 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Ren, X. et al. Tibet as a potential domestication center of cultivated barley of China. PLoS One 8, e62700 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Åberg, E. Hordeum agriocrithon nova sp., a wild six-rowed barley. Ann. Agric. Coll. Swed. 6, 159–212 (1938).
Google Scholar
Konishi, T. Genetic diversity in Hordeum agriocrithon E. Åberg, six-rowed barley with brittle rachis, from Tibet. Genet. Resour. Crop Evol. 48, 27–34 (2001).
Article Google Scholar
Tanno, K. & Takeda, K. On the origin of six-rowed barley with brittle rachis, agriocrithon [Hordeum vulgare ssp. vulgare f. agriocrithon (Åberg) Bowd.], based on a DNA marker closely linked to the vrs1 (six-row gene) locus. Theor. Appl. Genet. 110, 145–150 (2004).
Article CAS PubMed Google Scholar
Pourkheirandish, M. et al. Elucidation of the origin of “agriocrithon” based on domestication genes questions the hypothesis that Tibet is one of the centers of barley domestication. Plant J. 94, 525–534 (2018).
Article CAS PubMed Google Scholar
von Bothmer, R., Yen, C. & Yang, J. Does wild, six-rowed barley, Hordeum agriocrithon really exist. Plant Genet. Resour. Newsl. 77, 17–19 (1990).
Google Scholar
Li, X. et al. Early cultivated wheat and broadening of agriculture in Neolithic China. Holocene 17, 555–560 (2007).
Article ADS Google Scholar
Zhijun, Z. Eastward Spread of Wheat into China—New Data and New Issues. Chin. Archaeol. 9, 1–9 (2009).
Article Google Scholar
Dodson, J. R. et al. Origin and spread of wheat in China. Quat. Sci. Rev. 72, 108–111 (2013).
Article ADS Google Scholar
Barton, L. & An, C. B. An evaluation of competing hypotheses for the early adoption of wheat in East Asia. World Archaeol. 46, 775–798 (2014).
Article Google Scholar
Long, T. et al. The early history of wheat in China from ¹⁴C dating and Bayesian chronological modelling. Nat. Plants 4, 272–279 (2018).
Article PubMed Google Scholar
Wang, J. et al. Revealing a 5000-y-old beer recipe in China. Proc. Natl Acad. Sci. 2016, 01465 (2016).
Google Scholar
Chen, F. H. et al. Agriculture facilitated permanent human occupation of the Tibetan Plateau after 3600 B.P. Science 347, 248–250 (2015).
Article ADS CAS PubMed Google Scholar
Aldenderfer, M. & Yinong, Z. The prehistory of the Tibetan Plateau to the seventh century AD: perspectives and research from China and the West since 1950. J. World Prehist. 18, 1–55 (2004).
Article Google Scholar
Liu, X. et al. Journey to the east: diverse routes and variable flowering times for wheat and barley en route to prehistoric China. PLoS One 12, e0187405 (2017).
Article PubMed PubMed Central Google Scholar
Fu, D. X., Xu, T. W. & Feng, Z. Y. The ancient carbonized barley (Hordeum vulgare L. var. nudum) kernel discovered in the middle of Yalu Tsanypo river basin in Tibet. Southwest China J. Agric. Sci. 13, 38–41 (2000).
Google Scholar
Guedes, J. A. et al. Moving agriculture onto the Tibetan plateau: the archaeobotanical evidence. Archaeol. Anthropol. Sci. 6, 255–269 (2014).
Article Google Scholar
Guedes, J. A. et al. Early evidence for the use of wheat and barley as staple crops on the margins of the Tibetan Plateau. Proc. Natl Acad. Sci. 112, 5625–5630 (2015).
Article ADS Google Scholar
Dai, F. et al. Transcriptome profiling reveals mosaic genomic origins of modern cultivated barley. Proc. Natl Acad. Sci. 111, 13403–13408 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Dai, F. et al. Assembly and analysis of a qingke reference genome demonstrate its close genetic relation to modern cultivated barley. Plant Biotechnol. J. 16, 760–770 (2017).
Article PubMed PubMed Central Google Scholar
International Barley Genome Sequencing Consortium. A physical, genetic and functional sequence assembly of the barley genome. Nature 491, 711–716 (2012).
Article ADS Google Scholar
Mascher, M. et al. Barley whole exome capture: a tool for genomic research in the genus Hordeum and beyond. Plant J. 76, 494–505 (2013).
Article CAS PubMed PubMed Central Google Scholar
Russell, J. et al. Exome sequencing of geographically diverse barley landraces and wild relatives gives insights into environmental adaptation. Nat. Genet. 48, 1024–1030 (2016).
Article CAS PubMed Google Scholar
Mascher, M. et al. A chromosome conformation capture ordered sequence of the barley genome. Nature 544, 427–433 (2017).
Article ADS CAS PubMed Google Scholar
Poets, A. et al. Barley landraces are characterized by geographically heterogeneous genomic origins. Genome Biol. 16, 173 (2015).
Article PubMed PubMed Central Google Scholar
Morrell, P. L. et al. Resequencing data indicate a modest effect of domestication on diversity in barley: a cultigen with multiple origins. J. Hered. 105, 253–264 (2013).
Article PubMed Google Scholar
Jordan, K. W. et al. A haplotype map of allohexaploid wheat reveals distinct patterns of selection on homoeologous genomes. Genome Biol. 16, 48 (2015).
Article PubMed PubMed Central Google Scholar
Pourkheirandish, M. et al. Evolution of the grain dispersal system in barley. Cell 162, 527–539 (2015).
Article CAS PubMed Google Scholar
Komatsuda, T. et al. Six-rowed barley originated from a mutation in a homeodomain-leucine zipper I-class homeobox gene. Proc. Natl Acad. Sci. 104, 1424–1429 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Ramsay, L. et al. INTERMEDIUM-C, a modifier of lateral spikelet fertility in barley, is an ortholog of the maize domestication gene TEOSINTE BRANCHED 1. Nat. Genet. 43, 169–172 (2011).
Article CAS PubMed Google Scholar
Taketa, S. et al. Barley grain with adhering hulls is controlled by an ERF family transcription factor gene regulating a lipid biosynthesis pathway. Proc. Natl Acad. Sci. 105, 4062–4067 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Cuesta-Marcos, A. et al. Genome-wide SNPs and re-sequencing of growth habit and inflorescence genes in barley: implications for association mapping in germplasm arrays varying in size and structure. BMC Genom. 11, 707 (2010).
Article CAS Google Scholar
Sakuma, S. et al. Divergence of expression pattern contributed to neofunctionalization of duplicated HD-Zip I transcription factor in barley. New Phytol. 97, 939–948 (2013).
Article Google Scholar
Azhaguvel, P. & Komatsuda, T. A phylogenetic analysis based on nucleotide sequence of a marker linked to the brittle rachis locus indicates a diphyletic origin of barley. Ann. Bot. 100, 1009–1015 (2007).
Article CAS PubMed PubMed Central Google Scholar
Li, L. F. et al. Signatures of adaptation in the weedy rice genome. Nat. Genet. 49, 811–814 (2017).
Article CAS PubMed Google Scholar
Qiu, J. et al. Genomic variation associated with local adaptation of weedy rice during de-domestication. Nat. Commun. 8, 15323 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Beier, S. et al. Construction of a map-based reference genome sequence for barley Hordeum vulgare L. Sci. Data 4, 170044 (2017).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Article CAS PubMed PubMed Central Google Scholar
He, W. et al. ReSeqTools: an integrated toolkit for large-scale next-generation sequencing based resequencing analysis. Genet. Mol. Res. 12, 6275–6283 (2013).
Article CAS PubMed Google Scholar
Letunic, I. & Bork, P. Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 44, W242–W245 (2016).
Article CAS PubMed PubMed Central Google Scholar
Patterson, N., Price, A. L. & Reich, D. Population structure and eigenanalysis. PLoS Genet. 2, e190 (2006).
Article PubMed PubMed Central Google Scholar
Frichot, E. et al. Fast and efficient estimation of individual ancestry coefficients. Genetics 113, 160572 (2014).
Google Scholar
Jakobsson, M. & Rosenberg, N. A. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23, 1801–1806 (2007).
Article CAS PubMed Google Scholar
Rosenberg, N. A. distruct: a program for the graphical display of population structure. Mol. Ecol. Notes 4, 137–138 (2003).
Article Google Scholar
Nordborg, M. & Donnelly, P. The coalescent process with selfing. Genetics 146, 1185–1195 (1997).
CAS PubMed PubMed Central Google Scholar
Zhang C., et al. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics. https://doi.org/10.1093/bioinformatics/bty875 (2018).
Tajima, F. Evolutionary relationship of DNA sequences in finite populations. Genetics 105, 437–460 (1983).
CAS PubMed PubMed Central Google Scholar
Watterson, G. A. On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7, 256–276 (1975).
Article MathSciNet CAS PubMed Google Scholar
Nei, M. Analysis of gene diversity in subdivided populations. Proc. Natl Acad. Sci. 70, 3321–3323 (1973).
Article ADS CAS PubMed PubMed Central Google Scholar
Tajima, F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123, 585–595 (1989).
CAS PubMed PubMed Central Google Scholar
Hudson, R. R. Two-locus sampling distributions and their application. Genetics 159, 1805–1817 (2001).
CAS PubMed PubMed Central Google Scholar
Hufford, M. B. et al. Comparative population genomics of maize domestication and improvement. Nat. Genet. 44, 808 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bhatia G., et al. Estimating and interpreting FST: the impact of rare variants. Genome Res. https://doi.org/10.1101/gr.154831.113 (2013).
Durand, E. Y. et al. Testing for ancient admixture between closely related populations. Mol. Biol. Evol. 28, 2239–2252 (2011).
Article CAS PubMed PubMed Central Google Scholar
Patterson N. J., et al. Ancient admixture in human history. Genetics. https://doi.org/10.1534/genetics.112.145037 (2012).
Roshyara, N. R. & Scholz, M. fcGENE: a versatile tool for processing and transforming SNP datasets. PLoS One 9, e97589 (2014).
Article ADS PubMed PubMed Central Google Scholar
Terhorst, J., Kamm, J. A. & Song, Y. S. Robust and scalable inference of population history from hundreds of unphased whole genomes. Nat. Genet. 49, 303 (2017).
Article CAS PubMed Google Scholar
Meyer, R. S. et al. Domestication history and geographical adaptation inferred from a SNP map of African rice. Nat. Genet. 48, 1083 (2016).
Article CAS PubMed Google Scholar
Altschul, S. F. et al. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Article CAS PubMed Google Scholar
Nielsen, R. et al. Genomic scans for selective sweeps using SNP data. Genome Res. 15, 1566–1575 (2005).
Article CAS PubMed PubMed Central Google Scholar
Leigh, J. W. & Bryant, D. popart: full-feature software for haplotype network construction. Methods Ecol. Evol. 6, 1110–1116 (2015).
Article Google Scholar
Delaneau, O., Marchini, J. & Zagury, J. F. A linear complexity phasing method for thousands of genomes. Nat. Methods 9, 179–181 (2011).
Article PubMed Google Scholar

Download references

Acknowledgements

We thank Dr. Eviatar Nevo, Dr. Takao Komatsuda, Dr. Jade d’Alpoim Guedes, and Dr. Mark S. Aldenderfer for critical reading of this manuscript and giving important suggestions. We thank State Key Laboratory of Agricultural Genomics (No. 2011DQ782025) for their suggestion. This work was supported by the following funding sources: the Financial Special Fund (2014CZZX001 and 2015CZZX001, 2017CZZX001, and XZNKY-2018-C-021), the Tibet Department of Major Projects (XZ201801NA01).

Author information

These authors contributed equally: Xingquan Zeng, Yu Guo.

Authors and Affiliations

State Key Laboratory of Hulless Barley and Yak Germplasm Resources and Genetic Improvement, Lhasa, 850002, China
Xingquan Zeng, Qijun Xu, Hongjun Yuan, Yulin Wang, Zexiu Wei, Sang Zha, Yawei Tang & Nyima Tashi
Research Institute of Agriculture, Tibet Academy of Agriculture and Animal Husbandry Sciences, Lhasa Tibet, 850002, China
Xingquan Zeng, Qijun Xu, Hongjun Yuan, Yulin Wang, Sang Zha & Yawei Tang
BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China
Yu Guo, Likai Mao, Qingfeng Liu, Zhanfeng Xia, Juhong Zhou, Shuaishuai Tai, Li Song, Shiming Li, Weiming He, Shancen Zhao, Xiaodong Fang, Qiang Gao & Ye Yin
Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466, Seeland, Germany
Martin Mascher & Nils Stein
Institute of Crop Science, Chinese Academy of Agriculture Sciences, Beijing, 100081, China
Ganggang Guo & Jing Zhang
Department of Computer Science, City University of Hong Kong, Hong Kong, 999077, China
Shuaicheng Li
Tibet Academy of Agriculture and Animal Husbandry Sciences, Lhasa Tibet, 850002, China
Zexiu Wei & Nyima Tashi
Chengdu Life Baseline Technology Co., Ltd., Chengdu, 610041, China
Lijun Bai & Zhenhua Zhuang
BGI-Shenzhen, Shenzhen, 518083, China
Jian Wang & Huanming Yang
James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
Jian Wang & Huanming Yang
Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Brisbane, QLD, 4072, Australia
Robert J. Henry

Authors

Xingquan Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Yu Guo
View author publications
You can also search for this author in PubMed Google Scholar
Qijun Xu
View author publications
You can also search for this author in PubMed Google Scholar
Martin Mascher
View author publications
You can also search for this author in PubMed Google Scholar
Ganggang Guo
View author publications
You can also search for this author in PubMed Google Scholar
Shuaicheng Li
View author publications
You can also search for this author in PubMed Google Scholar
Likai Mao
View author publications
You can also search for this author in PubMed Google Scholar
Qingfeng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Zhanfeng Xia
View author publications
You can also search for this author in PubMed Google Scholar
Juhong Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Hongjun Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Shuaishuai Tai
View author publications
You can also search for this author in PubMed Google Scholar
Yulin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zexiu Wei
View author publications
You can also search for this author in PubMed Google Scholar
Li Song
View author publications
You can also search for this author in PubMed Google Scholar
Sang Zha
View author publications
You can also search for this author in PubMed Google Scholar
Shiming Li
View author publications
You can also search for this author in PubMed Google Scholar
Yawei Tang
View author publications
You can also search for this author in PubMed Google Scholar
Lijun Bai
View author publications
You can also search for this author in PubMed Google Scholar
Zhenhua Zhuang
View author publications
You can also search for this author in PubMed Google Scholar
Weiming He
View author publications
You can also search for this author in PubMed Google Scholar
Shancen Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Xiaodong Fang
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Gao
View author publications
You can also search for this author in PubMed Google Scholar
Ye Yin
View author publications
You can also search for this author in PubMed Google Scholar
Jian Wang
View author publications
You can also search for this author in PubMed Google Scholar
Huanming Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Robert J. Henry
View author publications
You can also search for this author in PubMed Google Scholar
Nils Stein
View author publications
You can also search for this author in PubMed Google Scholar
Nyima Tashi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.T., X.Z., and Q.X. designed and managed the project. Y.G. performed all of the bioinformatics analyses and wrote the manuscript. N.S. gave insightful instructions on the data analysis, suggestions, comments, and revision on the manuscript. M.M. gave insightful instructions on the data analysis. N.T., X.Z., S.L., L.M. and R.J.H. revised the manuscript. N.T., X.Z., G.G., H.Y., Y.W., Z.W., S.Z., Y.T. and J.Z. collected the accessions. Q.L., Z.X. and J.Z. helped to plot the figures. N.T., X.Z, S.T., L.S., S.L., L.B., Z.Z., W.H., S.Z., X.F., Q.G., Y.Y., J.W. and H.Y. helped with language editing.

Corresponding authors

Correspondence to Nils Stein or Nyima Tashi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Journal Peer Review Information: Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Supplementary Data 7

Supplementary Data 8

Supplementary Data 9

Supplementary Data 10

Supplementary Data 11

Source Data

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zeng, X., Guo, Y., Xu, Q. et al. Origin and evolution of qingke barley in Tibet. Nat Commun 9, 5433 (2018). https://doi.org/10.1038/s41467-018-07920-5

Download citation

Received: 26 March 2018
Accepted: 05 December 2018
Published: 21 December 2018
DOI: https://doi.org/10.1038/s41467-018-07920-5

This article is cited by

Acyl-CoA-binding protein (ACBP) genes involvement in response to abiotic stress and exogenous hormone application in barley (Hordeum vulgare L.)
- Huayu Chang
- Minhu Ma
- Guofang Xing
BMC Plant Biology (2024)
The introduction history of Hordeum vulgare var. nudum (naked barley) into Fennoscandia
- Jenny Hagenblad
- Robin Abbey-Lee
- Matti W. Leino
Vegetation History and Archaeobotany (2024)
BarleyExpDB: an integrative gene expression database for barley
- Tingting Li
- Yihan Li
- Licao Cui
BMC Plant Biology (2023)
CoreSNP: an efficient pipeline for core marker profile selection from genome-wide SNP datasets in crops
- Tingyu Dou
- Chunchao Wang
- Ganggang Guo
BMC Plant Biology (2023)
Occurrence of Fusarium mycotoxins in freshly harvested highland barley (qingke) grains from Tibet, China
- T. W. Zhang
- D. L. Wu
- F. Dong
Mycotoxin Research (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.