A footprint of plant eco-geographic adaptation on the composition of the barley rhizosphere bacterial microbiota

The microbiota thriving in the rhizosphere, the thin layer of soil surrounding plant roots, plays a critical role in plant’s adaptation to the environment. Domestication and breeding selection have progressively differentiated the microbiota of modern crops from the ones of their wild ancestors. However, the impact of eco-geographical constraints faced by domesticated plants and crop wild relatives on recruitment and maintenance of the rhizosphere microbiota remains to be fully elucidated. Here we performed a comparative 16S rRNA gene survey of the rhizosphere of 4 domesticated and 20 wild barley (Hordeum vulgare) genotypes grown in an agricultural soil under controlled environmental conditions. We demonstrated the enrichment of individual bacteria mirrored the distinct eco-geographical constraints faced by their host plants. Unexpectedly, Elite varieties exerted a stronger genotype effect on the rhizosphere microbiota when compared with wild barley genotypes adapted to desert environments with a preferential enrichment for members of Actinobacteria. Finally, in wild barley genotypes, we discovered a limited, but significant, correlation between microbiota diversity and host genomic diversity. Our results revealed a footprint of the host’s adaptation to the environment on the assembly of the bacteria thriving at the root–soil interface. In the tested conditions, this recruitment cue layered atop of the distinct evolutionary trajectories of wild and domesticated plants and, at least in part, is encoded by the barley genome. This knowledge will be critical to design experimental approaches aimed at elucidating the recruitment cues of the barley microbiota across a range of soil types.

. Description of the genotypes used in this study. Eco-geographical group; sampling site or type of the Elite material, genotype ID; mean annual rainfall (MAR*), mid-day temperature in January (MDT1*), Elevation, and soil bulk density (Db*), organic matter content (OM*) of the 'B1K' sampling sites from 19,39 . a Missing data.

Taxonomic diversification of the barley microbiota across barley genotypes.
To study the impact of these differential responses on the composition of the barley microbiota we generated 6,646,864 16S rRNA gene sequencing reads from 76 rhizosphere and unplanted soil specimens. These high-quality sequencing reads yielded 11,212 Operational Taxonomic Units (OTUs) at 97% identity (Supplementary Dataset 1: worksheet 2). A closer inspection of the taxonomic affiliation of the retrieved OTUs revealed that members of five bacterial phyla, namely Acidobacteria, Actinobacteria, Bacteroidetes, Proteobacteria and Verrucomicrobia, accounted for more than 97.8% of the observed reads (Fig. 2, Supplementary Dataset 1: worksheet 3). Among these dominant phyla, Bacteroidetes and Proteobacteria were significantly enriched in rhizosphere compared to bulk soil profiles (ANCOM, cut-off 0.6, alpha 0.05, taxa-based corrected, Supplementary Dataset 1: worksheets 4-5).
Next, we investigated the lower ranks of the taxonomic assignments (i.e., OTU level) and computed the Observed OTUs, Chao1 and Shannon indexes for each sample type. This analysis further supported the notion of the rhizosphere as a 'reduced complexity community' , as both the Observed OTUs and Shannon indexes, but not the projected Chao1, identified significantly richer and more even communities in the bulk soil samples compared to plant-associated specimens (P < 0.05, Mann-Whitney U test; Figure S3). Interestingly, when we compared the Chao1 index within rhizosphere samples, we observed that members of the 'Desert 1' group assembled a richer community compared with the other genotypes (P < 0.05, Kruskal-Wallis non-parametric analysis of variance followed by Dunn's post hoc test; Figure S3). www.nature.com/scientificreports/ To gain further insights into the impact of the sample type on the barley microbiota we generated a canonical analysis of principal coordinates (CAP) using the weighted Unifrac distance, which is sensitive to OTU relative abundance and phylogenetic relatedness. This analysis revealed a marked effect of the microhabitat, i.e., either bulk soil or rhizosphere, on the composition of the microbiota as evidenced by the spatial separation on the axis accounting for the major variation (Fig. 3). Interestingly, we observed a clustering of bacterial community composition within rhizosphere samples, which was more marked between 'Desert' and 'Elite' samples (Fig. 3). These observations were corroborated by a permutational analysis of variance which attributed ~ 30% of the observed variation to the microhabitat and, within rhizosphere samples, ~ 17% of the variation to the individual eco-geographic groups (Permanova P < 0.001, 5,000 permutations, Table 2). Strikingly similar results were obtained when we computed a Bray-Curtis dissimilarity matrix, which is sensitive to OTUs relative abundance only (Table 2; Figure S4). The dominant phyla of the bulk soil and rhizosphere microbiota are conserved across barley genotypes. Average relative abundance (% of sequencing reads) of the dominant phyla retrieved from the microbial profiles of indicated samples. Only phyla displaying a minimum average relative abundance of 1% included in the graphical representation. Stars depict phyla enriched in and discriminating between rhizosphere and between bulk soil samples (ANCOM, cut-off 0.6, alpha 0.05, taxa-corrected).   www.nature.com/scientificreports/ Taken together, these data indicate that the composition of the barley microbiota is fine-tuned by plant recruitment cues which progressively differentiate between unplanted soil and rhizosphere samples and, within these latter, wild ecotypes from elite varieties.
A footprint of host eco-geographic adaptation shapes the wild barley rhizosphere microbiota. To gain insights into the bacteria underpinning the observed microbiota diversification we performed a series of pair-wise comparisons between 'Elite' genotypes and each group of the wild barley ecotypes. This approach revealed a marked specialisation of the members of the 'Desert' ecotype compared to 'Elite' varieties as evidenced by the number of OTUs differentially recruited between members of these groups (Wald test, P < 0.05, FDR corrected; Fig. 4; Supplementary Dataset 1: worksheets 7-11). Thus, the wild barley 'Ecotype' emerged as an element shaping the recruitment cues of the barley rhizosphere microbiota.   www.nature.com/scientificreports/ A closer inspection of the OTUs differentially recruited between 'Desert' wild barley and 'Elite' varieties revealed that the domesticated material exerted the greatest selective impact on the soil biota, as the majority of the differentially enriched OTUs were enriched in 'Elite' varieties (Wald test, P < 0.05, FDR corrected; Supplementary Dataset 1: worksheets 7 and 8). Next, the taxonomic assignments of these 'Elite-enriched' OTUs versus the 'Desert' microbiota followed distinct patterns: while the comparison 'Elite'-'Desert 1' produced a subset of enriched OTUs assigned predominantly to Actinobacteria, Bacteroidetes and Proteobacteria, the comparison 'Elite'-'Desert 2' displayed a marked bias for members of the Actinobacteria (i.e., 44 out of 104 enriched OTUs, Fig. 5). Consistently, the cumulative abundance of sequencing reads assigned of those Actinobacterial OTUs in 'Elite' samples nearly doubled the one recorded for 'Desert 2' samples ( Figure S5). Within this phylum, we identified a broader taxonomic distribution, as those OTUs were assigned to the families Intrasporangiaceae, Micrococcaceae, Micromonosporaceae, Nocardioidaceae, Pseudonocardiaceae, Streptomycetaceae, as well as members of the order Frankiales. Interestingly, when we inspect intra-ecotype diversification we identified diagnostic OTUs capable of discriminating between 'Desert 1' and 'Desert 2' (Wald test, P < 0.05, FDR corrected; Supplementary Dataset 1: worksheets 12 and 13), while no such a feature was identified discriminating between 'Coast 1' and 'Coast 2' at the statistical test imposed. Taken together, our data indicate that wild barley 'Ecotype' (i.e., the differential effect of 'North' , 'Coast, and 'Desert' versus 'Elite') acts as a determinant for the rhizosphere barley microbiota whose composition is ultimately fine-tuned by a sub-specialisation within the 'Ecotype' itself (i.e., the differential effect of 'Desert 1' and 'Desert 2').
These observations prompted us to investigate whether the differential microbiota recruitment between the tested plants was encoded, at least in part, by the barley genome. We therefore generated a dissimilarity matrix using Single Nucleotide Polymorphisms (SNPs) available for the tested genotypes and we inferred their genetic relatedness using a simple matching coefficient (Supplementary Dataset 1: worksheet 14). With few notable exceptions, this analysis revealed three distinct clusters of genetically related plants, represented by and reflecting the 'Elite' material, the 'Desert' and the 'Coast' wild barley genotypes ( Figure S6). The genetic diversity between domesticated material exceeded their microbial diversity (compare relatedness of "Elite" samples in Fig. 3 with the ones of Figure S6) as further evidenced by the fact that we failed to identify a significant correlation between these parameters (P value > 0.05). However, when we focused the analysis solely on the pool of wild barley genotypes, we obtained a significant correlation between genetic and microbial distances (Mantel test r = 0.230; P value < 0.05; Fig. 6).
Taken together, this revealed a footprint of barley host's adaptation to the environment on the assembly of the bacteria thriving at the root-soil interface. This recruitment cue interjected the distinct evolutionary trajectories of wild and domesticated plants and, at least in part, is encoded by the barley genome.

Discussion
In this study we investigated how plant genotypes adapted to different eco-geographic niches may recruit a distinct microbiota once exposed to a common environment.
As we performed a 'common environment experiment' in a Scottish agricultural soil, we first determined how the chosen experimental conditions related to the ones witnessed by wild barleys in their natural habitats. Strikingly, the aboveground biomass gradient observed in our study, with 'Elite' material almost invariably outperforming wild genotypes and material sampled at the locations designated 'Desert 2' at the bottom of the ranking, "matched" the phenotypic characterisation of members of the 'B1K' collection grown in a 'common garden experiment' in a local Israeli soil 18 . Conversely, belowground resource allocation followed an opposite pattern as evidenced by an increased root:shoot dry weight ratio in wild genotypes compared to 'Elite' varieties. As responses to edaphic stress, such as drought tolerance, may modulate the magnitude of above-belowground resource partitioning in plants 21 and root traits 22 , our data might reflect the adaptation of the wild barley exposure to dry areas. Taken together, these results suggest that adaptive responses to eco-geographic constraints in barley have a genetic inheritance component which can be detected and studied in controlled conditions.
As genetically-inherited root traits have been implicated in shaping the rhizosphere microbiota in barley 23 and other crops 24 , these observations motivated us to examine whether these below-ground differences were reflected by changes in microbiota recruitment. The distribution of reads assigned to given phyla appears distinct in plantassociated communities which are dominated in terms of abundance by members of the phyla Acidobacteria, Actinobacteria, Bacteroidetes and Proteobacteria, with these two latter phyla significantly enriched in rhizosphere samples compared to bulk soil controls. This taxonomic affiliation is consistent with previous investigations in barley in either the same 23 or in a different soil type 15 as well as in other crop plants 25 . In summary, these data indicate that the higher taxonomic ranks of the barley rhizosphere microbiota are conserved across soil types as well as wild and domesticated genotypes.
The characterisation of the microbiota at lower taxonomic ranks, i.e., the OTU-level, revealed a significant effect of the microhabitat (i.e., either bulk soil or rhizosphere) and, within plant-associated communities, a footprint of eco-geographic adaptation. For instance, alpha diversity indexes clearly pointed at selective processes modulating bacterial composition as the number of Observed OTUs and the Shannon index indicate simplified and reduced-complexity communities inhabiting the rhizosphere compared to unplanted soil. This can be considered a hallmark of the rhizosphere microbiota as it has been observed in multiple plant species and across soils 6 . Conversely, within rhizosphere samples, alpha-diversity analysis failed to identify a clear pattern, except for the Chao1 index revealing a potential for a richer community associated with plants sampled at the 'Desert 1' locations. This motivated us to further explore the between-sample diversity, which is beta-diversity. This analysis revealed a clear host-dependent diversification of the bacteria associated to barley plants manifested by ~ 17% of the variance of the rhizosphere microbiota explained by the eco-geographical location of the sampled material. This value exceeded the host genotype effect on the rhizosphere microbiota we previously Scientific RepoRtS | (2020) 10:12916 | https://doi.org/10.1038/s41598-020-69672-x www.nature.com/scientificreports/ observed in wild and domesticated barley plants 15 , but is aligned with the magnitude of host effect observed in the rhizosphere microbiota of modern and ancestral genotypes of rice 26 and common bean 27 . As these studies were conducted in different soil types, our data suggest that the magnitude of host control on the rhizosphere microbiota is ultimately fine-tuned by and in response to soil characteristics. The identification of the bacteria underpinning the observed microbiota diversification led to three striking observations. First, the comparison between 'Elite' varieties and the material representing the 'Desert' ecotype   19,28 it is tempting to speculate that the adaptation to these environmental parameters played a predominant role also in shaping microbiota recruitment. Second, it is the domesticated material which exerted a stronger effect on microbiota recruitment, manifested by the increased number of host-enriched OTUs compared to wild barley genotypes. This suggests that the capacity of shaping the rhizosphere microbiota has not been "lost" during barley domestication and breeding selection. Our findings are consistent with data gathered for domesticated and ancestral common bean genotypes, which revealed that shifts from native soils to agricultural lands led to a stronger host-dependent effect on rhizosphere microbes 29 . Due to the intrinsic limitation of 16S rRNA gene profiles of predicting the functional potential of individual bacteria, it will be necessary to complement this investigation with whole-genome surveys 30,31 and metabolic analyses 16,32 to fully discern the impact of the host genotype on the functions provided by the rhizosphere microbiota to their hosts.
The third observation is the marked quantitative enrichment of OTUs assigned to the phylum Actinobacteria in 'Elite' varieties when compared to members of the 'Desert' ecotype, in particular plants of the 'Desert 2' locations. At first glance, the 'direction' of this bacterial enrichment is difficult to reconcile with the ecogeographic adaptation of wild barleys and, in particular, the fact that Actinobacteria are more tolerant to arid conditions 33 and, consequently, more abundant in desert versus non-desert soils 34 . However, the enrichment of Actinobacteria in modern crops compared to ancestral relatives has recently emerged as a distinctive feature of the microbiota of multiple plant species 35 . Although the ecological significance of this trait of the domesticated microbiota remains to be fully elucidated, studies conducted in rice 36 and other grasses, including barley 37 , indicate a relationship between drought stress and Actinobacteria enrichments. These observations suggest that the wild barley genome has evolved the capacity to recognise microbes specifically adapted to the local conditions and, in turn, to repress the growth of others. For instance, among the bacteria differentially enriched between 'Desert 1' and 'Desert 2' we identified genera, such as Arthrobacter sp., adapted to extreme environments and long-term nutrient starvation 38, possibly reflecting the differential adaptation of 'Desert 1' and 'Desert 2' plants to soil with limited organic matter 39 .
Interestingly, we were able to trace the host genotype effect on rhizosphere microbes to the genome of wild barley. This suggests that, similar to other wild species 11, microbiota recruitment co-evolved with other adaptive traits. Conversely, the genetic diversity in 'Elite' material largely exceeded microbiota diversity. This is reminiscent of studies conducted in maize which failed to identify a significant correlation between polymorphisms in the host genome and alpha-and beta-diversity characteristics of the rhizosphere microbiota 40,41 . Yet, and again similar to maize 42 , our data indicate that the recruitment of individual bacterial OTUs in the 'Elite' varieties, rather than community composition as a whole, is the feature of the rhizosphere microbiota under host genetic control.
Although these findings were gathered from the individual soil tested and further validation across a range of soil types is required, a prediction from these observations is that the host control of the rhizosphere microbiota is exerted by a limited number of loci in the genome with a relatively large effect. This is congruent with our previous observation that mono-mendelian mutations in a single root trait, root hairs, impact on ~ 18% of the barley rhizosphere microbiota 23 .
Likewise, this scenario is compatible with a limited number of genes controlling the biosynthesis and rhizodeposition of defensive secondary metabolites which have been implicated in shaping the plant microbiota 43 . Among these compounds, the indol-alkaloids benzoxazinoids recently gained centre-stage as master regulators of the maize-associated microbial communities [44][45][46] . Interestingly, H. vulgare has evolved a distinct indol-alkaloid compound, gramine 47 , which is preferentially accumulated in the tissues of the wild genotypes compared to 'Elite' varieites 48 and whose physiological properties are comparable to the ones of benzoxazinoids 49 . Whether gramine or other species-specific secondary metabolites contribute, at least in part, to shape the barley microbiota will be the focus of future investigations. www.nature.com/scientificreports/ Since modern varieties have been selected with limited or no knowledge of belowground interactions, how was the capacity of shaping the rhizosphere microbiota retained within the cultivated germplasm? The recent observation that genes controlling reproductive traits display pleiotropic effects on root system architecture 50 could provide a direct link between crop selection and microbiota recruitment in modern varieties. These traits, and in particular genes encoding flower developments, show a marked footprint of eco-geographic adaptation and have been selected during plant domestication and breeding 28 . By manipulating those genes, breeders may have manipulated also belowground traits, and in turn, the microbiota thriving at the root-soil interface. With an increased availability of genetic 51 and genomic 52 resources for wild and domesticated barleys, this hypothesis can now be experimentally tested and the adaptive significance of the barley rhizosphere microbiota ultimately deciphered. Specifically, intraspecific populations within the wild 53 as well as between wild and cultivated 51 germplasm, could be deployed in genetic mapping experiments aimed at identifying barley genetic determinants of the rhizosphere microbiota.

conclusions
Our results revealed a footprint of host's adaptation to the environment on the assembly of the bacteria thriving at the root-soil interface in barley. This recruitment cue layered atop of the distinct evolutionary trajectories of wild and domesticated plants and, at least in part, is encoded by the barley genome. Although our study was limited to the individual soil investigated, our sequencing survey will provide a reference dataset for the development of indexed bacterial collections of the barley microbiota. These can be used to infer causal relationships between microbiota composition and plant traits, as demonstrated for Arabidopsis thaliana 54 and rice 55 . Furthermore, this knowledge is critical for the establishment of reciprocal transplantation experiments aimed at elucidating the adaptive value of crop-microbiota interactions, similar to what has recently been proposed for the model plant A. thaliana 56 . However, for crop plants like barley, this will necessarily be conditioned by two elements: identifying the host genetic determinants of the rhizosphere microbiota and inferring microbial metabolic potential in situ. Ultimately, this will help devising strategies aimed at sustainably enhancing crop production for climate-smart agriculture.

Methods
Soil. The soil was sampled from the agricultural research fields of the James Hutton Institute, Invergowrie, Scotland, UK in the Quarryfield site (56° 27′ 5" N 3° 4′ 29" W; Sandy Silt Loam, pH 6.2; Organic Matter 5%; Table S1). This field was left unplanted and unfertilised in the 3 years preceding the investigations and previously used for barley-microbiota interactions investigations 23 . plant genotypes. Twenty wild barley genotypes (H. vulgare ssp. spontaneum) and four 'Elite' cultivars (H. vulgare ssp. vulgare) were used and described in Table 1. Wild barley genotypes were selected representing eco-geographical variation of the 'B1K' collection 18,19 . The 'Elite' genotypes were selected as a representation of different types of spring barley in plant genetic studies. The cultivar 'Morex' is an American six-row malting variety whose genome was the first to be sequenced 57 . The cultivars 'Bowman' and 'Barke' are two-row varieties, developed in US for feed and in Germany for malting, respectively, whereas Steptoe is an American six-row type used for animal feed 51, 58,59 . Plant growth conditions. Barley seeds were surface sterilized as previously reported 60 and germinated on 0.5% agar plates at room temperature. Seedlings displaying comparable rootlet development after 5 days postplating were sown individually in 12-cm diameter pots containing approximately 500 g of the 'Quarryfield' soil, plus unplanted pots filled with bulk soil as controls. Plants were arranged in a randomised design with this number of replicates: 'Coast1' number of replicates n = 12; 'Coast2' n = 12; 'Desert1' n = 11; 'Desert2' n = 12; 'North' n = 12; 'Elite' n = 13 (Supplementary Dataset 1: worksheet 1). Plants were grown for 5 weeks in a glasshouse at 18/14 °C (day/night) temperature regime with 16 h day length and watered every 2 days with 50 ml of deionized water.
Bulk soil and rhizosphere DNA preparation. At early stem elongation, corresponding to Zadoks stages 30-32 61, plants were pulled from the soil and the stems and leaves were separated from the roots ( Figure S1). Above-ground plant parts were dried at 70 °C for 72 h and the dry weight recorded. The roots were shaken manually to remove excess of loosely attached soil. For each barley plant, the top 6 cm of the seminal root system and the attached soil layer was collected and placed in sterile 50 ml falcon tube containing 15 ml phosphate-buffered saline solution (PBS). Rhizosphere was operationally defined, for these experiments, as the soil attached to this part of the roots and extracted through this procedure. The samples were then vortexed for 30 s and aseptically transferred to a second 50 ml falcon containing 15 ml PBS and vortexed again for 30 s to ensure the dislodging and suspension of the rhizosphere soil. Then, the two falcon tubes with the rhizosphere suspension were mixed and centrifuged at 1,500×g for 20 min, the supernatant was removed, with the rhizosphere soil collected as the pellet, flash frozen with liquid nitrogen and stored at − 80 °C, until further use. After the rhizosphere extraction step, these parts of the roots were combined with the rest of the root system for each plant, thoroughly washed with water removing any attached soil particles and dried at 70 °C for 72 h for root biomass measurement. Bulk soil samples were collected from the 6 cm below the surface of unplanted pots and subjected to the same procedure as above.
DNA was extracted from the rhizosphere samples using FastDNA SPIN Kit for Soil (MP Biomedicals, Solon, USA) according to the manufacturer's recommendations. The concentration and quality of DNA was checked using a Nanodrop 2000 (Thermo Fisher Scientific, Waltham, USA) spectrophotometer and stored at − 20 °C Scientific RepoRtS | (2020) 10:12916 | https://doi.org/10.1038/s41598-020-69672-x www.nature.com/scientificreports/ until further use. DNA concentration was used as a proxy for the proportion of the sampled microbiota and evaluated across sample type ( Figure S2).

Preparation of 16 rRNA gene amplicon pools.
The hypervariable V4 region of the small subunit rRNA gene was the target of amplification using the PCR primer pair 515F (5′-GTG CCA GCMGCC GCG GTAA-3′) and 806R (5′-GGA CTA CHVGGG TWT CTAAT-3′). The PCR primers had incorporated an Illumina flow cell adapter at their 5′ end and the reverse primers contained 12 bp unique 'barcode' for simultaneous sequencing of several samples 62 . PCR, including No-Template Controls (NTCs) for each barcoded primer, was performed as previously reported with the exception of the BSA at 10 mg/ml concentration per reaction 23 . Only samples whose NTCs yielded an undetectable PCR amplification were retained for further analysis.
Illumina 16S rRNA gene amplicon sequencing. The pooled amplicon library was submitted to the Genome Technology group, The James Hutton Institute (Invergowrie, UK) for quality control, processing and sequencing as previously described 23,63,64 . Briefly, samples were sequenced using an Illumina MiSeq platform with the 2 × 150 bp chemistry.
Sequencing reads processing. Sequencing reads were processed and analysed using a custom bioinformatics pipeline. First, Quantitative Insights into Microbial Ecology (QIIME) software, version 1.9.0, was used to process the FASTQ files following default parameters for each step 65 . The forward and reverse read files from individual libraries were decompressed and merged using the command join_paired_ends.py, with a minimum overlap of 30 bp between reads. Then, the reads were demultiplexed according to the barcode sequences. Quality filtering was performed using the command split_libraries_fastq.py, imposing a minimum acceptable PHRED score '-q' of 20. Next, these high quality reads were truncated at the 250th nucleotide using the function 'fastq_ filter' implemented in USEARCH 66 . Only these high-quality PE, length-truncated reads were used for clustering in Operational Taxonomic Units (OTUs) at 97% sequence identity. OTUs were identified using the 'closed reference' approach against Silva database (version 132) 67 . OTU-picking against the Silva database was performed using the SortMeRNA algorithm 68, producing in an OTU table containing the abundance of OTUs per sample plus a phylogenetic tree. To control for potential contaminant OTUs amplified during library preparation, we retrieved a list of potential environmental contaminant OTUs previously identified in our laboratory 64 and we used this list to filter the results of the aforementioned OTU-enrichment analysis. Additionally, singleton OTUs, (OTUs accounting for only one sequencing read in the whole dataset) and OTUs assigned to chloroplast and mitochondria (taken as plant derived sequences) were removed using the command filter_otus_from_otu_ tables.py. Taxonomy matrices, reporting the number of reads assigned to individual phyla, were generated using the command summarize_taxa.py. The OTU table, the phylogenetic tree and the taxonomy matrix, were further used in R for visualizations and statistical analysis.
Statistical analyses I: univariate datasets and 16S rRNA gene alpha and beta-diversity calculations. Analysis of the data was performed in R 69 using a custom script with the following packages: Phyloseq 70 for processing, Alpha and Beta-diversity metrics, ggplot2 71 for data visualisations, Vegan 72 for statistical analysis of beta-diversity, Ape 73 for phylogenetic tree analysis. For any univariate dataset used (e.g., aboveground biomass, DNA concentration) the normality of the data's distribution was checked using Shapiro-Wilk test. Nonparametric analysis of variance were performed by Kruskal-Wallis Rank Sum Test, followed by Dunn's post hoc test with the functions kruskal.test and the posthoc.kruskal.dunn.test, respectively, from the package PMCMR. For Alpha-diversity analysis, the OTU table was rarefied at 11,180 reads per sample and this allowed us to retain 8,744 OTUs for downstream analyses (Supplementary Dataset 1: worksheet 6). The Chao1, Observed OTUs and Shannon indices calculated using the function estimate richness in Phyloseq package. Beta-diversity was analysed using a normalized OTU table (i.e., not rarefied) for comparison. For the construction of the normalized OTU table, low abundance OTUs were further filtered removing those not present at least 5 times in 20% of the samples, to improve reproducibility. Then, to control for the uneven number of reads per specimen, individual OTU counts in each sample were divided over the total number of generated reads for that samples and converted in counts per million. Beta-diversity was analysed using two metrics: Bray-Curtis that considers OTUs relative abundance and Weighted Unifrac that additionally is sensitive to phylogenetic classification 74 . These dissimilarity matrices were visualized using Canonical Analysis of Principal coordinates (CAP) 75 using the ordinate function in the Phyloseq package and its significance was inspected using a permutational ANOVA over 5,000 permutations.
Beta-diversity dissimilarity matrices were assessed by Permutational Multivariate Analysis of Variance (Permanova) using Adonis function in Vegan package over 5,000 permutations to calculate effect size and statistical significance.
Statistical analyses II: analysis of Phyla and OTUs differentially enriched among samples. The analysis of the Phyla whose abundances differentiated among rhizosphere and bulk soil samples was performed with analysis of composition of microbiomes (ANCOM) 76 imposing 0.6 cut-off and 0.05 alpha value (taxa-based corrected) as previously described 77 .
The analysis of the OTUs whose abundances differentiated among samples was performed (a) between individual eco-geographic groups and bulk soil samples to assess the rhizosphere effect and (b) between the rhizosphere samples to assess the eco-geographic effect. The eco-geographic effect was further corrected for a microhabitat effect (i.e., for each group, only OTUs enriched against both unplanted soil and at least another barley genotype were retained for further analysis). The analysis was performed using the DESeq2 method 78 with Scientific RepoRtS | (2020) 10:12916 | https://doi.org/10.1038/s41598-020-69672-x www.nature.com/scientificreports/ an adjusted P value < 0.05 (False Discovery Rate, FDR corrected). This method was selected since it outperforms other hypothesis-testing approaches when data are not normally distributed and a limited number of individual replicates per condition (i.e., approximately 10) are available 79 . DESeq2 was performed using the eponymous named package in R with the OTU table filtered for low abundance OTUs as an input. The number of OTUs differentially recruited in the pair-wise comparisons between 'Elite' and wild barley genotypes was visualised using the package UpSetR 80 .
The phylogenetic tree was constructed using the representative sequences of the OTUs significantly differentiating 'Elite' genotypes and either 'Desert1' or 'Desert2' samples annotated with iTOL 81 .
Statistical analyses iii: correlation plot genetic distance-microbial distance. To assess the genetic variation on the barley germplasm we used the SNP platform 'BOPA1' 82 comprising 1,536 single nucleotide polymorphisms. We used GenAlex 6.5 83,84 to construct a genetic distance matrix using the simple matching coefficient. Genetic distance for the barley genotypes was visualised by hierarchical clustering using the function hclust in R. Microbial distance was calculated on the average distances for each ecogeographic group using the Weighted Unifrac metric. Correlation between the plant's genetic and microbial distances was performed using a mantel test with the mantel.rtest of the package ade4 in R. The correlation was visualised using the functions ggscatter of the R packages ggpbur.

Data availability
The sequences generated in the 16S rRNA gene sequencing survey are deposited in the European Nucleotide Archive (ENA) under the accession number PRJEB35359. The version of the individual packages and scripts used to analyse the data and generate the figures of this study are available at https ://githu b.com/Bulga relli D-Lab/Barle y_B1K Received: 19 February 2020; Accepted: 15 July 2020