Detection of breeding signatures in wheat using a linkage disequilibrium-corrected mapping approach

Dadshani, Said; Mathew, Boby; Ballvora, Agim; Mason, Annaliese S.; Léon, Jens

doi:10.1038/s41598-021-85226-1

Download PDF

Article
Open access
Published: 09 March 2021

Detection of breeding signatures in wheat using a linkage disequilibrium-corrected mapping approach

Said Dadshani¹,
Boby Mathew²,
Agim Ballvora¹,
Annaliese S. Mason¹ &
…
Jens Léon¹

Scientific Reports volume 11, Article number: 5527 (2021) Cite this article

3850 Accesses
16 Citations
5 Altmetric
Metrics details

Abstract

Marker assisted breeding, facilitated by reference genome assemblies, can help to produce cultivars adapted to changing environmental conditions. However, anomalous linkage disequilibrium (LD), where single markers show high LD with markers on other chromosomes but low LD with adjacent markers, is a serious impediment for genetic studies. We used a LD-correction approach to overcome these drawbacks, correcting the physical position of markers derived from 15 and 135 K arrays in a diversity panel of bread wheat representing 50 years of breeding history. We detected putative mismapping of 11.7% markers and improved the physical alignment of 5.4% markers. Population analysis indicated reduced genetic diversity over time as a result of breeding efforts. By analysis of outlier loci and allele frequency change over time we traced back the 2NS/2AS translocation of Aegilops ventricosa to one cultivar, “Cardos” (registered in 1998) which was the first among the panel to contain this translocation. A “selective sweep” for this important translocation region on chromosome 2AS was found, putatively linked to plant response to biotic stress factors. Our approach helps in overcoming the drawbacks of incorrectly anchored markers on the wheat reference assembly and facilitates detection of selective sweeps for important agronomic traits.

GRAS-Di system facilitates high-density genetic map construction and QTL identification in recombinant inbred lines of the wheat progenitor Aegilops tauschii

Article Open access 08 December 2020

The barley pan-genome reveals the hidden legacy of mutation breeding

Article Open access 25 November 2020

Diversity analysis of 80,000 wheat accessions reveals consequences and opportunities of selection footprints

Article Open access 11 September 2020

Introduction

Wheat has played an essential role in the history of human civilization for centuries, and is currently the third largest staple crop, contributing about 20% of total dietary calories and proteins worldwide¹. It is predicted that by 2050 the production of wheat has to increase by 18% in order to prevent global food insecurity². Moreover, amplified by global climate change, detrimental environmental conditions such as heavy heatwaves followed by repeated precipitation extremes (heavy rain) have had dramatic negative effects on yield during the last few years³, and this may also compromise future food security. Triggered by the Green Revolution (GR), global wheat productivity increased dramatically in the last century (Shiferaw 2012). Main drivers of the GR were implementation of modern and efficient productions systems and breeding of high yielding wheat varieties⁴. During the last decades, the productivity of wheat in Western Europe persistently increased, making this area one of the highest-yielding regions in the world: Europe is a major net exporter of wheat⁵. However, in recent years the productivity of wheat in high-yielding regions reached a plateau. It is assumed that a biophysical yield ceiling for wheat determines the productivity of wheat in the high-yield countries of Northwest Europe (Grassini et al. 2013). Genetic bottlenecks due to past breeding events are recognized as major impediments to crop improvement^6,7. Along with the threat of global climate change, the adverse effects of loss of genetic diversity are a major threat to global food security^8,9.

Population genetics and genome-wide association studies (GWAS) are comprehensive approaches to assess genomic diversity and detect signatures of past or ongoing selection in breeding at the molecular level^10,11,12. Previously, multiple population genomics approaches have been applied to investigate signatures of selection and domestication in plant and livestock breeding as well as for evolutionary or natural selection studies^10,13. In wheat, several local patterns of low genetic variation, denoted as “selective sweeps”, as a consequence of strong directional selection processes have been reported with respect to flowering time and phenology¹⁴. Other selective sweeps have been identified as result from introgressions of resistance genes from relatives of wheat with lower ploidy levels into hexaploid wheat followed by selection for these loci^{15,16,17,18,19,20}. Generally, the detection of loci under selection during crop improvement can contribute to more targeted breeding efforts and the opportunity to improve genomic selection models^14,21.

Association panels which consist of unrelated genotypes with high genetic diversity offer an ideal basis for GWAS by linking molecular marker information with phenotypic information to uncover genes controlling phenotypic variation^22,23. Additionally, association panels incorporating historically important cultivars and modern cultivars are also valuable sources of information on the breeding process and breeding signatures. Historically, the application of available genomic tools in hexaploid wheat has lagged behind their use in other cereals such as maize and rice²⁴. This is mainly due to the complexity of the wheat genome: wheat is an allohexaploid with three closely related subgenomes, a large genome size of (~ 16 Gb) and > 85% repetitive element content²⁵. However, analyzing a set of winter wheat cultivars representing 50 years of German breeding history, Lichthardt et al. (2020) were able to identify a co-evolution effect determining sink and source allocation in plants which consequently affected grain yield potential. Large-scale field experiments with the same association panel also revealed that modern cultivars consistently outperform older varieties with respect to yield parameters, grain quality, nutrient use efficiency and disease resistance (Voss-Fels et al. 2019). Studying changes of allele frequency within these panels in the course of breeding history allows us to trace the origin of beneficial alleles and thus identify valuable sources of genetic variation that may have been neglected during the breeding process. By identification and selection of genotypes containing alleles of interest these alleles can be re-introduced into modern cultivars²⁶.

GWAS is based upon the principle of linkage disequilibrium (LD), the nonrandom association of alleles at different loci. LD is highly affected by population structure, which is a potential confounding factor in all genetic association studies by the existence of differing levels of genetic relatedness in populations^27,28,29. Hence, understanding population structure in an association panel is an essential requirement before undertaking GWAS³⁰. Multiple approaches are applied in order to correct for population structure such as Principal Components Analysis (PCA), Kinship analysis (K) and admixture proportion inference analysis (structure or admixture analysis)^31,32,33. Another prerequisite to efficiently performing GWAS is the availability of accurate (high-density) genetic or physical maps of the applied markers: preferably a reference genome sequence. Along with the presentation of the reference wheat genome assembly IWGSC published a linear reference genome sequence of hexaploid wheat with RefSeq v1.0²⁵. However, as the number of available genome sequences steadily increases in many crops, it is becoming more and more evident that the genome of a single genotype is not sufficient to cover the large amount of presence/absence variations or structural variants (SVs) existing within a species^34,35. The construction of pan-genomes covering a “core” genome and “dispensable” genome containing genes that are not present in one or more genomes of the same species is hence desirable to increase the accuracy of downstream genomic analysis³⁶.

Association panels incorporating historically important cultivars and modern cultivars are an ideal platform to study the process of breeding from historical perspective by detection of loci that were subjected to selection in the frame of breeding process. On the other side combining GWAS and outlier loci analysis allows detection of beneficial alleles that were neglected during the selection process of modern cultivars. The aim of the present study is to estimate the potential of a LD-corrected wheat map for use in GWAS approaches and for detection of signatures that are related to the breeding progress of German cultivars. The detection of loci under selection during breeding process can contribute to more targeted breeding efforts and the opportunity to improve breeding by application of genomic selection models.

Materials and methods

Plant material

The diversity panel utilized in this study comprised 221 bread wheat cultivars, including 165 German cultivars and 56 bread wheat accessions representing global genetic diversity with cultivars from Europe, USA, Mexico, India and Australia (Supplementary Table S1). The German cultivars represented 50 years of German breeding history of bread wheat from 1966 to 2016. These cultivars were selected based on their economic and historic importance. The composition of the utilized population was described in Voss-Fels, et al.³⁷ and Lichthardt, et al.³⁸. Seeds of the bread wheat cultivars were provided by Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) by accepting the terms and conditions of the Standard Material Transfer Agreement (SMTA) of the International Treaty on Plant Genetic Resources for Food and Agriculture (http://www.fao.org/plant-treaty).

Genotyping and data processing

The accessions of the diversity panel were genotyped at SGS TraitGenetics GmbH (Gatersleben, Germany) using the Infinium iSelect 15 K single nucleotide polymorphism (SNP) bead array, which comprises 13 006 polymorphic loci³⁹, as well as by the 135 K Axiom Exome Capture Array carrying 136 780 SNP markers³⁸ .

The alignment tool Bowtie2 version 2.3.4.3⁴⁰ was used to align the sequence of the markers to the reference wheat genome assembly RefSeq v1.0²⁵. Following alignment, filtering was applied in order to detect the best assignment/anchorage to a physical position on the reference genome using the default criteria of Bowtie 2.3.4.3⁴⁰: (1) unique mapping to an unambiguous locus; (2) maximum 1 bp mismatch to the marker sequence, and (3) markers with multiple alignment options were discarded if the second-best hit showed < 3 bp mismatch to the marker sequence (i.e. markers with 2 or more hits (loci) were discarded if there was not at least 3 bp difference between the best and second-best hit). Monomorphic markers were discarded as well as SNPs with MAF (Minor Allele Frequency) less than 5% and more than 5% missing data.

Further filtering was applied for the markers obtaining from the Axiom array following manufacturer’s recommendation⁴¹ and other reports^42,43. Consequently, only markers from the category “PolyHighResolution” and “Off-Target Variants” (OTV) showing distinct clustering of genotype calls were considered for the successive steps of filtering process. Additionally, two cluster quality control metrics, FLD (Fisher’s Linear Discriminant) and HomFLD (Homozygote Fisher's Linear Discriminant), which is a version of FLD computed for the homozygous genotype clusters, were utilized as stringent criteria. Accordingly, the minimum requirements for the corresponding individual FLD and HomFLD, were set to ≥ 4 and ≥ 8, respectively. Some markers showed high LD with markers on other chromosomes or with distant regions on the same chromosome, but no or low LD with nearby markers on the same chromosome (Fig. 1). We improved the map by following the three steps described by Utsunomiya, et al.⁴⁴. First, genome-wide LD between each pair of SNPs was calculated using Plink v1.9⁴⁵. Second, a table of ambiguous SNPs with low LD (r² ≤ 0.5) with other markers of the same chromosome and high LD (r² > 0.5) with SNP on other chromosomes was created. Third, for each marker in this list the second-best alternative alignment, confirming the region indicated by high LD, was selected as the alternative physical position. Ultimately, 24,216 informative SNP marker with defined physical positions remained for further analysis. Missing markers were imputed using the java tool LD-kNNI⁴⁶, which is based on k-nearest neighbors imputation (kNNI) taking the LD between SNPs into account, when choosing the nearest neighbors.

Statistical analysis

The LD between markers across chromosomes was estimated using Tassel program v.5.2.61⁴⁷. The “prcomp” function of R was applied for calculation of marker-based principal components. The “K-means” function in the “stats” package in R v. 4.0.0 RC Team⁴⁸ was used to classify the genotypes of the association panel into clusters by application of the Bayesian information criterion (BIC) as the statistical measure of goodness of fit. The R package “adegenet” v 2.1.2⁴⁹ was used for the Discriminant Analysis of Principal Components (DAPC), which is a frequently used approach to identify and to describe clusters of individuals. DAPC facilitates the detection of genetic variation between groups and within-groups and yields synthetic variables which maximize between-group diversity while minimizing within-group diversity.

Additionally, the genetic structure of the diversity panel and the German cultivars was estimated separately, by using a Bayesian model-based clustering method implemented in STRUCTURE software v.2.3.4^50,51. For this, as required by the software, marker pruning was applied by removing markers with high LD (R² > 0.7) by using Plink v1.9⁴⁵. A set of 5 681 markers remained after the pruning process. The length of the burn-in period and the number of Markov chain Monte Carlo (MCMC) iterations after burn-in were set 50,000 and 100,000, respectively. To choose the best estimate of number of clusters (K), the hypothetical number of K was set from 1 to 10. The number of replications for the MCMC run was ten times for each K.

Based on the best number of K estimated by the software STRUCTURE, Nei’s estimator of pairwise F_ST⁵² for all pairs of sub-populations was calculated using the function “pairwise.fst” included in the “adegenet” package⁴⁹. Outlier locus detection was performed by using the Windows version of ‘BayeScan’ v2.1⁵³. The Bayesian method of BayeScan estimates F_ST for each SNP locus to perform a genomic scan for outlier F_ST values. The following conditions were set for the BayeScan analysis to detect highly significant SNPs affected by the breeding history: the number of MCMC iterations were set to 50,000, pilot run length to 10,000 and additional burn-in to 30,000. SNP-based haplotype analysis was performed using Haploview 4.2⁵⁴. Heat maps of pairwise LD between markers were plotted using the R package “LDheatmap” version 0.99-7⁵⁵. The SNP-density plot was produced using the R package “CMplot”. The blast server of EnsemblPlants http://plants.ensembl.org/Triticum_aestivum/Tools/Blast (Howe et al. 2020) was used to detect genes which were identified as outlier loci in the context of breeding history.

GWAS analysis

A genome-wide association study (GWAS) was performed using the multi-locus mixed linear model (MMLM-P + K) in SAS 9.4 (2015) taking into account population structure (P-matrix) and identity-by-state (IBS) kinship coefficient (K-matrix) between each individual. Iteratively, the forward selection and backward elimination approach described in Bauer et al. (2009) was used to reduce the number of false-positive QTL. The false discovery rate (FDR) was set to p < 0.05 for the iterative multi-locus approach in the QTL model. In order to further reduce the number of false positive QTLs, we applied fivefold cross-validation procedure. he best linear unbiased estimators (BLUE) for the following traits and for each genotype were previously generated by Voss-Fels, et al.³⁷: grain yield (GY), kernels per spike (KPS), kernels per m² (KPM), harvest-index (HI), plant height (PH), heading date (HD), kernel crude protein (KCP), sedimentation value (SD), falling number (FN), protein yield (PY), Nitrogen-use efficiency (NUE), green canopy duration (GCD), radiation interception efficiency (RIE).

The figures were produced using the R package “ggplot2”⁵⁶. The R package “adegenet”⁴⁹ was used to produce the DAPC plot.

Legal statement

The present study complies with relevant institutional, national, and international guidelines and legislation. Seeds of the bread wheat cultivars were provided by Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) with permission to collect the seeds of the plant material by accepting the terms and conditions of the Standard Material Transfer Agreement (SMTA) of the International Treaty on Plant Genetic Resources for Food and Agriculture (http://www.fao.org/plant-treaty/).

Results

Genotyping analysis

The diversity panel comprising 221 genotypes was genotyped using the Infinium iSelect 15 K SNP chip and the 135 K Axiom Exome Capture Array. A total of 81,655 SNP marker were aligned to the wheat reference genome assembly RefSeq v1.0²⁵, of which 54,389 markers aligned to physical positions on the assembled pseudomolecules. After stringent filtering for missing data and minor allele frequency we retained 25,510 SNP markers. However, aligning the markers to the reference genome without taking into account LD with the neighboring markers lead to putative mismapping of 11.7% of the markers. Following the LD-correction approach, the positions of 1388 SNP markers were corrected, whereas 1294 (5.1%) of markers showing ambiguous LD patterns or putative localization to multiple chromosomes were removed from the set. An example of marker misplacement is shown in Fig. 1. Performing GWAS for a simulated data set showed unexpected/spurious QTL on chromosome 1A with high logarithm of odds (LOD) scores. Unexpectedly, adjacent markers (physical distance 1395 bp) are in low LD with the main QTL and show considerably lower LOD scores (Fig. 1A and C). However, the LD-based correction approach suggests the correct location of this marker is on chromosome 1B, where it constitutes a haploblock of eight markers with high LD (Fig. 1D). Subsequent repetition of GWAS with the corrected map assigns the QTL to chromosome 1B surrounded by the markers of the haploblock (Fig. 1B).

Consequently, the LD-corrected map containing 24,216 SNP markers was used for further analysis (For the marker data and the LD-corrected map see Supplementary Table S2; for the genotypic data with imputations for missing marker data see Supplementary Table S3). Finally, 60% of markers (14,492) originated from the 135 K genotyping array, whereas 40% (9724 markers) originated from the 15 K genotyping array. The majority of the markers were localized on the A subgenome (9963 markers) and B subgenome (11,882 markers), whereas 10% of the markers were found on the D subgenome (2 371 markers). Furthermore, the A and B subgenomes had higher marker density per Mbp than the D subgenome, with 2.0 and 2.3 SNP/Mbp, respectively, compared to an average of 0.6 markers per Mbp (Fig. 2; Supplementary Table S4).

Analysis of population structure

Population structure was assessed using the full set of 24,216 SNP markers for cluster analysis using a K-means approach and principal component analysis (PCA). The cluster analysis assigned the accessions into three clusters following the elbow method (Fig. 3).

The first and second principal components of the PCA showed rather low explanatory power with respect to the genotypic variation of the analyzed accessions (6.7 and 5.1%, respectively). Stronger genetic divergence was observed between clusters 1 and 2 of the German panel (F_ST = 0.18). Population structure analysis using the software STRUCTURE suggested K = 3 for the German population (Fig. 4).

Analysis of population structure in the context of 50 years of breeding history

Discriminant Analysis of Principal Components (DAPC) was conducted to investigate the relationship between the German winter wheat cultivars relative to their breeding history. Hence, the German cultivars were assigned into six groups according to the decade in which the variety was released (1960s, 1970s, 1980s, 1990s, 2000s and 2010–2020).

Some grouping of German wheat cultivars by decade of release was apparent (Fig. 5). Specifically, the cultivars show some chronological separation with greater distances between genotypes released further apart in time. The first decade (1960–1970) was distinct from other decades, indicating a stronger differentiation between the genotypes released between 1960 and 1970 and newer genotypes. Notably, the distance between the clusters is shrinking over time, indicating higher genetic similarity among the cultivars registered during the recent decades.

Nei's pairwise F_ST also supported a greater differentiation between cultivars released in the 1960s and cultivars released in other decades, with the greatest similarity between cultivars released in modern decades Fig. 5.

Subsequently, an outlier locus detection approach was applied to detect signatures of breeding among the German winter wheat cultivars registered between 1960 and 2020. Accordingly, the software BayeScan v2.1 was utilized to scan for genomic regions which were putatively affected by breeding and therefore experiencing changes of allele frequency in the German population. Setting the FDR < 0.001 BayeScan identified 22 outlier loci (Fig. 6A). A total of 18 outlier loci were detected on chromosome 2AS spanning a 22.2 Mbp region between the SNP marker AX-158573264 (2,380,829 bp, denoted as M1) and SNP marker AX-158522761 (24,543,742 bp, denoted as M4). The second region harboring three outlier loci was detected on a comparably small genomic region of 0.2 Mbp on chromosome 2Bs between the SNP marker Ra_c2110_494 (146,735,843 bp) and the SNP RAC875_c14105_66 (146,902,161 bp). Surprisingly, the minor allele frequency for the markers between M1 and M4 was strongly shifted from 0% in the old genotypes (registered 1960–1970) to 50–75% in modern varieties registered between 2010 and 2020 (Fig. 6B). The same pattern of allele frequency shift for the markers between M1 and M4 is shown in Fig. 6C. Subsequently, GWAS was performed using precalculated BLUE values of several agronomic and physiological traits obtained from multi-year field trials under two nitrogen levels and with and without fungicide treatment³⁷. The GWAS results indicate a QTL hotspot between M1 and M4 on chromosome 2AS (Fig. 6D). The annual allele frequency change in this hotspot is above 2% and much higher than that of other regions on chromosome 2AS. In contrast, no significant marker by trait interaction was detected for the outlier loci detected on chromosome 2BS, indicating that this region has a minor effect on the analyzed traits.

The summary of GWAS results is presented in (Supplementary Table S5). Notably, the highest significant outlier loci M3 (Excalibur_c21663_145) and M2 (Tdurum_contig63196_123) were linked to a NAC domain and to ATP-sulfurylase PUA-like domain, respectively. Both domains are linked to plant responsive genes towards biotic stress factors, including pathogens.

Further investigation of the M1–M4 selective sweep region

The identified selective sweep between M1– M4 is known to be the target of an introgression from wheat wild relative species Aegilops ventricosa, known as the 2NS/2AS translocation⁵⁷. Among the German panel the cultivar 'Cardos' was the first genotype containing the 2NS/2AS translocation. Since the release of 'Cardos' in year 1998 the numbers of the cultivars containing this introgression increased successively (Fig. 7).

The cultivar 'Biscay' registered in year 2000 was the second genotype containing this introgression. Studying the pedigree information for the cultivars ‘Cardos’ and ‘Biscay’, it is obvious that the genotype VPM-1 containing the 2NS/2AS translocation is the common ancestor of both genotypes (Fig. 8).

Ultimately, the BLUE values of traits that were showing significant QTL under the treatment “without fungicide application” (see Fig. 6D) were visualized in combination with the presence/absence information of the 2NS/2AS translocation (Supplementary Figure S1). Due to the polygenic nature of the analyzed traits the presence of the 2NS/2AS translocation is not essential. However, in most cases recently registered genotypes containing the 2NS/2AS translocation show higher performance than older genotypes without the translocation.

Discussion

LD-based correction of marker positions

Advances in sequencing and genotyping technologies have significantly increased the number of available molecular markers in many crop species. The availability of high-quality reference genomes for major crops like maize, rice, potato, cassava and wheat are great leaps forward in genome research facilitating plant breeding⁵⁸. In this study we aligned the 25,510 SNP markers obtained from genotyping a diversity panel consisting of 221 winter wheat cultivars to the reference wheat genome assembly RefSeq v1.0²⁵. However, we observed a high number of markers with anomalously high LD with markers on other chromosomes but low LD with neighboring markers on the same chromosome. Unusual LD behavior of SNP markers is a serious impediment for studies, such as GWAS, that are dependent on accurate positions of applied markers for candidate gene identification^44,59. Following the LD correction approach, we were able to assign 5.4% of the misplaced markers to corrected physical positions. With the set of 24,216 remaining markers with corrected positions we ran population analysis for the diversity panel containing 221 winter wheat genotypes as well as the panel of 165 German cultivars.

Even though there are other confounding factors affecting LD anomalies (e.g. population stratification), one major source of inconsistency in LD patterns is due to problems with the reference genome assembly. Similar observations were previously described by Money et al.⁵⁹, who also observed a false positive QTL for skin color intensity in apple (Malus domestica Borkh.) on chromosome 3, where a marker was in low LD with nearby SNPs on chromosome 3 but in high LD with markers on chromosome 9. The authors concluded that 10–20% of SNPs in their apple data set had incorrect physical coordinates due to the available reference assembly. In hexaploid wheat, the high content of repetitive elements as well as high similarity among the different genome components are major obstacles for the construction of the reference genome assembly^60,61. Moreover, the importance of structural variants for construction of genome assemblies is becoming increasingly clear^62,63. Accordingly, studying structural variation in the human reference genome, Audano, et al.⁶⁴ concluded that insufficiencies and errors in the human reference genome demand for additional reference genomes to include SV by construction of the human pan-genome. Although the fully annotated reference genome of bread wheat variety 'Chinese Spring' is the current gold standard, the large and complex genome of hexaploid⁶⁵ wheat, consisting of three homoeologous and highly repetitive subgenomes, makes it difficult to construct a high-quality reference genome⁶⁶. Additionally, Walkowiak et al.⁶⁵ pointed out on the large genetic distance of the variety 'Chinese Spring' with the Western breeding material. In future, correction of reference genome assemblies based on the pangenome as well as using different types of mapping populations may prove useful in improving this important genomic resource⁶⁷.

Analysis of population structure in the frame of breeding history

Population structure analysis allowed us to detect breeding signatures in the German panel of wheat cultivars representing 50 years of breeding history. Discriminant Analysis of Principal Components (DAPC) suggested reduced genetic variability in German winter wheat cultivars registered between the year 2000–2010 and the cultivars registered after 2010. This is in line with previous reports, underlining that the intensive selection in modern plant breeding programs within a narrow range of plant germplasm with limited allele introgressions over time is the main cause of loss of genetic diversity^68,69. This is not surprising, as in many cases breeders focus on only a few best-performing varieties, neglecting genetic diversity⁷⁰. Similarly, Voss-Fels, et al.³⁷, examining the same group of genotypes, concluded that breeding has gradually reduced the number of negative or neutral haplotypes in recent decades by focusing on genotypes with favorable haplotypes. Moreover, modern cultivars outperform older cultivars with respect to morphological, physiological as well as agronomic traits under low-input conditions e.g. without fungicide treatment^37,38.

Detection of breeding signatures in German winter wheat

The F_ST-based Bayesian genome scan approach implemented in 'BayeScan' is frequently used for detection of candidate loci under selection by estimating the locus effect as well as the population effect^71,72,73. Using the software 'BayeScan' we were able to detect signals of directional selection in the German cultivars in the course of 50 years of breeding. The identified segment on chromosome 2AS, which was undergoing major effective changes, shows pattern of selective sweeps among the German cultivars registered during last decades. The SNP markers between M1 and M4 of the identified region on chromosome 2AS showed remarkable shift of allele frequencies of the minor allele from 0% to more than 50% among the genotypes of the German panel registered between 1960 and 2020. Similar observations of annual allele frequency changes of outlier loci were reported by Fu and Somers⁷⁴ and N’Diaye, et al.⁷³ studying the breeding history of Canadian spring wheat and durum wheat, respectively. Remarkably, the identified region between M1 and M4 is known to be subject to a large introgression from chromosome 2NS of Aegilops ventricosa Tausch to chromosome 2AS of the wheat line VPM-1¹⁸. The 2NS/2AS translocation is known to harbor large number of genes with resistance against yellow rust, brown rust, powdery mildew, eyespot disease and wheat blast disease^{18,19,20,57,75,76,77}. Several reports underline the superiority of genotypes containing this translocation with respect to resistance towards multiple pathogens, among them the cultivar 'Cardos' which was the first cultivar among the German panel containing the 2NS/AS translocation^19,57,78,79. The cultivar ‘Cardos’ was also for many years the reference resistance genotype among German wheat varieties⁸⁰. Recently⁶⁵, applied genotyping by sequencing approach to detect the 2NS/2AS translocation in three wheat panels. In agreement with our observations, they concluded that the presence of this introgressions was associated with disease resistance and higher grain yield.

Due to the directional selection the identified selective sweep on chromosome 2AS was not only subject to major shift of allele frequency. Additionally, GWAS analysis revealed significant marker by treatment interactions of this region linked to plant responsive genes related to biotic stress factors. Positive or directional selection and selective sweeps in natural populations as well as in long-term breeding approaches (historical breeding programs) have been the focus of population geneticists for many years^74,81,82. Linking shift of allele frequency with QTL results was previously reported for crops like barley⁸³ and maize⁸⁴. However, to our knowledge the present study is among few attempts to link results from GWAS analysis with the directional change of allele frequency in a historical association panel of wheat⁸⁵. Finally, as high-throughput genotyping now allows us to genetically characterize large numbers of genotypes with low costs, now the focus has shifted towards reduction of phenotyping costs. In this frame, identification of beneficial selective sweeps by studying changes of allelic frequency in a historical association panel and their integration into breeding programs will help to reduce phenotyping costs and increase breeding efficiency^86,87. The detected 2NS/2AS translocation, containing effective genes related to disease resistance, should be considered as valuable source of resistance genes for allele mining approaches to produce future varieties with higher disease resistance.

Conclusions

The LD-corrected physical genome sequence of wheat helps to enhance the power of genome wide association studies as well as identification of candidate genes in wheat. Additionally, we were able to trace back the translocation of 2NS/2AS of Aegilops ventricosa Tausch to the cultivar ‘Cardos’ (registered in 1998), which was the first German cultivar among the German panel containing this translocation. Moreover, understanding breeding from the historical perspective by screening for selective sweeps (among the genotypes of a historical association panel) offers an alternative for identifying favorable QTL-regions by means of population genetics, even without phenotyping.

Data availability

The datasets supporting the conclusions of this article are included within the article and its additional files.

References

Shiferaw, B. et al. Crops that feed the world 10. Past successes and future challenges to the role played by wheat in global food security. Food Secur. 5, 291–317. https://doi.org/10.1007/s12571-013-0263-y (2013).
Article Google Scholar
Alexandratos, N. & Bruinsma, J. World agriculture towards 2030/2050: the 2012 revision. (2012).
Rezaei, E. E., Siebert, S. & Ewert, F. Impact of data resolution on heat and drought stress simulated for winter wheat in Germany. Eur. J. Agron. 65, 69–82 (2015).
Article Google Scholar
Perkins, J. H. Geopolitics and the Green Revolution: Wheat, Genes, and the Cold War (Oxford University Press on Demand, 1997).
Google Scholar
Schils, R. et al. Cereal yield gaps across Europe. Eur. J. Agron. 101, 109–120 (2018).
Article Google Scholar
van de Wouw, M., van Hintum, T., Kik, C., van Treuren, R. & Visser, B. Genetic diversity trends in twentieth century crop cultivars: a meta analysis. Theor. Appl. Genet. 120, 1241–1252 (2010).
Article PubMed PubMed Central Google Scholar
Sansaloni, C. et al. Diversity analysis of 80,000 wheat accessions reveals consequences and opportunities of selection footprints. Nat. Commun. 11, 1–12 (2020).
Article CAS Google Scholar
Naylor, R. L. The Many Faces of Food Security. The Evolving Sphere of Food Security (Oxford University Press, 2014).
Book Google Scholar
Cribb, J. The Coming Famine: The Global Food Crisis and What We Can Do to Avoid It (University of California Press, 2010).
Book Google Scholar
Bhati, M., Kadri, N. K., Crysnanto, D. & Pausch, H. Assessing genomic diversity and signatures of selection in Original Braunvieh cattle using whole-genome sequencing data. BMC Genom. 21, 1–14 (2020).
Article CAS Google Scholar
Muñoz, M. et al. Genomic diversity, linkage disequilibrium and selection signatures in European local pig breeds assessed with a high density SNP chip. Sci. Rep. 9, 1–14 (2019).
Article CAS Google Scholar
Wright, S. I. & Gaut, B. S. Molecular population genetics and the search for adaptive evolution in plants. Mol. Biol. Evol. 22, 506–519 (2005).
Article CAS PubMed Google Scholar
Weigand, H. & Leese, F. Detecting signatures of positive selection in non-model species using genomic data. Zool. J. Linn. Soc. 184, 528–583 (2018).
Article Google Scholar
Cavanagh, C. R. et al. Genome-wide comparative diversity uncovers multiple targets of selection for improvement in hexaploid wheat landraces and cultivars. Proc. Natl. Acad. Sci. USA 110, 8057–8062. https://doi.org/10.1073/pnas.1217133110 (2013).
Article ADS PubMed PubMed Central Google Scholar
Sears, E. The transfer of leaf-rust resistance from Aegilops umbellulata to wheat. The transfer of leaf-rust resistance from Aegilops umbellulata to wheat. (1956).
Kimber, G. The incorporation of the resistance of Aegilops ventricosa to Cercosporella herpotrichoides into Triticum aestivum. J. Agric. Sci. 68, 373–376 (1967).
Article Google Scholar
McIntosh, R., Baker, E. & Driscoll, C. Cytogenetical studies in wheat I. Monosomic analysis of leaf rust resistance in the cultivars uruguay and transfer. Aust. J. Biol. Sci. 18, 971–978 (1965).
Article Google Scholar
Maia, N. Obtention des bles tendres resistants au pietin-verse par croisements interspecifiques bles x Aegilops. CR Acad. Agric. Fr. 53, 149–154 (1967).
Google Scholar
Helguera, M. et al. PCR assays for the Lr37-Yr17-Sr38 cluster of rust resistance genes and their use to develop isogenic hard red spring wheat lines. Crop Sci. 43, 1839–1847 (2003).
Article CAS Google Scholar
Cruz, C. et al. The 2NS translocation from Aegilops ventricosa confers resistance to the Triticum pathotype of Magnaporthe oryzae. Crop Sci. 56, 990–1000 (2016).
Article CAS PubMed PubMed Central Google Scholar
Jordan, K. W. et al. A haplotype map of allohexaploid wheat reveals distinct patterns of selection on homoeologous genomes. Genome Biol. 16, 1–18 (2015).
Article Google Scholar
Tanksley, S. & Nelson, J. Advanced backcross QTL analysis: a method for the simultaneous discovery and transfer of valuable QTLs from unadapted germplasm into elite breeding lines. Theor. Appl. Genet. 92, 191–203 (1996).
Article CAS PubMed Google Scholar
Visscher, P. M., Brown, M. A., McCarthy, M. I. & Yang, J. Five years of GWAS discovery. Am. J. Human Genet. 90, 7–24 (2012).
Article CAS Google Scholar
Bewley, J. D., Black, M. & Halmer, P. The Encyclopedia of Seeds: Science, Technology and Uses (Cabi, 2006).
Google Scholar
IWGSC. Shifting the limits in wheat research and breeding using a fully annotated reference genome. Science 361, eaar7191. https://doi.org/10.1126/science.aar7191 (2018).
Article CAS Google Scholar
Fradgley, N. et al. A large-scale pedigree resource of wheat reveals evidence for adaptation and selection by breeders. PLoS Biol. 17, e3000071 (2019).
Article PubMed PubMed Central CAS Google Scholar
Mathew, B., Sillanpää, M. J. & Léon, J. Advances in Statistical Methods to Handle Large Data Sets for Gwas in Crop Breeding (Burleigh Dodds Science Publishing, 2018).
Google Scholar
Pritchard, J. K., Stephens, M., Rosenberg, N. A. & Donnelly, P. Association mapping in structured populations. Am. J. Human Genet. 67, 170–181 (2000).
Article CAS Google Scholar
Ewens, W. J. & Spielman, R. S. The transmission/disequilibrium test: history, subdivision, and admixture. Am. J. Hum. Genet. 57, 455 (1995).
CAS PubMed PubMed Central Google Scholar
Liu, C.-C., Shringarpure, S., Lange, K. & Novembre, J. Statistical Population Genomics 67–86 (Humana, 2020).
Google Scholar
Putman, A. I. & Carbone, I. Challenges in analysis and interpretation of microsatellite data for population genetic studies. Ecol. Evol. 4, 4399–4428 (2014).
Article PubMed PubMed Central Google Scholar
Price, A. L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909 (2006).
Article CAS PubMed Google Scholar
Astle, W. & Balding, D. J. Population structure and cryptic relatedness in genetic association studies. Stat. Sci. 24, 451–471 (2009).
Article MathSciNet MATH Google Scholar
Golicz, A. A., Batley, J. & Edwards, D. Towards plant pangenomics. Plant Biotechnol. J. 14, 1099–1105 (2016).
Article PubMed Google Scholar
Yang, X., Lee, W.-P., Ye, K. & Lee, C. One reference genome is not enough. Genome Biol. 20, 104 (2019).
Article PubMed PubMed Central Google Scholar
Tettelin, H. et al. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial “pan-genome”. Proc. Natl. Acad. Sci. 102, 13950–13955 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Voss-Fels, K. P. et al. Breeding improves wheat productivity under contrasting agrochemical input levels. Nat. Plants 5, 706–714. https://doi.org/10.1038/s41477-019-0445-5 (2019).
Article PubMed Google Scholar
Lichthardt, C., Chen, T. W., Stahl, A. & Stutzel, H. Co-evolution of sink and source in the recent breeding history of winter wheat in Germany. Front. Plant Sci. 10, 1771. https://doi.org/10.3389/fpls.2019.01771 (2020).
Article PubMed PubMed Central Google Scholar
Muqaddasi, Q. H., Brassac, J., Börner, A., Pillen, K. & Röder, M. S. Genetic architecture of anther extrusion in spring and winter wheat. Front. Plant Sci. 8, 754 (2017).
Article PubMed PubMed Central Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357 (2012).
Article CAS PubMed PubMed Central Google Scholar
Thermo-Fisher-Scientific: Axiom^TM genotyping solution: data analysis guide. (2017).
Rimbert, H. et al. High throughput SNP discovery and genotyping in hexaploid wheat. PLoS ONE 13, e0186329 (2018).
Article PubMed PubMed Central CAS Google Scholar
Liu, S. et al. Development of the catfish 250K SNP array for genome-wide association studies. BMC Res. Notes 7, 135 (2014).
Article PubMed PubMed Central Google Scholar
Utsunomiya, A. T. et al. Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan. BMC Genom. 17, 705. https://doi.org/10.1186/s12864-016-3049-8 (2016).
Article Google Scholar
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, s13742-13748 (2015).
Article CAS Google Scholar
Money, D. et al. LinkImpute: fast and accurate genotype imputation for nonmodel organisms. G3 Genes Genom. Genet. 5, 2383–2390 (2015).
Google Scholar
Bradbury, P. J. et al. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23, 2633–2635 (2007).
Article CAS PubMed Google Scholar
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing (Vienna, Austria); https://www.R-project.org/ (2020).
Google Scholar
Jombart, T. et al. Package ‘adegenet’. (2020).
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
Article CAS PubMed PubMed Central Google Scholar
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol. Ecol. 14, 2611–2620. https://doi.org/10.1111/j.1365-294X.2005.02553.x (2005).
Article CAS PubMed Google Scholar
Nei, M. Analysis of gene diversity in subdivided populations. Proc. Natl. Acad. Sci. 70, 3321–3323 (1973).
Article ADS CAS PubMed MATH PubMed Central Google Scholar
Foll, M. BayeScan v2.1 user manual. Ecology 20, 1450–1462 (2012).
Google Scholar
Barrett, J. C., Fry, B., Maller, J. & Daly, M. J. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 21, 263–265 (2005).
Article CAS PubMed Google Scholar
Shin, J.-H., Blay, S., McNeney, B. & Graham, J. LDheatmap: an R function for graphical display of pairwise linkage disequilibria between single nucleotide polymorphisms. J. Stat. Softw. 16, 1–10 (2006).
Article Google Scholar
Wickham, H., Chang, W. & Wickham, M. H. Package ‘ggplot2’. Create Elegant Data Visualisations Using the Grammar of Graphics. Version 2, 1–189 (2016).
Bariana, H. & McIntosh, R. Cytogenetic studies in wheat XV Location of rust resistance genes in VPM1 and their genetic linkage with other disease resistance genes in chromosome 2A. Genome 36, 476–482 (1993).
Article CAS PubMed Google Scholar
Schreiber, M., Stein, N. & Mascher, M. Genomic approaches for studying crop evolution. Genome Biol. 19, 140 (2018).
Article PubMed PubMed Central CAS Google Scholar
Money, D., Migicovsky, Z., Gardner, K. & Myles, S. LinkImputeR: user-guided genotype calling and imputation for non-model organisms. BMC Genom. 18, 523. https://doi.org/10.1186/s12864-017-3873-5 (2017).
Article Google Scholar
Guan, J. et al. The battle to sequence the bread wheat genome: a tale of the three kingdoms. Genom. Proteom. Bioinform. 18, 221–229 (2020).
Article Google Scholar
Salzberg, S. L. & Yorke, J. A. Beware of mis-assembled genomes. Bioinformatics 21, 4320–4321 (2005).
Article CAS PubMed Google Scholar
Gabur, I., Chawla, H. S., Snowdon, R. J. & Parkin, I. A. Connecting genome structural variation with complex traits in crop plants. Theor. Appl. Genet. 132, 733–750 (2019).
Article PubMed Google Scholar
Mason, A. S. & Wendel, J. F. Homoeologous exchanges, segmental allopolyploidy, and polyploid genome evolution. Front. Genet. 11, 1014 (2020).
Article CAS PubMed PubMed Central Google Scholar
Audano, P. A. et al. Characterizing the major structural variant alleles of the human genome. Cell 176, 663–675 (2019).
Article CAS PubMed PubMed Central Google Scholar
Walkowiak, S. et al. Multiple wheat genomes reveal global variation in modern breeding. Nature. 588, 277–283 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Philippe, R. et al. Whole genome profiling provides a robust framework for physical mapping and sequencing in the highly complex and repetitive wheat genome. BMC Genom. 13, 47 (2012).
Article CAS Google Scholar
He, Z. & Bancroft, I. Organization of the genome sequence of the polyploid crop species Brassica juncea. Nat. Genet. 50, 1496–1497 (2018).
Article CAS PubMed Google Scholar
Fu, Y.-B. Understanding crop genetic diversity under modern plant breeding. Theor. Appl. Genet. 128, 2131–2142 (2015).
Article PubMed PubMed Central Google Scholar
Allard, R. W. Genetic changes associated with the evolution of adaptedness in cultivated plants and their wild progenitors. J. Hered. 79, 225–238 (1988).
Article CAS PubMed Google Scholar
Louwaars, N. P. Plant breeding and diversity: a troubled relationship?. Euphytica 214, 114 (2018).
Article PubMed PubMed Central Google Scholar
Mandel, J. R. et al. Association mapping and the genomic consequences of selection in sunflower. PLoS Genet. 9, e1003378 (2013).
Article CAS PubMed PubMed Central Google Scholar
Reinert, S., Osthoff, A., Léon, J. & Naz, A. A. Population genetics revealed a new locus that underwent positive selection in Barley. Int. J. Mol. Sci. 20, 202 (2019).
Article PubMed Central CAS Google Scholar
N’Diaye, A. et al. Haplotype loci under selection in Canadian durum wheat germplasm over 60 years of breeding: association with grain yield, quality traits, protein loss, and plant height. Front. Plant Sci. 9, 1589 (2018).
Article PubMed PubMed Central Google Scholar
Fu, Y.-B. & Somers, D. J. Allelic changes in bread wheat cultivars were associated with long-term wheat trait improvements. Euphytica 179, 209–225 (2011).
Article Google Scholar
Bariana, H. & McIntosh, R. Characterisation and origin of rust and powdery mildew resistance genes in VPM1 wheat. Euphytica 76, 53–61 (1994).
Article Google Scholar
Seah, S., Spielmeyer, W., Jahier, J., Sivasithamparam, K. & Lagudah, E. Resistance gene analogs within an introgressed chromosomal segment derived from Triticum ventricosum that confers resistance to nematode and rust pathogens in wheat. Mol. Plant Microbe Interact. 13, 334–341 (2000).
Article CAS PubMed Google Scholar
Coriton, O. et al. Double dose efficiency of the yellow rust resistance gene Yr17 in bread wheat lines. Plant Breed. 139, 263–271 (2020).
Article CAS Google Scholar
Williamson, V. M., Thomas, V., Ferris, H. & Dubcovsky, J. An Aegilops ventricosa translocation confers resistance against root-knot nematodes to common wheat. Crop Sci. 53, 1412–1418 (2013).
Article PubMed PubMed Central Google Scholar
Kishii, M. An update of recent use of Aegilops species in wheat breeding. Front. Plant Sci. 10, 585 (2019).
Article PubMed PubMed Central Google Scholar
Thiele, A., Schumann, E., Peil, A. & Weber, W. Eyespot resistance in wheat× Aegilops kotschyi backcross lines. Plant Breed. 121, 29–35 (2002).
Article Google Scholar
Koropoulis, A., Alachiotis, N. & Pavlidis, P. Statistical Population Genomics 87–123 (Humana, 2020).
Book Google Scholar
Ergon, Å., Skøt, L., Sæther, V. E. & Rognli, O. A. Allele frequency changes provide evidence for selection and identification of candidate loci for survival in red clover (Trifolium pratense L.). Front. Plant Sci. 10, 718 (2019).
Article PubMed PubMed Central Google Scholar
Pswarayi, A. et al. Changes in allele frequencies in landraces, old and modern barley cultivars of marker loci close to QTL for grain yield under high and low input conditions. Euphytica 163, 435–447 (2008).
Article Google Scholar
Jiao, Y. et al. Genome-wide genetic changes during modern breeding of maize. Nat. Genet. 44, 812–815 (2012).
Article CAS PubMed Google Scholar
Hao, C. et al. Resequencing of 145 landmark cultivars reveals asymmetric sub-genome selection and strong founder genotype effects on wheat breeding in China. Mol. Plant 13, 1733–1751 (2020).
Article CAS PubMed Google Scholar
Vatsiou, A. I., Bazin, E. & Gaggiotti, O. E. Detection of selective sweeps in structured populations: a comparison of recent methods. Mol. Ecol. 25, 89–103 (2016).
Article CAS PubMed Google Scholar
Cavanagh, C. R. et al. Genome-wide comparative diversity uncovers multiple targets of selection for improvement in hexaploid wheat landraces and cultivars. Proc. Natl. Acad. Sci. 110, 8057–8062 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study was funded by the German Federal Ministry of Education and Research (BMBF Grant 031A354E/BRIWECS) and by Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy, EXC-2070-390732324 (PhenoRob).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Institute of Crop Science and Resource Conservation (INRES), Plant Breeding, University of Bonn, Bonn, Germany
Said Dadshani, Agim Ballvora, Annaliese S. Mason & Jens Léon
Bayer CropScience, Monheim am Rhein, Germany
Boby Mathew

Authors

Said Dadshani
View author publications
You can also search for this author in PubMed Google Scholar
Boby Mathew
View author publications
You can also search for this author in PubMed Google Scholar
Agim Ballvora
View author publications
You can also search for this author in PubMed Google Scholar
Annaliese S. Mason
View author publications
You can also search for this author in PubMed Google Scholar
Jens Léon
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.D. and J.L. conceived the study and performed the data analysis. S.D., J.L., B.M. and A.S.M. were involved in the interpretation of the results. Reviewing and editing was performed by S.D., J.L., A.S.M., A.B., B.M. All authors have read and approved the final manuscript.

Corresponding authors

Correspondence to Said Dadshani or Jens Léon.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figure.

Supplementary Table 1.

Supplementary Table 2.

Supplementary Table 3.

Supplementary Table 4.

Supplementary Table 5.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dadshani, S., Mathew, B., Ballvora, A. et al. Detection of breeding signatures in wheat using a linkage disequilibrium-corrected mapping approach. Sci Rep 11, 5527 (2021). https://doi.org/10.1038/s41598-021-85226-1

Download citation

Received: 10 December 2020
Accepted: 25 February 2021
Published: 09 March 2021
DOI: https://doi.org/10.1038/s41598-021-85226-1

This article is cited by

Genetic dissection of root architectural plasticity and identification of candidate loci in response to drought stress in bread wheat
- Nurealam Siddiqui
- Melesech T. Gabi
- Agim Ballvora
BMC Genomic Data (2023)
Identification of genomic regions associated with cereal cyst nematode (Heterodera avenae Woll.) resistance in spring and winter wheat
- Deepti Chaturvedi
- Saksham Pundir
- Shailendra Sharma
Scientific Reports (2023)
Genetic dissection of grain iron and zinc, and thousand kernel weight in wheat (Triticum aestivum L.) using genome-wide association study
- Gopalareddy Krishnappa
- Hanif Khan
- Gyanendra Pratap Singh
Scientific Reports (2022)
Identification of QTLs for wheat heading time across multiple-environments
- Salma Benaouda
- Said Dadshani
- Agim Ballvora
Theoretical and Applied Genetics (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects