The megabase-scale crossover landscape is largely independent of sequence divergence

Lian, Qichao; Solier, Victor; Walkemeier, Birgit; Durand, Stéphanie; Huettel, Bruno; Schneeberger, Korbinian; Mercier, Raphael

doi:10.1038/s41467-022-31509-8

Download PDF

Article
Open access
Published: 02 July 2022

The megabase-scale crossover landscape is largely independent of sequence divergence

Nature Communications volume 13, Article number: 3828 (2022) Cite this article

4771 Accesses
14 Citations
14 Altmetric
Metrics details

Subjects

Abstract

Meiotic recombination frequency varies along chromosomes and strongly correlates with sequence divergence. However, the causal relationship between recombination landscapes and polymorphisms is unclear. Here, we characterize the genome-wide recombination landscape in the quasi-absence of polymorphisms, using Arabidopsis thaliana homozygous inbred lines in which a few hundred genetic markers were introduced through mutagenesis. We find that megabase-scale recombination landscapes in inbred lines are strikingly similar to the recombination landscapes in hybrids, with the notable exception of heterozygous large rearrangements where recombination is prevented locally. In addition, the megabase-scale recombination landscape can be largely explained by chromatin features. Our results show that polymorphisms are not a major determinant of the shape of the megabase-scale recombination landscape but rather favour alternative models in which recombination and chromatin shape sequence divergence across the genome.

Mapping genotypes to chromatin accessibility profiles in single cells

Article 08 May 2024

Efficient gene knockout and genetic interaction screening using the in4mer CRISPR/Cas12a multiplex knockout platform

Article Open access 27 April 2024

A high efficiency precision genome editing method with CRISPR in iPSCs

Article Open access 30 April 2024

Introduction

Meiotic recombination is initiated by the formation of numerous DNA double-strand breaks, a minority of which are repaired as crossovers (COs), resulting in reshuffling of the genetic material between generations. COs are, thus, crucial for diversity, adaptation, evolution and breeding^1,2,3,4. Two pathways have been described for meiotic CO formation (class I and II)^1,3,4. Class I COs represent the vast majority of COs and are subject to interference, the propensity of COs to be widely spaced along chromosomes⁵.

COs are not homogeneously distributed and recombination frequencies vary along chromosomes^6,7. Many different features are correlated with the recombination landscape. One consistent pattern across monocentric species is the suppression of COs at and next to centromeres^3,8,9. The landscape can also differ between the two sexes of the same species, a phenomenon called heterochiasmy^10,11,12,13. Polymorphism between homologues can negatively affect crossovers, as observed very locally at crossover hotspots or even completely suppress crossovers in cases of large polymorphisms, like megabase-scale inversions^{7,14,15,16,17,18,19,20,21}. In contrast, however, heterozygous regions in Arabidopsis thaliana showed increased recombination rates when juxtaposed with homozygous regions, suggesting that the density of small-scale sequence divergence can increase recombination rates²². In addition, increasing single nucleotide polymorphism (SNP) density in hybrids associates positively with COs, and the pericentromeric regions that are dense in polymorphisms are also elevated in COs, potentially due to a positive feedback of mismatch recognition during CO formation²¹. A positive correlation between polymorphisms and recombination landscapes can also be observed in natural populations: in many species, historical recombination landscapes as deduced from linkage disequilibrium are positively correlated with SNP densities^{23,24,25,26,27}. In addition, COs tend to colocalize with gene promoters and with regions of open chromatin and low levels of DNA methylation^3,28,29,30.

To better understand the relationship between polymorphisms and meiotic recombination, we aimed to compare CO distribution along chromosomes in the quasi-absence (inbred lines) and presence (hybrids) of polymorphisms. In hybrids, the numerous DNA polymorphisms can be used to precisely map COs^{6,30,31,32,33,34,35,36,37,38,39}, while this is not an option in homozygous inbred lines. Instead, CO frequency in such lines can be estimated by cytological techniques^40,41,42,43, but this has also some limitations, such as the difficulty in identifying individual chromosomes. Alternatively, fluorescence-tagged lines (FTLs) could be used to measure recombination in intervals flanked by markers conferring fluorescence in seeds or pollen grains, but these FTLs are not suitable for mapping the genome-wide CO landscape^44,45,46.

In this study, we develop a method to analyse genome-wide recombination landscapes in inbred lines. We characterize the crossover landscapes of two Arabidopsis inbred lines and compare them to the hybrid, and with the historical recombination pattern in this species. All these CO landscapes are remarkably similar, with the exception of local suppression due to large heterozygous rearrangements. This shows that polymorphism density, with the exception of large structural variations, is not a major determinant of the CO landscape. We also show that only very few chromatin features, like chromatin accessibility and DNA methylation, are sufficient to explain more than 85% of the megabase-scale recombination landscape in Arabidopsis.

Results

A method to robustly detect crossover genome-wide in pure lines

To investigate the landscape of meiotic recombination in A. thaliana inbred lines, we applied moderate EMS mutagenesis to introduce genetic markers into the genomes of A. thaliana Col-0 and Ler. Independent M₂ mutants were crossed to generate F1*s, and independent F1*s were reciprocally crossed to generate F1 populations, which were used to analyse recombination independently in female and male meiosis. (Fig. 1, Supplementary Fig. 1, Supplementary Table S1, Materials and methods). Through Illumina short-read genome sequencing of F1*s and F1s, we identified 838–955 high-confidence mutations segregating in the Col populations and 471–539 in the Ler populations (Supplementary Table S1), which is a negligible level compared to the natural divergence between these two accessions^14,39,47. The markers were randomly distributed across the chromosomes, which allowed the identification of meiotic CO events (Supplementary Fig. 2A–E). We analyzed four independent pairs of populations from both accessions, with a total of 309 and 309 progenies derived from female and male meiosis in Col and 253 and 251 progenies derived from female and male meiosis in Ler, respectively. Overall, we identified 3155 COs in Col (examples shown in Supplementary Figs. 3 and 4, median resolution 522 kb, Supplementary Data 2) and 2004 (median resolution 855 kb, Supplementary Data 3) COs in Ler. We observed a consistent CO frequency between the replicate populations (Fig. 2A, B), arguing against the unlikely possibility that an EMS mutation dominantly affected CO numbers in the F1*. CO numbers were correlated neither with sequence depth nor with the number of markers, suggesting an absence of bias in CO detection (Supplementary Fig. 5). Altogether, this suggests that our method robustly detects COs in inbred lines.

**Fig. 1: Experimental design for CO identification in Arabidopsis inbred lines.**

**Fig. 2: Analysis of female and male COs in Col, Ler, and F1 hybrid populations.**

A female and male crossover landscape in the Columbia-Landsberg F1 hybrid

To compare the recombination landscapes of Col and Ler with the corresponding F1 hybrid, we sequenced reciprocal back-crosses of Col/Ler F1 hybrids with Col to identify COs in 428 and 294 progenies derived from female and male meiosis, respectively. We identified 1192 COs (median resolution 739 bp) and 1587 COs (median resolution 1019 bp) in female and male hybrids, respectively (Supplementary Fig. 6, Supplementary Data 4). The female and male high-resolution CO distribution that we obtained is consistent with a previous dataset that described female/male CO landscapes with lower resolution¹¹ and CO distribution in the same hybrid in F2s that does not distinguish female and male COs⁷ (Supplementary Fig. 7 and 8, Supplementary Data 5). Comparison of the genomic compartments where COs occurred did not reveal differences between females and males, with COs notably enriched in promotor regions in both sexes. This suggests that the factors driving fine-scale CO placement are similar in female and male meiosis (Supplementary Fig. 7E, F).

Comparing crossover number in Arabidopsis pure lines and hybrids

In all three types of populations, Col, Ler and hybrid, we observed heterochiasmy, i.e., significantly more COs in male compared to female meiosis (Mann-Whitney test, p < 2.2e−16, Fig. 2C). This heterochiasmy was confirmed in the three genotypes by counting MLH1 foci, whose number is consistently higher in male compared to female meiocytes (Supplementary Fig. 9). In male meiosis, both the highest (Col, 6.13) and the lowest (Ler, 4.62) numbers of COs are observed in inbred lines, with the hybrid exhibiting an intermediate number of COs (5.4), consistent with MLH1 foci analysis (Fig. 2C, Supplementary Fig. 9). The observation that the hybrid has an intermediate number of COs compared to the two inbred lines suggests that the global CO frequency in males is mainly genetically controlled in trans and not, or only marginally, driven by sequence polymorphism. In females, the highest CO number is also observed in Col (4.08), with less COs in Ler (3.34, p = 1.2e−07), indicating that the same trans mechanism also influences CO frequency in females. However, an even lower level of COs is observed in the hybrid (2.79, p = 0.0002), suggesting that an additional phenomenon is responsible for the reduced CO frequency specifically in female hybrids. In all contexts, CO interference is more pronounced in females than in males, with the strongest interference observed in female hybrids (Supplementary Figs. 10–12). It should be noted that the CO interference was measured within DNA (Mb) space and that chromosomes are organized along shorter axes in females than in males⁴⁸. When a conversion is applied for analysis in the chromosome axis space (µm)⁴⁹, CO interference appears very similar in female and male meiosis (Supplementary Fig. 12), suggesting that interference propagates at similar µm distances along axes in females and males but that due to higher compaction interference acts over larger DNA distances in females. The anti-correlation observed between CO interference and CO numbers suggests that modulation of CO interference, likely through modulation of axis length, is an important determinant of CO numbers. In both sexes of the three backgrounds, CO number is positively correlated with individual chromosome length, except for the female hybrid where the curve is almost flat at just above 0.5 COs per chromosome per gamete, corresponding to one CO per bivalent and a very strong CO interference (Supplementary Fig. 13).

Polymorphisms does not define megabase-scale crossover distribution

Along chromosomes, a strikingly similar pattern is observed in the three genetic backgrounds. COs are markedly suppressed at the centromeric regions and tend to be frequent at the edge of peri-centromeres in both female and male meiosis. In all three backgrounds, the female and male recombination landscapes tend to diverge with decreasing distance from telomeres, with distal regions exhibiting among the highest recombination intervals in males and the lowest in females (Fig. 2D–F, Supplementary Fig. 14). The female/male difference is less pronounced in Ler, notably in distal regions. This may be due to a generally lower frequency of COs in Ler compared to Col, and because trans-factors (e.g., HEI10) tend to affect more distal regions⁵⁰. However, it should be noted that the Ler profile tends to have a larger interval of confidence, notably at telomeres, because of slightly smaller sample size and marker set than the two other genotypes. Strikingly, CO distributions are more closely correlated between the same sexes across the three different backgrounds than between the two sexes in the same background (Fig. 2, Supplementary Fig. 16). For example, female hybrids are more similar to female Col and Ler (Spearman’s correlation r_s = 0.62 and 0.64) than to male hybrids (r_s = 0.26). Thus, sequence divergence appears to have a far lesser impact on the CO landscape than the sex of the meiocyte.

To compare the contemporary CO landscapes with the historical landscape, we reconstructed a historical recombination map using a set of non-singleton SNPs generated from 2029 accessions (Fig. 3A and Supplementary Fig. 15)^51,52. Confirming previous findings²⁷, the historical CO landscape is strongly correlated with the sequence diversity (Fig. 3A, F, Supplementary Fig. 16). The historical landscape is the result of combined female and male recombination, and we thus compared it to the merged female and male dataset for the inbred lines and to the previously described large Col/Ler F2 dataset (Fig. 3D–F and Supplementary Fig. 16)⁷. To facilitate the comparison of the landscapes independently of total CO numbers, we show both the observed CO density (cM/Mb, Fig. 3B) and the corresponding normalized distribution (Fig. 3A, C). Strikingly, the CO landscape in the two inbred lines and hybrid all appear similar to each other, with co-localization of many peaks and valleys, including large peaks on both sides of the centromeric regions, but also in the middle of the arms (Fig. 3C). This similarity is confirmed by genome-wide correlation analyses (Fig. 3D–F). Correlation between CO levels in intervals in Col vs Ler is 0.66 (Fig. 3E), and 0.74 between Col and the hybrid (Fig. 3D). The coefficient of correlation is even higher if non-linear correlation is used (r_n = 0.73–0.78, Fig. 3F) or when chromosome arms and peri-centromeres are considered separately (Fig. 4A, Supplementary Fig. 18). The historical recombination landscape is also similar to the three contemporary landscapes, with most peaks being conserved (r_n = 0.6–0.7, Fig. 3A, F). This shows that the global CO landscape is largely independent of the presence (hybrid and historical) or quasi-absence (Col and Ler) of polymorphisms between the two chromosomes that recombined.

**Fig. 3: Comparison of the genome-wide CO landscape in Col, Ler, and F2 hybrids with genetic polymorphisms.**

**Fig. 4: Association and prediction of CO distribution with genomic and epigenomic features.**

While the CO landscapes are similar, they are not identical. One notable divergence is observed at position ~2 Mb of chromosome 4, with suppression of CO in the hybrid that is not observed in the inbred lines and the historical landscape (Figs. 2D–F and 3A–E). Accordingly, the corresponding intervals stand out in the correlation analysis (red arrow, Fig. 3D). This region corresponds to a large ~1.2 Mb genomic inversion, which suppresses recombination in the Col/Ler hybrid where it is heterozygous^7,47,53,54. We then asked if the smaller rearranged regions are also depleted for COs. We explored the overlap of COs with the non-syntenic regions by employing permutation tests in F2 hybrids and observed a strong depletion of COs in non-syntenic regions (Supplementary Fig. 17B, p = 0.0002) and CO enrichment in the adjacent regions (Supplementary Fig. 17C, p = 0.0002), confirming that structural arrangements are correlated with inhibition of CO formation in hybrids^7,14. The CO resolution obtained for inbred lines did not allow us to test if these regions recombine normally in Col and Ler, as is the case for the unique large inversion.

Consistent with previous analyses²⁷, we found that the historical recombination rate is highly correlated with sequence diversity along chromosomes (Fig. 3A, F, Supplementary Fig. 16), and contemporary COs in the Col/Ler hybrid are correlated with SNP density between Col and Ler^28,55. As shown above, the CO landscapes all show high similarity to each other. Consequently, the CO landscapes in Col and Ler are correlated with Col/Ler SNPs (r_n = 0.28–0.39) and sequence diversity (r_n = 0.44–0.45), whereas these polymorphisms were absent in the Col and Ler inbred lines where these COs were produced. This strongly argues against the possibility that the polymorphisms shape the CO landscape, as the CO landscape is largely unchanged when polymorphisms are absent (with the notable exception of large rearrangements).

Association of crossover landscape with genomic and epigenomic features

To decipher the contributions of genomic and epigenomic features to shaping the CO landscape, we analyzed the recombination distribution in Col with a total of 17 different features. The CO and the genomic and the epigenomic data were all produced in the same strain, Col. The measured features included GC content; gene and transposable element densities; origins of DNA replication (BrdU-seq)⁵⁶; meiotic DSBs (SPO11-1-oligonucleotides)⁵⁷; chromatin accessibility in flowers (ATAC-seq and DNase-seq)^58,59,60,61; euchromatic (H3K4me1, H3K4me2 and H3K4me3, ChIP-seq)^57,62 and heterochromatin histone modification marks in flower buds (H3K9me2 and H3K27me1, ChIP-seq)⁶²; DNA methylation in male meiocytes (CG, CHG and CHH contexts, BS-seq)⁶³; nucleosome occupancy in buds (micrococcal nuclease sequencing, MNase-seq)⁵⁷; and the meiotic cohesin REC8 occupancy (ChIP-seq, Fig. 4A, Supplementary Figs. 18–20, Supplementary Data 1)⁶².

Genome-wide, CO distribution is correlated with many genetic and epigenetic features, notably positively with open chromatin (ATAC, r_n = 0.71), H3K4me1(r_n = 0.65), gene density (r_n = 0.64) and CHG methylation (r_n = 0.55, Supplementary Figs. 18–20). These correlations are at least partially driven by the centromeric regions, at which COs are abolished and where these features are strongly depleted (Supplementary Figs. 18–20). However, considering only the chromosome arms, the correlations are almost the same between COs and ATAC (r_n = 0.71), H3K4me1(r_n = 0.64), CHG methylation (r_n = 0.59), gene density (r_n = 0.57) and other features (Fig. 4A), suggesting a relationship between CO density and chromatin features beyond the centromere.

We next used a machine-learning algorithm (random forest) to analyse the contribution of the 14 chromatin features to explaining the variation in the crossover landscape in Col. We first developed a model to predict the frequency of meiotic recombination for a given interval with all the chromatin features together and analyzed how the model learned to perform the prediction. Over 95% of the variation can be explained by the random forest predictive model (Fig. 4C, E). As shown in Fig. 4B, D, the most important feature was open chromatin (ATAC), which alone explained 39% of the genome-wide variation (Fig. 4E) and 29% of the variation along chromosome arms (Fig. 4C). We observed that the top three features can explain >85% of the variation, while the top six features and five features can explain ~95% of the variation along chromosome arms and genome-wide, respectively (Fig. 4C, E). In order to further investigate the performance of the random forest model, we used four chromosomes as the training set and the remaining chromosome as the testing set. This analysis was done for each of the five chromosomes, considering the top five features for the entire genome (Fig. 4F) or the top six features for chromosome arms (Supplementary Fig. 21). The model trained with the training set performed well with the test set, resulting in a significant correlation (r_s = 0.58, r_n = 0.65) between the prediction and the observations of the test set (Fig. 4F, Supplementary Fig. 21). Altogether, these results show that it is sufficient to use only a few chromatin-related features including chromatin accessibility and DNA methylation, to predict a large part of the megabase-scale distribution of meiotic recombination in A. thaliana.

Discussion

In this work, we developed a method to analyse the genome-wide recombination landscape in inbred lines and applied it to the Arabidopsis accessions Col and Ler. This method is based on the introduction of a limited number of markers and allows robust detection of COs. The strategy can be applied to any species for which homozygous lines and mutagenesis are available. We expect this method to be particularly useful for exploring the natural variation of recombination landscapes in species that are inbred lines in the wild (e.g., Arabidopsis) and for exploring CO distribution in species where inter-strain crosses are problematic (e.g., in the fission yeast Schizosaccharomyces pombe because of killer meiotic drivers)⁶⁴.

Meiotic recombination frequency has previously been studied in hybrids in many species and varies along chromosomes and positively correlates with the distribution of polymorphisms^{3,7,12,21,23,24,25,26,27,55}. One of the possible causes for these correlations is that heterozygosity may favour the formation of COs, in a process putatively driven by mismatch recognition during DSB repair^21,22. In fact, mutants without mismatch sensor function showed a reshuffling of meiotic recombination towards regions with less polymorphisms²¹, which suggests that polymorphisms are involved in the local placement of COs. However, the broad distribution of COs across the chromosomes was only marginally affected in mismatch recognition mutants, which is in agreement with chromatin being the major determinant of the megabase-scale recombination landscape.

We showed here that the megabase-scale recombination landscape in inbred lines is similar to those of hybrids as well as to historical patterns. Broad conservation of CO distribution was previously suggested in tomato and maize by comparing recombination nodules in inbreds to genetic maps in hybrids^41,42,65. The observation that the CO landscape is maintained in the quasi-absence of polymorphism leads to the conclusion that polymorphisms are not a major determinant of the megabase-scale CO distribution. Polymorphisms, including SNPs and small rearrangements, influence the local recombination pattern^18,19,20,21, but this effect is not manifest at the megabase-scale; at this range, the landscape appears to be largely unaffected by polymorphism density. An important exception is genomic rearrangements, such as the ~1.2 Mb inversion (between Col and Ler) on chromosome 4, where COs are abolished in hybrids, while the corresponding regions are CO-proficient in isogenic lines. Smaller structural variations are also associated with CO depletion in the hybrid and are presumably CO-prone in the inbred lines, though we cannot confirm this because of the relatively low resolution in CO position.

In many species, COs tend to colocalize with nucleosome- and methylation-depleted gene promoters^{3,7,28,30,36,66,67}, consistent with our observation in Col/Ler hybrids. Moreover, in this study, we found that among a total of 14 genomic and epigenomic features, open chromatin (ATAC), DNA methylation in the CHH context, and gene density, are the most potent factors for predicting the distribution of COs along chromosomes arms in inbred Col, which is consistent with previous findings^57,62. Interestingly, these three features were enough to explain ~85% of the variation of the CO distribution along chromosome arms. We do not claim that these three features alone directly control CO positions. For example, if gene density is ignored, the top three features can still explain more than 85% of the variation (ATAC, mCHH and MNase, Supplementary Fig. 22). Our results show that the chromatin context, which can be largely captured using only a few features, can robustly predict megabase CO landscapes. Interestingly, the most predictive feature (open chromatin, ATAC), is largely conserved between different tissues at the megabase-scale (Supplementary Fig. 23). This suggests that the megabase-scale chromatin landscape is stable throughout development and is a major driver of the CO landscape.

Our results suggest that the large-scale CO landscape is not driven by the polymorphism density. Thus, two possibilities may explain the correlation between polymorphism density and recombination observed in hybrid and historical landscapes. First, the recombination landscape could gradually shape the polymorphism density. Indeed, meiotic recombination is mutagenic, which might be an important driver of genetic diversity and genome evolution^{25,31,67,68,69,70,71}. In addition, selection tends to reduce polymorphisms in regions with low recombination rates: both the spread of beneficial mutations and the removal of deleterious mutations by selection reduce polymorphism levels and this effect is larger if recombination is low⁷². A second, not mutually exclusive hypothesis, is that local differences in chromatin features not only influence the distribution of recombination, but that chromatin, independently of recombination, contributes to genomic diversity by shaping differences in local mutation rates along the genome^73,74.

While the recombination landscape is largely conserved between the inbred lines and the hybrid, they differ in the total CO number. Globally, there are more COs in Col than in Ler, with the hybrid having an intermediate number. This is consistent with previous observations in a few crossover reporter intervals and is probably largely driven by an allelic difference in the pro-CO factor HEI10; the Col allele was shown to increase the number of COs compared to the Ler allele in a co-dominant manner⁵⁰. Other trans components, such as the SMC5/6 complex subunit SIN1, probably also contribute to the difference in recombination between Col and Ler⁷⁵.

When female and male recombination are analyzed separately, CO rates and MLH1 foci are always highest in Col and lowest in Ler and always higher in males than in females. The male hybrid exhibits an intermediate recombination rate of CO formation, but, in contrast, the female hybrid has less COs than the two inbred lines. This suggests that some mechanism specifically reduces CO frequency in female hybrids compared to the female inbred lines. One possibility is that class II COs, which represent a minority of COs, are inhibited in the presence of polymorphism and thus reduced in hybrids^53,76. This would have a proportionally larger effect on female meiosis where class I COs are less numerous than in males, and thus account for the very low level of COs in female hybrids. As class II COs are non-interfering, this would also explain why CO interference is stronger in female hybrids than in female inbreds and could especially account for the absence of very closely spaced double-COs (Supplementary Fig. 10)⁴⁹. Interestingly, in female hybrids, the number of COs observed was close to the obligate one crossover per bivalent (0.5 crossovers per chromatid), suggesting that the CO landscape in female hybrids corresponds to the distribution of the obligate CO, which thus occurs highly preferentially in the proximal regions. The most striking contrast between females and males was the pronounced difference in the distal regions, where males tend to recombine more than females in both pure lines and hybrids. This further confirms that the megabase-scale recombination landscape is largely independent of polymorphisms and instead suggests that the cellular environment plays a much more critical role, notably by controlling chromosome organization^49,77.

An improved understanding of the control of meiotic recombination along the chromosome opens the possibility of manipulating COs and increasing recombination rates globally^53,76,78 and in reluctant regions. This would facilitate the reshuffling of genomic material, breaking of the linkage between beneficial and deleterious alleles and allow the combination of favourable alleles in elite varieties.

Methods

Isogenic population construction and sequencing

Plants were grown in greenhouses or growth chambers (16-h day/8-h night, 20 °C). Wild-type Col-0 and Ler-1 are 186AV1B4 and 213AV1B1 from the Versailles A. thaliana stock center (http://publiclines.versailles.inra.fr/). For each accession, seeds were subjected to EMS mutagenesis as described in ref. ⁷⁹, and four independent M2 plants were crossed to produce two independent F1*s, which were consequently heterozygous for a set of EMS mutations (Fig. 1). Then, the two F1* plants were reciprocally crossed to generate two F1 populations. To test the robustness of the results and detect the unlikely possibility that a dominant modifier of recombination was caused by an EMS-induced mutation, two independent replicates of the entire process were performed for each accession. These F1* and F1 plants were then used for CO analysis by whole-genome sequencing (Fig. 1, Supplementary Table S1). Leaf samples from the populations were used for DNA purification and library preparation for 2 × 150bp HiSeq 3000 Illumina sequencing⁸⁰. To detect the markers, we sequenced genomic DNA from the F1*s (~59× and ~16×, in Col and Ler, respectively) and F1s (~4.8× and ~5.0×)

Identifying and genotyping EMS-induced mutations

For each individually sequenced F1* and F1 plant of Arabidopsis Col and Ler accessions, the whole-genome resequencing reads were aligned against the Col-0 TAIR10 reference genome^81,82 and Ler assembled genome⁴⁷ by BWA v0.7.15-r1140⁸³ with default parameters, and variant calling in F1* populations was performed using inGAP-family³⁹, separately. To obtain high-quality mutation marker lists, we first removed non-allelic markers using inGAP-family with input from the tandem replicates and structural variants predicted using Tandem Repeats Finder v4.09⁸⁴ and inGAP-family, respectively, and further filtered variations that did not meet the following criteria: (i) heterozygous genotype with alternative allele frequency from 0.4 to 0.6, (ii) specific to each of the F1*s, and (iii) GA to CT substitution. Then, the read count and genotype map of mutation markers of each F1* was generated from their F1 progenies by inGAP-family, which was subsequently used for mutation phasing and CO identification. In order to properly compare CO landscapes in isogenic and hybrid lines, we transferred the coordinates of mutations in the Ler population to Col-0 by using syntenic alignments identified by SyRI v1.2⁸⁵.

Two additional replicates in Col (C and D) were discarded, because the marker analysis showed that one of the F1*s resulted from an accidental selfing and not from a cross. Two additional replicates in Ler were also discarded (C and H), because the number of detected mutation markers (<350) was insufficient for good genome coverage.

Phasing mutations and CO identification in inbred lines

To phase the EMS-induced mutation markers, we employed a hierarchical clustering-based sliding window method, with a window size of ten mutation markers and step size of one mutation marker (Supplementary Fig. 1). For each window, the genotype map of the mutation markers was constructed and used as input for clustering, resulting in two groups: one consisting of wild-type samples and one comprising mutant samples. The genotype and phase of mutation was evaluated by the voting strategy based on multiple window clustering. During this process, for the first and last 5-9th markers, a support rate of 0.9 was used to impute and correct the genotype of the marker if it was not covered or poorly covered, and for the other non-covered or poorly covered markers in between, a support rate of 0.8 was used. The CO events were defined as consistent switches of phase of mutation markers along chromosome arms, and the border was further refined by examining the wild-type allele of the mutation. For the termini of chromosomes, COs were validated as switches with one well-supported mutant allele or more than ten reads supported by the consecutive wild-type allele of the variant marker. The CO interference was analyzed using MADpattern v.1.1^86,87.

CO analysis in hybrid population

The Col/Ler and wild-type Col plants were reciprocally crossed to construct female and male populations (428 and 294 plants, respectively). Leaf samples of the backcross populations were collected for DNA purification, library preparation and Illumina sequencing⁸⁰. In addition, the raw reads of the Col/Ler F2 population were downloaded from ArrayExpress with the accession number E-MTAB-8165 (https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-8165)⁷.

The quality of the raw sequencing datasets was checked using FastQC v0.11.9 (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/), and then adaptors and low-quality bases were trimmed using Trimmomatic v0.38⁸⁸, with parameters “LEADING:5 TRAILING:5 SLIDINGWINDOW:5:20 MINLEN:50”. In order to generate a list of high-confidence SNP markers between Col and Ler, we adopted a strategy by combing the whole-genome alignment and short-read mapping^89,90. First, the Col and Ler genomes were aligned to identify syntenic SNPs by SyRI⁸⁵. Then, further checks and filters were applied to avoid the artificial and non-allelic SNPs by inGAP-family^33,39. The sequencing reads of F1 and F2 samples were aligned to the TAIR10 Col reference genome by BWA⁸³. The meiotic CO prediction and filtering of the poorly covered and potentially contaminated samples were performed using a sliding-window-based method^39,89. Each sliding window was genotyped by the supporting reads of Col and Ler alleles. To avoid false genotyping, we selected 0.95 as the threshold allelic ratio for the determination of homozygosity in F1 hybrids. The final CO breakpoint was further refined by checking the genotype information of individual SNPs. Identified COs were manually checked at random using inGAP-family³⁹.

SPO11-1-oligo, BrdU-seq, ChIP-seq, MNase-seq, DNase-seq, and ATAC-seq data analysis

Short reads from public datasets (Supplementary Data 1) were quality-checked with FastQC. Specific 3′ adaptor and 5′ end sequences were trimmed before alignment by Cutadapt v1.9.1⁹¹ as described⁵⁷. For BrdU-seq and ATAC-seq datasets, the reads were processed with Trimmomatic to remove potential adaptor sequences and low-quality bases, with “LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36”. Duplicated reads were removed using BBMap (https://github.com/BioInfoTools/BBMap). Then, clean reads were aligned to the TAIR10 reference genome using Bowtie2 v2.2.8⁹² with settings “–very-sensitive -k 10” for single-end datasets and further settings “–no-discordant –no-mixed” for paired-end datasets. The uniquely mapped reads were kept for subsequent analysis, which were processed by Samtools v1.9⁹³ and Sambamba v0.6.8⁹⁴. For all sequencing data, coverage across the genome was evaluated and normalized with bins per million mapped reads (BPM) in bedGraph format using bamCoverage v3.4.3⁹⁵.

Bisulfite sequencing data analysis

The quality and adaptor sequencing of raw reads were examined by FastQC. The sequencing reads were mapped to the TAIR10 reference genome with Bismark v0.22.0⁹⁶, with the following setting: -q -bowtie2 -N 1 -L 24. Reads that mapped to multiple positions and duplicated alignments were removed. Methylated cytosines in the CG, CHG and CHH contexts and the level of methylation, were extracted for subsequent association analysis.

Genome-wide CO distribution correlation analysis

The chromosomal profiles of COs, genomic and epigenomic features were estimated in 50-kb windows along chromosomes. For a given window, the recombination frequency was normalized with the total CO number within the corresponding chromosome. Then, all of the COs, genomic and epigenomic data were smoothed with 40 nearby windows (total = 2 Mb) using the filter function (stats v3.6.2 package, with default parameter, a moving average strategy) and then normalized using the scale function (base v3.6.2 package) in the R environment. The non-linear correlation matrices were calculated using the nlcor package (https://github.com/ProcessMiner/nlcor)⁹⁷ in R, at the genome, chromosome-arm and pericentromeric scales, respectively. The constitution (peri-centromeres, centromeres and arms) of the TAIR10 reference genome was adopted from Underwood et al.⁶⁶. Here, all the random forest models were trained using randomForest v4.6-14 package in R, with the setting of “mtry=3, importance=TRUE, na.action=na.omit, ntree=2000”.

Estimating nucleotide diversity and historical recombination rate

For the sequence polymorphism data of 2029 Arabidopsis accessions from the 1001 Genomes Project⁵¹ and the RegMap population⁵², we first selected diallelic SNP positions with <20% missing data and >5% minimum allele frequency using VCFtools v0.1.16⁹⁸. Then, we masked SNPs located in (i) tandem repeat regions (Tandem Repeats Finder output), (ii) repetitive elements and low-complexity regions (extracted from the masked TAIR10 reference genome), (iii) transposable elements (TAIR10 annotation) and (iv) centromeric regions (definition adopted from Underwood et al.⁶⁶). Finally, we obtained a collection of 905,613 SNPs from 2029 accessions for CO frequency analysis. FastEPRR v2.0⁹⁹ was employed for estimating population recombination rates (ρ = 4Ner, where Ne is the effective population size and r is the recombination rate of the window), with 50-kb non-overlapping window size. The nucleotide diversity of each 50-kb non-overlapping window along chromosomes was calculated using VCFtools. The geographical distribution of the 2029 Arabidopsis accessions was made by ggplot2 v3.3.5 package in R.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The raw sequencing data of individuals of Col, Ler inbred lines, and the Col/Ler F1 hybrid can be accessed in ArrayExpress under the accession numbers E-MTAB-11248, E-MTAB-11249, E-MTAB-11250, E-MTAB-11251, E-MTAB-11254, respectively. The public datasets used in this study are provided in the Supplementary Data 1. The list of COs identified in Col, Ler, F1 hybrid (female and male) and F2 hybrid can be found in Supplementary Data 2–5. The Col-0 TAIR10 reference genome is downloaded from the TAIR database [https://www.arabidopsis.org/]. The sequence polymorphism data of 2029 Arabidopsis accessions is downloaded from FigShare [https://figshare.com/projects/Imputation_of_3_million_SNPs_in_the_Arabidopsis_regional_mapping_population/72887]. Source data are provided with this paper.

Code availability

The related code is available at GitHub [https://github.com/qclian/EMS_Col_Ler].

References

Mercier, R., Mezard, C., Jenczewski, E., Macaisne, N. & Grelon, M. The molecular biology of meiosis in plants. Annu. Rev. Plant Biol. 66, 297–327 (2015).
Article CAS PubMed Google Scholar
Wang, Y. & Copenhaver, G. P. Meiotic recombination: mixing it up in plants. Annu Rev. Plant Biol. 69, 577–609 (2018).
Article CAS PubMed Google Scholar
Zelkowski, M., Olson, M. A., Wang, M. & Pawlowski, W. Diversity and determinants of meiotic recombination landscapes. Trends Genet. 35, 359–370 (2019).
Article CAS PubMed Google Scholar
Gray, S. & Cohen, P. E. Control of meiotic crossovers: from double-strand break formation to designation. Annu Rev. Genet. 50, 175–210 (2016).
Article CAS PubMed PubMed Central Google Scholar
von Diezmann, L. & Rog, O. Let’s get physical–mechanisms of crossover interference. J Cell Sci. 134, jcs255745 (2021).
Mancera, E., Bourgon, R., Brozzi, A., Huber, W. & Steinmetz, L. M. High-resolution mapping of meiotic crossovers and non-crossovers in yeast. Nature 454, 479–485 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Rowan, B. A. et al. An ultra high-density Arabidopsis thaliana crossover map that refines the influences of structural variation and epigenetic features. Genetics 213, 771–787 (2019).
Article CAS PubMed PubMed Central Google Scholar
Fernandes, J. B., Wlodzimierz, P. & Henderson, I. R. Meiotic recombination within plant centromeres. Curr. Opin. Plant Biol. 48, 26–35 (2019).
Article CAS PubMed Google Scholar
Nambiar, M. & Smith, G. R. Repression of harmful meiotic recombination in centromeric regions. Semin Cell Dev. Biol. 54, 188–197 (2016).
Article CAS PubMed PubMed Central Google Scholar
Lenormand, T. & Dutheil, J. Recombination difference between sexes: a role for haploid selection. PLoS Biol. 3, e63 (2005).
Article PubMed PubMed Central CAS Google Scholar
Giraut, L. et al. Genome-wide crossover distribution in Arabidopsis thaliana meiosis reveals sex-specific patterns along chromosomes. PLoS Genet. 7, e1002354 (2011).
Article CAS PubMed PubMed Central Google Scholar
Stapley, J., Feulner, P. G. D., Johnston, S. E., Santure, A. W. & Smadja, C. M. Variation in recombination frequency and distribution across eukaryotes: patterns and processes. Philos. Trans. R. Soc. Lond. B Biol. Sci. 372, 20160455 (2017).
Sardell, J. M. & Kirkpatrick, M. Sex Differences in the Recombination Landscape. Am. Nat. 195, 361–379 (2020).
Article PubMed Google Scholar
Zapata, L. et al. Chromosome-level assembly of Arabidopsis thaliana Ler reveals the extent of translocation and inversion polymorphisms. Proc. Natl Acad. Sci. USA 113, E4052–E4060 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zetka, M. C. & Rose, A. M. The meiotic behavior of an inversion in Caenorhabditis elegans. Genetics 131, 321–332 (1992).
Article CAS PubMed PubMed Central Google Scholar
Jaarola, M., Martin, R. H. & Ashley, T. Direct evidence for suppression of recombination within two pericentric inversions in humans: a new sperm-FISH technique. Am. J. Hum. Genet. 63, 218–224 (1998).
Article CAS PubMed PubMed Central Google Scholar
Demirci, S., Peters, S. A., de Ridder, D. & van Dijk, A. D. J. DNA sequence and shape are predictive for meiotic crossovers throughout the plant kingdom. Plant J. 95, 686–699 (2018).
Article CAS Google Scholar
Borts, R. H. & Haber, J. E. Meiotic recombination in yeast: alteration by multiple heterozygosities. Science 237, 1459–1465 (1987).
Article ADS CAS PubMed Google Scholar
Baudat, F. & de Massy, B. Regulating double-stranded DNA break repair towards crossover or non-crossover during mammalian meiosis. Chromosome Res. 15, 565–577 (2007).
Article CAS PubMed Google Scholar
Serra, H. et al. Interhomolog polymorphism shapes meiotic crossover within the Arabidopsis RAC1 and RPP13 disease resistance genes. Plos Genet. 14, e1007843 (2018).
Article PubMed PubMed Central CAS Google Scholar
Blackwell, A. R. et al. MSH2 shapes the meiotic crossover landscape in relation to interhomolog polymorphism in Arabidopsis. Embo J. 39, e104858 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ziolkowski, P. A. et al. Juxtaposition of heterozygous and homozygous regions causes reciprocal crossover remodelling via interference during Arabidopsis meiosis. Elife 4, e03708 (2015).
Begun, D. J. & Aquadro, C. F. Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster. Nature 356, 519–520 (1992).
Article ADS CAS PubMed Google Scholar
Nordborg, M. et al. The pattern of polymorphism in Arabidopsis thaliana. PLoS Biol. 3, e196 (2005).
Article PubMed PubMed Central CAS Google Scholar
Spencer, C. C. et al. The influence of recombination on human genetic diversity. PLoS Genet. 2, e148 (2006).
Article PubMed PubMed Central CAS Google Scholar
Gore, M. A. et al. A first-generation haplotype map of maize. Science 326, 1115–1117 (2009).
Article ADS CAS PubMed Google Scholar
Kim, S. et al. Recombination and linkage disequilibrium in Arabidopsis thaliana. Nat. Genet. 39, 1151–1155 (2007).
Article CAS PubMed Google Scholar
Choi, K. H. et al. Arabidopsis meiotic crossover hot spots overlap with H2A. Z nucleosomes at gene promoters. Nat. Genet 45, 1327 (2013).
Article CAS PubMed Google Scholar
Yelina, N. E. et al. DNA methylation epigenetically silences crossover hot spots and controls chromosomal domains of meiotic recombination in Arabidopsis. Gene Dev. 29, 2183–2202 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wijnker, E. et al. The genomic landscape of meiotic crossovers and gene conversions in Arabidopsis thaliana. Elife 2, e01426 (2013).
Article PubMed PubMed Central CAS Google Scholar
Qi, J. et al. Characterization of meiotic crossovers and gene conversion by whole-genome sequencing in Saccharomyces cerevisiae. Bmc Genomics 10, 475 (2009).
Article MathSciNet PubMed PubMed Central CAS Google Scholar
Lu, P. L. et al. Analysis of Arabidopsis genome-wide variations before and after meiosis and meiotic recombination by resequencing Landsberg erecta and all four products of a single meiosis. Genome Res. 22, 508–518 (2012).
Article CAS PubMed PubMed Central Google Scholar
Qi, J., Chen, Y. M., Copenhaver, G. P. & Ma, H. Detection of genomic variations and DNA polymorphisms and impact on analysis of meiotic recombination and genetic mapping. Proc. Natl Acad. Sci. USA 111, 10007–10012 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Liu, H. et al. Causes and consequences of crossing-over evidenced via a high-resolution recombinational landscape of the honey bee. Genome Biol. 16, 15 (2015).
Article CAS PubMed PubMed Central Google Scholar
Si, W. et al. Widely distributed hot and cold spots in meiotic recombination as shown by the sequencing of rice F2 plants. N. Phytol. 206, 1491–1502 (2015).
Article CAS Google Scholar
Kianian, P. M. A. et al. High-resolution crossover mapping reveals similarities and differences of male and female recombination in maize. Nat. Commun. 9, 2370 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Luo, C., Li, X., Zhang, Q. H. & Yan, J. B. Single gametophyte sequencing reveals that crossover events differ between sexes in maize. Nat. Commun. 10, 785 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Dreissig, S. et al. Natural variation in meiotic recombination rate shapes introgression patterns in intraspecific hybrids between wild and domesticated barley. N. Phytol. 228, 1852–1863 (2020).
Article CAS Google Scholar
Lian, Q., Chen, Y., Chang, F., Fu, Y. & Qi, J. inGAP-family: accurate detection of meiotic recombination loci and causal mutations by filtering out artificial variants due to genome complexities. Genom. Proteom. Bioinform. https://doi.org/10.1016/j.gpb.2019.11.014 (2021).
Anderson, L. K. et al. High-resolution crossover maps for each bivalent of Zea mays using recombination nodules. Genetics 165, 849–865 (2003).
Article CAS PubMed PubMed Central Google Scholar
Koo, D. H. et al. Integration of cytogenetic and genetic linkage maps unveils the physical architecture of tomato chromosome 2. Genetics 179, 1211–1220 (2008).
Article CAS PubMed PubMed Central Google Scholar
Anderson, L. K. et al. Integrating genetic linkage maps with pachytene chromosome structure in maize. Genetics 166, 1923–1933 (2004).
Article CAS PubMed PubMed Central Google Scholar
Sherman, J. D. & Stack, S. M. Two-dimensional spreads of synaptonemal complexes from solanaceous plants. VI. High-resolution recombination nodule map for tomato (Lycopersicon esculentum). Genetics 141, 683–708 (1995).
Article CAS PubMed PubMed Central Google Scholar
Emmanuel, E., Yehuda, E., Melamed-Bessudo, C., Avivi-Ragolsky, N. & Levy, A. A. The role of AtMSH2 in homologous recombination in Arabidopsis thaliana. Embo Rep. 7, 100–105 (2006).
Article CAS PubMed Google Scholar
Berchowitz, L. E. & Copenhaver, G. P. Fluorescent Arabidopsis tetrads: a visual assay for quickly developing large crossover and crossover interference data sets. Nat. Protoc. 3, 41–50 (2008).
Article CAS PubMed Google Scholar
Wu, G., Rossidivito, G., Hu, T. Q., Berlyand, Y. & Poethig, R. S. Traffic lines: new tools for genetic analysis in Arabidopsis thaliana. Genetics 200, 35–U53 (2015).
Article CAS PubMed PubMed Central Google Scholar
Jiao, W. B. & Schneeberger, K. Chromosome-level assemblies of multiple Arabidopsis genomes reveal hotspots of rearrangements with altered evolutionary dynamics. Nat. Commun. 11, 989 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Drouaud, J. et al. Sex-specific crossover distributions and variations in interference level along Arabidopsis thaliana chromosome 4. PLoS Genet. 3, e106 (2007).
Article PubMed PubMed Central CAS Google Scholar
Lloyd, A. & Jenczewski, E. Modelling sex-specific crossover patterning in Arabidopsis. Genetics 211, 847–859 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ziolkowski, P. A. et al. Natural variation and dosage of the HEI10 meiotic E3 ligase control Arabidopsis crossover recombination. Genes Dev. 31, 306–317 (2017).
Article CAS PubMed PubMed Central Google Scholar
Alonso-Blanco, C. et al. 1,135 Genomes reveal the global pattern of polymorphism in Arabidopsis thaliana. Cell 166, 481–491 (2016).
Article CAS Google Scholar
Arouisse, B., Korte, A., van Eeuwijk, F. & Kruijer, W. Imputation of 3 million SNPs in the Arabidopsis regional mapping population. Plant J. 102, 872–882 (2020).
Article CAS PubMed PubMed Central Google Scholar
Serra, H. et al. Massive crossover elevation via combination of HEI10 and recq4a recq4b during Arabidopsis meiosis. Proc. Natl Acad. Sci. USA 115, 2437–2442 (2018).
Article CAS PubMed PubMed Central Google Scholar
Schmidt, C. et al. Changing local recombination patterns in Arabidopsis by CRISPR/Cas mediated chromosome engineering. Nat. Commun. 11, 4418 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Horton, M. W. et al. Genome-wide patterns of genetic variation in worldwide Arabidopsis thaliana accessions from the RegMap panel. Nat. Genet. 44, 212–216 (2012).
Article CAS PubMed PubMed Central Google Scholar
Costas, C. et al. Genome-wide mapping of Arabidopsis thaliana origins of DNA replication and their associated epigenetic marks. Nat. Struct. Mol. Biol. 18, 395–U190 (2011).
Article CAS PubMed PubMed Central Google Scholar
Choi, K. et al. Nucleosomes and DNA methylation shape meiotic DSB frequency in Arabidopsis thaliana transposons and gene regulatory regions. Genome Res. 28, 532–546 (2018).
Article CAS PubMed PubMed Central Google Scholar
Zhang, W. L., Zhang, T., Wu, Y. F. & Jiang, J. M. Genome-wide identification of regulatory DNA elements and protein-binding footprints using signatures of open chromatin in Arabidopsis. Plant Cell 24, 2719–2731 (2012).
Article CAS PubMed PubMed Central Google Scholar
Maher, K. A. et al. Profiling of accessible chromatin regions across multiple plant species and cell types reveals common gene regulatory principles and new control modules. Plant Cell 30, 15–36 (2018).
Article CAS PubMed Google Scholar
Alvarez, J. M. et al. Local changes in chromatin accessibility and transcriptional networks underlying the nitrate response in Arabidopsis roots. Mol. Plant 12, 1545–1560 (2019).
Article CAS PubMed Google Scholar
Zhong, Z. et al. DNA methylation-linked chromatin accessibility affects genomic architecture in Arabidopsis. Proc. Natl Acad. Sci. USA 118, e2023347118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Lambing, C. et al. Interacting genomic landscapes of REC8-cohesin, chromatin, and meiotic recombination in Arabidopsis. Plant Cell 32, 1218–1239 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kawakatsu, T. et al. Epigenomic diversity in a global collection of Arabidopsis thaliana accessions. Cell 166, 492–505 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bravo Nunez, M. A., Nuckolls, N. L. & Zanders, S. E. Genetic villains: killer meiotic drivers. Trends Genet. 34, 424–433 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chang, S. B., Anderson, L. K., Sherman, J. D., Royer, S. M. & Stack, S. M. Predicting and testing physical locations of genetically mapped loci on tomato pachytene chromosome 1. Genetics 176, 2131–2138 (2007).
Article CAS PubMed PubMed Central Google Scholar
Underwood, C. J. et al. Epigenetic activation of meiotic recombination near Arabidopsis thaliana centromeres via loss of H3K9me2 and non-CG DNA methylation. Genome Res. 28, 519–531 (2018).
Article CAS PubMed PubMed Central Google Scholar
Halldorsson, B. V. et al. Characterizing mutagenic effects of recombination through a sequence-level genetic map. Science 363, eaau1043 (2019).
Article CAS PubMed Google Scholar
Arbeithuber, B., Betancourt, A. J., Ebner, T. & Tiemann-Boege, I. Crossovers are associated with mutation and biased gene conversion at recombination hotspots. Proc. Natl Acad. Sci. USA 112, 2109–2114 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Rattray, A., Santoyo, G., Shafer, B. & Strathern, J. N. Elevated mutation rate during meiosis in Saccharomyces cerevisiae. Plos Genet. 11, e1004910 (2015).
Article PubMed PubMed Central CAS Google Scholar
Duret, L. & Arndt, P. F. The impact of recombination on nucleotide substitutions in the human genome. PLoS Genet 4, e1000071 (2008).
Article PubMed PubMed Central CAS Google Scholar
Hellmann, I. et al. Why do human diversity levels vary at a megabase scale? Genome Res. 15, 1222–1231 (2005).
Article CAS PubMed PubMed Central Google Scholar
Cutter, A. D. & Payseur, B. A. Genomic signatures of selection at linked sites: unifying the disparity among species. Nat. Rev. Genet. 14, 262–274 (2013).
Article CAS PubMed PubMed Central Google Scholar
Monroe, J. G. et al. Mutation bias reflects natural selection in Arabidopsis thaliana. Nature 602, 101–105 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Belfield, E. J. et al. Thermal stress accelerates Arabidopsis thaliana mutation rate. Genome Res. 31, 40–50 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zhu, L. et al. Natural variation identifies SNI1, the SMC5/6 component, as a modifier of meiotic crossover in Arabidopsis. Proc. Natl Acad. Sci. USA 118, e2021970118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Fernandes, J. B., Seguela-Arnaud, M., Larcheveque, C., Lloyd, A. H. & Mercier, R. Unleashing meiotic crossovers in hybrid plants. Proc. Natl Acad. Sci. USA 115, 2431–2436 (2018).
Article CAS PubMed Google Scholar
Kleckner, N., Storlazzi, A. & Zickler, D. Coordinate variation in meiotic pachytene SC length and total crossover/chiasma frequency under conditions of constant DNA length. Trends Genet 19, 623–628 (2003).
Article CAS PubMed Google Scholar
Mieulet, D. et al. Unleashing meiotic crossovers in crops. Nat. Plants 4, 1010–1016 (2018).
Article CAS PubMed Google Scholar
Capilla-Perez, L. et al. The HEM lines: a new library of homozygous Arabidopsis thaliana EMS mutants and its potential to detect meiotic phenotypes. Front Plant Sci. 9, 1339 (2018).
Article PubMed PubMed Central Google Scholar
Rowan, B. A., Patel, V., Weigel, D. & Schneeberger, K. Rapid and inexpensive whole-genome genotyping-by-sequencing for crossover localization and fine-scale genetic mapping. G3 5, 385–398 (2015).
Article PubMed PubMed Central Google Scholar
Arabidopsis Genome I. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408, 796–815 (2000).
Article ADS Google Scholar
Lamesch, P. et al. The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools. Nucleic Acids Res. 40, D1202–D1210 (2012).
Article CAS PubMed Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580 (1999).
Article CAS PubMed PubMed Central Google Scholar
Goel, M., Sun, H., Jiao, W. B. & Schneeberger, K. SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies. Genome Biol. 20, 277 (2019).
Article PubMed PubMed Central Google Scholar
Zhang, L., Liang, Z., Hutchinson, J. & Kleckner, N. Crossover patterning by the beam-film model: analysis and implications. PLoS Genet. 10, e1004042 (2014).
Article PubMed PubMed Central CAS Google Scholar
White, M. A., Wang, S., Zhang, L. & Kleckner, N. Quantitative modeling and automated analysis of meiotic recombination. Methods Mol. Biol. 1471, 305–323 (2017).
Article CAS PubMed PubMed Central Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Capilla-Perez, L. et al. The synaptonemal complex imposes crossover interference and heterochiasmy in Arabidopsis. Proc. Natl Acad. Sci. USA 118, e2023613118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, H. K. et al. The cohesin loader SCC2 contains a PHD finger that is required for meiosis in land plants. PLoS Genet. 16, e1008849 (2020).
Article CAS PubMed PubMed Central Google Scholar
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnetjournal 17, 3 (2011).
Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Article CAS PubMed PubMed Central Google Scholar
Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience 10, giab008 (2021).
Tarasov, A., Vilella, A. J., Cuppen, E., Nijman, I. J. & Prins, P. Sambamba: fast processing of NGS alignment formats. Bioinformatics 31, 2032–2034 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ramirez, F. et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic acids Res. 44, W160–W165 (2016).
Article CAS PubMed PubMed Central Google Scholar
Krueger, F. & Andrews, S. R. Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics 27, 1571–1572 (2011).
Article CAS PubMed PubMed Central Google Scholar
Ranjan, C. & Najari, V. Package “nlcor:” compute nonlinear correlations. Research Gate. https://doi.org/10.13140/RG.2.2.33716.68480 (2020).
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
Article CAS PubMed PubMed Central Google Scholar
Gao, F., Ming, C., Hu, W. J. & Li, H. P. New Software for the Fast Estimation of Population Recombination Rates (FastEPRR) in the Genomic Era. G3-Genes Genomes Genet. 6, 1563–1571 (2016).
CAS Google Scholar

Download references

Acknowledgements

We would like to thank the Max Planck Genome centre for DNA extraction, library preparation and sequencing, Hequan Sun and Wen-Biao Jiao for helpful discussions, Charles Underwood, Ian Henderson, and Andrew Tock for help with SPO11-1-oligo analysis and Wayne Crismani and Andrew Lloyd for critical reading of the manuscript. This work was support by core funding from the Max Planck Society and an Alexander von Humboldt Fellowship to Q.L.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Chromosome Biology, Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, 50829, Cologne, Germany
Qichao Lian, Victor Solier, Birgit Walkemeier, Stéphanie Durand, Korbinian Schneeberger & Raphael Mercier
Max Planck-Genome-centre Cologne, Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, 50829, Cologne, Germany
Bruno Huettel
Faculty of Biology, LMU Munich, 82152, Planegg-Martinsried, Germany
Korbinian Schneeberger

Authors

Qichao Lian
View author publications
You can also search for this author in PubMed Google Scholar
Victor Solier
View author publications
You can also search for this author in PubMed Google Scholar
Birgit Walkemeier
View author publications
You can also search for this author in PubMed Google Scholar
Stéphanie Durand
View author publications
You can also search for this author in PubMed Google Scholar
Bruno Huettel
View author publications
You can also search for this author in PubMed Google Scholar
Korbinian Schneeberger
View author publications
You can also search for this author in PubMed Google Scholar
Raphael Mercier
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Q.L., K.S. and R.M. designed the research and analyzed the data. V.S. and B.W. generated plant materials. B.H. supervised the whole-genome sequencing work. S.D. performed and analyzed the cytology experiments. Q.L. and R.M. wrote the article with input from K.S.

Corresponding authors

Correspondence to Korbinian Schneeberger or Raphael Mercier.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Ian Henderson, Liangran Zhang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lian, Q., Solier, V., Walkemeier, B. et al. The megabase-scale crossover landscape is largely independent of sequence divergence. Nat Commun 13, 3828 (2022). https://doi.org/10.1038/s41467-022-31509-8

Download citation

Received: 10 January 2022
Accepted: 20 June 2022
Published: 02 July 2022
DOI: https://doi.org/10.1038/s41467-022-31509-8

This article is cited by

Structural variation and DNA methylation shape the centromere-proximal meiotic crossover landscape in Arabidopsis
- Joiselle B. Fernandes
- Matthew Naish
- Ian R. Henderson
Genome Biology (2024)
Meiotic recombination dynamics in plants with repeat-based holocentromeres shed light on the primary drivers of crossover patterning
- Marco Castellani
- Meng Zhang
- André Marques
Nature Plants (2024)
A pan-genome of 69 Arabidopsis thaliana accessions reveals a conserved genome structure throughout the global species range
- Qichao Lian
- Bruno Huettel
- Raphael Mercier
Nature Genetics (2024)
Molecular mechanisms and regulation of recombination frequency and distribution in plants
- Meilin Zou
- Sergey Shabala
- Meixue Zhou
Theoretical and Applied Genetics (2024)
Genetic dissection and identification of stripe rust resistance genes in the wheat cultivar Lanhangxuan 121, a cultivar selected from a space mutation population
- Qimeng Wu
- Lei Liu
- Chunlian Li
Molecular Breeding (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.