Rice 3D chromatin structure correlates with sequence variation and meiotic recombination rate

Golicz, Agnieszka A.; Bhalla, Prem L.; Edwards, David; Singh, Mohan B.

doi:10.1038/s42003-020-0932-2

Download PDF

Article
Open access
Published: 12 May 2020

Rice 3D chromatin structure correlates with sequence variation and meiotic recombination rate

Agnieszka A. Golicz¹,
Prem L. Bhalla¹,
David Edwards ORCID: orcid.org/0000-0001-7599-6760² &
…
Mohan B. Singh¹

Communications Biology volume 3, Article number: 235 (2020) Cite this article

4206 Accesses
19 Citations
10 Altmetric
Metrics details

Subjects

Abstract

Genomes of many eukaryotic species have a defined three-dimensional architecture critical for cellular processes. They are partitioned into topologically associated domains (TADs), defined as regions of high chromatin inter-connectivity. While TADs are not a prominent feature of A. thaliana genome organization, they have been reported for other plants including rice, maize, tomato and cotton and for which TAD formation appears to be linked to transcription and chromatin epigenetic status. Here we show that in the rice genome, sequence variation and meiotic recombination rate correlate with the 3D genome structure. TADs display increased SNP and SV density and higher recombination rate compared to inter-TAD regions. We associate the observed differences with the TAD epigenetic landscape, TE composition and an increased incidence of meiotic crossovers.

The 3D architecture of the pepper genome and its relationship to function and evolution

Article Open access 16 June 2022

The prevalence, evolution and chromatin signatures of plant regulatory elements

Article 18 November 2019

Mapping nucleosome-resolution chromatin organization and enhancer-promoter loops in plants using Micro-C-XL

Article Open access 02 January 2024

Introduction

The mechanism and functional significance of DNA packaging in the nucleus has been a long-standing question in biology. The genomes of animals and plants are partitioned into chromatin domains, referred to as topologically associated domains (TADs)^1,2. These are structural units of chromosome compartmentalization, which emerged as a key feature of higher-order genome organization. They define the regulatory landscapes of chromosomes, form units of co-regulated genes and confine the effect of distal regulatory elements³. The abundance of plant TADs is related to genome size; they are not prominent in the compact genome of Arabidopsis^3,4,5,6, but more abundant in the larger genomes of rice, maize, tomato, sorghum, foxtail millet and cotton^7,8,9. Rice TAD borders were reported to be associated with active epigenetic marks and high levels of transcription. Previous results also hinted at an asymmetric epigenetic mark and gene-density distribution across TAD borders⁷.

To broaden our understanding of plant TADs, here we asked whether the presence of TADs in the rice genome is associated with changes in nucleotide and structural variant density, using data from the 3000 Rice Genomes Project, the largest publicly available crop plant genome resequencing dataset, as well as the associated publicly available SNP and structural variant calls^10,11,12. We found that the rice genome can be divided in TAD and inter-TAD regions. Compared to inter-TADs, TADs have increased sequence variant density, meiotic recombination rate, are over-represented in transposable elements (TEs) and silencing epigenetic marks. Genes found in TADs are shorter, have lower expression levels and are overrepresented in functions related to signalling and response to environmental stimuli.

Results

TAD discovery

To date three Hi-C-based studies of the 3D conformation of the rice genome have been performed^7,8,13. All three studies identified TADs, however, there were substantial differences in domain size, genome coverage by the domains identified as well as intra-domain interaction strength (Supplementary note, Supplementary Table 1). The differences most likely reflect the TAD discovery algorithms used as well as the underlying TAD definitions adopted. Liu et al.⁷ defined TADs only as regions of very strong interaction signals and the resulting TADs identified were relatively small (median size 45 to 50 kb) covering about a third of the genome, whereas TADs identified by Dong et al.^8,13 were much larger (median size of 160 kb and 450 kb, respectively) covering higher proportion of the genome, but also had much lower median intra-TAD interaction signal (Supplementary Table 1). Together these results suggest that rice TADs, similar to what was found in metazoans¹⁴, could have hierarchical structure with smaller, densely interacting domains contained within larger scale domains of hundreds of kilo-base pairs. However, the analysis performed by Liu et al.⁷, as well as visual inspection of the interaction maps of rice genome (Fig. 1a), clearly indicates that there exist smaller regions of increased interactions, which manifest as strong triangular Hi-C signals¹⁵. We were intrigued by those and wondered whether any sequence features distinguish those from the remainder of the rice genome. We therefore used Armatus¹⁴, a software package designed specifically for the discovery of densely interacting domains and which has been shown to outperform both Arrowhead¹⁶ and DomainCaller¹⁷ algorithms in benchmarking comparisons¹⁸. We performed TAD discovery using all three rice Hi-C datasets available^7,8,13, however, ultimately chose TADs identified by Armatus from Liu et al.⁷ dataset for final analysis, based on biological replicate concordance, number of valid pairs identified by Hi-C-Pro, TAD size and interaction strength (Supplementary note). We then compared the coordinates of 4599 TADs identified in our study with those reported by Liu et al⁷ and found that both sets of TADs overlap significantly more than it would be expected by chance alone (Fig. 1b, Supplementary data 1). We also investigated the distribution of epigenetic marks at TAD boundaries and found prominent H3K4me3, H3K9ac and H4K12ac peaks centred on TAD boundaries (Fig. 2, Supplementary Fig. 1), which is consistent with previous findings of enrichment of those epigenetic marks at rice TAD boundaries^7,8,13. The TADs discovered had a median size of 35 kb and covered 69.7% of the rice genome, while regions falling outside of identified TADs (inter-TADs) had a median size of 25 kb and covered 30.3% of the rice genome.

**Fig. 1: Distribution of TADs across the rice genome.**

**Fig. 2: Variant profiles across human and rice TAD boundaries.**

We then set out to explore whether there exist any genetic and epigenetic features which distinguish the densely interacting TADs (regions identified as TADs by Armatus) from inter-TAD regions (genomic regions falling outside of TADs identified by Armatus) (Fig. 1a).

Different variation profiles of rice and human TAD borders

A significantly reduced sequence variation is observed at TAD borders within the human genome, and this sequence conservation has been attributed to the requirement for boundary formation and recognition by boundary-binding proteins, including CTCF¹⁹. Deletion and mutation of TAD borders has in turn been linked to pathogenicity, including developmental disorders and cancers^19,20. Although no CTCF homologue had been found in plants, homologues of cohesins are present^2,21. One of the important outstanding questions regarding TAD formation in plants is whether sequence-specific boundary recognition is necessary for TAD formation. Genome-wide analysis of variant density allowed us to compare plant and human TAD boundary variant profiles.

Conservation of human TAD boundaries is reflected in SNP and structural variant distribution across the human genome¹⁹. When SNP and structural variant breakpoint density is plotted across TAD borders, there is a pronounced reduction in density exactly at the TAD border¹⁹ (Fig. 2a). Human TAD borders also display increased gene density¹⁷, which likely contributes to the variation profile observed (Fig. 2a). However, upon partitioning into genic and intergenic sequence the trend of reduced sequence variation at TAD boundary persists, supporting importance of boundary sequence conservation (Supplementary Fig. 2). We then employed the same apprach¹⁹ to study variant profile of TAD borders across the rice genome. We would expect that if there were a requirement for sequence recognition for boundary formation in rice, a similar reduction in variant density at TAD border would be found. However, in rice no immediate reduction in variant density at TAD boundary was observed (Fig. 2b). We did observe a reduction in variant density at ~5 kb before TAD boundary, but this most likely corresponds to an increase in gene density, as in contrast to what is observed in human, the 5 kb dip in variant density disappears upon partitioning into genic and intergenic sequence (Supplementary Fig. 2). The finding suggests that sequence conservation at TAD boundaries may be less important for rice than it is for human TAD formation. In addition, we noted an overall uneven distribution of both SNPs and SV breakpoints between TADs and inter-TADs, with TADs appearing to have a higher density of genomic variants.

Rice TADs have higher variant density compared to inter-TADs

Having observed an asymmetry in variant distribution across TAD borders (Fig. 2), we then turned to comparisons of entire TAD and inter-TAD regions. First, we asked if TADs and inter-TADs differ in genomic variant distribution. We investigated the distribution of 6,495,641 SNPs, 3,534,001 deletion and 410,823 insertion breakpoints in the rice genome. We found that both SNP and structural variant breakpoint density is higher within rice TADs compared inter-TADs (Fig. 3). Rice TADs and inter-TAD regions differ in protein coding gene density (median TAD gene density—0.11 gene/kb, median inter-TAD gene-density—0.15 gene/kb, two-tailed Wilcoxon rank sum test p < 0.001) and gene bodies demonstrate reduced SNP and SV abundance compared to intergenic regions due to functional constraints^11,12. To avoid bias due to differential gene density between regions, we partitioned the rice genome into exons, introns and intergenic sequence. Comparison of these domains demonstrated an increased SNP and insertion and deletion breakpoint density within TADs across both genic and intergenic sequences (Fig. 3). The results suggest that some properties of TADs, beyond differences in gene density, may contribute to their increased sequence variation.

**Fig. 3: Variant density in TADs and inter-TAD regions.**

Correlation between rice genomic and epigenomic features

We wanted to investigate the potential causes for the differences in SNP and SV breakpoint density between TAD and inter-TAD regions. To that end we first performed a genome-wide analysis attempting to correlate different sequence and epigenetic features across the rice genome. A similar strategy has previously been used to for the analysis of human data and revealed that the chromatin state is a major influence on regional mutation rates²². For example, in human, H3K9 di- and tri- methylation and methylation in CG context have been associated with increased SNV mutation rate^22,23,24,25. In medaka, a Japanese rice fish, DNA hypermethylation is associated with an increase in all types of single nucleotide substitutions, not just spontaneous oxidative de-amination of methylated CpG²⁴. Some of the proposed mechanisms of action include alternative DNA repair pathways and impaired access of repair machinery due to chromatin compaction^24,26. Similarly, our genome-wide analysis revealed significant positive correlation between SNP density across both genic and intergenic regions, CG and CHG DNA methylation and H3K9 di-methylation (Fig. 4, Supplementary Figs 3 and 4, Supplementary data 2). SNP density in both genic and intergenic regions was also positively correlated with transposable element density (especially CACTA DNA transposons and retrotransposons) revealing potential link between transposon density, heterochromatin and SNP accumulation. Transposons are subject to heterochromatic silencing associated with high levels of CG and CHG DNA methylation²⁷, and heterochromatic silencing has been shown to spread to the surrounding sequences in maize²⁸ and rice²⁹. As transposon density as well as CG, CHG and H3K9me2 methylation are positively correlated with SNP density, heterochromatic spreading could contribute to SNP accumulation. Condensed chromatin self-organization^30,31 could also impair access of the DNA repair machinery. In addition, compared with RNA transposons, DNA transposons had stronger positive correlation with deletion breakpoints, while RNA transposons were more strongly correlated with insertion breakpoints, in accordance with the postulated transposon origin of many reported rice SVs¹². Finally, we observed an association between variant density and recombination rate, especially SNP density being positively correlated with recombination rate. The observation is in line with previous findings as increased SNP density was found to be associated with meiotic crossovers^32,33,34.

**Fig. 4: Correlations between different genomic and epigenomic features across the rice genome.**

We then set out to test which of the features identified in this genome-wide analysis could contribute to increased variant density within TADs.

TADs and inter-TADs differ in epigenetic marks, TEs and recombination rate

First, we compared levels of DNA methylation and histone modifications between TADs and inter-TAD regions. We found that TADs are lower in active and higher in repressive epigenetic marks compared to inter-TAD regions (Fig. 5a). TADs have a significantly higher CG and CHG DNA methylation and H3K9me2 levels than inter-TADs, a feature which has been shown to be mutagenic in humans²² and positively correlated with SNP density in our genome-wide analysis, suggesting that it could contribute to single nucleotide mutation accumulation in TADs. The results are also consistent with asymmetry in epigenetic marks across TAD boundaries observed (Supplementary Fig. 1) and previously reported⁷. We also observed that the proportion of C to T nucleotide substitution is higher in TADs than in inter-TADs (median fraction for TADs 0.2419, median fraction for inter-TADs 0.2277, two-tailed Wilcoxon rank sum test p < 0.001), suggesting that spontaneous oxidative de-amination of CpG may also add to the observed differences in sequence variation between TAD and inter-TAD regions. The findings suggest that chromatin state and epigenetic modifications may contribute to the differential single nucleotide variant accumulation across TAD and inter-TAD regions.

**Fig. 5: Comparison of genomic and epigenomic features between TAD and inter-TAD regions.**

Increased presence of silencing marks is often associated with transposable elements (TEs)³⁵ and our genome-wide analysis pointed to correlation between TEs, epigenetic silencing marks and variant density. We therefore investigated the TE composition of TADs and found differential TE partitioning between TAD and inter-TAD regions. We found that TADs are overrepresented in CACTA DNA transposons and retrotransposons (Fig. 5b), which have been found to be positively correlated with SNP density (Fig. 4). Increased TE density in TADs may promote heterochromatic silencing of the surrounding chromatin^27,28 or condensed chromatin self-assembly^30,31, which in turn could contribute to single nucleotide variant accumulation. In addition TEs in rice are associated with insertions and deletions¹² (Fig. 4) and increased TE density in TADs most likely at least partially explains increased density of SV breakpoints.

Increased SNP density was previously found to be associated with meiotic crossovers^32,33,34. Our genome-wide analysis also revealed significant positive correlation between SNP density and meiotic recombination rate (Fig. 4). We investigated whether TADs have a higher recombination rate and are more likely to overlap meiotic crossover (CO) events. We used two publicly available datasets, including 1067 unique breakpoints generated by resequencing of 38 rice F2 indica individuals³³ and genome-wide recombination rate identified from 75 indica and 75 japonica accessions from the 3000 Rice Genomes Project³⁴. We found that TADs have significantly higher recombination rate compared to inter-TAD regions (Fig. 5c). TADs overlapped more breakpoints than would be expected by chance (Fig. 5d). Interestingly, meiotic crossovers are also known to be associated with open chromatin state and high levels of H3K4me3, H3K9ac and H4K12ac³⁴, which is decreased in TADs, compared to inter-TAD regions. On the other hand, retrotransposon methylation was shown to be positively correlated with recombination rate²⁹. A mechanism integrating multiple contributing factors including the underlying genome sequence, epigenetic modifications and 3D chromatin structure likely controls meiotic crossover positioning and recombination rate.

Genes found in TADs and inter-TADs have different functions

Our analysis suggests that the genome is effectively partitioned into TADs overrepresented in silent chromatin marks and transposable elements and inter-TAD regions associated with active chromatin. We were curious if the genes found within TADs and outside of TAD regions differ in any other properties beyond variant density. We found that genes found in inter-TAD regions are on average longer (Fig. 6a) and expressed at higher levels (Fig. 6b). We also searched for functional overrepresentation of TAD and inter-TAD genes. We found the inter-TAD genes to be overrepresented in functions related to translation, oxidative phosphorylation and protein transport (Fig. 6d) and the TAD genes to be overrepresented in functions related to protein phosphorylation and regulation of gene expression (Fig. 6c). The inter-TAD genes appear overrepresented in house-keeping functions while the TAD genes are overrepresented with functions related to signalling and responses to environmental stimuli.

**Fig. 6: Genes in TAD and inter-TAD regions are significantly overrepresented in different functional categories.**

Discussion

Our observations mirror that made in Drosophila melanogaster, where a proposed model suggests that chromatin is divided in TADs and inter-TAD regions³¹. With inter-TADs harbouring active chromatin and genes overrepresented in house-keeping functions. Our observations point to a similar organization of the rice genome, which is also reminiscent of the compacted and loose structural domains identified in Arabidopsis, albeit at a smaller scale^3,4,5. We also demonstrate that in contrast to human TAD borders, rice TAD borders do not show signatures of purifying selection, supporting the hypothesis that sequence recognition by insulating proteins in not necessary for TAD formation and consistent with the Drosophila model of TAD self-assembly due to an ability of inactive chromatin to aggregate³¹. We further extend those findings by analysis of sequence variants and find that TADs have a higher density of variants across both coding and non-coding regions. Our results suggest that beyond involvement in chromatin folding heterochromatic TADs may form a mutagenic environment which could contribute to variant accumulation, as increased SNP density was observed both across genic and non-genic regions.

Methods

TAD identification

The three existing rice Hi-C datasets were obtained from Sequence Read Archive (PRJNA391551, PRJNA354683, PRJNA429927)^7,8,13. Interaction maps were built using Hi-C-Pro³⁶ v2.11.1 at 5 kb pair resolution using rice genome MSU7/IRGSP1.0³⁷ as a reference. Interaction maps were first built separately for the biological replicates to investigate concordance between results obtained (Supplementary note) and biological replicates were then merged to obtain final interaction maps used for TAD identification. TADs were identified using Armatus¹⁴ v2.3. Several values of the γ parameter which controls TAD size were tested and the final value used was 0.4 (Supplementary note, Supplementary Table 2 and Supplementary Table 3).

Rice SNP and SV analysis

Rice genome and genome annotation (MSU7/IRGSP1.0) were downloaded from Phytozome v12³⁸. SNP (3K RG 18mio base SNP Dataset) and SV (RG Large Structural Variants release 1.0) datasets were downloaded from SNP-Seek database³⁹. SNPs were filtered to include only those found in Japonica and Indica lines with a minimum minor allele frequency of 0.01 using VCFtools⁴⁰. SV variants were pre-filtered to include only those found in Japonica and Indica lines. Filtering was performed for consistency with Hi-C data (Japonica lines only) and recombination data (Japonica and Indica lines) available. For deletions, start and end coordinates were used as breakpoints. For insertions, start coordinates were used as breakpoints.

Rice recombination, epigenetic and expression data

Crossover breakpoints were obtained from ref. ³³. Recombination rates across the rice genome were obtained from ref. ³⁴. Histone modification ChIP-Seq data were obtained from PlantDHS⁴¹. DNA methylation data were obtained from MethBank⁴² v3.0. Gene expression data were obtained from ref. ⁷ (PRJNA354683). Expression levels were quantified using Kallisto⁴³ v0.45.0.

Rice TE annotation

The non-redundant set of TEs was obtained from TREP database⁴⁴. RepeatMasker v4.0.7, with default parameters using TREP database, was used to annotate the TEs in the rice genome.

Overlap analysis

The overlaps between TADs, genes, variants, histone and DNA modifications were computed using bedtools⁴⁵ v2.28. The significance of overlap between Liu et al. TADs, crossover breakpoints, TEs and Armatus TADs was evaluated using regioneR⁴⁶ v1.15.2 overlapPermTest function by random region re-distribution across the genome (randomizeRegions).

Relative abundance at TAD boundary

Relative abundance calculation was performed based on the score introduced by Fudenberg et al.¹⁹ with a minor adjustment of operating on densities rather than total counts and using a random re-distribution of windows rather than genome-wide abundance to obtain expected values to account for potential bias resulting from performing calculations on genomic windows.

Re-distribution of TAD boundaries across the rice genome was performed using randomizeRegions function of regioneR v1.15.2.

Genome-wide feature correlation analysis

The genome was split in 40 kb non-overlapping windows. Densities and signal levels for the corresponding genomic and epigenomic features were computed using bedtools⁴⁵ v2.28 intersect and map functions. Pearson correlations were computed for log2-transformed and non-transformed values. Spearman correlations were computed for non-transformed values. Significance computed using cor.mtest implemented in package corrplot was used for significance testing (p < 0.05)). To avoid log2(0), 0.1 (median value of the entire dataset) was added to all values.

Gene ontology (GO) analysis

GO annotation was obtained from Agrigo2⁴⁷. TopGO⁴⁸ v2.35.0 was used to compute functional overrepresentation of genes using method ‘weight’ to adjust for multiple comparisons.

Human genomic diversity analysis

Human 1000 genomes variants⁴⁹ were downloaded from ENSEMBL and pre-filtered retaining only single nucleotide variants (SNVs) with minor allele frequency of over 0.01. Coordinates of human TADs were obtained from¹⁶ (GSE63525). Human deletions were obtained from⁵⁰ (dbVar nstd100).

Statistics and reproducibility

Differences between TADs and inter-TADs were tested for statistical significance using two-tailed Wilcoxon rank sum test implemented in R. Overrepresentation of features within TADs was tested for statistical significance using permutation test implemented in regioneR. Pearson and Spearman correlations were computed using cor function in R and significance was computed using cor.mtest function implemented in the package corrplot. Enrichment of GO terms was analyzed suing topGO package, using method ‘weight’ to adjust for multiple comparisons.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All data used in the manuscript is publicly available (See “Methods”).

Supplementary data 1 and 2 can be found under https://osf.io/hevzb/.

Code availability

No custom code or mathematical algorithm central to the conclusions was used.

References

Dixon, JesseR., Gorkin, David, U. & Ren, B. Chromatin domains: the unit of chromosome organization. Mol. Cell 62, 668–680 (2016).
Article CAS PubMed PubMed Central Google Scholar
Sotelo-Silveira, M., Chavez Montes, R. A., Sotelo-Silveira, J. R., Marsch-Martinez, N. & de Folter, S. Entering the next dimension: plant genomes in 3D. Trends Plant Sci. 23, 598–612 (2018).
Article CAS PubMed Google Scholar
Feng, S. et al. Genome-wide Hi-C analyses in wild-type and mutants reveal high-resolution chromatin interactions in arabidopsis. Mol. Cell 55, 694–707 (2014).
Article CAS PubMed PubMed Central Google Scholar
Grob, S., Schmid, Marc, W. & Grossniklaus, U. Hi-C analysis in arabidopsis identifies the KNOT, a structure with similarities to the flamenco locus of Drosophila. Mol. Cell 55, 678–693 (2014).
Article CAS PubMed Google Scholar
Wang, C. et al. Genome-wide analysis of local chromatin packing in Arabidopsis thaliana. Genome Res. 25, 246–256 (2015).
Article PubMed PubMed Central CAS Google Scholar
Liu, C. et al. Genome-wide analysis of chromatin packing in Arabidopsis thaliana at single-gene resolution. Genome Res. 26, 1057–1068 (2016).
Article CAS PubMed PubMed Central Google Scholar
Liu, C., Cheng, Y. J., Wang, J. W. & Weigel, D. Prominent topologically associated domains differentiate global chromatin packing in rice from Arabidopsis. Nat. Plants 3, 742–748 (2017).
Article CAS PubMed Google Scholar
Dong, P. et al. 3D chromatin architecture of large plant genomes determined by local A/B compartments. Mol. Plant 10, 1497–1509 (2017).
Article CAS PubMed Google Scholar
Wang, M. et al. Evolutionary dynamics of 3D genome architecture following polyploidization in cotton. Nat. Plants 4, 90–97 (2018).
Article CAS PubMed Google Scholar
Wang, W. et al. Genomic variation in 3,010 diverse accessions of Asian cultivated rice. Nature 557, 43–49 (2018).
Article CAS PubMed PubMed Central Google Scholar
Tatarinova, T. V. et al. Nucleotide diversity analysis highlights functionally important genomic regions. Sci. Rep. 6, 35730 (2016).
Article PubMed PubMed Central Google Scholar
Fuentes, R. R. et al. Structural variants in 3000 rice genomes. Genome Res. 29, 870–880 (2019).
Dong, Q. et al. Genome-wide Hi-C analysis reveals extensive hierarchical chromatin interactions in rice. Plant J. 94, 1141–1156 (2018).
Article CAS PubMed Google Scholar
Filippova, D., Patro, R., Duggal, G. & Kingsford, C. Identification of alternative topological domains in chromatin. Algorithms Mol. Biol. 9, 14 (2014).
Article PubMed PubMed Central Google Scholar
Grob, S. Plants are not so different. Nat. Plants 3, 690–691 (2017).
Article PubMed Google Scholar
Rao, S. S. P. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680 (2014).
Article CAS PubMed PubMed Central Google Scholar
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
Article CAS PubMed PubMed Central Google Scholar
Forcato, M. et al. Comparison of computational methods for Hi-C data analysis. Nat. Methods 14, 679–685 (2017).
Article CAS PubMed PubMed Central Google Scholar
Fudenberg, G. & Pollard, K. S. Chromatin features constrain structural variation across evolutionary timescales. Proc. Natl Acad. Sci. USA 116, 2175 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kaiser, V. B. & Semple, C. A. When TADs go bad: chromatin structure and nuclear organisation in human disease. F1000Research 6, 314 (2017).
Article CAS Google Scholar
Wood, A. J., Severson, A. F. & Meyer, B. J. Condensin and cohesin complexity: the expanding repertoire of functions. Nat. Rev. Genet. 11, 391–404 (2010).
Article CAS PubMed PubMed Central Google Scholar
Schuster-Böckler, B. & Lehner, B. Chromatin organization is a major influence on regional mutation rates in human cancer cells. Nature 488, 504 (2012).
Article PubMed CAS Google Scholar
Carlson, J. et al. Extremely rare variants reveal patterns of germline mutation rate heterogeneity in humans. Nat. Commun. 9, 3753–3753 (2018).
Article PubMed PubMed Central CAS Google Scholar
Qu, W. et al. Genome-wide genetic variations are highly correlated with proximal DNA methylation patterns. Genome Res. 22, 1419–1425 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mugal, C. F. & Ellegren, H. Substitution rate variation at human CpG sites correlates with non-CpG divergence, methylation level and GC content. Genome Biol. 12, R58–R58 (2011).
Article CAS PubMed PubMed Central Google Scholar
Sun, L. et al. Preferential protection of genetic fidelity within open chromatin by the mismatch repair machinery. J. Biol. Chem. 291, 17692–17705 (2016).
Article CAS PubMed PubMed Central Google Scholar
West, P. T. et al. Genomic distribution of H3K9me2 and DNA methylation in a maize genome. PLOS ONE 9, e105267 (2014).
Article PubMed PubMed Central CAS Google Scholar
Eichten, S. R. et al. Spreading of heterochromatin is limited to specific families of maize retrotransposons. PLOS Genet. 8, e1003127 (2012).
Article CAS PubMed PubMed Central Google Scholar
Choi, J. Y. & Purugganan, M. D. Evolutionary epigenomics of retrotransposon-mediated methylation spreading in rice. Mol. Biol. Evolution 35, 365–382 (2017).
Article CAS Google Scholar
Gavrilov, A. A. et al. Unraveling the mechanisms of chromatin fibril packaging. Nucleus 7, 319–324 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ulianov, S. V. et al. Active chromatin and transcription play a key role in chromosome partitioning into topologically associating domains. Genome Res. 26, 70–84 (2016).
Article PubMed PubMed Central Google Scholar
Makova, K. D. & Hardison, R. C. The effects of chromatin organization on variation in mutation rates in the genome. Nat. Rev. Genet. 16, 213–223 (2015).
Article CAS PubMed PubMed Central Google Scholar
Si, W. et al. Widely distributed hot and cold spots in meiotic recombination as shown by the sequencing of rice F2 plants. N. Phytol. 206, 1491–1502 (2015).
Article CAS Google Scholar
Marand, A. P. et al. Historical meiotic crossover hotspots fueled patterns of evolutionary divergence in rice. Plant Cell 31, 645 (2019).
Article CAS PubMed PubMed Central Google Scholar
Sigman, M. J. & Slotkin, R. K. The first rule of plant transposable element silencing: location, location, location. Plant Cell 28, 304 (2016).
Article CAS PubMed PubMed Central Google Scholar
Servant, N. et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 16, 259 (2015).
Article PubMed PubMed Central CAS Google Scholar
Kawahara, Y. et al. Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data. Rice 6, 4 (2013).
Article PubMed PubMed Central Google Scholar
Goodstein, D. M. et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 40, D1178–D1186 (2012).
Article CAS PubMed Google Scholar
Mansueto, L. et al. Rice SNP-seek database update: new SNPs, indels, and queries. Nucleic Acids Res. 45, D1075–D1081 (2017).
Article CAS PubMed Google Scholar
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
Article CAS PubMed PubMed Central Google Scholar
Zhang, T., Marand, A. P. & Jiang, J. PlantDHS: a database for DNase I hypersensitive sites in plants. Nucleic Acids Res. 44, D1148–D1153 (2016).
Article CAS PubMed Google Scholar
Li, R. et al. MethBank 3.0: a database of DNA methylomes across a variety of species. Nucleic Acids Res. 46, D288–D295 (2018).
Article CAS PubMed Google Scholar
Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527 (2016).
Article CAS PubMed Google Scholar
Wicker, T., Matthews, D. E. & Keller, B. TREP: a database for Triticeae repetitive elements. Trends Plant Sci. 7, 561–562 (2002).
Article CAS Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Gel, B. et al. regioneR: an R/Bioconductor package for the association analysis of genomic regions based on permutation tests. Bioinformatics 32, 289–291 (2015).
PubMed PubMed Central Google Scholar
Tian, T. et al. agriGO v2.0: a GO analysis toolkit for the agricultural community, 2017 update. Nucleic Acids Res. 45, W122–W129 (2017).
Article CAS PubMed PubMed Central Google Scholar
Alexa, A., Rahnenführer, J. & Lengauer, T. Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinformatics 22, 1600–1607 (2006).
Article CAS PubMed Google Scholar
The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article CAS Google Scholar
Coe, B. P. et al. Refining analyses of copy number variation identifies specific genes associated with developmental delay. Nat. Genet. 46, 1063–1071 (2014).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to thank Dr Philipp Bayer for the valuable comments on the manuscript. This work was supported the University of Melbourne McKenzie Fellowship.

Author information

Authors and Affiliations

School of Agriculture and Food, Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Parkville, VIC, 3010, Australia
Agnieszka A. Golicz, Prem L. Bhalla & Mohan B. Singh
School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Perth, WA, 6009, Australia
David Edwards

Authors

Agnieszka A. Golicz
View author publications
You can also search for this author in PubMed Google Scholar
Prem L. Bhalla
View author publications
You can also search for this author in PubMed Google Scholar
David Edwards
View author publications
You can also search for this author in PubMed Google Scholar
Mohan B. Singh
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

AAG conceived the study, designed the experiments, performed the analysis and wrote the manuscript, PLB and MBS edited the manuscript, DE provided critical comments on the study design and edited the manuscript. This research was supported by Spartan HPC at the University of Melbourne, Australia.

Corresponding author

Correspondence to Agnieszka A. Golicz.

Ethics declarations

Competing interests

Authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer review file

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Golicz, A.A., Bhalla, P.L., Edwards, D. et al. Rice 3D chromatin structure correlates with sequence variation and meiotic recombination rate. Commun Biol 3, 235 (2020). https://doi.org/10.1038/s42003-020-0932-2

Download citation

Received: 29 May 2019
Accepted: 31 March 2020
Published: 12 May 2020
DOI: https://doi.org/10.1038/s42003-020-0932-2

This article is cited by

Comprehensive mapping and modelling of the rice regulome landscape unveils the regulatory architecture underlying complex traits
- Tao Zhu
- Chunjiao Xia
- Weibo Xie
Nature Communications (2024)
The 3D architecture of the pepper genome and its relationship to function and evolution
- Yi Liao
- Juntao Wang
- Changming Chen
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.