Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

A pan-grass transcriptome reveals patterns of cellular divergence in crops

Abstract

Different plant species within the grasses were parallel targets of domestication, giving rise to crops with distinct evolutionary histories and traits1. Key traits that distinguish these species are mediated by specialized cell types2. Here we compare the transcriptomes of root cells in three grass species—Zea mays, Sorghum bicolor and Setaria viridis. We show that single-cell and single-nucleus RNA sequencing provide complementary readouts of cell identity in dicots and monocots, warranting a combined analysis. Cell types were mapped across species to identify robust, orthologous marker genes. The comparative cellular analysis shows that the transcriptomes of some cell types diverged more rapidly than those of others—driven, in part, by recruitment of gene modules from other cell types. The data also show that a recent whole-genome duplication provides a rich source of new, highly localized gene expression domains that favour fast-evolving cell types. Together, the cell-by-cell comparative analysis shows how fine-scale cellular profiling can extract conserved modules from a pan transcriptome and provide insight on the evolution of cells that mediate key functions in crops.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Cell and nucleus profiles identify the same markers but show different sensitivities and artifacts.
Fig. 2: Mapping cell identities from maize to sorghum and gene duplicate analysis.
Fig. 3: Detection of dosage compensation and cellular destination of regulatory neofunctionalized genes.
Fig. 4: Differential divergence of cell types in maize compared with Setaria.

Similar content being viewed by others

Data availability

All reference genomes were downloaded from Arabidopsis TAIR10.38, at https://www.arabidopsis.org/, for Maize B73 v4, S. bicolor v3 and S. viridis v2 reference genomes at https://plants.ensembl.org/. All raw single-cell and single-nucleus RNA-seq data, expression matrices and analysed R-Seurat objects are available under Gene Expression Omnibus accession GSE225118. All data used to generate figures are available at https://figshare.com/articles/dataset/Data_for_Guillotin_et_al_/22331002, except for the following figures, for which the data can be found in the following deposited files under GEO accession GSE225118: Arabidopsis_Cells_Nuclei_Seurat_Obj.RData.gz (Fig. 1c and Extended Data Figs. 2c,d and 4a,b), Maize_Sorghum_Setaria_Cells_Nuclei_Seurat_Obj.RData.gz (Extended Data Figs. 3d and 5c,d). Data in Extended Data Figs. 2c,d and 3d, and those in Extended Data Fig. 5c,d are clustered separately. Data on single-cell RNA-seq quality control are provided in Supplementary Table 1. Analysis of single-cell versus single-nucleus RNA-seq data is provided in Supplementary Tables 2 and 3. Cell-specific marker genes for all species, including a shared pan library of marker genes, are provided in Supplementary Table 4. Data on regulon analysis are provided in Supplementary Table 5. Data on duplicate genes are provided in Supplementary Tables 6 and 7. Cellular divergence analysis is provided in Supplementary Table 8 and in situ probe information is provided in Supplementary Table 9.

References

  1. Woodhouse, M. R. & Hufford, M. B. Parallelism and convergence in post-domestication adaptation in cereal grasses. Philos. Trans. R. Soc. B 374, 20180245 (2019).

    Article  CAS  Google Scholar 

  2. Rich-Griffin, C. et al. Single-cell transcriptomics: a high-resolution avenue for plant functional genomics. Trends Plant Sci. 25, 186–197 (2020).

    Article  CAS  PubMed  Google Scholar 

  3. Marioni, J. C. & Arendt, D. How single-cell genomics is changing evolutionary and developmental biology. Annu. Rev. Cell Dev. Biol. 33, 537–553 (2017).

    Article  CAS  PubMed  Google Scholar 

  4. Shafer, M. E. R. Cross-species analysis of single-cell transcriptomic data. Front. Cell Dev. Biol. 7, 175 (2019).

    Article  ADS  PubMed  PubMed Central  Google Scholar 

  5. Kajala, K. et al. Innovation, conservation, and repurposing of gene function in root cell type development. Cell 184, 3333–3348.e19 (2021).

    Article  CAS  PubMed  Google Scholar 

  6. Swigonova, Z. et al. On the tetraploid origin of the maize genome. Comp. Funct. Genomics 5, 281–284 (2004).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Swigonova, Z. Close split of sorghum and maize genome progenitors. Genome Res. 14, 1916–1923 (2004).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Kozlova, L. V., Nazipova, A. R., Gorshkov, O. V., Petrova, A. A. & Gorshkova, T. A. Elongating maize root: zone-specific combinations of polysaccharides from type I and type II primary cell walls. Sci. Rep. 10, 10956 (2020).

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  9. Ma, W. et al. The mucilage proteome of maize (Zea mays L.) primary roots. J. Proteome Res. 9, 2968–2976 (2010).

    Article  CAS  PubMed  Google Scholar 

  10. Schittenhelm, S. & Schroetter, S. Comparison of drought tolerance of maize, sweet sorghum and sorghum–sudangrass hybrids. J. Agron. Crop Sci. 200, 46–53 (2014).

    Article  CAS  Google Scholar 

  11. Zhang, Y. et al. Differentially regulated orthologs in sorghum and the subgenomes of maize. Plant Cell 29, 1938–1951 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Zheng, Z. et al. Shared genetic control of root system architecture between Zea mays and Sorghum bicolor. Plant Physiol. 182, 977–991 (2020).

    Article  CAS  PubMed  Google Scholar 

  13. McKain, M. R. et al. Ancestry of the two subgenomes of maize. Preprint at BioRxiv https://doi.org/10.1101/352351 (2018).

  14. Schnable, J. C., Springer, N. M. & Freeling, M. Differentiation of the maize subgenomes by genome dominance and both ancient and ongoing gene loss. Proc. Natl Acad. Sci. USA 108, 4069–4074 (2011).

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  15. Bawa, G., Liu, Z., Yu, X., Qin, A. & Sun, X. Single-cell RNA sequencing for plant research: insights and possible benefits. Int. J. Mol. Sci. 23, 4497 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Farmer, A., Thibivilliers, S., Ryu, K. H., Schiefelbein, J. & Libault, M. Single-nucleus RNA and ATAC sequencing reveals the impact of chromatin accessibility on gene expression in Arabidopsis roots at the single-cell level. Mol. Plant 14, 372–383 (2021).

    Article  CAS  PubMed  Google Scholar 

  17. Long, Y. et al. FlsnRNA-seq: protoplasting-free full-length single-nucleus RNA profiling in plants. Genome Biol. 22, 66 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Marand, A. P., Chen, Z., Gallavotti, A. & Schmitz, R. J. A cis-regulatory atlas in maize at single-cell resolution. Cell 184, 3041–3055.e21 (2021).

    Article  CAS  PubMed  Google Scholar 

  19. Ortiz-Ramírez, C. et al. Ground tissue circuitry regulates organ complexity in maize and Setaria. Science 374, 1247–1252 (2021).

    Article  ADS  PubMed  PubMed Central  Google Scholar 

  20. Ding, J. et al. Systematic comparison of single-cell and single-nucleus RNA-sequencing methods. Nat. Biotechnol. 38, 737–746 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Ray F. Evert. in Esau’s Plant Anatomy, Meristems, Cells, and Tissues of the Plant Body: their Structure, Function, and Development 3rd edn 99 (Wiley, 2006).

  22. Sorenson, R. S., Deshotel, M. J., Johnson, K., Adler, F. R. & Sieburth, L. E. Arabidopsis mRNA decay landscape arises from specialized RNA decay substrates, decapping-mediated feedback, and redundancy. Proc. Natl Acad. Sci. USA 115, E1485–E1494 (2018).

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  23. Lotfollahi, M., Wolf, F. A. & Theis, F. J. scGen predicts single-cell perturbation responses. Nat. Methods 16, 715–721 (2019).

    Article  CAS  PubMed  Google Scholar 

  24. Ferrari, C., Manosalva Pérez, N. & Vandepoele, K. MINI-EX: integrative inference of single-cell gene regulatory networks in plants. Mol. Plant 15, 1807–1824 (2022).

    Article  CAS  PubMed  Google Scholar 

  25. Donner, T. J., Sherr, I. & Scarpella, E. Regulation of preprocambial cell state acquisition by auxin signaling in Arabidopsis leaves. Development 136, 3235–3246 (2009).

    Article  CAS  PubMed  Google Scholar 

  26. Wang, S. et al. RppM, encoding a typical CC-NBS-LRR protein, confers resistance to southern corn rust in maize. Front. Plant Sci. 13, 951318 (2022).

    Article  PubMed  PubMed Central  Google Scholar 

  27. Ingram, G. C., Magnard, J. L., Vergne, P., Dumas, C. & Rogowsky, P. M. ZmOCL1, an HDGL2 family homeobox gene, is expressed in the outer cell layer throughout maize development. Plant Mol. Biol. 40, 343–354 (1999).

    Article  CAS  PubMed  Google Scholar 

  28. Li, Z., Tang, J., Srivastava, R., Bassham, D. C. & Howell, S. H. The transcription factor bZIP60 links the unfolded protein response to the heat stress response in maize. Plant Cell 32, 3559–3575 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Guo, Z. et al. MRG1/2 histone methylation readers and HD2C histone deacetylase associate in repression of the florigen gene FT to set a proper flowering time in response to day-length changes. New Phytol. 227, 1453–1466 (2020).

    Article  CAS  PubMed  Google Scholar 

  30. Grover, C. E. et al. Homoeolog expression bias and expression level dominance in allopolyploids. New Phytol. 196, 966–971 (2012).

    Article  CAS  PubMed  Google Scholar 

  31. Lynch, M. & Force, A. The probability of duplicate gene preservation by subfunctionalization. Genetics 154, 459–473 (2000).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Chaudhary, B. et al. Reciprocal silencing, transcriptional bias and functional divergence of homeologs in polyploid cotton (Gossypium). Genetics 182, 503–517 (2009).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Hughes, T. E., Langdale, J. A. & Kelly, S. The impact of widespread regulatory neofunctionalization on homeolog gene evolution following whole-genome duplication in maize. Genome Res. 24, 1348–1355 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Zhao, M., Zhang, B., Lisch, D. & Ma, J. Patterns and consequences of subgenome differentiation provide insights into the nature of paleopolyploidy in plants. Plant Cell 29, 2974–2994 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Li, L. et al. Co-expression network analysis of duplicate genes in maize (Zea mays L.) reveals no subgenome bias. BMC Genomics 17, 1–16 (2016).

    Article  Google Scholar 

  36. Birchler, J. A. & Veitia, R. A. Gene balance hypothesis: connecting issues of dosage sensitivity across biological disciplines. Proc. Natl Acad. Sci. USA 109, 14746–14753 (2012).

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  37. Muyle, A., Marais, G. A. B., Bačovský, V., Hobza, R. & Lenormand, T. Dosage compensation evolution in plants: theories, controversies and mechanisms. Philos. Trans. R. Soc. B 377, 20210222 (2022).

    Article  CAS  Google Scholar 

  38. Walsh, J. R., Woodhouse, M. R., Andorf, C. M. & Sen, T. Z. Tissue-specific gene expression and protein abundance patterns are associated with fractionation bias in maize. BMC Plant Biol. 20, 4 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Renny-Byfield, S., Rodgers-Melnick, E. & Ross-Ibarra, J. Gene fractionation and function in the ancient subgenomes of maize. Mol. Biol. Evol. 34, 1825–1832 (2017).

    Article  CAS  PubMed  Google Scholar 

  40. Xu, X. et al. Single-cell RNA sequencing of developing maize ears facilitates functional analysis and trait candidate gene discovery. Dev. Cell 56, 557–568.e6 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Rastogi, S. & Liberles, D. A. Subfunctionalization of duplicated genes as a transition state to neofunctionalization. BMC Evol. Biol. 5, 28 (2005).

    Article  PubMed  PubMed Central  Google Scholar 

  42. Lee, J., Shah, M., Ballouz, S., Crow, M. & Gillis, J. CoCoCoNet: conserved and comparative co-expression across a diverse set of species. Nucleic Acids Res. 48, W566–W571 (2021).

    Article  Google Scholar 

  43. Van Deynze, A. et al. Nitrogen fixation in a landrace of maize is supported by a mucilage-associated diazotrophic microbiota. PLoS Biol. 16, e2006352 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  44. Galloway, A. F., Knox, P. & Krause, K. Sticky mucilages and exudates of plants: putative microenvironmental design elements with biotechnological value. New Phytol. 225, 1461–1469 (2020).

    Article  PubMed  Google Scholar 

  45. Werker, E. & Kislev, M. Mucilage on the root surface and root Hairs of sorghum: Heterogeneity in structure, manner of production and site of accumulation. Ann. Bot. 42, 809–816 (1978).

    Article  Google Scholar 

  46. Voiniciuc, C., Guenl, M., Schmidt, M. H.-W. & Usadel, B. Highly branched xylan made by IRX14 and MUCI21 links mucilage to Arabidopsis seeds. Plant Physiol. 169, 2481–2495 (2015).

    CAS  PubMed  PubMed Central  Google Scholar 

  47. Wang, B. et al. Genome-wide selection and genetic improvement during modern maize breeding. Nat. Genet. 52, 565–571 (2020).

    Article  PubMed  Google Scholar 

  48. Arendt, D. The evolution of cell types in animals: emerging principles from molecular studies. Nat. Rev. Genet. 9, 868–882 (2008).

    Article  CAS  PubMed  Google Scholar 

  49. Wang, X. et al. Genome alignment spanning major poaceae lineages reveals heterogeneous evolutionary rates and alters inferred dates for key evolutionary events. Mol. Plant 8, 885–898 (2015).

    Article  CAS  PubMed  Google Scholar 

  50. Efroni, I., Ip, P.-L., Nawy, T., Mello, A. & Birnbaum, K. D. Quantification of cell identity from single-cell gene expression profiles. Genome Biol. 16, 9 (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  51. Stuart, T. et al. Comprehensive integration of single-cell data. Cell 177, 1888–1902 e21 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. Hafemeister, C. & Satija, R. Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression. Genome Biol. 20, 296 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  53. Raju, S. K. K., Ledford, S. M. & Niederhuth, C. E. DNA methylation signatures of duplicate gene evolution in angiosperms. Plant Physiol. kiad220 (2023).

  54. Hernández-Coronado, M. et al. Plant glutamate receptors mediate a bet-hedging strategy between regeneration and defense. Dev. Cell 57, 451–465.e6 (2022).

    Article  PubMed  Google Scholar 

  55. Yanai, I. et al. Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification. Bioinformatics 21, 650–659 (2005).

    Article  CAS  PubMed  Google Scholar 

  56. Crow, M., Paul, A., Ballouz, S., Huang, Z. J. & Gillis, J. Characterizing the replicability of cell types defined by single cell RNA-sequencing data using MetaNeighbor. Nat. Commun. 9, 884 (2018).

    Article  ADS  PubMed  PubMed Central  Google Scholar 

  57. Fischer, S., Crow, M., Harris, B. D. & Gillis, J. Scaling up reproducible research for single-cell transcriptomics using MetaNeighbor. Nat. Protoc. 16, 4031–4067 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Hanley, J. A. & McNeil, B. J. A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology 148, 839–843 (1983).

    Article  CAS  PubMed  Google Scholar 

  59. Wolf, F. A., Angerer, P. & Theis, F. J. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 19, 15 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  60. Crow, M., Suresh, H., Lee, J. & Gillis, J. Coexpression reveals conserved gene programs that co-vary with cell type across kingdoms. Nucleic Acids Res. 50, 4302–4314 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. Huang, T., Guillotin, B., Rahni, R., Birnbaum, K. & Wagner, D. A rapid and sensitive multiplex, whole mount RNA fluorescence in situ hybridization and immunohistochemistry protocol. Preprint at bioRxiv https://doi.org/10.1101/2023.03.09.531900 (2023).

  62. Jackson, D., Veit, B. & Hake, S. Expression of maize KNOTTED1 related homeobox genes in the shoot apical meristem predicts patterns of morphogenesis in the vegetative shoot. Development 120, 405–413 (1994).

    Article  CAS  Google Scholar 

Download references

Acknowledgements

The authors thank M. Purugganan and G. Coruzzi for helpful comments. This work was funded by National Science Foundation (IOS-1934388) to K.D.B., D.J. and J.G., the National Institutes of Health (R35GM136362) to K.D.B., Human Frontiers of Science (LT000972/2018-L) to B.G. and startup funds from the University of California Riverside to S.C.G. In addition, M.P. is funded by the William Randolph Hearst Scholarship from the School of Biological Sciences. J.G. is also supported by the National Institutes of Health (R01 LM012736 and R01 MH113005).

Author information

Authors and Affiliations

Authors

Contributions

B.G. and K.D.B designed the research. B.G. generated all single-cell and single-nucleus RNA-seq data, with early profiles performed by C.O.R. M.A.M. and B.G. designed the single-nucleus RNA-seq protocol. R.R. and B.G. performed the whole-mount in situ hybridization analysis. R.R., X.X. and D.J. performed the tissue preparation and histology for the spatial transcriptomics analysis. S.C.G. and B.G. conceived the analysis strategy and performed the tests for dosage compensation. S.K.R. performed the non-WGD duplication identification. M.P and J.G. performed the MetaNeighbor, MINI-EX, CoCoCoNet and validation analysis. B.G. analysed all the data. K.D.B., B.G. and R.R. wrote the manuscript.

Corresponding author

Correspondence to Kenneth D. Birnbaum.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature thanks Patrick Edger, James Schnable and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer review reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Quality control and fidelity analysis of RNA-seq profiles using violin plots.

a Distribution of the number of UMI detected among cells vs. nuclei. b Distribution of the number of genes detected among cells vs. nuclei. c Pearson correlation distributions of gene expression from single-cell or single-nucleus compared to whole-root RNAseq data in Arabidopsis and maize. The distributions are derived by randomly sampling 2,000 genes for correlation analysis between cells and nuclei. The random sampling was repeated 250 times to generate the distribution of correlation values. Violin plots display show the kernel probability density of the data at different values, boxplot inside display as the middle black line is the median, exact media is displayed on the graphs, the lower and upper hinges correspond to the first and third quartiles (Q1,Q3), extreme line shows Q3+1.5xIQR to Q1-1.5xIQR (interquartile range-IQR). Dots beyond the extreme lines shows potential outliers.

Extended Data Fig. 2 Evaluation of agreement in nuclear and cell type profiles.

a, b UMAP clustering in Arabidopsis single-cells (a) and single-nuclei (b) clustered independently, showing clusters with the same diagnosed cell identities. c, d Dot plots showing expression levels per cluster and expression in percent of cells of the same set of cell-type specific markers in cells (c) or nuclei (d). The markers are in the same order in both plots.

Extended Data Fig. 3 Analysis of sensitivity of nuclear and cell profiles in distinguishing clusters and identifying markers.

a Arabidopsis down sampling analysis shows the number of cells needed to resolve different clusters. A branch signifies that a new cluster with a known cell type identity was distinguished at a given sample size. b A similar analysis using the single nucleus RNA-seq dataset, showing that more nuclei are needed to resolve the same number of clusters compared to cells in (a). Tracking the branches of graphs in (a) vs. (b) leads to a rule-of-thumb that two-fold more nuclei than cells are needed to identify clusters. c UMAP of the combined maize single-cell and -nuclei datasets, clusters are colored by cell type identity. d Dotplot of maize marker genes in cells (blue) or in nuclei (red), showing overall concordance of marker gene expression in the two datasets.

Extended Data Fig. 4 Analysis of differentially regulated genes and cell capture efficiency in nuclear vs. cellular profiles.

a, b Heatmaps of genes known to be induced by protoplast generation (Birnbaum et al., 2003) showing their expression in cells (a) vs. nuclei (b). The analysis shows that stress-induced genes also have higher expression in cells vs. nuclei, with a bias in specific cell types. c Distribution of expression levels of genes annotated for mRNA decay in cells or in nuclei, decay values from Sorenson et al., 2018. A significant increase in expression of mRNA decay-related genes was detected in nuclei, (n = 1965 genes, Wilcoxon rank sum test, two-sided, p-value = 1.98e-11), the boxplots display the middle line is the median, the lower and upper hinges correspond to the first and third quartiles (Q1,Q3), extreme line shows Q3+1.5xIQR to Q1-1.5xIQR (interquartile range-IQR). Dots beyond the extreme lines shows potential outliers. d Proportion of cells vs nuclei present in each cell type cluster.

Extended Data Fig. 5 Analysis of marker gene identification in maize single nucleus vs. cell profiles.

a, b UMAPs of maize single-cell and single-nucleus RNA-seq data clustered independently. Only the single nucleus RNA-seq dataset displays a cluster annotated as columella, which is absent in the single-cell dataset. c, d Dotplot of maize marker genes for each cell type cluster, showing expression in cells (c) and in nuclei (d) datasets independently. Markers for columella outlined in the red box are only present in the single nucleus dataset.

Extended Data Fig. 6 Analysis of overall expression similarity among all cellular and nuclear clusters in the three monocot species studied.

a AUROC test comparing every cell type in all species for both cell and nuclei datasets, showing that clusters discovered in either cell or nuclei group by like cell type and not by either species or source of material (cells or nuclei). b–c UMAPs generated by additional integration of the dataset using a Python supervised integration method scGen. This method uses a variational autoencoder to learn the underlying latent space for the cell types. b Different colors represent the clusters identified by the Seurat integration mapped onto the new scGen integration, showing Seurat classification was in relative agreement with the scGen classification. i.e., scGEN clusters have relatively homogenous coloration. c The same UMAP as in (b), this time showing the species distribution. Overall, each cluster has cells from each of the three species.

Extended Data Fig. 7 In-situ hybridization corroborating evidence for marker localization in single cell/nuclei RNA-seq profiles in maize.

a–n in situ hybridization using Hairpin Chain Reaction (HCR) probes labeling various transcripts. Cross sections are on the left and longitudinal sections are on the right. UMAPs showing each transcript’s cluster localization are displayed next to each probe’s fluorescent image. Additionally, spatial transcriptomics imaging data of the same probe is shown in the right column for (c–e). The minimum/maximum values for each fluorescence channel (grey: autofluorescence, magenta: HCR probes) have been adjusted to show the localization more clearly in the merged image.

Extended Data Fig. 8 In-situ hybridization corroborates evidence for localization of marker gene expression from single-cell RNA-seq profiles in sorghum.

a-i In situ hybridization using Hairpin Chain Reaction (HCR) probes labeling various transcripts. Cross sections are on the left and longitudinal sections on the right (a,c,d,e). Longitudinal sections are shown in (f,g,h,i). UMAPs showing each transcript’s cluster localization are shown next to each probe’s fluorescent image. The minimum/maximum values for each fluorescence channel (grey: autofluorescence, magenta: HCR probes) have been adjusted to show the localization more clearly in the merged image.

Extended Data Fig. 9 Regulon conservation across species, and distribution of gene pair expression patterns.

a Conserved regulons found using MINI-EX and their pattern of expression. The regulon is labeled by the transcription factor that putatively regulates it in each row. b–d Distribution of genes pairs on the dominance vs. regulatory subfunctionalization scale for transposed, tandem and proximal duplicate pairs. In blue, neofunctionalized duplicates are shown as a percentage of the bar. e–g Distribution on the dominance to regulatory subfunctionalization scale for dispersed gene duplicate pairs binned in thirds by their Ks value. The graphs suggest that duplicates tend to lose co-expressed patterns and gain dominance over time. h Boxplot of Ks values showing the distribution among all the duplicate classes used in the analysis. In h, statistical analysis was performed using a Kruskal-Wallis one-way ANOVA followed by the Tukey test for all pairwise comparisons. Not sharing a letter represents statistical significance at p < 0.05. In boxplots the middle line is the median, the lower and upper hinges correspond to the first and third quartiles (Q1,Q3), extreme line shows Q3+1.5xIQR to Q1-1.5xIQR (interquartile range-IQR). Dots beyond the extreme lines shows potential outliers. h. n = 10,104 WGD, n = 860 Proximal, n = 3,154 Transposed, n = 7,552 Dispersed, n = 1,448 Tandem.

Extended Data Fig. 10 Overall analysis of expression conservation in duplicate classes and analysis of columella expression across species.

a–c Dosage compensation analysis representing the expression ratios of maize over sorghum orthologous genes in tandem, transposed, and dispersed duplicate pairs. The first two boxplots represent cases in which a sorghum ortholog is expressed in the same homologous cell type as only a single maize duplicate (either M1 or M2). The third and fourth boxplots represent cases in which both homeologs are expressed in the same cell and a sorghum homolog is expressed in a homologous cell type. The last boxplot shows the ratio when both of the co-expressed homeologs are added together in the numerator, showing a mean ratio close to 1. The higher expression in the first two boxplots compared to the second two indicates dosage compensation. d Conservation rate of cis-regulatory elements between WGD homeolog pairs in promoters. The plot shows no major differences between co-expressed and dominant gene pairs, and no major differences among the different classes of duplication. e–h Distribution of maize genes displaying regulatory neofunctionalization of expression into new cell types. Colors signify the cell type of origin. i Expression heatmap of the 443 genes displaying high expression divergence across species in columella cells in maize, according to CoCoCoNet, with the orthologous gene expression in the other two species. j Example of the gene DMR6 switching its expression between columella in maize to epidermis / cortex in sorghum. a-c, statistical analysis was performed using ANOVA followed by the Tukey test for all pairwise comparisons, Not sharing a letter represents statistical significance at p < 0.05. In boxplots the middle line is the median, the lower and upper hinges correspond to the first and third quartiles (Q1,Q3), extreme line shows Q3+1.5xIQR to Q1-1.5xIQR (interquartile range-IQR). Dots beyond the extreme lines shows potential outliers. a–h: n = 10,104 WGD, n = 860 Proximal, n = 3,154 Transposed, n = 7,552 Dispersed, n = 1,448 Tandem.

Supplementary information

Reporting Summary

Peer Review File

Supplementary Table 1

Metric summary and details of different replicates performed in this study, number of genes from cell nucleus profiles and thresholds used.

Supplementary Table 2

Arabidopsis clustering repartition of nucleus or cell profiles alone and merged dataset.

Supplementary Table 3

Differentially expressed genes between nucleus and cell profiles for each species

Supplementary Table 4

Cell-type markers in maize, sorghum and Setaria, including conserved markers for each cell type across species.

Supplementary Table 5

Mini-Ex regulon prediction.

Supplementary Table 6

Metrics of orthologue dominance between M1 and M2, including cis-regulatory element predictions, Ka, Ks and neofunctionalization caracteristics.

Supplementary Table 7

GO_Enrichment_S_SM1_SM2_M1M2_SM1M2 used for Fig. 3e.

Supplementary Table 8

CococoNet Analysis all_functional_scores_calculated_Columella Enrichment Mucilage.

Supplementary Table 9

Probe references for in situ hybridization and molecular cartography.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Guillotin, B., Rahni, R., Passalacqua, M. et al. A pan-grass transcriptome reveals patterns of cellular divergence in crops. Nature 617, 785–791 (2023). https://doi.org/10.1038/s41586-023-06053-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s41586-023-06053-0

This article is cited by

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Search

Quick links

Nature Briefing: Translational Research

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

Get what matters in translational research, free to your inbox weekly. Sign up for Nature Briefing: Translational Research