Megaphylogeny resolves global patterns of mushroom evolution

Mushroom-forming fungi (Agaricomycetes) have the greatest morphological diversity and complexity of any group of fungi. They have radiated into most niches and fulfill diverse roles in the ecosystem, including wood decomposers, pathogens or mycorrhizal mutualists. Despite the importance of mushroom-forming fungi, large-scale patterns of their evolutionary history are poorly known, in part due to the lack of a comprehensive and dated molecular phylogeny. Here, using multigene and genome-based data, we assemble a 5,284-species phylogenetic tree and infer ages and broad patterns of speciation/extinction and morphological innovation in mushroom-forming fungi. Agaricomycetes started a rapid class-wide radiation in the Jurassic, coinciding with the spread of (sub)tropical coniferous forests and a warming climate. A possible mass extinction, several clade-specific adaptive radiations, and morphological diversification of fruiting bodies followed during the Cretaceous and the Paleogene, convergently giving rise to the classic toadstool morphology, with a cap, stalk, and gills (pileate-stipitate morphology). This morphology is associated with increased rates of lineage diversification, suggesting it represents a key innovation in the evolution of mushroom-forming fungi. The increase in mushroom diversity started during the Mesozoic-Cenozoic radiation event, an era of humid climate when terrestrial communities dominated by gymnosperms and reptiles were also expanding.

Explosive diversification events, with intermittent periods of relatively little change, have generated uneven patterns of species richness across the tree of life. Although rapid radiations have been inferred in many clades1-5, few examples exist for fungi, in part because of their limited fossil record, and the lack of comprehensive phylogenies. Advances in modeling the evolutionary process now allow such inferences to be made from phylogenetic trees. Studies of diversification of mushroom-forming fungi have focused on individual clades6-9, yielding hypotheses on ecological opportunities6, the evolution of mutualistic lifestyle7,10 and fruiting body morphologies6,11 as drivers of adaptive evolution in fungi. However, these inferences, based on specific clades, remained untested across larger phylogenetic scales. Fruiting bodies provide support and protection for developing spores, which probably represents a driving force for increasingly more efficient morphologies12. Ancestral fruiting bodies of the Agaricomycotina were probably crust-like, 'resupinate' forms13,14, which then evolved into increasingly more complex forms, including derived 'pileate-stipitate' types, which are differentiated into a cap, stipe and hymenophore (spore-bearing surface). Pileate-stipitate forms have arisen repeatedly from simpler morphologies (e.g. resupinate or coral-like) during evolution12,13,15 and dominate extant agaricomycete diversity (>21,000 described species16), but how this diversity arose, what explains the dominance of pileate-stipitate species in the class and whether fruiting body morphology impacts diversification rates (e.g. as a key innovation) are not known.

Results and Discussion
We investigated broad patterns of diversification in Agaricomycetes using a phylogenetic comparative approach and generated the most comprehensive phylogeny of the group to date, comprising 5,284 species (Fig. 1). Our dataset represents ca. 26% of described Agaricomycetes, the highest sampling density reported for any fungal group to date. Our inferences of phylogenetic trees combine a robust genome-based backbone phylogeny of 104 species (650 genes, 141,951 amino acid characters, Supplementary Fig. 1-3, Supplementary Table 1, Supplementary Note 1,2) with sequence data from up to three nuclear loci for 5,284 taxa (LSU rRNA, rpb2, ef-1a, 5,737 characters). New sequence data were generated for 1,222 species (Supplementary Note 2, Supplementary Data 1), including 9 new genomes of phylogenetically key taxa (Supplementary Note 3). The phylogenetic tree topology is largely congruent with previously published clade-specific phylogenies and resolves 21 orders of Agaricomycetes plus Dacrymycetes and Tremellomycetes as outgroups. Most of the nodes received strong bootstrap support, but we also identified nodes that require more scrutiny and probably denser taxon sampling to be resolved with certainty (for a detailed genome-based overview of agaricomycete phylogeny see Supplementary Note 1). We inferred ten time-calibrated phylogenies for the 5,284 taxa dataset using a 2-stage Bayesian approach and all taxonomically informative, non-conflicting fossil calibration points available in the Agaricomycetes (eight fossils, determined by a fossil cross-validation procedure, see Supplementary Myr) being the oldest. The Agaricomycetidae, the group uniting the Agaricales and Boletales was estimated at ~185 Myr (174-192, Fig. 1, Supplementary Note 4). Because these molecular clock estimates are older than previous phylogenomics-based figures17-19, we performed sensitivity analyses to address the underlying causes, which suggest that the differences are attributable to the higher taxon sampling density in both our genome-based and 5,284-species tree and not to differences in software or choice of priors in fossil-based molecular clock calibrations. Various taxon and fossil sampling schemes, tree (5,284-taxon versus the genome-based phylogeny) and software choice had little impact on inferred ages (Supplementary Data 2, Supplementary Note 4), providing a robust set of age estimates for Agaricomycetes.
We analyzed global rates of speciation and extinction in a macroevolutionary framework using BAMM20. We accounted for incomplete and unequal sampling by using cladespecific sampling fractions and by ensuring that each genus is sampled in proportion to the number of its described species. Analyses of diversification rates in ten time-calibrated phylogenies provided evidence for rate variation through time and across lineages. Diversification rates increased abruptly at ~180 Myr ago in the early Jurassic (Fig. 2), coinciding with the breakup of Pangea, the onset of a warm, humid climate and the spread of (sub)tropical gymnosperm forests. Consistent with this, ancestral state reconstructions suggest that the last common ancestors of most orders were likely associated with gymnosperms as hosts (Fig. 1, Fig. 3/a, Supplementary Data 3, Supplementary Note 5), followed by multiple shifts to and diversification in association with angiosperms as hosts during the Cretaceous, similar to patterns observed in a radiation of herbivorous beetles21. This agrees with observations of fungus-plant co-evolution at smaller scales10,22, frequent host switches during fungal evolution23 and might reflect the dependence of fungi on plants as nutrient sources or mutualistic partners.
Acceleration of speciation is observed across all larger orders (Supplementary Data 4), suggestive of an external underlying cause, such as climate change or nutrient availability, instead of clade-specific events. We find that the rise in speciation rates in the Jurassic is coupled with a slightly delayed increase in extinction rates ~150 mya (Fig. 2). Although estimating extinction rates from phylogenies is notoriously difficult24-26, this signal is also detected in an independent analysis using an episodic speciation/extinction model27 under a variety of conditions (Supplementary Note 6, Supplementary Fig. 6, Supplementary Table  3,4). Together, these data suggest that fungi experienced severe extinction in a period that falls close to a mass extinction event at the Jurassic/Cretaceous boundary28, an era with a fluctuating edaphic conditions, differential extinction of several metazoan groups, and a relatively stable flora. On the other hand, we find no evidence for increased probability of extinction around the Cretaceous-Tertiary (K-T) boundary, a period of severe mass extinction in plant and animal lineages. It has been hypothesized that the massive deforestation after the bolide impact has buffered fungi through the period29, which is supported by a peak in fungal spore and hyphae fossilization right after the cataclysm30 and would explain the lack of mass extinction signal in our data.
The latitudinal diversity gradient (LDG) hypothesis posits that species richness increases from the poles towards the tropics. To test if this general prediction is valid for Agaricomycetes, we analyzed latitude-dependent diversification rates across all 4,429 species for which sufficient geographic data could be obtained (see Methods). Agaricomycetes display their highest species diversity and net diversification rates in the temperate zone (Supplementary Fig. 7,8, Supplementary Table 5,6, Supplementary Note 7), which contrasts with the diversity gradient from the poles towards the tropics in animals and plants31. Although such a pattern could arise as a result of more concentrated efforts of recording fungi in the temperate zone, we find that the zone with highest speciation rate (between 20° and 40° in the temperate zone) does not coincide with the zone of the highest sampling density (30°-60°) in our data. This suggests that our inference of a temperate peak in diversification rates is largely independent from sampling density. A reversed latitudinal diversity gradient has already been implied for major fungal groups by several surveys of environmental sequences32-34 and phylogenetic analyses22,35, in particular for ectomycorrhizal fungi. Although such signals might also be sensitive to biased taxonomic efforts across ecoregions, these data suggest that Agaricomycetes might not conform to broad macroecological patterns that dominate in animal and plant lineages.
To obtain a more resolved picture on diversification histories, we analyzed rate variation across clades. We identified 85 significant shifts in diversification rate (defined as shifts present in ≥50% of trees with marginal odds ratio ≥5) that included clades of diverse ages spread out across the Agaricomycetes (Fig. 1 Note 8). The largest of these comprises the Agaricomycetidae, consistent with results of rate through time and lineages through time analyses, which revealed an acceleration of net diversification rates in the Jurassic (Fig. 2). Order-level shifts to higher diversification rates were inferred in the last common ancestor of the Auriculariales, that of the Cantharellales, the Hymenochaetales (excluding the Repetobasidiaceae), the Russulales and the Agaricales (excluding the hygrophoroid and pleurotoid clades (sensu Matheny15) (Fig. 1). Most of the shifts (67%) were found in the Agaricales and predominantly in the Paleogene, consistent with a dynamic evolutionary history of the order but also with its largest share of diversity in the class. We found accelerated diversification in the genera Coprinellus and Laccaria, as reported previously6,7, but also in Cortinarius, Amanita and Lepiota, among others (for the complete list of shifts see Supplementary Data 5). Some of these represent the most taxonomically diverse agaric genera, such as Cortinarius with >2,00016 species described. Accelerated diversification in these groups is consistent with patterns seen in adaptive radiations36,37. Indeed, many clades show a diversification slowdown through time, a key sign of adaptive radiations36 (Supplementary Data 6), suggesting that many of the detected shifts may feature rapid adaptive filling of available niches. Among the 85 detected shifts in diversification rates, there were several that denoted old clades with low diversity, in which the shift represents a diversification slowdown rather than acceleration. For example, the genus Typhula, comprising species with minute club-shaped fruiting bodies, is an ancient group (dated at ca. 76 myr) with a low diversity and an inferred diversification slowdown along its stem branch. Such parameters may indicate clades of relict lineages rooting deeply in Agaricales history but having only a few extant representatives. Taken together, these analyses provide a global view on bursts and slowdowns in the diversification history of mushroom-forming fungi and should facilitate a better understanding of adaptive diversification events in fungi, of which only a single example has been explicitly demonstrated6.
We next asked how morphological innovations may have contributed to shape the extant diversity of Agaricomycetes. Because fruiting body complexity is an important adaptive trait13,14, we mapped the evolution of fruiting body morphologies on the phylogeny and examined its impact on diversification rates. We performed ancestral state reconstructions and summarized inferred ancestral states through time across ten chronograms ( Fig. 3/b). In line with previous studies13,14, our reconstructions suggest ancestors with resupinate (crustlike) fruiting bodies for most early nodes along the backbone of the Agaricomycetes. The early dominance of resupinate morphologies was rapidly taken over by an array of more complex morphologies in the late Jurassic (~170-180 myr), of which the pileate-stipitate type appears most dominant. This pattern could have been driven by the radiation of Agaricomycetidae, which contains the majority of pileate-stipitate species and started to diversify around this period, although pileate-stipitate species evolved in 12 orders as well, including the Russulales, Cantharellales and Gloeophyllales, among others. In the meanwhile, resupinate lineages may have simply continued to diversify at their previous rate, but, because of the radiation of species with pileate-stipitate fruiting bodies their dominance was diminished at the scale of the entire class. Consistent with this view, gains of the pileate-stipitate morphology are estimated to be significantly more likely (7-50 times, logBayes-factor, logBF = 47) than their loss, indicating a trend towards pileate-stipitate morphologies, as reported earlier13. In fact, the pileate-stipitate morphology evolved convergently in at least 33 clades, suggesting it represents a stable attractor in the morphospace. Nevertheless, the probability of the loss of pileate-stipitate morphology is significantly larger than zero (logBF = 157.2), reflecting evolutionary transformations of pileate-stipitate fruiting bodies happening in many groups ( Fig. 3 . This may be because the pileate-stipitate morphology provides support and protection for developing spores (e.g. from precipitation) and enhances their dispersal efficiency by raising sporogenous tissue above ground-level; these advantages may explain how pileate-stipitate fruiting bodies can promote diversification and suggest they represent a key innovation for mushroom-forming fungi.

Conclusions
This study represents a global analysis of the dynamic evolutionary history of mushroomforming fungi. The diversity of Agaricomycetes might have been shaped by both class-wide diversification rate shifts and explosive clade-specific events. We identified a major classwide radiation of Agaricomycetes in the Jurassic, which possibly followed a warming and humid climate in this period. The onset of this radiation coincided with the expansion of angiosperms and followed patterns of gymno-and later angiosperm expansion during the Mesozoic-Cenozoic radiation40. Our analyses provide evidence for several clade-specific adaptive radiations in the class, providing a possible explanation for why some genera Europe PMC Funders Author Manuscripts contain a single while others thousands of species. Our data display strong signal for a mass extinction event coinciding with increased extinction rates at the Jurassic/Cretaceous boundary, however, no such evidence was found in the Agaricomycetes for the much more influential K-T mass extinction event. The evolution of pileate-stipitate fruiting bodies, i.e., morphologies with cap and stipe might be a key driver of evolutionary radiations, suggesting that the convergent evolution of the classic mushroom morphology contributed to shaping extant diversity in the Agaricomycetes. We predict that further data on traits and ecological opportunities that generated the spectacular diversity of mushroom-forming fungi will help to elucidate the general principles of mushroom evolution and integrate this ecologically important group into the record of major biodiversification events.

Methods 1 Molecular biology methods
Genomic DNA was extracted from 0.1 -10 mg of dried herbarium specimens using the DNeasy Plant Mini kit (Qiagen) or the Nucleospin Plant II mini kit (Macherey-Nagel) following the manufacturers' instructions. PCR amplification was performed as described previously6,15 on the large subunit of (LSU) of the nuclear ribosomal RNA with the LR7 and LROR primers. Sequencing was performed commercially using the same primers as used for PCR. Sanger reads were assembled to a contig using the Pregap and Gap4 programs of the Staden package43 and checked manually against the electropherograms.

Taxon sampling
We sampled herbarium specimens so that taxon sampling be proportional across clades and geographic regions. To that end, we initially obtained estimates of the known diversity for each of the genera from Species Fungorum44 (www.speciesfungorum.org, accessed 8.24.2015), the Dictionary of the Fungi16 and from Funga Nordica45. Taxonomic information from these sources was compiled for all genera of Agaricomycetes. Using this compilation, we calculated the number of target species that needed to be sampled for each genus to obtain a final dataset of ~5,000 species (~30% sampling of total diversity) in which sampling of species in each genus is proportional to their described diversity. Although the three sources differed in their completeness, geographic scope (Dictionary of the Fungi and Species Fungorum being global databases and Funga Nordica being taxonomically most stringent of the three) and taxonomic concept, it gave us general guidance on how to keep taxon sampling proportional across genera. While sampling species within genera, attempts have been made to obtain specimens from undersampled geographic regions, in particular Africa, Asia, Australasia and South America.

Sequence alignment
Multiple sequence alignment was carried out for the LSU, ef-1a and RPB2 loci separately using the Probabilistic Alignment Kit (PRANK release 140603)46,47. An iterative alignment refinement strategy as described in Tóth et al.48 was employed: ML gene trees computed from preliminary alignments (using RAxML, see below) were used as guide trees for the next round of multiple alignment for PRANK. After three rounds of iterative refinement, the alignments were further corrected manually using a text editor. Manual curation was restricted to correcting homologous regions erroneously juxtaposed by PRANK. Alignments of individual sequences were concatenated into a superalignment. The concatenated alignment is available at Dryad (Accession Number XXXX to be provided upon publication), GenBank accession numbers for all sequences used are shown in Supplementary Data 1.

Genome and transcriptome sequencing and assembly
4.1 Sequencing-The genomes of Coprinopsis marcescibilis, Crucibulum leave, Dendrothele bispora, Heliocybe sulcata, Peniophora sp., Pluteus cervinus, Polyporus arcularius and Pterula gracilis were sequenced using a combination of fragment and longmate pair Illumina libraries. In addition, the genome of Coprinellus micaceus was sequenced using Pacific Biosciences platform. Fragment Illumina genomic libraries were produced for 100 ng of DNA was sheared to 270 bp using the Covaris LE220 and size selected using SPRI beads (Beckman Coulter). The fragments were treated with end-repair, A-tailing, and ligation of Illumina compatible adapters (IDT, Inc) using the KAPA-Illumina library creation kit (KAPA biosystems). For long-mate pair libraries (Crucibulum laeve, Dendrothele bispora, Heliocybe sulcata, Polyporus arcularius and Pterula gracilis), 5 μg of DNA was sheared using the Covaris g-TUBE(TM) and gel size selected for 4 kb. The sheared DNA was treated with end repair and ligated with biotinylated adapters containing loxP. The adapter ligated DNA fragments were circularized via recombination by a Cre excision reaction (NEB). The circularized DNA templates were then randomly sheared using the Covaris LE220 (Covaris). The sheared fragments were treated with end repair and Atailing using the KAPA-Illumina library creation kit (KAPA biosystems) followed by immobilization of mate pair fragments on strepavidin beads (Invitrogen). Illumina compatible adapters (IDT, Inc) were ligated to the mate pair fragments and 8 cycles or in case of Heliocybe sulcata 10 cycles of PCR were used to enrich for the final library (KAPA Biosystems). For Coprinellus micaceus unamplified libraries were generated using Pacific Biosciences standard template preparation protocol. 5 μg of gDNA was used to generate each library and the DNA was sheared using Covaris g-Tubes(TM) to generate sheared fragments of >10kb in length. The sheared DNA fragments were then prepared using Pacific Biosciences SMRTbell template preparation kit, where the fragments were treated with DNA damage repair, had their ends repaired so that they were blunt-ended, and 5' phosphorylated. Pacific Biosciences hairpin adapters were then ligated to the fragments to create the SMRTbell template for sequencing. The SMRTbell templates were then purified using exonuclease treatments and size-selected using AMPure PB beads.
All transcriptomes in this study were sequenced using Illumina RNA-Seq. Stranded cDNA libraries were generated using the Illumina Truseq Stranded RNA LT kit. mRNA was purified from 1 μg of total RNA using magnetic beads containing poly-T oligos. mRNA was fragmented and reversed transcribed using random hexamers and SSII (Invitrogen) followed by second strand synthesis. The fragmented cDNA was treated with end-pair, A-tailing, adapter ligation, and 10 cycles or in case of Coprinopsis marcescibilis, Crucibulum leave and Pluteus cervinus 8 cycles of PCR. We performed all-versus-all blast using mpiBLAST 1.6.054 with default parameters, then identified gene families using the Markov clustering algorithm MCL 14-13755 with an inflation parameter of 2.0. We first identified gene families that contained 52-208 genes (50-200% of the species included in the study), then performed multiple sequence alignment using PRANK 14060347. Ambiguously aligned regions were removed from the alignments using Gblocks 091b56 with the settings -p=yes -b2=26 -b3=10 -b4=5 -b5=h -t=p -e=.gbl. Next, we screened for gene families that contained a single representative gene for each species, or ones that contained inparalogs but no deep paralogs. Deep paralogs were identified following Nagy et al. 57 Europe PMC Funders Author Manuscripts each species. Gene families in which >=75% of the species were represented were concatenated into a supermatrix.
We used RAxML 8.1.2 to perform ML analysis and bootstrapping on the concatenated dataset, under a WAG model with gamma-distributed rate heterogeneity partitioned by gene. We ran 100 bootstrap replicates using the rapid hill-climbing algorithm. Because in initial analyses Peniophora sp., Tubaria furfuracea, Clavaria fumosa were placed on long terminal branches, we performed the following steps to clean the alignments of overly divergent sequences (either due to sequencing errors or pseudogenes). We screened all individual gene trees and excluded proteins with a terminal branch length >2.75 times longer than the average branch length of the gene tree. Altogether 1,294 (in 542 gene families) protein sequences were excluded from the analyses.

Maximum Likelihood analyses of the 5,284-taxon dataset
Maximum likelihood trees for the 5,284-taxon dataset were inferred using the parallel version of RAxML 8.1.258 under the GTR model with gamma distributed rate heterogeneity (4 categories) with three partitions corresponding to the LSU, ef1-α and rpb2 loci. The phylogenomic tree was used as a backbone monophyly constraint. We performed 245 ML inferences and tested whether these trees adequately represented the plausible set of topologies given the alignment. This was done to ensure that phylogenetic uncertainty is properly taken into account in subsequent comparative analyses. If our tree set contains all plausible topologies, then the rolling average of pairwise Robinson-Foulds (RF) distances should show a saturation as a function of increasing the number of trees. To this end, we computed RF distance for each pair of trees for incrementally larger numbers of trees using R package "phangorn" v.2.0.259. We then plotted the rolling average, maximum and minimum values as a function of the number of trees in R.

Molecular clock dating of the 5,284-taxon dataset
Due to computational limitations, we focused on 10 ML trees that maximize topological diversity (as inferred based on RF distances) as representative trees for molecular clock analysis. These were chosen by first calculating pairwise RF distances between trees, then hierarchically clustering trees using Ward's clustering method (as implemented in the hclust R function). The ten ML trees for subsequent analyses were chosen evenly from the resulting clusters of trees. To overcome the computational limitations of inferring accurate chronograms for thousand-tip phylogenies, we adopted a two-step strategy. First, PhyloBayes 4.1b60 was used to infer parameters of a birth-death model and divergence times of trees on 10% subsampled versions of the 5,284-taxon dataset. Subsampled datasets were obtained by randomly deleting 90% of the species, but forcing key descendants of fossil-calibrated nodes to be retained in the subsampled alignments. Then we used the ages and parameters estimated in PhyloBayes as input parameters for FastDate development version61 (provided by Thomas Flouri, downloaded Jan 5 2017) for analyzing the complete 5,284-species dataset.
PhyloBayes analyses were run using the 10% subsampled dataset, a birth-death prior on divergence times, an uncorrelated gamma multiplier relaxed clock model and a CAT-poisson substitution model with a gamma distribution on the rate across sites. A uniformly distributed prior was applied to fossil calibration times. All analyses were run until convergence, typically 15,000 cycles. Convergence of chains was assessed by visually inspecting the likelihood values of the trees and the tree height parameter. We sampled every tree from the posterior and after discarding the first 7,000 samples as burn-in we summarized the posterior estimates using the readdiv function of PhyloBayes.
Next, we used FastDate, a program that implements the speed dating algorithm described in Akerborg et al. 61. FastDate was run on the complete trees (5,284 species) with the node ages constrained to the values of the 95% highest posterior densities of the ages inferred by PhyloBayes. FastDate analyses were run with time discretized into 1,000 intervals and the ratio of sampled extant individuals set to 0.14.

Fossil calibrations
The extensive sampling density of our tree allowed us to use more fossil calibration points than any study before and to place them more precisely within the tree. Eight fossils were used as calibration points (Supplementary Table 2) for the crown node of the specified taxa.
We excluded a number of prospective well-preserved fossils for reasons of redundantly calibrating an already calibrated node (Appianoporites vancouverensis -Hymenochaetales62, Protomycena electra -marasmioid clade63, Cyathus dominicanus -Nidulariaceae64, Fomes idahoensis -Polyporales65. Geastrum tepexensis 66 was not considered as it could be assigned to either of the two earth-star clades, one in the Phallomycetidae (Geastraceae) or the other in the Boletales (Astraeus). For other mushroom fossils, the taxonomic affiliations were deemed too uncertain.
To investigate if there is any conflict between the fossil calibration points we conducted a fossil cross-validation analysis following Near et al. 67. For this, we ran PhyloBayes analyses on one of the 10% subsampled trees with the same settings as mentioned above, using a single fossil calibration point at a time. This resulted in eight independent molecular dating analyses. To quantify conflict between single fossil calibrations, first we calculated the sum of the square differences between molecular age estimates (MA) and fossil ages (FA): Next, the analysis with the highest SS value was removed and s was recalculated. We continued this process until only two analyses remained. In parallel, we performed onetailed F-tests to check if the removal of the fossil had a significant effect on the variance of s. The principle of this procedure is that during the stepwise removal of fossils s should decrease by small constant values and variance should not decrease significantly.

Genome-based molecular clock analyses
We examined the robustness of our molecular age estimates using phylogenomic methods as an alternative to the 5,284 species phylogeny. Specifically, we tested whether the differences between our and previous17,18 molecular clock estimates for Agaricomycete orders were attributable to differences in the dataset or analytical method used or a difference in how precisely fossils could be placed on the tree. To this end, we performed a series of molecular clock analyses on our phylogenomic dataset that resembled that of Kohler and Floudas in terms of topology and taxon sampling density, but differed in the placement of some fossil calibrations. A key difference was that our phylogenomic tree included the most recent common ancestor (mrca) of the Hymenochaetales and that of the Suillaceae, which allowed us to calibrate the MRCA of these clades, as opposed to their stem nodes by Kohler   ). In all analyses we constrained the age of the root to be between 300 mya and 600 mya.

Penalized likelihood analysis in r8s-
We ran a series of molecular clock analyses in r8s v.1.8170. A cross-validation analysis was performed to determine the optimal smoothing parameter (λ) by testing values across 7 orders of magnitude starting from 10 -3 . The additive penalty function was applied and the optimization was run 25 times starting from independent starting points. In one optimization step, after reaching an initial solution, the solution was perturbed and the truncated Newton (TN) optimization was rerun 20 times. We compared the results of previous studies to that of analyses across seven ancestral nodes in Agaricomycotina (Supplementary Data 2).

Bayesian molecular clock dating-We used the mcmctree method implemented
in PAML version 4.8a71. The independent-rates clock model, a WAG substitution model and approximate likelihood calculation72 were used. The birth rate, the death rate and the sampling fraction of the birth-death process were set to 1, 1 and 0.14 respectively. The shape and the concentration parameter of the gamma-Dirichlet prior for the drift rate coefficient (σ2) was set to 1 and three different scale parameters were tested (10, 100, 1,000) to see their effect on the time estimates. The substitution rates of each gene were estimated by codeml under a global clock model, to set the parameters of the gamma-Dirichlet prior for the overall rate. By calculating the mean substitution rate of the loci and examining the density plot of the rates we set up a prior which reasonably fitted the data: the shape parameter, the scale parameter and the concentration parameter were set to 5, 90.7441 and 1, respectively, resulting in an average substitution rate per site per time unit of 0.055. We set the time unit to 100 myr and applied uniform priors on 8 fossil calibrations with lower and upper hard bounds. MCMC (Markov chain Monte Carlo) analysis was run for 80,000 iterations, discarding the first 20,000 iterations as a burn-in and sampling every 30th tree from the posterior. After three independent analyses were run the convergence of loglikelihood values was visually inspected and the estimated ages were compared between replicates. We coded species' preference for substrate based on literature data (detailed listing of resources used is not given) as either gymnosperm, angiosperm, or both (i.e. generalist species). Species were considered gymno-or angiosperm specialists if >90% of records were for either plant group, or their habitat preferences in taxonomic literature were described by the terms 'exclusively', 'predominantly', 'mostly', or occurring on the least dominant substrate 'rarely', 'very rarely' or 'exceptionally'. On the other hand, species having records on both gymno-and angiosperm substrates, or described in the literature as generalists, or given multiple gymno-or angiosperm substrate plant species were coded as generalists. Species with insufficient data or no data at all were treated as missing data. Soilinhabiting saprotrophs were coded based on their association with either gymno-or angiosperms, if a clear preference was reported in the literature.

Global positioning system data:
We downloaded 5,884,445 fungal GPS records from the GBIF database, representing 4,429 Agaricomycetidae species. For 848 species, a location based on the province, country, and/or biogeographic realm74 was obtained from the literature. In cases where only the province, country or biogeographic realm was available, we took the centroid of the area as a proxy for GPS location. All GPS data were handled and processed in R75. Noise was added to identical GPS coordinates using the jitter function in R.

Analyses of geographic diversification:
State-dependent diversification rates were estimated in two ways: (1) using centroid latitudes as continuous traits (e.g. Sánchez-Ramírez et al. 35), and (2) using discrete areas. We used the data mentioned above to first calculate a centroid coordinate (RGEOS R package) in cases were multiple GPS records exist per species. Then we took the centroid latitude for each species. For singletons, we simply took the latitude of each record. We also used the GPS database to calculate a standard deviation for latitude per species. For species with a single record, we took the mean standard deviation of species with multiple records. We fitted multiple Quantitative

Europe PMC Funders Author Manuscripts
Europe PMC Funders Author Manuscripts State Speciation and Extinction (QuaSSE76) models, using a maximum-likelihood approach, to ten of the 5,284-species chronograms. Models included speciation rate changes as a constant function of the trait (no effect), as a linear function, or as a Gaussian function, keeping the extinction rate constant76. We also added two models in which the extinction rate varied as linear or Gaussian functions, while speciation rate remained constant. To improve the efficiency of the algorithm given the scale of the tree, we applied the following settings to the models: method="fftC", dt.max=1 and nx=256. Akaike information criterion (AIC) values were averaged across phylogenies and compared between different models. We used the best supported model to interpolate speciation rates into a 1 x 1 degree global map made with Natural Earth. Free vector and raster map data @ naturalearthdata.com. . For species with multiple GPS records, we extracted binary information about their presence in both areas. For records that were not found precisely within the geometry of the area, we took the area to which the distance to the GPS point was closer. To assign discrete states to each species with multiple records, our criterion was to assign a "widespread" state (both areas) if records for the area with the minor frequency was higher than 0.2. In any other case, we assigned "tropical" or "extra-tropical" states to taxa in the area with the highest frequency. For taxa that had only biogeographic realm data, we coded them as follows: "widespread": Neotropical, Afrotropical, Oceanian/Australia; "tropical": Panamenian, Oriental; and "extra-tropical": Europe/Palearctic/North Asia, Nearctic, Saharo-Arabian, Sino-Japanese. Using these three states for each taxon, we fitted multiple constrained models and one full model of Geographic State Speciation and Extinction (GeoSSE77) in a maximum-likelihood framework. The full model (a) consisted of seven parameters: (1) speciation rate in the tropics, (2) speciation rate in the extra-tropical zone, (3) the allopatric speciation rate (speciation rate in both tropical and extra-tropical regions), (4) extinction rate in the tropics, (5) extinction rate in the extra-tropical zone, and dispersal rates to (6) the tropics and to (7) the extra-tropical zone. The constrained models consisted of: (a) speciation in the tropics is equal to speciation in the extra-tropical zone; (b) extinction in the tropics is equal to extinction in the extra-tropical zone; (c) dispersal to the tropics occurs at the same rate as dispersal to the extra-tropical zone. In each case, a single parameter for speciation, extinction, and dispersal was respectively estimated, allowing the other parameters of the model to vary. To explore the parameter space better, we reestimated parameters using MCMC on the full model. In a similar way as above, we used AIC scores to rank the models.

Model tests:
To infer substitution models that best describe the evolution of agaricoid/non-agaricoid fruiting bodies we used ML and MCMC approaches implemented in BayesTraits 2.0 Linux 64 Quad Precision alternative build78 and the R package diversitree v.0.9-839. The Binary State Speciation and Extinction (BiSSE) model38 was Europe PMC Funders Author Manuscripts used in diversitree. We performed model tests by first constraining the forward and reverse transition rates between non-agaricoid (state 0) and agaricoid type (state 1) to be equal (q 01 = q 10 ). We also separately constrained each of the rates to be zero (q 01 = 0 or q 10 = 0). Each of the constrained models were compared to the best fit model based on log-likelihood values (ML analyses in BayesTraits), log Bayes factors (MCMC analyses in BayesTraits) or likelihood ratio test and Akaike information criterion scores (ML analyses of BiSSE model in diversitree)38,39,78,79. In BayesTraits, a difference of 2.00 log likelihood units (ML approach) or a difference of 10 log marginal likelihood units (MCMC approach) was considered as significant support for a model over another, while in the BiSSE analyses significance was assessed using likelihood ratio tests (p < 0.05).

BayesTraits analysis:
BayesTraits analyses (ML and MCMC) were performed on 245 phylogenetic trees using the MultiState module of the program. Before the final MCMC analyses, we tried several prior distributions (uniform, exponential, gamma and the hyperprior versions of these) with different settings. Based on preliminary analyses we found that the gamma distribution was most optimal, therefore in further analyses we used a gamma hyper-prior with different prior distributions for each parameter (Supplementary Note 5). All preliminary BayesTraits analyses were conducted with the following settings: 1,010,000 generations, 10,000 generations as burn-in and sampling every 500 th generation. We forced Markov chains to spend 200,000 generations on each tree using the equaltree option, with 100,000 generations as burn-in and sampling every 500 th generation. The marginal likelihood was estimated by the stepping stone method78,80 using 50 stones with a chain length of 5,000. All analyses in BayesTraits were repeated three times to check the congruence of independent runs. MCMC was performed using an exponential prior, defined as 1/(2r), where r is the character independent diversification rate. State specific sampling fractions were defined based on data from Species Fungorum44 (see below: Accounting for non-random incomplete taxon sampling). We first optimized the MCMC sampler's step size argument by running 100 generations, then we ran MCMC analyses for 20,000 generations with burn-in set to 10%. The convergence of the chains was visually checked based on likelihood and parameter values. Ancestral probability distributions of substrate preference for the mrca-s of each of the orders and some additional clades were plotted as pie charts using the nodelabels function of ape v.4.1.84.

Diversification rate analyses
10.5.1 Accounting for random and incomplete taxon sampling: To meet the assumption of random or complete sampling of the BAMM and BiSSE models20,39, we specified the sampling fraction of each genus in accordance with the described number of species based on Species Fungorum44. To gather information on described species, we screened all orders of the Agaricomycetes, Dacrymycetes and Tremellomycetes in Species Fungorum and gathered all species with a custom java program (available from the authors upon request). We took into account taxonomic and nomenclatural synonymy if indicated by Species Fungorum. We accounted for incomplete taxon sampling by two strategies. First, we assigned specific sampling fractions to the character states (BiSSE model) or to genera (BAMM model). In these cases we used the built-in correction of the BiSSE and BAMM models20,85 to account for missing species in our phylogeny. Second, because we could have unintentionally oversampled certain genera, (e.g. because of better availability of specimens), we statistically tested for oversampling and generated a pruned phylogeny in which each genus is represented in proportion of its described diversity. We adjusted taxon sampling in our phylogeny by iteratively deleting species from genera that were oversampled relative to the mean sampling fraction of the tree, until sampling fractions of each genera corresponded their known size as judged by a comparison to Species Fungorum. To do this we performed a hypergeometric test (p<0.05) at each iteration of species removal. Elimination of species was stopped when oversampling disappeared (p>0.05, hypergeometric test). With this procedure we wanted to produce a 'skeletal tree' (Sensu FitzJohn et al. 85), where species were both evenly and randomly sampled from every genera85.

Trait-dependent diversification rate analyses:
We performed analyses under the BiSSE model to examine the effect of agaricoid fruiting body type on diversification rate. BISSE analyses were carried out in diversitree v.09-10 as described above, with the following differences. We performed model tests to examine if the presence of a cap influenced speciation and extinction rates. To this end, we first constrained state-specific speciation or extinction rates to be equal (λ 0 = λ 1 or μ 0 = μ 1 ) then constrained both the speciation rates and extinction rates to be equal (λ 0 = λ 1 and μ 0 = μ 1 ). Models were compared by likelihood ratio test (LRT) in R.

Trait-independent diversification rate analyses:
We used BAMM 2.5.0.
(Bayesian Analysis of Macroevolutionary Mixtures)20, to examine rate heterogeneity across lineages and detect shifts in diversification rates. We analyzed 10 chronograms and ran MCMC analyses for 100 million generations using four independent chains per analysis with 50 million generations as burn-in. Prior parameters were optimized using the setBAMMpriors function in BAMMtools 2.1.6.86, except for the prior on the expected number of shifts, which was set to 270 based on preliminary runs. We accounted for incomplete taxon sampling as described above. We checked the convergence of chains by visually inspecting the convergence of likelihoods, by calculating the effective sample size (ESS) and the Geweke's diagnostic87 of log-likelihoods, numbers of shifts and evolutionary rate parameters, using functions in CODA 0.19-188. Geweke's diagnostic tests for the quasi equality of the means of parameters sampled at different parts of the MCMC analysis; if means at the beginning and end regions of the post-burn-in MCMC sample do not differ to a statistically significant extent (P>0.05), then the Markov chain is considered to be in stationary phase. To ensure that a shift is highly supported by the data and the prior had negligible contribution, we examined only core shifts, i.e. those with a prior-to-posterior marginal odds ratio exceeding 586.
We compared core shifts across 10 chronograms to obtain a consensus view on shifts that can be detected in all or most of the trees. To this end, we first empirically identified taxonomically similar clades for which a core shift was inferred, taking into account topological differences among trees. Then we noted whether the mean diversification rate change after the shift is positive or negative (i.e. rate acceleration or deceleration). There were cases when several core shifts were inferred on adjacent branches around the mrca of a given clade. These come from distinct shift configurations sampled during the MCMC analysis and correspond to the same signal of rate variation in the data but were located to adjacent branches of the tree. In such cases, we chose the shift with the highest posterior probability, noting that the biological reality of increasing rates might be spread out across a few adjacent branches around the one with the highest posterior. Finally, shifts were referred to as congruent core shifts if they were highly congruent across >=4 trees and had strong tree support (mean posterior probability > 0.5).
Next, we assessed whether posterior probabilities of congruent core shifts reached convergence in a fashion similar to the 'cumulative' function of AWTY89. For this, we calculated the posterior probability of each core shift at each generation, as if the analysis was stopped at that point and plotted posterior probabilities as a function of generation using the ggplot2 v.2.2.1 package83. We also took into account the distinct shift configurations by examining whether a rate increase, after a congruent core shift was explained by a rate decrease after another core shift. First, we determined congruent core shift -core shift pairs which could potentially be mutually exclusive to each other. These shift pairs were determined in one tree, usually in the tree where a congruent core shift had high posterior probability. Then we calculated two kinds of posterior proportions: one for co-occurring shift pairs and one for each of the single occurrence of the shifts. We said that two shifts were mutually exclusive to each other if only a negligible co-occurrence was presented and the direction of the diversification rate change was different between the single occurrence posterior samples.
To reveal tree-wide evolutionary patterns we calculated average net rates through time using the getRateThroughTimeMatrix function and plotted by the plotRateThroughTime function of the BAMM package.

Validation of BAMM estimates:
There have been critics of the BAMM method recently90,91. Meyer and Wiens91 found that the method-of-moments estimator (MS) yielded stronger relationship between true and estimated diversification rates than BAMM, particularly in smaller clades. Therefore to validate the BAMM results we used the methodof-moments estimator (MS) on clades with congruent core shifts. The MS approach estimates diversification rates from species richness and clade age while it can incorporate incomplete taxon sampling and extinction rate by assuming a relative extinction fraction (ε = extinction rate / speciation rate)92. We used the bd.ms function in geiger 2.0.6.93. The analyses were performed on all the 85 clades with congruent core shifts, using three different relative extinction fractions (0, 0. 45,9) and calculating with both stem and crown ages. We calculated unsampled species fractions using the genus-specific sampling fractions we determined for BAMM analyses. For the same set of clades we calculated average net diversification and net speciation rates from BAMM data using the getCladeRates function.
Then we performed linear regressions between MS estimates with different settings and average net diversification or net speciation rates from BAMM. We examined the adjusted R-squared and the p-values of the models to evaluate the correspondence between the two methods.

Detecting Mass Extinction Events-
We performed analyses using the compound Poisson process (CPP) on mass extinction times model (CoMET)27,94, to examine tree-wide variation in diversification rate and occurrence of mass extinctions during the evolution of mushroom-forming fungi. First, we conducted a model comparison on ten chronograms using the CoMET model with a constant rate birth-death process implemented in the TESS 2.1.0. R package95. We compared models with and without mass extinction events based on marginal likelihoods. We allowed the occurrence of a mass extinction event along the entire time span of a tree and we set the survival probability of species to 0.1 (corresponding to 10% of the species surviving a mass extinction event). The overall sampling fraction of species was set to 0.14. MCMC analyses were run for 20,000 generations and the first 2,000 posterior samples were discarded as a burn-in. Marginal likelihoods were estimated using stepping stone simulation using 100 stepping stones.
Then, we performed reversible jump Markov Chain Monte Carlo (rjMCMC) analyses under the CoMET model to sample from the space of episodically varying birth-death processes with mass-extinction events. In this analysis, we also tested the significance of the occurrence of a mass extinction event by performing model tests within time intervals.
Bayes factor values BF > 10 or lnBF > 6 were considered strong support for a model, following Höhna et al. 96 and Kass & Raftery97. We examined the sensitivity of the posterior probabilities to different prior settings in preliminary analyses of a randomly chosen tree. We examined models with 2 or 10 expected mass extinction events, with 30, 100 or 270 expected rate changes and with survival probabilities of 0.001, 0.05 or 0.3. All of these preliminary analyses were run for 1.6 million generations with 100,000 generations as burn-in. We used a log-normal prior on speciation and extinction rates with a mean of 0.2 and 0.15 and a standard deviation of 0.5 and 0.5, respectively. In the final analyses we used all 10 chronograms and an empirical hyperprior on rate parameters based on 200,000 iterations with 100,000 burn-in. The priors on the number of expected mass extinction and the expected rate changes were set to 2 and 30 respectively, based on results of preliminary analyses. We set the survival probability to 0.05. Analyses were run for 3 million generations with 1 million generations as burn-in. The convergence of the analyses was checked by visually inspecting the log likelihood values and by computing the effective sample size and the Geweke diagnostic for the log likelihoods, the number of speciation rate shifts, the number of extinction rate shifts and the number of mass extinction events using the effectiveSize and the geweke.diag function of the CODA 0.19.-1 package88, respectively. To check the convergence of interval-specific parameters we used the tess.plot.singlechain.diagnostics function of the TESS package. Phylogenetic relationships and diversification across 5,284 mushroom-forming fungi. One of the 245 analyzed maximum likelihood trees was randomly chosen and visualized. Trees were inferred from nrLSU, rpb2, ef1-a sequences with a phylogenomic backbone constraint of deep nodes. Branches are colored by net diversification (speciation minus extinction) rate inferred in BAMM. Warmer colors denote a higher rate of diversification. Significant shifts in diversification rate are shown by triangles at nodes. Only shifts present on >50% of 10 analyzed trees, with a Bayesian posterior probability >0.5 and a posterior odds ratio >5 are shown. See Data 6 for detailed discussion of shifts. Reconstructed probabilities of ancestral plant hosts for order-level clades are shown as pie charts partitioned by the inferred ancestral

Europe PMC Funders Author Manuscripts
Europe PMC Funders Author Manuscripts probability for gymnosperm (green) and angiosperm host (black). Pie charts are given for the most recent common ancestors of each order plus backbone nodes within the Agaricales -for small orders see Supplementary Data 3. Inner and outer bars around the tree denote extant substrate preference (black -angiosperm, green -gymnosperm, grey -generalist) and the placement of species used for inferring the 650-gene phylogenomic backbone phylogeny. Geological time scale is indicated with gray/white concentric rings.