Computer vision models enable mixed linear modeling to predict arbuscular mycorrhizal fungal colonization using fungal morphology

Zhang, Shufan; Wu, Yue; Skaro, Michael; Cheong, Jia-Hwei; Bouffier-Landrum, Amanda; Torrres, Isaac; Guo, Yinping; Stupp, Lauren; Lincoln, Brooke; Prestel, Anna; Felt, Camryn; Spann, Sedona; Mandal, Abhyuday; Johnson, Nancy; Arnold, Jonathan

doi:10.1038/s41598-024-61181-5

Download PDF

Article
Open access
Published: 13 May 2024

Computer vision models enable mixed linear modeling to predict arbuscular mycorrhizal fungal colonization using fungal morphology

Shufan Zhang¹,
Yue Wu¹,
Michael Skaro¹,
Jia-Hwei Cheong²,
Amanda Bouffier-Landrum¹,
Isaac Torrres¹,
Yinping Guo²,
Lauren Stupp²,
Brooke Lincoln²,
Anna Prestel²,
Camryn Felt²,
Sedona Spann³,
Abhyuday Mandal⁴,
Nancy Johnson³ &
…
Jonathan Arnold²

Scientific Reports volume 14, Article number: 10866 (2024) Cite this article

1 Altmetric
Metrics details

Subjects

Abstract

The presence of Arbuscular Mycorrhizal Fungi (AMF) in vascular land plant roots is one of the most ancient of symbioses supporting nitrogen and phosphorus exchange for photosynthetically derived carbon. Here we provide a multi-scale modeling approach to predict AMF colonization of a worldwide crop from a Recombinant Inbred Line (RIL) population derived from Sorghum bicolor and S. propinquum. The high-throughput phenotyping methods of fungal structures here rely on a Mask Region-based Convolutional Neural Network (Mask R-CNN) in computer vision for pixel-wise fungal structure segmentations and mixed linear models to explore the relations of AMF colonization, root niche, and fungal structure allocation. Models proposed capture over 95% of the variation in AMF colonization as a function of root niche and relative abundance of fungal structures in each plant. Arbuscule allocation is a significant predictor of AMF colonization among sibling plants. Arbuscules and extraradical hyphae implicated in nutrient exchange predict highest AMF colonization in the top root section. Our work demonstrates that deep learning can be used by the community for the high-throughput phenotyping of AMF in plant roots. Mixed linear modeling provides a framework for testing hypotheses about AMF colonization phenotypes as a function of root niche and fungal structure allocations.

A spatial transcriptome map of the developing maize ear

Article 14 May 2024

Seedling root system adaptation to water availability during maize domestication and global expansion

Article 22 May 2024

Land conversion to agriculture induces taxonomic homogenization of soil microbial communities globally

Article Open access 29 April 2024

Introduction

Most vascular land plants have lived in symbiotic association with Arbuscular Mycorrhizal Fungi (AMF) for more than 400 million years¹. The plant provides carbon (C), and in return the AMF provide Nitrogen (N) and Phosphorus (P). This exchange of nutrients is central to tree diversity in forests worldwide², determination of CO₂ in the atmosphere³, and plant tolerance to drought, heat and pathogens^4,5,6. The development of the AMF symbiosis is initiated by a single fungal hypha contacting a neighboring host root^7,8. Insertion of the epidermal layer by the prepenetration apparatus⁹ is followed by intraradical hyphal growth. On reaching the inner cortex, branches arising from the intraradical hyphae could penetrate the cortical cell walls and form arbuscules known as the structure for nutrient transfer between symbionts¹⁰. Post-penetration development includes the differentiation of vesicles¹¹ and spores¹². Vesicles are nutrient storing structures for lipids and carbohydrates obtained from the plant host. AMF reproduce asexually using spores. The extensive extraradical hyphal network uptakes nitrogen, phosphorus, and other ions in soil that were otherwise inaccessible to the plant host. The various types of mycorrhizal structures differentiate from one continuum of fungal hyphae¹³ and can occur simultaneously in plant roots¹⁴. Seminal work has shown that hyphal length, as well as spore counts and density, can vary significantly among conspecific AMF isolates, and that this variation has been shown to be correlated with differences in plant growth¹⁵.

The internal development of the fungus is influenced by the plant genome^16,17,18. An AMF species shows different morphological growth patterns, Arum- vs Paris-types, depending on the species of the plant partner in the association^19,20,21. Large variation in AMF richness and abundance have been characterized in several plant populations, in an effort to identify symbiosis-associated genes^{16,17,22,23,24,25,26}. Plant mutants were generated for biological validation of symbiosis genes²⁷. DELLA proteins were revealed as master regulators that interact with the symbiosis signaling pathway, which provides a mechanism to integrate symbiosis with plant growth and development^28,29. For example, DELLA transcription and protein stabilization serves to restrain plant growth but to promote arbuscule development^30,31. Direct evidence from the greenhouse highlighted that the functioning of colonization depends not only on the plant genotype but also on the identity of AMF genera/species/isolates²³. The relative allocation to selfish versus non-selfish fungal structures³² also depends on the abiotic environmental conditions. Fertilization often reduces allocation to extraradical hyphae and arbuscules relative to other structures³². The genotypes of the organisms involved and the environmental conditions under which they interact determine the functioning of mycorrhizal association along the mutualistic-parasitic continuum^{33,34,35,36,37,38,39,40}. A better understanding of the factors is needed.

The AMF research community is limited by a lack of cost efficient and high-throughput imaging methods to quantitate the abundance of AMF hyphal structures in roots. In 1990, McGonigle et al. developed an unbiased approach for scoring AMF colonized root samples⁴¹. It is the gold standard until now, but it is laborious and demands skilled human scorers. Molecular quantification methods like AMF-specific phospholipid fatty acids (PLFA) approximate the amount of AM fungal biomass. DNA-based methods like quantitative real-time PCR (qPCR) allow quantification of specific AMF taxa in roots and soil. Amplicon sequencing allows the measurement of relative abundance of AMF taxa in root samples. A disadvantage of the PLFA- and DNA-based approaches is that they cannot measure colonization and morphology at the fungal structure level. Microscopy methods are synergistic by quantitating fungal structures and their morphology inside roots^41,42. Imaging, however, requires human scorers and the process is laborious and repetitive. Preparation and visual examination of 1000 AMF slides with 20–30 root segments per slides takes an experienced researcher 2 months to complete. A computer vision model could potentially carry out this task in a few hours.

Machine learning has been applied to fungal image classification even with limited training data in Neurospora crassa^43,44. A deep learning-based software, AMFinder, was developed to automate the process of quantifying AMF colonized root images⁴⁵. The examples demonstrated computer vision as a powerful tool for high-throughput AMF phenotyping. Further improvements remain to quantitate the allocation to AM fungal structures and their morphological phenotypes in the roots using a newly available instance segmentation method of computer vision model.

Instance segmentation using deep learning techniques, like Mask R-CNN⁴⁶, offers an opportunity for accurate and robust detection and per-pixel segmentation of different hyphal structures in root images. With the image analysis on the inferred segmentations, hyphal length/width, hyphal branching frequency, arbuscule length/width, vesicle size, spore size and other morphological traits can be automatically measured. These morphological traits can be correlated with various biological and physical processes of plants, such as photosynthesis, respiration, transpiration, and carbon and nutrient assimilation, which can be very useful for quantitative trait locus (QTL) mapping¹⁶ and Genome-wide Association Studies (GWAS)¹⁷ for symbiotic gene discovery.

Transfer learning is a technique that helps to transfer features learned from one dataset to another. The advancement of transfer learning benefits applications with limited annotated data. As of 2020, Mask R-CNN is one of the few deep learning architectures that can provide a generalist performance for instance image segmentation⁴⁷. Transfer learning-based application of Mask R-CNN have been adopted rapidly for imaging-based plant phenotyping in recent years⁴⁸.

We present a Mask R-CNN based image analysis method that provides the four previously unavailable advantages: (1) requires a minimal training data via transfer learning (2) achieves pixel level identification of multiple AM fungal structures via instance segmentation; (3) works on root samples colonized by a mixed populations of AMF in the field; (4) provides morphological measures on each category of AM fungal structure. We took the quantification and morphological measures from the image analysis to address fundamental questions about the AMF symbiosis: (1) can a mathematical model be developed to predict AMF colonization; (2) does the allocation to AM fungal structures vary between plants; (3) are there differences in the niche within the root system, where AMF structures are found?

To understand AMF symbiosis as part of largescale systems biology studies, we developed a deep learning-based image analysis method to automatically measure AMF colonization intensity and fungal structure morphologies. The mixed linear model was used to provide a framework for testing hypotheses about AMF colonization and the variation in these morphometric measures. The result is a direct connection between the fungal structures present in each root sample and fungal colonization of the roots. This connection will permit the exploration of how AMF affect plant health through allocation to their structures.

Results

Performance of Mask R-CNN on AMF image segmentation

Our Mask R-CNN model can segment AMF colonized root images with satisfying performance (Table 1). The training images and annotations were generated by human scorers using the McGonicle method⁵⁰ on a grid associated with the 192 root intersections per slide (see “Materials and methods”). The pretrained Mask R-CNN model on the COCO dataset was loaded and trained on an in-house dataset with the default augmentation including image random flip and resize and a 0.7 confidence score threshold (Model 1) showing higher performance on our in-house testing images with 25.9 mean average precision (mAP) and 47.5 mean average precision at intersection over union (IoU) threshold of 0.5 (AP50) across classes (see “Materials and methods”). For each class, the average precision (AP) captures both the precision (related to type I error) and recall (power = 1-type II error) for IoU from 0.5 to 0.95 with a 0.05 step interval. Example results recall (power = 1-type II error) for IoU from 0.5 to 0.95 with a 0.05 step interval. Example results presented the agreement between model prediction and the ground truth (Fig. 1). The Mask R-CNN excelled at segmenting sorghum root and spore with AP values larger than 40 (Table 1). Reasonable performance was achieved on arbuscule and vesicle with AP ranging from 20 to 30. The model struggled with predicting instances of intraradical and extraradical hyphae.

Table 1 Performance of the Mask R-CNN compares favorably with published performance on other image datasets. The training set 1 is made of 767 in-house images on the Georgia samples. Training set 2 contains additional images from the AMFinder dataset^45,49. The model performance is measured by average precision (AP) of each class, mean average precision (mAP) and mean average precision at intersection over union threshold of 0.5 (mAP50)⁴⁶.

Full size table

The confidence score distributions of high precision predictions of root, extraradical hypha, vesicle and spore were left-skewed, indicating high certainty on the assigned class labels (Fig. 2). Arbuscules had lower confidence scores in comparison to sorghum roots (Fig. 2) (P < 0.00003 Tukey Multiple Comparison applied to an ANOVA of angular transformation of confidence scores). The low AP value and the high confidence score of extraradical hypha suggest that the main challenge for the Mask R-CNN is the pixel level segmentation of extraradical hypha rather than instance classification. Given the low AP value and confidence score of intraradical hypha, this fungal structure was dropped from latter analyses. Difficulty in arbuscule classification could be driven by the observation that arbuscules present in both isolation and clusters in sorghum roots. The observed frequencies of fungal structures from Mask R-CNN predictions did not differ from the frequencies counted by human scorers on the testing images (p-value = 0.786 with Fisher’s exact test). As the segmentation model produced satisfying results, we chose the best one (Model 1) for inference on a much larger in-house dataset (24,391 images) where images were collected in similar experimental settings.

The pretrained model was also trained and tested with a combined dataset including the AMFinder dataset⁴⁵ to illustrate how the model will be iteratively improved as more data are collected. The AMFinder dataset was made up of images collected from lab grown plants with a single AMF species inoculum and had patterns different from our in-house dataset collected from plants grown in soil from the field in Georgia, USA. With learning rates 0.001 and the default augmentation, we obtained a mAP of 29.6 and mAP50 of 50.2 in the combined test set (see “Materials and methods”). This showed that our Mask R-CNN Model can be expanded and adapted to diverse conditions with different experimental settings as more representative training data are included.

We then tested whether adding the AMFinder images to the training dataset improved model performance on our in-house AMF colonized root images. The model performance was slightly decreased on the original test set. The best model trained on the combined image set had a mAP of 21.4 and mAP50 of 39.7. Whether adding new data with patterns different from the targeted conditions needs further testing with different training schedules and approaches.

From image segmentation to measures of fungal morphology

The best performing computer vision model on in-house images presented above was applied to over 20,000 images of 108 root samples from the top, middle, bottom root regions of 12 sibling sorghum plants to generate pixel-wise segmentations of the five hyphal structures. From the segmentation results, the average quantity and size of fungal structures were computed for each root sample. Paired correlation analyses of the fungal structure morphological traits were examined first to identify whether fungal structures tend to co-occur in sorghum roots.

Positive associations dominated the frequencies of fungal structures. Higher occurrence of extraradical hyphae was consistently associated with higher occurrence of arbuscules. Vesicles and spores were positively correlated in sizes and counts. Larger number of vesicles and spores in a sample were suggestive of smaller arbuscules (Fig. 3).

Using mixed linear models to predict total AMF abundance with fungal structure morphology

From the morphometric data of fungal structures, two measures of total AMF abundance were computed: the percentage root area occupied by AMF (percent colonization) and the density of AMF per root area (count density). Mixed linear model (MLM) analysis of the two phenotypes as response variables and the fungal structure morphology traits as predictors provides a means to test two hypotheses in the study of differential colonization by AMF. One major hypothesis is that AMF presents differential colonization between root sections. The second hypothesis is that allocation to AM fungal structures varies between plants. In this section, background on mixed linear modeling serves as an accessible introduction to how MLMs can be used to test these two hypotheses.

The rationale for mixed effects

Mixed Linear Models (MLMs), also known as multilevel or hierarchical models, feature fixed and random effects⁵¹ (Box 1). Experimental treatments are typically modeled as fixed effects. Individual observations are grouped by random factors. Random factors, therefore, constitute the grouping level. Fixed factors are estimated as the mean effect for a particular factor level. In contrast, if the primary interest lies in estimating between-group variances, variables are modelled as random effects. The estimated values of random factors are shrunk towards the population mean.

The choice of using mixed effects to model AMF colonization is motivated by the experimental design. Our data are inherently hierarchical. AMF colonization was quantified in each of the three root regions within each sorghum plant, and three replicates were taken per root region (Fig. 4). The nested layers are plant, root depth, and replicate. The spatial scales between root samples and sorghum plants are biologically nested⁵². Between-sample variance needs to be evaluated as a random factor. It would be wrong to treat root samples from the same sorghum plant as independent. The twelve sorghum plants are siblings randomly sampled from a RIL population. Between-plant variance is treated as a random factor as well. Root region and AMF structure level phenotypes are the fixed effects. The same model structure can be used to model the AMF count density.

Mixed linear model assumptions

In MLM, both root samples and sibling plants are assumed to be randomly sampled from a population of samples and a population of sorghum recombinant inbred lines, respectively⁵³. It is assumed that the sampling errors (ε_ij) and the plant level random effects (u_0i) are independent and that the random effects and sampling errors have a normal distribution of zero mean and distinct constant variances (Box 1). Another assumption is that the mean and variance of the response variable are not functionally dependent on each other across siblings. Transformation of the response variable is an effective method to remove the dependency. Choice of models are designed to address each of the three questions of the paper laid out at the end of the introduction.

The null model (Model 1)

The Null MLM Model does not include explanatory variables but only the mean overall AMF colonization and the plant level and sample level differences in colonization. The colonization by AMF of a root sample from a plant (y_ij) is equal to the mean colonization in the sorghum population (β₀) plus the plant level random difference from the population mean (u_0i) plus the sample level differences (ε_ij) (Box 1). This simple Null Model shows that MLM partitions the total variance in AMF colonization var(y_ij) into a variance between plants var(u_0i) and a variance between samples var(ε_ij) as shown from Box 1 (Model 1⁵⁴).

$$var\left({y}_{ij}\right)=var\left({u}_{0i}\right)+var\left({\epsilon }_{ij}\right)={\sigma }_{u}^{2}+{\sigma }_{\epsilon }^{2}$$

The between plant variance var(u_0i) was estimated to be 0.0099 and two times that of the between sample variance var(ε_ij) in Table 2. The proportion of the total percent colonization differences can be quantified at the plant level by computing the intraclass correlation (ICC)⁵⁴.

Table 2 Mixed linear models are well predicted for percent colonization by arbuscule count, arbuscule size, region, and plant. Proportional change in variance (PCV) is provided to measure the importance of random effects. Intraclass correlation (ICC) is used to implicate the variance between plants. Varied measures of fit are provided to assess model performance.

Full size table

$$ICC= \frac{var\left({u}_{0i}\right)}{var\left({u}_{0i}\right)+var\left({\epsilon }_{ij}\right)} =\frac{{\sigma }_{{u}_{0}}^{2}}{{\sigma }_{{u}_{0}}^{2}+{\sigma }_{\epsilon }^{2}}$$

In Table 2, the ICC of Null Model was 0.647, which implies that 64.7% of differences in total percent colonization of AMF is at the plant level and could be controlled by the plant genome⁵⁵. An alternative hypothesis would be that this clustering of variance at the plant level might be attributable to the different composition of AMF structures, and this composition could be defined by the plant genome. We will come back to testing the alternative hypothesis in Models 3, 4, and 5.

Random intercept model with experimental design variable (Model 2)

In Model 2, the Null Model (Model 1) is expanded by including the design variable, the root region where the root sample was obtained (Fig. 4), with fixed effect β₁. Root region is a discrete predictor with three levels. The goal of the model is to investigate if AMF percent colonization differs between root regions and to determine the extent to which variance at plant level may change after taking into account differences in colonization in root regions. Proportional change in variance (PCV) at different levels can be computed to evaluate the change using the following equation^56,57,

$${PCV}_{plant}=\frac{var\left({u}_{0i}\right)-va{r}^{\prime}\left({u}_{0i}\right)}{var\left({u}_{0i}\right)}$$

$${PCV}_{sample}=\frac{var\left({\epsilon }_{ij}\right)-va{r}^{\prime}\left({\epsilon }_{ij}\right)}{var\left({\epsilon }_{ij}\right)}$$

where var(u_0i) is the between plant variance in the Null Model and var’(u_0i) is the between plant variance in the new model. Comparing Model 2 to the Null Model 1, the PCV_plant was equal to − 0.0221, and PCV_sample was equal to 0.3648 in Table 2. We concluded that 36.48% of sample variance within plants in the null model is attributed to differences in root regions. By adjusting for the root regions where the sample was obtained, another 2.21% of the variance in percent colonization by AMF was accounted for by plant differences in Table 2.

Random intercept + fixed slope model with AMF structure predictors (Model 3, 4, 5)

As mentioned earlier under the Null Model, an alternative hypothesis to the clustering of total percent colonization of AMF at the plant level is that the plant level differences in AMF abundance can be attributable to the different composition of AMF structures in the plants, which could be controlled by the plant genome^55,58. To test the possibility, the three Models (3, 4, and 5) expand Model 2 by including arbuscule count and/or size as fixed effects determined to be significant by Lasso Regression⁵⁹. By comparing Models 3, 4, and 5 to the Null Model, the changes in plant level variance can be measured using ICCs and PCVs after adding different AMF structure predictors to the model (Table 2). The interpretation of ICCs and PCVs leads to a major conclusion regarding the total AMF percent colonization and the composition of fungal structures, which is discussed in depth in a later section.

The effect of average size of arbuscule (β₂) and the effect of arbuscule count (β₃) per sample are continuous variables that describe the association of the AMF structures with the total degree of root colonization by AMF (y_ij)s. A positive estimate of β₃, for example, indicates a positive linear relation between AMF colonization and average arbuscule size. A larger estimate of β₃ than β₄ means arbuscule size has a stronger effect on total AMF colonization per unit increase than arbuscule count. In all three models, the relations between fixed effects and overall AMF colonization are considered to be the same in all sorghum plants. In other words, the slopes are fixed with respect to plant.

Random intercept + random slope model with AMF structure predictors (Model 6)

In Model 6, the effect of arbuscule count (β₃) on overall AMF colonization may differ between sorghum plants. For examples, in some plants with high AMF colonization in the roots, arbuscules may be the dominant hyphal structure but not in other plants. In Model 6, the regression coefficient of AMF colonization on arbuscule count varies at the plant level to capture this differential effect. By comparing Model 6 to Model 5, it is possible to determine whether the assumption of varying magnitude of association of arbuscule count and percent colonization between plants holds.

The total variance in AMF colonization var(y_ij)is still made up of two parts, a variance between plants var(u_0i, u_1ix_1ij) and a variance between samples var(ε_ij). The variance between plants var(u_0i, u_1ix_1ij), however, partitions into a slope variance var(u_1ix_1ij), intercept variance var(u_0i) and their covariance cov(u_0i, u_1ix_1ij)⁶⁰. This variance is a quadratic function in arbuscule count:

$$\begin{aligned} var\left( {y_{ij} } \right) & = var\left( {u_{0i} ,u_{1i} x_{1ij} } \right) + var\left( {\varepsilon_{ij} } \right) \\ & = var\left( {u_{0i} } \right) + var\left( {u_{1i} x_{1ij} } \right) + 2cov\left( {u_{0i} ,u_{1i} x_{1ij} } \right) + var\left( {\varepsilon_{ij} } \right) \\ & = \sigma_{u0}^{2} + \sigma_{u1}^{2} x_{1ij}^{2} + 2\sigma_{u0u1} x_{1ij} + \sigma_{\varepsilon }^{2} \\ \end{aligned}$$

When there are random slopes in the model, the Variance Partition Coefficient (VPC), a function of arbuscule count, is calculated to measure the relationship of plant level variance to the total variance rather than an ICC⁶⁰:

$$VPC= \frac{var({u}_{0i},{u}_{1i}{x}_{1ij})}{var\left({u}_{0i},{u}_{1i}{x}_{1ij}\right)+var({\epsilon }_{ij})}=\frac{{\sigma }_{{u}_{0}}^{2}+{\sigma }_{{u}_{1}}^{2}{x}_{1ij}^{2}+2{\sigma }_{{u}_{0}{u}_{1}}{x}_{1ij}}{{\sigma }_{{u}_{0}}^{2}+{\sigma }_{{u}_{1}}^{2}{x}_{1ij}^{2}+2{\sigma }_{{u}_{0}{u}_{1}}{x}_{1ij}+{\sigma }_{\epsilon }^{2}}$$

VPC is similar to ICC in terms of interpretation of the result.

Arbuscule morphology is predictive of differential AMF percent root colonization in sibling plants

Looking at how the plant level variance changes as predictors such as, root region, arbuscule size and count were added to the Null Model in Model 2, 3, 4, 5, we observed that, in Model 6, 90.9% total variance in percent colonization is attributed to the plants (PCV_plant = 0.9091). An VPC of 56.6% suggested that even if variance at plant level shrunk, it still explained the majority of differences in percent colonization. This is possible because the sample level variance dropped with the plant level variance by 87.1% as arbuscule count and size were added as predictors.

Model 6 had the highest R-squared^57,61 of 0.9574 and desired lowest information criteria and deviance. A model with random slopes for both arbuscule count and size were fitted but not included in Table 2 as it was not significantly different from Model 6 in a likelihood ratio test. Confidence scores for arbuscule was added as a fixed effect to Model 6 and was tested nonsignificant using a likelihood ratio test. In Fig. 5a, the expected values of AMF colonization from Model 6 were plotted to visualize the fit of the model to the data and the varying slopes between plants. The plant level variance and its VPC is a function of arbuscule count in Model 6 (Fig. 5b). The clustering of plant level variance was stronger when more arbuscules were observed, also reflected by the increasing of VPC (Fig. 5b). For example, N6F3, E37, N66, N102, and N116 had higher slopes and hence higher arbuscule counts than the remaining accessions. The remaining plants, however, had lower arbuscule counts and similar total AMF percent colonization. It was impossible to distinguish the remaining plants by their arbuscule counts. At first sight it seemed strange that arbuscule count should be selected as a predictor when it had lower confidence scores than other traits (Fig. 2), but as shown in Fig. 5B, there were substantial differences in arbuscule count for the first 5 accessions, and that was why arbuscule count was selected for inclusion in the mixed linear model to explain plant level differences.

The plant level variance component in count is stable across models: loss of goodness of fit in random effects is offset by a gain in goodness of fit to fixed effects of fungal structures

The same variable selection and model selection procedure for overall AMF percent colonization was applied to fit overall AMF count density as the response variable. The β vector of fixed effects was modified to include the appropriate AMF structure phenotypes as predictors.

Lasso regression⁵⁹ selected the count measures of all fungal structures as fixed effects in the mixed linear model to predict overall AMF count density in sorghum plants. Forward selection removed the number of intraradical hyphae as an explanatory variable. Root regions and the counts of arbuscules, vesicles, spores and extraradical hyphae were the five fixed effects in Model 3 and 4 for count density. Random slopes were added with respect to the four effects of fungal structures. The effect of extraradical hyphae was the only one that differed between the sorghum plants. A random slope was added to the effect of extraradical hyphae in Model 4. We added the confidence score of extraradical hypha as a fixed effect to count density Model 4 and found that it was not a significant variable.

The proportion of variance at the plant level remained stable as fixed effects were added to models (Table 2). In Table 3, the ICCs and VPC of between plant variance ranged from 0.428 to 0.490, which was less than a 7% difference in variance explained. The proportional changes in plant variance decreased by 70.9%, which was compensated for by a 77.4% drop in sampling variance. What variation in count density lost to the fixed effects was replaced by the improved fit of the model.

Table 3 Mixed linear models are well predicted for count density by counts of hyphal structures, root regions, and sorghum accessions. Proportional change in variance (PCV) is provided to measure the importance of random effects. Intraclass correlation (ICC) is used to implicate the variance between accessions. Varied measures of fit are provided to assess model performance.

Full size table

Differential AMF colonization between sorghum root regions

A significant improvement of Model 2 to the Null Model 1 for both total AMF colonization phenotypes supports AMF colonization to be different between root regions. PCV_sample was 0.365 and 0.205 respectively for AMF percent colonization and count density (Tables 2 and 3). The positive signs of PCVs suggest that the sample variances within plants in the null models are attributed to differences in root regions. The top root region had the highest colonization by t-tests at the 0.05 significance level (Fig. 6a,b). Arbuscule count was a predictor essential for the modeling of both phenotypes in the previous section. It is a reasonable speculation that arbuscule count is a main driving force in the positive correlation of the two total AMF colonization traits. The speculation is sustained by larger arbuscule size (Fig. 6c), higher numbers of arbuscules and extraradical hyphae in the top root region (Fig. 6d), tested significant using Tukey Multiple Comparison tests (Table 4).

Table 4 Tukey multiple comparison contrasts between region levels with their significance (p-values). The top, middle, and bottom root regions were abbreviated as TOP, MID and BOT.

Full size table

Although the morphological traits of other AM fungal structures did not have significant effects, they could still contribute to differential colonization. If the sorghum plants were colonized by equal amounts of AMF, Fig. 7a,b showed how the relative abundance of AM fungal structures in the roots could differ. When the twelve sorghum plants were ranked in decreasing order of AMF percent colonization from left to right, the relative amount of arbuscules and extraradical hyphae trended downward. The same observation held if the panel was divided by root regions.

To quantify the observation, one more phenotype was calculated, the amount of arbuscule and extraradical hyphae divided by the total AMF structures. It measures the proportion of nutrient exchange (PNE) structures³². After logit transformation, a mixed linear model was fitted to PNE with root region as the fixed effect and a plant level random effect. The intraclass correlation for PNE was 0.585. The conditional R-squared of the mixed linear model was 0.624. Tukey multiple comparison test showed that the top 0–15 cm and the middle 15–30 cm tested insignificant to each other, but both were tested significant to the bottom > 30 cm roots (Table 5). Percent colonization and count density had the middle 15–30 cm root region tested insignificant against the bottom root regions. The levels of AMF colonization of the twelve sorghum plants were ranked differently with percent colonization, count density and PNE. Some similarity was found between percent colonization and PNE using Spearman correlation (rho = 0.544, p < 0.001) There was no correlation between proportion NE and AMF count density (rho = − 0.089, p = 0.362) (Fig. 8).

Table 5 Multiple comparison test of AMF colonization along root regions.

Full size table

Discussion

Preferential AMF colonization in sorghum roots

The 15 cm of roots closest to the soil surface presented highest total AMF colonization and proportion of nutrient exchange fungal structures (Figs. 4 and 6, Tables 4 and 5). The total AMF richness and colonization at 0–10 cm soil depth were shown to be higher than at deeper soil layers in Pedicularis kansuensis at a subalpine grassland ecosystem⁶². Kabir et al. found that total colonization, total hyphae density, and spore density were highest at a depth of 0–15 cm of soil in corn roots. Top roots of sorghum produce secondary roots and fine root via branching. The growing tip is at the bottom of the root. One hypothesis is that the AMF preferentially colonize the actively branching sections of sorghum roots. An alternative hypothesis is that the top layer is rich in phosphorus, defining the niche for AMF colonization⁶². One way to test the first hypothesis is to examine the variation in root morphology between inbred lines with the software DIRT⁶³ to see whether or not root morphology has an impact on fungal structure counts. The latter hypothesis could also be tested by using nanodots or phosphorus labeling experiments to track the exchange in the top layer⁶⁴.

Plant level variance of AMF colonization

In the null model, the total variance in AMF colonization was composed of the sampling variance and plant level variance. Expanding the null models transferred the plant level variance from the variance components to the fungal structure morphology traits. One hypothesis for the result is that sorghum plants of different genotypic backgrounds determine overall AMF abundance in roots by manipulating the developmental stage of AMF species after colonization. Alternatively, in lifestyle studies of single AMF species, researchers found that the composition and morphology of their fungal structures vary. Sorghum lines may match with a community of AMF species in the soil that generates different compositions of AM fungal structures in roots⁶⁵.

Our experimental design does not permit direct testing of the amount of variance in AMF colonization that is under the genomic control of plant hosts, because there were no biological replicates of the RILs. If biological replicates were available, plant level variance would provide an estimate of population level variance of AMF colonization and the measurements, ICCs and VPCs, would capture the broad sense heritability including additive, dominant and epistatic effects. The results in Table 2 are suggestive of performing a large scale Genome Wide Association Study (GWAS) and or Recombinant Inbred Lines (RILs) study in Sorghum bicolor to test whether or not sorghum genes play a role in AMF colonization^16,17.

Differential AMF colonization between sorghum inbred lines

Percent colonization was a new measurement of overall AMF abundance by taking the ratio of pixelwise segmentation of fungal structures and sorghum roots of our computer vision model. Count density was a similar measurement of AMF abundance to the output of McGonigle⁵⁰ scoring method. The proportion of variance at plant level of these two phenotypes did not lose to the integration of fungal structure morphology traits into the models, which establishes them as favorable response variables for Genome-wide Association Studies of AMF colonization in sorghum populations. While both phenotypes were modeled with fungal structure morphology traits as fixed effects, the fixed effects differed in their ability to explain variability in the data. Two morphological traits of arbuscule were required to explain 95.7% variability in percent colonization suggestive of a mechanism for how the plant controls AMF colonization. One example of a gene relevant to this mechanism encodes the protein DELLA^31,66.

Count density was modeled with four fungal structures. The final fit to predicting count density explained 87.2% of variability. One could argue that percent colonization is a better phenotype for GWAS for its simplicity, goodness of fit and higher plant level variance. The plant level variance of the phenotype would decrease as the number of plant replicates and sorghum inbred lines grow in scale. Count density should not be discarded as technologies provide newer features to quantify AMF abundance. Finally, the two phenotypes did rank the twelve sorghum inbred lines in different orders. Inbred line L8 had the highest AMF count density but ranked 7th for percent colonization. Ranks for most other inbred lines were comparable. It is recommended to use both phenotypes in future studies of AMF colonization using computer vision.

Role of models across scales to understand the AMF symbiosis with plants

To understand the symbiosis of Sorghum bicolor with AMF it has been necessary here to develop models that operate at multiple scales⁶⁷. First, computer vision models were developed that provided the high throughput data to describe how AMF colonize the plant at the individual plant level. A variety of measures were developed that then can be used in GWAS studies to test the role of the plant genome in shaping the AMF microbiome. To avoid the confounding effects of environmental field variables in a GWAS, it will important to use factorial designs⁶⁸ and blocking to separate the effects of accession from environmental field variables, like Nitrogen and Phosphorus levels. These models operated at the individual level of plants in the study. The computer vision models provided a diverse array of measures to describe the colonization process described above (Tables 2 and 3). We are in the process of not only automating classification and segmentation of fungal structures but full automation of image acquisition as well. A final limitation of the computer vision phenotyping method here is not having live cell images to follow the dynamics of the structures. The Kokkoris laboratory⁶⁹ has been able to follow the dynamics of nuclei in AMF, and it would be useful to adopt a similar approach to other AMF structures to gain a time dimension on colonization. Live cell imaging will also contribute to understanding the prepenetration apparatus⁹, arbuscule formation³¹, and AMF inheritance¹².

On top of the computer vision model for feature extraction, mixed linear models were added to discover relations between measures of overall AMF abundance with fungal structure morphology and root niche. This mixed linear modeling approach provides a framework for GWAS and QTL mapping by suggesting underlying mechanisms by which the plant genome controls the AMF community. For example, arbuscule count entered into the prediction. This relation suggests a mechanistic link to the arbuscules in how the plant genome controls the AMF microbiome. There are likely genes in both sides of the partnership controlling the development of these structures⁷⁰. Other layers to the modeling will need to be added to conceptualize our understanding of this ancient and fundamental symbiosis⁶⁷.

Materials and methods

The Georgia dataset

Plant cultivation

Sorghum plants were derived from Recombinant Inbred Lines (RILs), a mapping population consisting of 191 F3:5 from a cross between an unnamed accession of Sorghum propinquum (William Rooney, Texas A & M University, College Station, TX) and inbred line TX7000 of S. bicolor⁷¹. The seeds were kindly provided by Jeff Bennetzen, one of the PIs who constructed the RIL collection⁷¹. No permissions are needed to use these Sorghum accessions. Three seeds from 15 RILs were planted on October 5, 2020 in steam sterilized Sungro garden soil in 2.5-gallon pots at the UGA Botany Greenhouse. Seedlings were grown on a 11-h light cycle. Plants were fertilized with 1 tablespoon Osmocote. Individual seedling was transferred in 2.5-gallon pots filled with a 4:1 mix of steam sterilized turface and soil from Ironhorse Farm, Watkinsville, GA (Table 1) on day 15. Seedlings were grown to maturity on a 11 h Light/Dark cycle with watering as needed. In addition, one commercial hybrid forage sorghum plant derived from Richardson, TX was harvested at Iron Horse Farm, GA on October 13, 2020, two grain sorghums of accession M72GB7 at Iron Horse Farm, GA on November 12, 2020, a Colby sorghum at UGA Botany Greenhouse. All the methods were carried out in accordance with relevant Institutional guidelines and regulations.

Root imaging

Random samples of 0.25 g of fine roots were taken from the whole roots of Richardson, M72GB7, Colby, E46-W, N88, E24, E46 for training images. The cleaned whole roots were cut into 1 cm pieces. Fine roots with intact cortex were randomly selected and weighed to get 3 cassettes of 0.25 g of samples per plant. Root samples were cleared in 10% alkaline hydrogen peroxide solution for 2 h and in 5% KOH overnight at room temperature. Fungal structures were stained using a modified Ink and Vinegar method⁷². Stained roots were spread and flattened on slides prior for imaging. Mounted root samples were imaged at 200× magnification with a Zeiss Primo Star compound microscope equipped with an Axiocam 105 color camera. Focusing was done locally and manually for every field of view during imaging to increase sharpness, but no post image acquisition processing was involved, such as adjusting contrast. McGonigle method was used to generate images at 192 root intersections. The root intersections were 0.5 cm equidistantly spread across a 75 × 25 mm glass slide. The fungal structures at root intersections were manually scored and annotated for training the computer vision model.

To test for the differential AMF colonization in root regions and in sorghum plants, 12 RIL sorghums of E37, L8, N6-F3, N10, N43, N66, N68, N102, N108, N110, N116, and N162 were sampled from three root regions. The ‘TOP’ region was the first 15 cm of roots below soil surface. The ‘MID’ region is the next 15 cm below. The ‘BOT’ region was roots longer than 30 cm (Fig. 4). Aerial roots were excluded from sampling. From each region of a plant, 3 technical replicates of 0.25 g of fine roots with intact cortex were randomly sampled. Each plant was represented with a total of 9 cassettes or 2.25 g of root samples. A line has only 1 plant as biological replicate. The same clearing, staining and imaging procedures were applied.

The Cambridge dataset

The publicly available Cambridge dataset (zenodo ID https://doi.org/10.5281/zenodo.5118948) included 15 whole slide scanning images acquired with an Epson Perfection flatbed scanner (Epson UK, Hemel Hempstead, UK) using default settings and a resolution of 3200 dots per inch. The images were downloaded from the zenodo dataportal⁷³ using the zenodo-get software method. The 15 whole slide images were in jpg format and 10,389 × 5108 pixels and 96 pixels/inch in size. The original annotations were discarded. The same annotators and annotation standards for the Georgia dataset were used in reannotation to maintain uniformity. The annotated images were tiled and added to the Georgia dataset to create a secondary training set for more and even representation of each AMF class.

Image annotation

The root image annotation was conducted using the VGG annotator tool⁷⁴. The fungal and root structures were manually annotated using the polygon tool. One of seven class labels was assigned to a structure (Table 1). The annotation results were exported as a json file and csv table.

All 746 jpg images in the Georgia dataset were segmented and annotated. We generated 3577 polygon annotations. A total of 14 out of 15 images were selected from the Cambridge dataset and produced 20,588 annotations.

Class	Annotation rules for the Georgia dataset
Root	Plant root
Extradical hypha (exH)	Filamentous structure outside the boundary of a plant root annotation
Intraradical hypha (inH)	Filamentous structure within the boundary of a plant root annotation
Spore (sp)	Circular structures with a solid outline, connected to AMF external hypha, and outside the boundary of plant root
Vesicle (ves)	Circular or rectangular structures with a solid outline and within the boundary of plant root
Arbuscule (arb)	Highly branched hypha with fuzzy outline and connected to intraradical hyphae within the boundary of plant root
Others	Non-AM fungal structures

Data cleaning

The Georgia and the Cambridge datasets were cleaned to produce similar input data. All segmentation shapes were approximated by polygons, including converting polyline to polygon directly and resampling points in circles to produce polygons. Some segmentation shapes, including point, rectangle and ellipse, were removed. Empty and undefined segmentations were also removed. Classes with few representative examples were merged into ‘others’. Class labels were made uniform in their vocabularies. The final class list included root, AMF internal hypha, AMF external hypha, AMF arbuscule, AMF vesicle, AMF spore and others. The cleaned Georgia dataset included 746 jpg images that are 2380 × 1740 pixels and 300 pixels/inch in size and 3577 annotations.

The Cambridge dataset needed additional processing steps. To have comparable input data in size, the 14 images were tiled and subsampled. The images were tiled into squares of 512 × 512 pixels and smaller images on the boundaries. The segmentations were subsampled to fit each tile. New segmentation polygons were produced at the intersection of the tiles and the original segmentations using Shapely⁷⁵. Polygons with self-intersection were dissected into smaller simple polygons. Points and LineStrings were ignored as subsampling results. Indices for segmentation and bounding box were recalculated relative to the new tiled image. Only tiled images with at least one segmentation annotation were kept. The quality of tiling and subsampling were checked by comparing segmentation in the raw images and the tiled small images visually. The resulting Cambridge dataset included 1379 tiled jpg images that are 512 × 512 pixels and 96 pixels/inch in size and 20,558 annotations.

The Georgia dataset were separated into training, validation and testing sets at 8:1:1 ratio. The training set has 598 images and 2874 annotations. To increase the number of examples for each fungal structure, the Cambridge dataset was divided in the same 8:1:1 ratio and merged to the previous Georgia training, validation and testing sets. The secondary training set is made up of 1105 images and 16,417 annotations.

The final prediction set was consisted of 24,391 root images from the ‘TOP’, ‘MID’, and ‘BOT’ regions of 12 sorghum RIL plants. Images with height or width less than 100 pixels in the prediction set were dropped.

Mask R-CNN model training

Mask R-CNN was implemented in Detectron2^46,76 and is composed of the backbone, the region proposal network (RPN), and heads^46,77. The ResNet 50 and FPN (Feature Pyramid Network) backbone extracts feature map from images^46,78. RPN proposes candidate regions⁷⁹. Heads produce bounding box, mask, and class inferences. The Mask R-CNN model was pretrained on the COCO dataset with 3 × schedule^46,76,80. The pretrained model was retrained on the first and secondary training sets for 50 epochs with batch size 2 and the default learning rate schedule.

Different hyperparameters were tested, and each combination was repeated three times with different random seeds. Learning rates of 0.001 and 0.002 were tested. The number of frozen or fine-tuned backbone modules was varied by changing the ‘FREEZE_AT’ parameter from 1 to 3. Two augmentation options were implemented. The default option included image random flip and resize, and the second option added random crop, rotation, and brightness adjustment as augmentation options. Other parameters were set to the defaults in Detectron2 configuration⁷⁶.

Model performance and hyperparameters were evaluated based on mean Average Precision (mAP). The best fine-tuned model for defined hyperparameters was selected based on total loss in validation set during training⁴⁶.

The model quality metric mAP was calculated with varying confidence thresholds and averaged over all classes. In addition, AP50 was calculated at Intersection over union (IoU) level 50%, and AP was averaged over IoU levels from 50 to 95%. Score threshold for inference in test set was set to 0.7.

Mixed linear model prediction and statistical analysis

The best model was used for the prediction set of 24,391 images. Other settings remained the same as training. Inferred segmentations in an image were cross-tabulated by class versus segmentation number and pixel number. For downstream statistical analysis on AMF colonization, three class level statistics were generated using the two outputs above.

Count density of an AMF structure was defined as its segmentation number divided by the root pixel number (count/pixel). Average class size of an AMF structure was its pixel number divided by its segmentation number (pixel/count). Percent colonization by an AMF structure was measured as its pixel number per root pixel (pixel/pixel). The three class level colonization statistics were calculated for every slide. A total of 648 entries was used for regression analysis to test for differential colonization in root regions and sorghum plants.

Mixed effect models in ‘lme4’ R package⁸¹ were used for modeling the three class level AMF colonization statistics. ANOVA and t-tests were used to test for the significance of model parameters. Likelihood ratio test was used to test the significance of a model to a nested model.

Computational resources

Model training and inference was implemented on sapelo2 at the Georgia Advanced Computing Resource Center (GACRC) with one p100 GPU, 4 CPUs, and 20 GB memory. GPUs were used for model training. CPUs were used for model inference. Codes are available in GitHub: https://github.com/Arnold-Lab/image_seg_sorghum_am.

Data availability

Summary data and codes are available in GitHub: https://github.com/Arnold-Lab/image_seg_sorghum_am. The analyses and manuscript are available in RStudio with the exception of the tables, which were converted manually back to Word formatting from image formatting at the request of the publisher. The large collection of over 20,000 raw images are available upon reasonable request from a shared DropBox folder.

References

Bonfante, P. & Genre, A. Mechanisms underlying beneficial plant–fungus interactions in mycorrhizal symbiosis. Nat. Commun. 1, 1–11 (2010).
Article Google Scholar
Zhong, Y. et al. Arbuscular mycorrhizal trees influence the latitudinal beta-diversity gradient of tree communities in forests worldwide. Nat. Commun. 12, 1–12 (2021).
Article ADS Google Scholar
Field, K. J. et al. Contrasting arbuscular mycorrhizal responses of vascular and non-vascular plants to a simulated palaeozoic co2 decline. Nat. Commun. 3, 1–8 (2012).
Article Google Scholar
Gao, C. et al. Co-occurrence networks reveal more complexity than community composition in resistance and resilience of microbial communities. Nat. Commun. 13, 3867. https://doi.org/10.1038/s41467-022-31343-y (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Pozo, M. J. & Azcón-Aguilar, C. Unraveling mycorrhiza-induced resistance. Curr. Opin. Plant Biol. 10, 393–398. https://doi.org/10.1016/j.pbi.2007.05.004 (2007).
Article CAS PubMed Google Scholar
Ruiz-Lozano, J. M., Porcel, R., Azcón, C. & Aroca, R. Regulation by arbuscular mycorrhizae of the integrated physiological response to salinity in plants: New challenges in physiological and molecular studies. J. Exp. Bot. 63, 4033–4044. https://doi.org/10.1093/jxb/ers126 (2012).
Article CAS PubMed Google Scholar
Choi, J., Summers, W. & Paszkowski, U. Mechanisms underlying establishment of arbuscular mycorrhizal symbioses. Annu. Rev. Phytopathol. 56, 135–160 (2018).
Article CAS PubMed Google Scholar
Buee, M., Rossignol, M., Jauneau, A., Ranjeva, R. & Bécard, G. The pre-symbiotic growth of arbuscular mycorrhizal fungi is induced by a branching factor partially purified from plant root exudates. Mol. Plant Microbe Interact. 13, 693–698 (2000).
Article CAS PubMed Google Scholar
Genre, A., Chabaud, M., Faccio, A., Barker, D. G. & Bonfante, P. Prepenetration apparatus assembly precedes and predicts the colonization patterns of arbuscular mycorrhizal fungi within the root cortex of both Medicago truncatula and Daucus carota. Plant Cell. 20, 1407–1420 (2008).
Article CAS PubMed PubMed Central Google Scholar
Pumplin, N. & Harrison, M. J. Live-cell imaging reveals periarbuscular membrane domains and organelle location in Medicago truncatula roots during arbuscular mycorrhizal symbiosis. Plant Physiol. 151, 809–819. https://doi.org/10.1104/pp.109.141879 (2009).
Article CAS PubMed PubMed Central Google Scholar
Smith, S. E. & Read, D. J. Mycorrhizal Symbiosis (Academic Press, 2010).
Marleau, J., Dalpé, Y., St-Arnaud, M. & Hijri, M. Spore development and nuclear inheritance in arbuscular mycorrhizal fungi. BMC Evol. Biol. 11, 1–11 (2011).
Google Scholar
Kokkoris, V., Stefani, F., Dalpé, Y., Dettman, J. & Corradi, N. Nuclear dynamics in the arbuscular mycorrhizal fungi. Trends Plant Sci. 25, 765–778. https://doi.org/10.1016/j.tplants.2020.05.002 (2020).
Article CAS PubMed Google Scholar
Montero, H., Choi, J. & Paszkowski, U. Arbuscular mycorrhizal phenotyping: The dos and don’ts. New Phytol. 221, 1182–1186. https://doi.org/10.1111/nph.15489 (2019).
Article PubMed Google Scholar
Koch, A. M., Croll, D. & Sanders, I. R. Genetic variability in a population of arbuscular mycorrhizal fungi causes variation in plant growth. Ecol. Lett. 9, 103–110. https://doi.org/10.1111/j.1461-0248.2005.00853.x (2006).
Article PubMed Google Scholar
Plouznikoff, K., Asins, M. J., de Boulois, H. D., Carbonell, E. A. & Declerck, S. Genetic analysis of tomato root colonization by arbuscular mycorrhizal fungi. Ann. Bot. 124, 933–946. https://doi.org/10.1093/aob/mcy240 (2019).
Article CAS PubMed PubMed Central Google Scholar
De Vita, P. et al. Genetic markers associated to arbuscular mycorrhizal colonization in durum wheat. Sci. Rep. 8, 1–12 (2018).
CAS Google Scholar
Pawlowski, M. L., Vuong, T. D., Valliyodan, B., Nguyen, H. T. & Hartman, G. L. Whole-genome resequencing identifies quantitative trait loci associated with mycorrhizal colonization of soybean. Theor. Appl. Genet. 133, 409–417 (2020).
Article CAS PubMed Google Scholar
Jacquelinet-Jeanmougin, S. & Gianinazzi-Pearson, V. Endomycorrhizas in the gentianaceae. I. The fungi associated with gentiana lutea l. N. Phytol. 95, 663–666 (1983).
Article Google Scholar
Gerdemann, J. W. Vesicular-arbuscular mycorrhizae formed on maize and tuliptree by endogone fasciculata. Mycologia 57, 562–575. https://doi.org/10.2307/3756732 (1965).
Article Google Scholar
Smith, F. A. & Smith, S. E. Tansley review no. 96. Structural diversity in (vesicular)-arbuscular mycorrhizal symbioses. N. Phytol. 137, 373–388 (1997).
Article CAS Google Scholar
Stahlhut, K. N. et al. Genetic control of arbuscular mycorrhizal colonization by rhizophagus intraradices in Helianthus annuus (l.). Mycorrhiza 31, 723–734. https://doi.org/10.1007/s00572-021-01050-5 (2021).
Article CAS PubMed Google Scholar
Watts-Williams, S. J. et al. Diverse sorghum bicolor accessions show marked variation in growth and transcriptional responses to arbuscular mycorrhizal fungi. Plant Cell Environ. 42, 1758–1774. https://doi.org/10.1111/pce.13509 (2019).
Article CAS PubMed Google Scholar
Pawlowski, M. L., Vuong, T. D., Valliyodan, B., Nguyen, H. T. & Hartman, G. L. Whole-genome resequencing identifies quantitative trait loci associated with mycorrhizal colonization of soybean. Theor. Appl. Genet. 133, 409–417. https://doi.org/10.1007/s00122-019-03471-5 (2020).
Article CAS PubMed Google Scholar
Lehnert, H., Serfling, A., Enders, M., Friedt, W. & Ordon, F. Genetics of mycorrhizal symbiosis in winter wheat (Triticum aestivum). N. Phytol. 215, 779–791. https://doi.org/10.1111/nph.14595 (2017).
Article CAS Google Scholar
Johnson, A. C., Pendergast, T. H., Chaluvadi, S., Bennetzen, J. L. & Devos, K. M. Identification of micrornas responsive to arbuscular mycorrhizal fungi in Panicum virgatum (switchgrass). BMC Genom. 23, 688. https://doi.org/10.1186/s12864-022-08797-x (2022).
Article CAS Google Scholar
MacLean, A. M., Bravo, A. & Harrison, M. J. Plant signaling and metabolic pathways enabling arbuscular mycorrhizal symbiosis. Plant Cell 29, 2319–2335 (2017).
Article CAS PubMed PubMed Central Google Scholar
Gallego-Bartolomé, J. et al. Molecular mechanism for the interaction between gibberellin and brassinosteroid signaling pathways in arabidopsis. Proc. Natl. Acad. Sci. 109, 13446–13451. https://doi.org/10.1073/pnas.1119992109 (2012).
Article ADS PubMed PubMed Central Google Scholar
Davière, J.-M. & Achard, P. Gibberellin signaling in plants. Development 140, 1147–1151. https://doi.org/10.1242/dev.087650 (2013).
Article CAS PubMed Google Scholar
Jiang, C., Gao, X., Liao, L., Harberd, N. P. & Fu, X. Phosphate starvation root architecture and anthocyanin accumulation responses are modulated by the gibberellin-della signaling pathway in arabidopsis. Plant Physiol. 145, 1460–1470. https://doi.org/10.1104/pp.107.103788 (2007).
Article CAS PubMed PubMed Central Google Scholar
Floss, D. S., Levy, J. G., Lévesque-Tremblay, V., Pumplin, N. & Harrison, M. J. Della proteins regulate arbuscule formation in arbuscular mycorrhizal symbiosis. Proc. Natl. Acad. Sci. 110, E5025–E5034 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Johnson, N. C., Rowland, D. L., Corkidi, L., Egerton-Warburton, L. M. & Allen, E. B. Nitrogen enrichment alters mycorrhizal allocation at five mesic to semiarid grasslands. Ecology 84, 1895–1908. https://doi.org/10.1890/0012-9658(2003)084[1895:NEAMAA]2.0.CO;2 (2003).
Article Google Scholar
Johnson, N. C., Graham, J. H. & Smith, F. A. Functioning of mycorrhizal associations along the mutualism-parasitism continuum. N. Phytol. 135, 575–586 (1997).
Article Google Scholar
Graham, J. H. & Abbott, L. K. Wheat responses to aggressive and non-aggressive arbuscular mycorrhizal fungi. Plant Soil 220, 207–218. https://doi.org/10.1023/A:1004709209009 (2000).
Article CAS Google Scholar
Klironomos, J. N. Variation in plant response to native and arbuscular mycorrhizal fungi. Ecology 84, 2292–2301 (2003).
Article Google Scholar
Taylor, J. & Harrier, L. A comparison of nine species of arbuscular mycorrhizal fungi on the development and nutrition of micropropagated Rubus idaeus l. Cv. Glen prosen (red raspberry). Plant Soil 225, 53–61. https://doi.org/10.1023/A:1026519431096 (2000).
Article CAS Google Scholar
Hart, M. M. & Reader, R. J. Does percent root length colonization and soil hyphal length reflect the extent of colonization for all amf?. Mycorrhiza 12, 297–301. https://doi.org/10.1007/s00572-002-0186-5 (2002).
Article PubMed Google Scholar
Munkvold, L., Kjøller, R., Vestberg, M., Rosendahl, S. & Jakobsen, I. High functional diversity within species of arbuscular mycorrhizal fungi. New Phytol. 164, 357–364. https://doi.org/10.1111/j.1469-8137.2004.01169.x (2004).
Article PubMed Google Scholar
Feddermann, N., Finlay, R., Boller, T. & Elfstrand, M. Functional diversity in arbuscular mycorrhiza—The role of gene expression, phosphorous nutrition and symbiotic efficiency. Fungal Ecol. 3, 1–8. https://doi.org/10.1016/j.funeco.2009.07.003 (2010).
Article Google Scholar
Mensah, J. A. et al. High functional diversity within species of arbuscular mycorrhizal fungi is associated with differences in phosphate and nitrogen uptake and fungal phosphate metabolism. Mycorrhiza 25, 533–546. https://doi.org/10.1007/s00572-015-0631-x (2015).
Article CAS PubMed Google Scholar
Mc, G. T., Miller, M. H., Evans, D. G., Fairchild, G. L. & Swan, J. A. A new method which gives an objective measure of colonization of roots by vesicular-arbuscular mycorrhizal fungi. New Phytol. 115, 495–501. https://doi.org/10.1111/j.1469-8137.1990.tb00476.x (1990).
Article Google Scholar
Trouvelot, A., Kough, J. & Gianinazzi-Pearson, V. Measurement of the va mycorrhization rate of a root system. Research and estimation methods of functional significance. Physiol. Genet. Asp. Mycorrhizae Dijon. 217–221 (1985).
Krach, E. K., Wu, Y., Skaro, M., Mao, L. & Arnold, J. Wild isolates of Neurospora crassa reveal three conidiophore architectural phenotypes. Microorganisms https://doi.org/10.3390/microorganisms8111760 (2020).
Article PubMed PubMed Central Google Scholar
Krach, E. K., Skaro, M., Wu, Y. & Arnold, J. Characterizing the gene-environment interaction underlying natural morphological variation in neurospora crassa conidiophores using high-throughput phenomics and transcriptomics. G3 (Bethesda). https://doi.org/10.1093/g3journal/jkac050 (2022).
Article PubMed PubMed Central Google Scholar
Evangelisti, E. et al. Artificial intelligence enables the identification and quantification of arbuscular mycorrhizal fungi in plant roots. bioRxiv https://doi.org/10.1101/2021.03.05.434067 (2021).
Article Google Scholar
He, K. M., Gkioxari, G., Dollar, P. & Girshick, R. Mask r-cnn. 2017 IEEE International Conference on Computer Vision (Iccv), 2980–2988, https://doi.org/10.1109/Iccv.2017.322 (2017).
He, K., Gkioxari, G., Doll\'ar, P. & Girshick, R. Mask r-cnn. arXiv pre-print serve., ar*xiv:1703.06870 (2018).
Ferguson, J. N. et al. Machine learning-enabled phenotyping for gwas and twas of wue traits in 869 field-grown sorghum accessions. Plant Physiol. 187, 1481–1500. https://doi.org/10.1093/plphys/kiab346 (2021).
Article CAS PubMed PubMed Central Google Scholar
Tukey, J. W. Comparing individual means in the analysis of variance. Biometrics 5, 99–114 (1949).
Article MathSciNet CAS PubMed Google Scholar
McGonigle, T. P., Miller, M. H., Evans, D. G., Fairchild, G. L. & Swan, J. A. A new method which gives an objective measure of colonization of roots by vesicular–arbuscular mycorrhizal fungi. N. Phytol. 115, 495–501. https://doi.org/10.1111/j.1469-8137.1990.tb00476.x (1990).
Article CAS Google Scholar
Searle, S. R., Casella, G. & McCulloch, C. E. Variance Components. (Wiley, 2009).
Schielzeth, H. & Nakagawa, S. Nested by design: Model fitting and interpretation in a mixed model era. Methods Ecol. Evol. 4, 14–24. https://doi.org/10.1111/j.2041-210x.2012.00251.x (2013).
Article Google Scholar
Brenton, Z. W. et al. A genomic resource for the development, improvement, and exploitation of sorghum for bioenergy. Genetics 204, 21–33 (2016).
Article CAS PubMed PubMed Central Google Scholar
Juan, M., Basile, C., Min, Y., John, L. & Lennart, R. A brief conceptual tutorial of multilevel analysis in social epidemiology: Linking the statistical concept of clustering to the idea of contextual phenomenon. J. Epidemiol. Community Health. 59, 443. https://doi.org/10.1136/jech.2004.023473 (2005).
Article Google Scholar
Deng, S. et al. Genome wide association study reveals plant loci controlling heritability of the rhizosphere microbiome. ISME J. 15, 3181–3194. https://doi.org/10.1038/s41396-021-00993-z (2021).
Article CAS PubMed PubMed Central Google Scholar
Juan, M., Min, Y., Basile, C., John, L. & Lennart, R. A brief conceptual tutorial on multilevel analysis in social epidemiology: Investigating contextual phenomena in different groups of people. Journal of Epidemiology and Community Health 59, 729. https://doi.org/10.1136/jech.2004.023929 (2005).
Article Google Scholar
Nakagawa, S. & Schielzeth, H. A general and simple method for obtaining r2 from generalized linear mixed-effects models. Methods Ecol. Evol. 4, 133–142. https://doi.org/10.1111/j.2041-210x.2012.00261.x (2013).
Article Google Scholar
Merlo, J., Yang, M., Chaix, B., Lynch, J. & Råstam, L. A brief conceptual tutorial on multilevel analysis in social epidemiology: Investigating contextual phenomena in different groups of people. J Epidemiol. Community Health. 59, 729–736. https://doi.org/10.1136/jech.2004.023929 (2005).
Article PubMed PubMed Central Google Scholar
Groll, A. & Tutz, G. Variable selection for generalized linear mixed models by l1-penalized estimation. Stat. Comput. 24, 137–154. https://doi.org/10.1007/s11222-012-9359-z (2014).
Article MathSciNet Google Scholar
Subramanian, S. V., Jones, K. & Duncan, C. In Neighborhoods and Health (eds Kawachi, I. & Berkman, L. F.) (Oxford University Press, 2003).
Johnson, P. C. D. Extension of nakagawa & schielzeth’s r2glmm to random slopes models. Methods Ecol. Evol. 5, 944–946. https://doi.org/10.1111/2041-210X.12225 (2014).
Article PubMed PubMed Central Google Scholar
Wang, X. et al. N-p fertilization did not reduce amf abundance or diversity but alter amf composition in an alpine grassland infested by a root hemiparasitic plant. Plant Divers. 40, 117–126. https://doi.org/10.1016/j.pld.2018.05.001 (2018).
Article ADS PubMed PubMed Central Google Scholar
Das, A. et al. Digital imaging of root traits (dirt): A high-throughput computing and collaboration platform for field-based root phenomics. Plant Methods 11, 1–12 (2015).
Article Google Scholar
Whiteside, M. D. et al. Mycorrhizal fungi respond to resource inequality by moving phosphorus from rich to poor patches across networks. Curr. Biol. 29, 2043–2050 (2019).
Article CAS PubMed PubMed Central Google Scholar
Martignoni, M. M. et al. Co-inoculation with arbuscular mycorrhizal fungi differing in carbon sink strength induces a synergistic effect in plant growth. J. Theor. Biol. 531, 110859 (2021).
Article CAS PubMed Google Scholar
Yu, N. et al. A della protein complex controls the arbuscular mycorrhizal symbiosis in plants. Cell Res. 24, 130–133 (2014).
Article CAS PubMed Google Scholar
Johnson, N. C. et al. From lilliput to brobdingnag: Extending models of mycorrhizal function across scales. BioScience 56, 889–900. https://doi.org/10.1641/0006-3568(2006)56[889:FLTBEM]2.0.CO;2 (2006).
Article Google Scholar
Fisher, R. A. The Design of Experiments (Oliver and Boyd, 1935).
Cornell, C. et al. The arbuscular mycorrhizal fungus Rhizophagus irregularis harmonizes nuclear dynamics in the presence of distinct abiotic factors. Fungal Genet. Biol. 158, 103639. https://doi.org/10.1016/j.fgb.2021.103639 (2022).
Article CAS PubMed Google Scholar
Ivanov, S., Austin, J., Berg, R. H. & Harrison, M. J. Extensive membrane systems at the host–arbuscular mycorrhizal fungus interface. Nat. Plants 5, 194–203. https://doi.org/10.1038/s41477-019-0364-5 (2019).
Article PubMed Google Scholar
Govindarajulu, R. et al. Integration of high-density genetic mapping with transcriptome analysis uncovers numerous agronomic qtl and reveals candidate genes for the control of tillering in sorghum. G3 Genes Genomes Genet. 11, 024. https://doi.org/10.1093/g3journal/jkab024 (2021).
Article CAS Google Scholar
Vierheilig, H. & Piché, Y. A modified procedure for staining arbuscular mycorrhizal fungi in roots. Zeitschrift für Pflanzenernährung und Bodenkunde 161, 601–602. https://doi.org/10.1002/jpln.1998.3581610515 (1998).
Article CAS Google Scholar
Evangelisti, E. et al. Deep learning-based quantification of arbuscular mycorrhizal fungi in plant roots. N. Phytol. (2021).
Dutta, A. & Zisserman, A. The via annotation software for images, audio and video. In Proceedings of the 27th ACM International Conference on Multimedia (Mm'19), 2276–2279. https://doi.org/10.1145/3343031.3350535 (2019).
Gillies, S. et al. Shapely: Manipulation and Analysis of Geometric Objects. https://github.com/shapely/shapely (2007).
Wu, Y., Kirillov, A., Massa, F., Lo, W.-Y. & Girshick, R. Detectron2. https://github.com/facebookresearch/detectron2 (2019).
Wu, Y., Kirilov, A., Massa, F., Lo, W.-Y. & Girschick, R. Detectron 2. https://github.com/facebookresearch/detectron2 (2019).
Lin, T.-Y. et al. In European Conference on Computer Vision.
Ren, S., He, K., Girshick, R. & Sun, J. Faster r-cnn: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. https://doi.org/10.1109/TPAMI.2016.2577031 (2015).
Article PubMed Google Scholar
Lin, T. Y. et al. Microsoft coco: Common objects in context. Comput. Vis. Eccv 2014 Pt V 8693, 740–755. https://doi.org/10.1007/978-3-319-10602-1_48 (2014).
Article Google Scholar
Bates, D., Mächler, M., Bolker, B. & Walker, S. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48. https://doi.org/10.18637/jss.v067.i01 (2015).
Article Google Scholar

Download references

Acknowledgements

This work was supported by DOE DE-SC0021386 and NSF DBI 1946937 and NSF DMS-2311186.

Author information

Authors and Affiliations

Institute of Bioinformatics, University of Georgia, Athens, GA, USA
Shufan Zhang, Yue Wu, Michael Skaro, Amanda Bouffier-Landrum & Isaac Torrres
Genetics Department, University of Georgia, Athens, GA, USA
Jia-Hwei Cheong, Yinping Guo, Lauren Stupp, Brooke Lincoln, Anna Prestel, Camryn Felt & Jonathan Arnold
School of Earth and Sustainability and Department of Biological Sciences, North Arizona University, Flagstaff, AZ, USA
Sedona Spann & Nancy Johnson
Statistics Department, University of Georgia, Athens, GA, USA
Abhyuday Mandal

Authors

Shufan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yue Wu
View author publications
You can also search for this author in PubMed Google Scholar
Michael Skaro
View author publications
You can also search for this author in PubMed Google Scholar
Jia-Hwei Cheong
View author publications
You can also search for this author in PubMed Google Scholar
Amanda Bouffier-Landrum
View author publications
You can also search for this author in PubMed Google Scholar
Isaac Torrres
View author publications
You can also search for this author in PubMed Google Scholar
Yinping Guo
View author publications
You can also search for this author in PubMed Google Scholar
Lauren Stupp
View author publications
You can also search for this author in PubMed Google Scholar
Brooke Lincoln
View author publications
You can also search for this author in PubMed Google Scholar
Anna Prestel
View author publications
You can also search for this author in PubMed Google Scholar
Camryn Felt
View author publications
You can also search for this author in PubMed Google Scholar
Sedona Spann
View author publications
You can also search for this author in PubMed Google Scholar
Abhyuday Mandal
View author publications
You can also search for this author in PubMed Google Scholar
Nancy Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Arnold
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.Z. and J.A. performed experimental design. S.Z., L.S., C.F., and J.C. carried out image acquisition. S.Z., L.S., B.L., J.C., A.B., Y.G., A.P., C.F., S.S., and N.J. worked on image annotation. L.S., B.L., and I.T. executed data analysis. Y.W., M.S., and S.Z. implemented computer vision model. S.Z., A.M., and J.A. developed mixed linear models. S.Z., Y.W., M.S. and J.A. wrote the manuscript. Y.G. and J.A. managed the project.

Corresponding author

Correspondence to Jonathan Arnold.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, S., Wu, Y., Skaro, M. et al. Computer vision models enable mixed linear modeling to predict arbuscular mycorrhizal fungal colonization using fungal morphology. Sci Rep 14, 10866 (2024). https://doi.org/10.1038/s41598-024-61181-5

Download citation

Received: 29 June 2023
Accepted: 02 May 2024
Published: 13 May 2024
DOI: https://doi.org/10.1038/s41598-024-61181-5

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

A spatial transcriptome map of the developing maize ear

Seedling root system adaptation to water availability during maize domestication and global expansion

Land conversion to agriculture induces taxonomic homogenization of soil microbial communities globally

Introduction

Results

Performance of Mask R-CNN on AMF image segmentation

From image segmentation to measures of fungal morphology

Using mixed linear models to predict total AMF abundance with fungal structure morphology

The rationale for mixed effects

Mixed linear model assumptions

The null model (Model 1)

Random intercept model with experimental design variable (Model 2)

Random intercept + fixed slope model with AMF structure predictors (Model 3, 4, 5)

Random intercept + random slope model with AMF structure predictors (Model 6)

Arbuscule morphology is predictive of differential AMF percent root colonization in sibling plants

The plant level variance component in count is stable across models: loss of goodness of fit in random effects is offset by a gain in goodness of fit to fixed effects of fungal structures

Differential AMF colonization between sorghum root regions

Discussion

Preferential AMF colonization in sorghum roots

Plant level variance of AMF colonization

Differential AMF colonization between sorghum inbred lines

Role of models across scales to understand the AMF symbiosis with plants

Materials and methods

The Georgia dataset

Plant cultivation

Root imaging

The Cambridge dataset

Image annotation

Data cleaning

Mask R-CNN model training

Mixed linear model prediction and statistical analysis

Computational resources

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Comments

Search

Quick links