Fully-automated root image analysis (faRIA)

Narisetti, Narendra; Henke, Michael; Seiler, Christiane; Junker, Astrid; Ostermann, Jörn; Altmann, Thomas; Gladilin, Evgeny

doi:10.1038/s41598-021-95480-y

Download PDF

Article
Open access
Published: 06 August 2021

Fully-automated root image analysis (faRIA)

Narendra Narisetti¹,
Michael Henke¹^nAff3,
Christiane Seiler¹,
Astrid Junker¹,
Jörn Ostermann²,
Thomas Altmann¹ &
…
Evgeny Gladilin¹

Scientific Reports volume 11, Article number: 16047 (2021) Cite this article

6409 Accesses
19 Citations
7 Altmetric
Metrics details

Subjects

Abstract

High-throughput root phenotyping in the soil became an indispensable quantitative tool for the assessment of effects of climatic factors and molecular perturbation on plant root morphology, development and function. To efficiently analyse a large amount of structurally complex soil-root images advanced methods for automated image segmentation are required. Due to often unavoidable overlap between the intensity of fore- and background regions simple thresholding methods are, generally, not suitable for the segmentation of root regions. Higher-level cognitive models such as convolutional neural networks (CNN) provide capabilities for segmenting roots from heterogeneous and noisy background structures, however, they require a representative set of manually segmented (ground truth) images. Here, we present a GUI-based tool for fully automated quantitative analysis of root images using a pre-trained CNN model, which relies on an extension of the U-Net architecture. The developed CNN framework was designed to efficiently segment root structures of different size, shape and optical contrast using low budget hardware systems. The CNN model was trained on a set of 6465 masks derived from 182 manually segmented near-infrared (NIR) maize root images. Our experimental results show that the proposed approach achieves a Dice coefficient of 0.87 and outperforms existing tools (e.g., SegRoot) with Dice coefficient of 0.67 by application not only to NIR but also to other imaging modalities and plant species such as barley and arabidopsis soil-root images from LED-rhizotron and UV imaging systems, respectively. In summary, the developed software framework enables users to efficiently analyse soil-root images in an automated manner (i.e. without manual interaction with data and/or parameter tuning) providing quantitative plant scientists with a powerful analytical tool.

RhizoNet segments plant roots to assess biomass and growth for enabling self-driving labs

Article Open access 05 June 2024

Semi-automated Root Image Analysis (saRIA)

Article Open access 23 December 2019

As good as human experts in detecting plant roots in minirhizotron images but efficient and reproducible: the convolutional neural network “RootDetector”

Article Open access 25 January 2023

Introduction

Image based high-throughput phenotyping of roots is one of the emerging disciplines in plant phenomics. It aims to extract the plant morphological and physiological properties in a non-destructive manner to study the plant performance under given conditions¹. Traditional approaches to root phenotyping have relied on destructive and artificial grown mediums such as liquids or gels^2,3. However, the root growth is known to be dependent on physical conditions⁴ and such studies have shown a non-typical response of the roots in soil^5,6.

More recently, non-destructive methods such as X-ray computed tomography^7,8, nuclear magnetic resonance (NMR) microscopy⁹ and laser scanning¹⁰ provide unique insights into 3D organization of living root architecture, however, their throughput capabilities are presently rather limited. Moreover, minirhizotrons^11,12 and rhizotron systems^13,14 have gained popularity to enable non-invasive imaging of roots in a soil environment. However, the minirhizotrons require a repeated photographing of roots through a transparent surface of below ground observation tubes¹⁵. In contrast, rhizotron systems contain rectangular glass pots which requires a single photographing of roots¹⁶. Recently, near-infrared (NIR) imaging of roots growing along transparent pots were presented in our previous works^17,18. These systems contain special low pass filters to block root exposure to visible light and the images were taken by NIR camera under suitable illumination.

Due to high level of optical soil heterogeneity, soil-root images exhibit a relatively low contrast between back- and foreground structures. Consequently, at the local scale root and soil pixels cannot be distinguished on the basis of their intensity values only. Several root image solutions were suggested in the past, however, most of them were designed for a specific imaging system^{19,20,21,22,23}. Examples of general-purpose semi-automated tools include GiA Roots²⁴, IJ-Rhizo²⁵ as well as our previously published saRIA software²⁶. All these tools rely on thresholding and morphological filtering techniques to segment the roots from background. Other root phenotyping solutions like SmartRoot^27,28 require manual segmentation by placing multiple landmarks along the roots that are subsequently interconnected to the root skeleton. All the above software solutions are time consuming, have limited throughput capabilities, and require expertise in parameter tuning.

To overcome the limitations of existing methods, automated root image segmentation solutions are required for high-throughput root image segmentation and phenotyping. In the last 5 years, deep learning gained high attention especially in computer vision applications, because of the ability to directly extract and train relevant multi-level features from data without prior knowledge and human effort in feature design. Convolutional neural networks (CNNs) are a class of deep learning approaches that have shown to outperform traditional methods in many applications of the computer vision that are associated with higher level cognitive abilities²⁹. CNNs have been shown to outperform conventional approaches when applied to traditionally difficult tasks of image analysis including pattern detection and object segmentation in biomedical images^30,31, traffic scenes³² and remote sensing³³. In recent years, they were also used for high-throughput plant phenotyping such as detection of wheat roots grown in germination paper³⁴, segmentation of roots from soil in X-ray tomography³⁵ and segmentation of spikes in wheat plants³⁶. However, most of these works present exemplary application and/or computational frameworks that can hardly be handled by end-users without advanced programming skills.

The focus of this work is on semantic segmentation of soil-root images by which root pixels are automatically segmented from soil regions. For this kind of approach, CNNs often use encoder–decoder architecture. Till date, several papers have been published on this type of CNN architecture for biomedical^30,31 and areal applications^32,33. Moreover, this type of architectures are constantly improving by cascading or fusing the CNNs in biomedical^37,38 and remote sensing applications³⁹.

Application of CNNs to automated image analysis and plant phenotyping became an emerging trend in quantitative plant sciences in the recent years⁴⁰. However, reliable software tools suitable for a particular plant type are rarely available due to the large variability of optical plant appearance, differences between experimental setups^35,40, and the absence of labelled ground truth data^41,42. Consequently, only a few software tools for high-throughput plant image analysis and phenotyping are presently known.

Previously published state of the art encoder–decoder CNN solutions for root image segmentation include RootNav 2.0⁴³, SegRoot⁴⁴ and RootNet⁴⁵. Among those, RootNav 2.0 and RootNet tools were primarily developed for particular experimental setups such as roots grown on germination paper with high contrast between root and (blue) background pixels, and, thus, cannot be expected to perform accurately by application to other imaging modalities such as noisy soil-root images in this work.

Among the above mentioned tools, SegRoot appears to be the most suitable one for soil-root image segmentation as it is previously shown to be capable of segmenting roots from soil background in minirhizotrons systems. Moreover, the architecture of SegRoot is somewhat similar to U-Net and it transfers the location of feature maps to decoder for image segmentation. However, this approach failed to detect fine, blurry and low contrast roots, which, in turn, compromises the accuracy of resulting phenotypic traits such as estimated root biomass and other geometric features. To overcome these limitations, here, we adopted a U-Net³⁰ based encoder–decoder architecture which transfers both location and pixel information of the feature maps to the decoder. Also, it is especially useful when large amount of manually annotated data is challenging, such as often the case in biomedical applications.

The aim of this work is to develop an efficient and handy tool for fully automated root image segmentation and quantification using a pre-trained deep CNN framework which could be used in a straightforward manner even by unskilled users. Although, our approach relies on supervised model training, for the end-users such a model-based image analysis is performed in a fully automated manner (i.e. without interaction with data and/or parameter tuning) in contrast to purely manual or semi-automated image segmentation approaches where such interactions are required. Consequently, we termed this approach fully-automated root image analysis (faRIA). The main contributions of this work include:

Development of a CNN approach to automated root image segmentation based on the U-Net architecture from³⁰,
Training and application of the CNN model for efficient segmentation of root structures of different size, shape and optical contrast on low budget hardware systems using image masking approach,
Evaluation and comparison of our CNN model vs. other state-of-the-art tools for root image analysis using the Dice similarity metrics,
Evaluation of our CNN framework performance on images of different root imaging modalities,
Development of a GUI based front-end for efficient handling of the algorithmic framework suitable also for IT-unskilled users.

The paper is structured as follows: first, we describe the methodological framework of proposed U-Net based deep learning algorithm and performance matrices for soil-root segmentation. Then, a brief experimental setup consist of data preparation, training and prediction procedure are discussed. Followed by, the results of experimental investigation are presented including a comparison of faRIA performance to other image segmentation tools, performance on resized images and robustness by application to other image modalities and plant species. In discussion, we summarize the results of an evaluation study using faRIA image segmentation and present its GUI implementation for efficient application in high-throughput root phenotyping.

Methods

Deep CNN model for root image segmentation

The proposed CNN architecture is derived from the original U-Net³⁰ which provides a versatile framework for semantic image segmentation consisting of encoder and corresponding decoder units. Our CNN model has a depth of 3 which is less than original U-Net depth of 4 due to the smaller input image size. Further, in our approach batch normalization⁴⁶ is applied after each convolutional layer in contrast to the original U-Net architecture where it was not the case. The motivation behind the batch normalization is it is known to make model performance more faster and stable^46,47. Furthermore, the original U-Net³⁰ used dropout layer which we avoided because in some cases the combination of batch normalization and dropout layers can cause worse results⁴⁸. Also, kernel size of the convolutional layers was set larger in our approach than in the original U-Net to improve the continuity in segmentation of roots⁴⁹. The details of the convolutional parameters in comparison to the original U-Net are summarized in Table 1.

Table 1 Convolutional parameters of the original U-Net and proposed modifications.

Full size table

Motivated by the encoder–decoder architecture of U-Net, a network framework for soil-root image segmentation was constructed, see Fig. 1. In particular, our network was designed to be trained on patches of input images in original resolution. This was introduced in order to enable model training using larger amount of ground truth data on consumer GPUs while preserving high-frequency image information which otherwise would be lost either by restricting the training set to maximum possible capacity of GPU RAM or by image downscaling. Furthermore, training of CNN on image patches instead of full-size images is known to be more advantageous for learning local features⁵⁰. Therefore, the architecture was designed in such a way that it has input and output layers of the size $256\times 256$. In what follows, the details of network encoder and decoder layers are described.

Encoder network: The encoder network consists of 3 encoder blocks. The first encoder block takes the image patches of size $256\times 256$ as input and produces corresponding feature maps of size ($256\times 256\times 16$) as output. Then the feature maps are forwarded to the second and third encoder blocks to generate further feature maps for the root detection. Each encoder block consists of two convolutional layers to learn feature maps at respective levels, where each convolutional layer consists of $7\times 7$ convolution filter followed by batch normalization⁴⁶ and a non-linear activation function called Rectified Linear Unit (ReLU)⁵¹. Here, batch normalization improves the network performance and stability by normalizing the feature maps at respective levels⁴⁶. Followed by each encoder block, max-pooling operation using general window size of $2\times 2$^50,52 is applied for down-sampling the feature maps by half of its original size. This results in aggregate features are generated more efficiently. All three encoders are repeated with varying depth of 16, 32 and 64 to detect diverse root features respectively. The details of each encoder block and corresponding max-pool layers are given in Table 2.

Followed by encoder network, a bridge encoder block without max-pooling layer is applied. This results in 128 feature maps of each size $32\times 32$ are generated.

Decoder network: The output from the bridge encoder ($32\times 32\times 128$) is upsampled using $3\times 3$ transpose convolution with same padding and stride 2. This means size of feature maps ($32\times 32\times 128$) were double to ($64\times 64\times 128$) by applying filter of size $3\times 3$ to all input elements and boarder elements were computed using zero padding. Then the resulting feature map is concatenated with the corresponding encoder feature maps. This results in feature maps of size ($64\times 64\times 256$) are generated. Then it is passed through a convolutional layers like encoder block but having decreasing channel depth of 64. This process is repeated for remaining decoder blocks with decreasing channel depth of 32 and 16. The details of each decoder block and corresponding transpose layer outputs are given in Table 3. Finally, the output of the final decoder block is fed into a convolutional layer of size $1\times 1\times 1$ with “Softmax” activation function⁵³ to classify each pixel as root or non-root at the patch level. The output of proposed architecture is a predicted mask of size $256\times 256$ like the input image patch a shown in Fig. 1.

Table 2 Details of all encoder blocks and corresponding max-pool layer output.

Full size table

Table 3 Details of all decoder blocks and corresponding transpose convolutional layers.

Full size table

Performance metrics

To evaluate the performance of the proposed U-Net model during training and testing stage, Dice coefficient (DC)⁵⁴ is used. It measures the area of intersection between the model and ground truth segmentation and its value ranges from 0 to 1, where 1 corresponds to $100\%$ perfect and 0 to false segmentation. The Dice coefficient is defined as:

$$\begin{aligned} DC = \frac{2*(P \cap G)}{P \cup G} = \frac{2*\sum _{i}^{N} P_i G_i}{\sum _{i}^{N} P_i + \sum _{i}^{N} G_i}, \end{aligned}$$

(1)

where P and G are predicted and ground truth binary images respectively. $P_i$ and $G_i$ are output values 0 and 1 of pixel i in predicted and ground truth binary image respectively. Also, the above equation can be re-written as following:

$$\begin{aligned} DC = 2 * \frac{{\text {precision}} * {\text {recall}}}{{\text {precision}} + {\text {recall}}}. \end{aligned}$$

(2)

From Eq. (2) it follows that the model would likely overestimate soil pixels and underestimate root pixels in the segmented image, because root images typically contain significantly more background pixels than root pixels. In that case, precision defines the ratio of correctly predicted root pixels to the number of pixels predicted to be root and recall is the ratio of correctly predicted root pixels to the number of actual root pixels in the image.

Ethical approval

All the protocols involving plants adhered to the ethical guidelines for plant usage were followed while conducting the experiments.

Experimental setup

Data and image annotation

Near-infrared (NIR) images of maize plant roots grown in soil were captured by using IPK plant phenotyping system for large plants¹⁷. Images were taken by one side-view 12MP monochrome camera (UI-5200SE-M-GL, IDS) with chip sensitive in NIR portion of electromagnetic spectrum and suitable distortion-free lens (V1228-MPY). Also, it includes homogeneous infrared LED light source (850 nm) and filters preventing reflections during image acquisition. In brief, plants were grown in rhizopots [$342\times 350$ mm (${\mathrm{W}}\times {\mathrm{L}}$)] filled with the potting substrate (Potgrond P, Klassmann).

200 greyscale root images of maize plants acquired with the IPK plant phenotyping system were selected for the ground truth segmentation. This labelling task is performed by agronomists using our previously published software for semi-automated root image analysis (saRIA)²⁶ which provides an efficient graphical user interface for tuning parameters of image segmentation including intensity threshold, morphology and noise removal to generate an accurate segmentation of roots in soil. The images acquired with the above imaging system have resolution of $2345\times 2665$. A detailed root annotation with saRIA took approximately 5–10 min per image depending on the amount of root pixels in the image. Figure 2 shows an example of IPK plant phenotyping system images and their corresponding binary segmentation using saRIA. This binary mask contains all roots as foreground in white and the remaining pixels as background in black.

To enable application of the proposed model to a broad range of root imaging modalities, the model originally developed for NIR root image segmentation was applied to LED-based rhizotron and ultraviolet (UV) imaging systems^18,26. In fact, such approach is feasible because root structures in both image modalities exhibit large similarities. The rhizotron system contains a root camera (Allied Vision Prosilica GT 6600) and uses white LED illumination to image the roots growing in soil along plexiglass plates. The UV system contains two monochrome UV-sensitive cameras (UI-5490SE-M-GL, IDS) with two sets of LED illumination panels (UV, 380 nm) in a custom-made imaging box. It is suitable for capturing small plants in transparent pots of size [$77\times 77\times 97$ mm (${\mathrm{W}}\times {\mathrm{L}}\times {\mathrm{H}}$)] filled with the potting substrate (Potgrond P, Klasmann). This system allows non-invasive acquisition of root images in darkness¹⁸.

Training

The proposed U-Net model was developed under Python 3.6.1 using TensorFlow⁵⁵ library with Keras API^56,57. Image processing functions like cropping and morphological functions (dilation, erosion) were implemented using PIL, Numpy⁵⁸ and Scikit-Image⁵⁹ packages. Then the model was trained on Linux operating system (Intel(R) Xeon(R) Gold 6130 CPU @ 2.10GHz) with NVIDIA Tesla P100-PCIE-16GB graphic card.

Images analysed in this work contain both thin and fine root structures that may have only one or few pixels in width. To preserve such fine structures the binary masks were dilated similar to strategy applied in SegRoot⁴⁴. Originally $2345\times 2665$ sized root images of maize plants are analysed step-wise using $256\times 256$ crop masks. Thereby, the original image edges were padded with zeros so that both its width and height are divisible by 256. Hence, original image size is increased to $2560\times 2816$ with zero-padding. Then each image is partitioned into 110 non-overlapping $256\times 256$ crop masks and approximately 20,000 crop masks are generated for all images. However, 2/3 of those cropped masks contain only background structures that contribute to training the network only background appearance. To avoid potential imbalance between plant and non-plant training masks, only cropped regions with both root and background pixels information of the size $256\times 256$ were selected from 182 original images. Then each cropped image is normalized in the range of [0, 1] for feature consistency in the CNN network.

Subsequently, the data set was partitioned into training and validation sets in the ratio of 85:15. The training set is used to optimize the proposed model with Adam optimizer⁶⁰ in such way that the weight parameters improve the model segmentation performance. Also, the initial weights of the networks were defined randomly as proposed by Krizhevsky et al.⁶¹ with the mean 0 and the standard deviation of 0.05. Here, the model training was initialized for maximum of 200 epochs with 16 number of convolutional channel features and batch size of 128 as per system constraints. Loss functions quantify the unhappiness of our network during training and it defines the difference between predicted output and ground truth generated by saRIA. The result of loss function can be improved by updating weights of the network in an iterative manner. Here, more commonly used “binary cross-entropy loss” function⁵⁰ is used to predict binary class label (i.e., roots and non-roots) at each patch level. This function compares each pixel prediction (0: non-root, 1: root) to the corresponding ground truth pixel and averages all pixels loss for computing total loss of the image. Therefore, each pixel contributes to the overall objective loss function. Then the learning rate of the Adam optimizer⁶⁰ was estimated from a range of reasonable values (0.00001, 0.0001, 0.001, 0.1, 1 and 10) while monitoring the training and validation Dice coefficient of the model.

Prediction

As stated in image annotation subsection, the images from IPK plant phenotyping system have the original resolution of $2345\times 2665$, while the proposed U-Net model requires input images of the size $256\times 256$. In the preprocessing stage, zero padding is applied to test images similar as it was done in the training stage. Then non-overlapping $256\times 256$ masks were generated. The model does predictions on these $256\times 256$ masks that are then combined to one single output image. Finally, the zero padded pixels were removed and the segmented image with resolution identical to the original input image was generated. This complete process is dynamic and automatized in the prediction stage as shown in Fig. 3. Since the output layer is given by the Sigmoid activation function, the predicted segmentation is a probability map with values ranging between 0 and 1. Hence the generated probability map was converted to a binary image using threshold T. Here, the root pixels with a relatively high T $\ge$ 0.9 is chosen to avoid misclassification for the soil-root image segmentation. After fully automated segmentation, the proposed model performs phenotyping of segmented root structures similar to saRIA²⁶.

In practice, the end-users prefer to have an easy-to-use software solution including the Graphical User Interface (GUI). Therefore, a user-friendly GUI front-end was developed under the MATLAB 2019b environment⁶² to comfortably operate the complex algorithmic framework of faRIA software. Figure 3 shows the complete workflow involved in faRIA for automatic root segmentation and trait extraction. For import of deep learning models trained under Python the MATLAB interoperability routine importKerasNetwork⁶² was used. According to specification of this function, the U-Net models trained in Python were exported in the so-called h5 file format, which is supported by the recent versions of MATLAB including 2019b.

In addition to 256 cropped masks, the proposed U-Net model was extended to train on full images. This model has an input and output images of size $1024 \times 1024$ as per our system constraints. So that the original and ground truth images were resized to $1024\times 1024$ using bi-linear interpolation method⁶³. Also, the model consist of an additional encoder and decoder blocks with convolution mask of size $5\times 5$ in their respective networks. Therefore, encoder network generates the feature maps from size $1024\times 1024 \times 1$ to $32\times 32\times 128$ and inverse size in the decoder network. To distinguish both networks, the proposed U-Net model on 256 and 1024 masks are named as faRIA:256 and faRIA:1024, respectively.

Results

Training and validation of faRIA

As discussed above, the training and validation of faRIA:256 model was performed on totally 6465 image patches in the ratio of 85:15 between train and test images, respectively. The performance of the trained model is analysed using binary cross-entropy loss, Dice coefficient, precision and recall at each epoch during learning stage of the network. Figure 4 shows the training and validation performance of the faRIA:256 over 200 epochs. It turned out that the training loss (Fig. 4a) was minimized and platen the curve near to zero after epoch number 140. Simultaneously, training DC, precision and recall were maximized and achieved more than 90% of the accuracy from epoch number 100. But generalized performance of the model is measured using validation parameters. Figure 4b explains that the proposed model achieved maximum validation Dice coefficient of 0.874 and minimum validation loss of 0.033 at epoch number 71.

Evaluation of faRIA versus SegRoot

For comparing the performance of faRIA:256 model with existing tools, SegRoot⁴⁴ was trained on the same image data set. For this purpose, the SegRoot model was trained on $256\times 256$ image blocks for 200 epochs with best practical parameters of depth 5 and width 8 as suggested in Wang et al.⁴⁴. In addition, to validate the performance of proposed model on full image instead of $256\times 256$ blocks (faRIA:256), faRIA:1024 was proposed. The faRIA:1024 model was trained for 200 epochs with training configurations similar to faRIA:256. Tables 4 and 5 show the training parameters and performance measures of the faRIA:256 with respect to SegRoot and faRIA:1024.

Table 4 Training parameters of SegRoot, faRIA:1024 and faRIA:256 over 200 epochs.

Full size table

Table 5 Training performance of SegRoot, faRIA:1024 and faRIA:256 over 200 epochs.

Full size table

Followed by training performance, an exemplary performance of above three models on test image was performed, see in Fig. 5. Thereby, the faRIA:256 model showed the DC of 0.83 whereas SegRoot and faRIA:1024 achieved 0.42 and 0.44 respectively. Also, the presence of marginal artefacts in faRIA:1024 and faRIA:256 compared to ground truth are shown in Fig. 6.

Segmentation of further image modalities

The faRIA:256 model originally trained on maize plant roots from IPK plant phenotyping system is applied to LED-based rhizotron and UV imaging systems for the root segmentation from soil. Figures 7 and 8 shows the DC of faRIA:256 model over 40 barley and 30 arabidopsis root images from rhizotron and UV imaging system and achieved mean DC of 0.85 and 0.68 respectively. An exemplary segmentation of rhizotron (image number 4 in Fig. 7) and UV image (image number 6 in Fig. 8) are shown in Figs. 9a–c,e and 10a–c,e respectively. Here, the faRIA:256 model resulted DC of 0.87 and 0.79 for rhizotron and UV image compared to the ground truth generated by saRIA respectively. In addition, the performance of the SegRoot on same rhizotron and UV image compared to the ground truth is shown in the Figs. 9d,f and 10d,f respectively. Here, false negative (green) and false positive (pink) pixels represents the undetected and falsely classified root pixels in the predicted segmentation compared to the ground truth.

Evaluation of phenotypic traits versus saRIA

In addition to the segmentation performance, phenotyping characterization obtained with faRIA are also evaluated in comparison to saRIA. Here, correlation coefficient of determination $R^2$ and significance level p value are used to measure the percent of the faRIA calculated traits that are close to the ground-truth (from saRIA) and model validation respectively. Figure 11 shows the correlation between the saRIA (x-axis) and faRIA (y-axis) outputs for four traits where each point denotes one particular image out of 40 barley root images from rhizotron imaging system. Out of 75 traits, only four important traits for root biomass calculation are presented for faRIA evaluation. They are total root area, total root length, total root surface area and total root volume. Further information on definition of traits is included in the Supplementary Information, see Table S1. Figure 11 shows that correlations between traits calculated with saRIA and faRIA are highly significant and exhibit $R^2$ values greater than 0.98, 0.97, 0.98 and 0.98 and p values 1.59e−40, 5.01e−38, 7.63e−42, and 5.13e−42, respectively.

Graphical user interface and runtime

Figure 12 shows the GUI of faRIA software which is freely available as a precompiled executable program from https://ag-ba.ipk-gatersleben.de/faria.html. In addition to fully automated image segmentation, faRIA calculates 75 root traits that are categorized into 12 feature groups named area (number of root pixels), number of disconnected root objects, total length, surface area, volume, number of branching and ending points, statistical distribution (mean, median, standard deviation, skewness, kurtosis, percentile and bootstrap) of root geometry in horizontal and vertical direction, width, orientation and convex-hull. In the present release, the phenotyping module of faRIA is identical to our saRIA software²⁶. Further information on definition of traits is included in the Supplementary Information, see Table S1.

The faRIA software provides users with an option to select faRIA:256 or faRIA:1024 model depending on image quality, time and accuracy. The faRIA software can analyse a single image or large image data set to automatically detect and extract multiple root traits. Regarding timing performance, the faRIA segmentation, root tracing and trait calculation all together take, in average, 80 s using faRIA:256 and 15 s using faRIA:1024 models to process and analyse a 6-megapixel (cropped) image on a system with Intel(R) Xeon(R) Gold 6130 CPU @2.10GHz. Therefore, faRIA:1024 can process at least 3 times faster than faRIA:256 for root image analysis.

Discussion

Our experimental results on different plant species from different imaging systems have demonstrated a remarkable accuracy of an adopted U-Net model for fully automated soil-root image segmentation. During the training stage, the faRIA:256 model achieved nearly zero loss and $\ge$ 95% of accuracy measured by the Dice coefficient (DC) crossover 200 epochs, see Fig. 4. By application to the test images, the best performance was found at the epoch number 71 with the maximum DC of 0.874 and minimum loss of 0.033. For larger number of epochs, validation error was just marginally higher. However, the precision and recall are contrasting each other at low DC epochs, and both achieved maximum at epoch number 71. Therefore, the network weights and optimization parameters at epoch number 71 are adopted as the best model for soil-root image segmentation.

The performance of the faRIA:256 model was compared with the SegRoot. From the summary in Table 5, it is evident that faRIA:256 is significantly outperforming the SegRoot on our data set with improving the cross-entropy loss by the factor 10 and DC by 20%, respectively. We draw this results back to the fact that the SegRoot model transfers only max-pooling indices (i.e., location of feature maps) from encoder to decoder for feature concatenation and reconstruction, whereas our U-Net model transfers complete feature map information (i.e., both location and pixel values) to the decoder. This leads to detection of both primary and secondary low contrast roots with the improved DC in comparison to the SegRoot, see Fig. 5. However, more information required for U-Net makes the decoder path expensive and requires more memory (9.47 MB) than the SegRoot (1.49 MB).

In addition to the faRIA:256 model, which was trained on $256\times 256$ patches of original large root images, the performance of proposed U-Net architecture was reformulated on full images and validated with images downscaled to the size of $1024\times 1024$ due to our hardware limitations using the faRIA:1024 model. While both faRIA:1024 and faRIA:256 models demonstrated a comparable accuracy in the training stage, faRIA:256 exhibits more balanced performance between precision and recall than faRIA:1024. This imbalance is cased by the pixels of intermediate intensity on the boundary between the soil and root regions that correspond to average values calculated by downscaling. Pixels of intermediate intensities lead to false positive detection (Fig. 5b). In particular, it is the case by segmentation of thin root structures in downscaled images using the faRIA:1024 model.

Since roots and background regions exhibit similar structural properties in images of different modalities and plant species, our model originally trained on NIR maize roots images could also be applied to barley and arabidopsis roots from LED-rhizotron and UV imaging systems, respectively. For rhizotron images it achieved the minimum accuracy of 80% for all images with exception of the image number 19 in Fig. 7. The overall mean DC = 0.85 indicates a fairly accurate segmentation of rhizotron images. The exceptional image with the number 19 exhibit low DC due to the presence of high intensity noise similar to root structures. Moreover, our model preserves the root thickness and continuity in the secondary roots compared to the SegRoot as shown in Fig. 9e,f. This results in DC of rhizotron image 0.87 is higher than the SegRoot 0.73.

The application of faRIA on UV images, the accuracy of the faRIA:256 model ranged between 60 and 83% with the mean DC = 0.7, see Fig. 8. A relatively low DC for some UV images is due to the presence of diverse artefacts including low contrast between the root architecture and heterogeneous soil regions, in-homogeneous scene illumination (i.e., vertical intensity gradient). This results in inaccurate segmentation (pink colour pixels) of low contrast structures and false detection of high intensity background structures as shown in Fig. 10. However, faRIA:256 achieved the continuity in the root segmentation along the contrast varying root structures with DC of 0.80 (Fig. 10e) whereas SegRoot results in discontinues root structures with DC of 0.67 (Fig. 10f). Therefore, approximately 80% of the root pixels were correctly detected by faRIA:256 compared to the ground truth. Further examples of NIR, rhizotron and UV root image segmentation for juvenile or adult plants are in the Supplementary Information (see Figs. S1–S6).

Furthermore, a direct comparison between phenotypic traits calculated with semi-automated (saRIA) and fully automated (faRIA) approaches shows a highly significant correlation which indicates that root image segmentation and phenotyping using faRIA as practically as good as human-supervised one.

Further, investigations with extended and/or augmented image data are required to improve the accuracy of segmentation of other root images that were not included in the original training set. On the other hand, it cannot be excluded that training of dedicated models with a narrow focus on a particular type of imaging modality and image structures could be a more reliable strategy to achieve more accurate results.

Conclusion

Automated segmentation and analysis of a large amount of structurally heterogeneous and noisy soil-root images is a challenging task which solution is highly demanded in quantitative plant science. Here, we present an efficient GUI-based software tool for fully automated soil-root image segmentation which relies on the U-Net CNN architecture trained on a set of 6465 masks derived from 182 manually segmented soil-root images. The proposed algorithmic framework is capable to efficiently segment root structures of different size, shape and contrast with higher accuracy of DC = 0.87 in comparison to the state-of-the-art solutions (SegRoot: DC = 0.67). Our experimental results showed that the model trained with representative patches of root and background structures enables consideration of a larger amount ground truth data than original full-size images. Thereby, the faRIA:256 model trained on smaller size masks outperforms the larger mask model (faRIA1024) with respect to the overall precision and recall by comparison with ground truth data. In addition to NIR maize root images that were originally used for CNN model training, the faRIA tool can also be applied to other imaging modalities and plants species that exhibit similar structural properties of root and background regions. In addition to root image segmentation, faRIA calculates a number of useful phenotypic traits that in our experimental studies were shown to exhibit a significant correlation ($R^2=0.98$) with the ground truth traits. While the present CNN framework was predominantly trained with regular soil-root images, further investigations are required to address such challenging problems as segmentation of roots overlaid with a large scale noise (for example, due to water condensation) or filling artificial gaps in the root system that occur due to inhomogeneous scene illumination. Possible approaches to addressing these problems include, for example, appropriate augmentation of the training data set and/or alternative CNN models.

References

Lynch, J. Root architecture and plant productivity. Plant Physiol. 109, 7 (1995).
Article CAS Google Scholar
Iyer-Pascuzzi, A. S. et al. Imaging and analysis platform for automatic phenotyping and trait ranking of plant root systems. Plant Physiol. 152, 1148–1157. https://doi.org/10.1104/pp.109.150748 (2010).
Article CAS PubMed PubMed Central Google Scholar
Trachsel, S., Kaeppler, S. M., Brown, K. M. & Lynch, J. P. Shovelomics: High throughput phenotyping of maize (Zea mays L.) root architecture in the field. Plant Soil 341, 75–87 (2011).
Article CAS Google Scholar
Bengough, A. & Mullins, C. Penetrometer resistance, root penetration resistance and root elongation rate in two sandy loam soils. Plant Soil 131, 59–66 (1991).
Article Google Scholar
Wojciechowski, T., Gooding, M., Ramsay, L. & Gregory, P. The effects of dwarfing genes on seedling root growth of wheat. J. Exp. Bot. 60, 2565–2573 (2009).
Article CAS Google Scholar
Watt, M. et al. A rapid, controlled-environment seedling root screen for wheat correlates well with rooting depths at vegetative, but not reproductive, stages at two field sites. Ann. Bot. 112, 447–455 (2013).
Article CAS Google Scholar
Perret, J., Al-Belushi, M. & Deadman, M. Non-destructive visualization and quantification of roots using computed tomography. Soil Biol. Biochem. 39, 391–399 (2007).
Article CAS Google Scholar
Tracy, S. R. et al. The x-factor: Visualizing undisturbed root architecture in soils using X-ray computed tomography. J. Exp. Bot. 61, 311–313 (2010).
Article CAS Google Scholar
van der Weerd, L. et al. Quantitative NMR microscopy of osmotic stress responses in maize and pearl millet. J. Exp. Bot. 52, 2333–2343 (2001).
Article Google Scholar
Fang, S., Yan, X. & Liao, H. 3d reconstruction and dynamic modeling of root architecture in situ and its application to crop phosphorus research. Plant J. 60, 1096–1108 (2009).
Article CAS Google Scholar
Zeng, G., Birchfield, S. T. & Wells, C. E. Automatic discrimination of fine roots in minirhizotron images. New Phytol. 177, 549–557 (2008).
Article Google Scholar
Johnson, M. G., Tingey, D. T., Phillips, D. L. & Storm, M. J. Advancing fine root research with minirhizotrons. Environ. Exp. Bot. 45, 263–289 (2001).
Article Google Scholar
Van de Geijn, S., Vos, J., Groenwold, J., Goudriaan, J. & Leffelaar, P. The wageningen rhizolab-a facility to study soil–root–shoot–atmosphere interactions in crops. Plant Soil 161, 275–287 (1994).
Article Google Scholar
Huck, M. G. & Taylor, H. M. The rhizotron as a tool for root research. In Advances in Agronomy Vol. 35 (ed. Sparks, D. L.) 1–35 (Elsevier, 1982).
Eshel, A. & Beeckman, T. Plant Roots: The Hidden Half (CRC Press, 2013).
Book Google Scholar
Nagel, K. A. et al. Growscreen-rhizo is a novel phenotyping robot enabling simultaneous measurements of root and shoot growth for plants grown in soil-filled rhizotrons. Funct. Plant Biol. 39, 891–904 (2012).
Article Google Scholar
Junker, A. et al. Optimizing experimental procedures for quantitative evaluation of crop plant performance in high throughput phenotyping systems. Front. Plant Sci. 5, 770. https://doi.org/10.3389/fpls.2014.00770 (2015).
Article PubMed PubMed Central Google Scholar
Shi, R., Junker, A., Seiler, C. & Altmann, T. Phenotyping roots in darkness: Disturbance-free root imaging with near infrared illumination. Funct. Plant Biol. 45, 400–411 (2018).
Article CAS Google Scholar
Armengaud, P. et al. Ez-rhizo: Integrated software for the fast and accurate measurement of root system architecture. Plant J. 57, 945–956 (2009).
Article CAS Google Scholar
Pace, J., Lee, N., Naik, H. S., Ganapathysubramanian, B. & Lübberstedt, T. Analysis of maize (Zea mays L.) seedling roots with the high-throughput image analysis tool aria (automatic root image analysis). PLoS ONE 9, e108255 (2014).
Article ADS Google Scholar
Le Bot, J. et al. Dart: A software to analyse root system architecture and development from captured images. Plant Soil 326, 261–273 (2010).
Article Google Scholar
Arsenault, J.-L., Poulcur, S., Messier, C. & Guay, R. Winrhlzo$^{\rm TM}$ a root-measuring system with a unique overlap correction method. HortScience 30, 906 (1995).
Article Google Scholar
Bontpart, T. et al. Affordable and robust phenotyping framework to analyse root system architecture of soil-grown plants. Plant J. 103, 2330–2343. https://doi.org/10.1111/tpj.14877 (2020).
Article CAS PubMed Google Scholar
Galkovskyi, T. et al. Gia roots: Software for the high throughput analysis of plant root system architecture. BMC Plant Biol. 12, 116 (2012).
Article Google Scholar
Pierret, A., Gonkhamdee, S., Jourdan, C. & Maeght, J.-L. Ij\_rhizo: An open-source software to measure scanned images of root samples. Plant Soil 373, 531–539 (2013).
Article CAS Google Scholar
Narisetti, N. et al. Semi-automated root image analysis (saRIA). Sci. Rep. 9, 1–10. https://doi.org/10.1038/s41598-019-55876-3 (2019).
Article CAS Google Scholar
Lobet, G., Pagès, L. & Draye, X. A novel image-analysis toolbox enabling quantitative analysis of root system architecture. Plant Physiol. 157, 29–39 (2011).
Article CAS Google Scholar
Cai, J. et al. Rootgraph: A graphic optimization tool for automated image analysis of plant roots. J. Exp. Bot. 66, 6551–6562 (2015).
Article CAS Google Scholar
Zheng, L., Yang, Y. & Tian, Q. Sift meets CNN: A decade survey of instance retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 40, 1224–1244 (2017).
Article Google Scholar
Ronneberger, Olaf, Philipp Fischer, and Thomas Brox. "U-net: Convolutional networks for biomedical image segmentation." International Conference on Medical image computing and computer-assisted intervention. Springer, Cham, 2015.
Bai, W. et al. Human-level CMR image analysis with deep fully convolutional networks. arXiv https://arxiv.org/abs/1710.09289 (2018).
Badrinarayanan, V., Kendall, A. & Cipolla, R. Segnet: A deep convolutional encoder–decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39, 2481–2495 (2017).
Article Google Scholar
Marmanis, D. et al. Semantic segmentation of aerial images with an ensemble of CNSS. ISPRS Ann. Photogr. Remote Sens. Spatial Inf. Sci. 2016(3), 473–480 (2016).
Article Google Scholar
Pound, M. P. et al. Deep machine learning provides state-of-the-art performance in image-based plant phenotyping. Gigascience 6, gix083 (2017).
Article Google Scholar
Douarre, C., Schielein, R., Frindel, C., Gerth, S. & Rousseau, D. Transfer learning from synthetic data applied to soil-root segmentation in X-ray tomography images. J. Imaging 4, 65 (2018).
Article Google Scholar
Misra, T. et al. Spikesegnet-a deep learning approach utilizing encoder-decoder network with hourglass for spike segmentation and counting in wheat plant from visual imaging. Plant Methods 16, 1–20 (2020).
Article Google Scholar
Wang, R., Cao, S., Ma, K., Zheng, Y. & Meng, D. Pairwise learning for medical image segmentation. Med. Image Anal. 67, 101876. https://doi.org/10.1016/j.media.2020.101876 (2021).
Article PubMed Google Scholar
Karani, N., Erdil, E., Chaitanya, K. & Konukoglu, E. Test-time adaptable neural networks for robust medical image segmentation. Med. Image Anal. 68, 101907. https://doi.org/10.1016/j.media.2020.101907 (2021).
Article PubMed Google Scholar
Khan, S. et al. Deepsmoke: Deep learning model for smoke detection and segmentation in outdoor environments. Exp. Syst. Appl.https://doi.org/10.1016/j.eswa.2021.115125 (2021).
Article Google Scholar
Jiang, Y. & Li, C. Convolutional neural networks for image-based high-throughput plant phenotyping: A review. Plant Phenom. 2020, 22 (2020).
Article Google Scholar
Zhu, Yezi, et al. "Data Augmentation using Conditional Generative Adversarial Networks for Leaf Counting in Arabidopsis Plants." BMVC. 2018.
Chen, J. & Shi, X. A sparse convolutional predictor with denoising autoencoders for phenotype prediction. In Proceedings of the 10th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, 217–222 (2019).
Yasrab, R. et al. RootNav 2.0: Deep learning for automatic navigation of complex plant root architectures. GigaScience 8, Giz123. https://doi.org/10.1093/gigascience/giz123 (2019).
Article PubMed PubMed Central Google Scholar
Wang, T. et al. Segroot: A high throughput segmentation method for root image analysis. Comput. Electron. Agric. 162, 845–854 (2019).
Article Google Scholar
Yasrab, R., Pound, M. P., French, A. P. & Pridmore, T. P. Rootnet: A convolutional neural networks for complex plant root phenotyping from high-definition datasets. bioRxivhttps://doi.org/10.1101/2020.05.01.073270 (2020).
Article Google Scholar
Ioffe, S. & Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint https://arxiv.org/abs/1502.03167 (2015).
Santurkar, S., Tsipras, D., Ilyas, A. & Madry, A. How does batch normalization help optimization? https://arxiv.org/abs/1805.11604 (2019).
Li, X., Chen, S., Hu, X. & Yang, J. Understanding the disharmony between dropout and batch normalization by variance shift. https://arxiv.org/abs/1801.05134 (2018).
Peng, C., Zhang, X., Yu, G., Luo, G. & Sun, J. Large kernel matters—improve semantic segmentation by global convolutional network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017).
Jha, R. R., Jaswal, G., Gupta, D., Saini, S. & Nigam, A. Pixisegnet: Pixel-level iris segmentation network using convolutional encoder–decoder with stacked hourglass bottleneck. IET Biom. 9, 11–24 (2020).
Article Google Scholar
Agostinelli, F., Hoffman, M., Sadowski, P. & Baldi, P. Learning activation functions to improve deep neural networks. arXiv preprint https://arxiv.org/abs/1412.6830 (2014).
Wang, L., Guo, S., Huang, W. & Qiao, Y. Places205-vggnet models for scene recognition. arXiv preprint https://arxiv.org/abs/1508.01667 (2015).
Dunne, R. A. & Campbell, N. A. On the pairing of the softmax activation and cross-entropy penalty functions and the derivation of the softmax activation function. In Proceedings ofthe 8th Aust. Conference on the Neural Networks, Melbourne, vol. 181, 185 (Citeseer, 1997).
Zou, K. H. et al. Statistical validation of image segmentation quality based on a spatial overlap index1: Scientific reports. Acad. Radiol. 11, 178–189 (2004).
Article Google Scholar
Abadi, M. et al. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint https://arxiv.org/abs/1603.04467 (2016).
Tian, C., Xu, Y. & Zuo, W. Image denoising using deep CNN with batch renormalization. Neural Netw. 121, 461–473. https://doi.org/10.1016/j.neunet.2019.08.022 (2020).
Article PubMed Google Scholar
Tian, C. et al. Deep learning on image denoising: An overview. Neural Netw. 131, 251–275. https://doi.org/10.1016/j.neunet.2020.07.025 (2020).
Article PubMed Google Scholar
Walt, S., Colbert, S. C. & Varoquaux, G. The numpy array: A structure for efficient numerical computation. Comput. Scie. Eng. 13, 22–30 (2011).
Article Google Scholar
Van der Walt, S. et al. Scikit-image: Image processing in python. PeerJ 2, e453 (2014).
Article Google Scholar
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. arXiv preprint https://arxiv.org/abs/1412.6980 (2014).
Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems, 1097–1105 (2012).
Mathworks. Matlab and Statistics Toolbox Release 2019b (The MathWorks, 2019).
Bovik, A. C. Chapter 3—Basic gray level image processing. In The Essential Guide to Image Processing (ed. Bovik, A.) 43–68 (Academic Press, 2009). https://doi.org/10.1016/B978-0-12-374457-9.00003-2.
Chapter Google Scholar

Download references

Acknowledgements

This work was performed within the German Plant-Phenotyping Network (DPPN) which is funded by the German Federal Ministry of Education and Research (BMBF) (project identification number: 031A053). M.H. was supported from European Regional Development Fund-Project “SINGING PLANT” (No. CZ.02.1.01/0.0/0.0/16_026/0008446).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Michael Henke
Present address: Plant Sciences Core Facility, CEITEC-Central European Institute of Technology, Masaryk University, 62500, Brno, Czech Republic

Authors and Affiliations

Leibniz Institute of Plant Genetics and Crop Plant Research, OT Gatersleben, Corrensstr. 3, 06466, Seeland, Germany
Narendra Narisetti, Michael Henke, Christiane Seiler, Astrid Junker, Thomas Altmann & Evgeny Gladilin
Institute for Information Processing (TNT), Leibniz University of Hannover, Appelstr. 9A, 30167, Hannover, Germany
Jörn Ostermann

Authors

Narendra Narisetti
View author publications
You can also search for this author in PubMed Google Scholar
Michael Henke
View author publications
You can also search for this author in PubMed Google Scholar
Christiane Seiler
View author publications
You can also search for this author in PubMed Google Scholar
Astrid Junker
View author publications
You can also search for this author in PubMed Google Scholar
Jörn Ostermann
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Altmann
View author publications
You can also search for this author in PubMed Google Scholar
Evgeny Gladilin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.N., M.H., and E.G. conceived, designed and performed the computational experiments, analysed the data, wrote the paper, prepared figures, and tables, and reviewed drafts of the paper. C.S. and A.J. executed the laboratory experiments, acquired image data, and reviewed drafts of the paper. J.O. co-supervised the computational data analysis and reviewed drafts of the paper. T.A. co-conceptualized the project and reviewed drafts of the paper.

Corresponding author

Correspondence to Narendra Narisetti.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Narisetti, N., Henke, M., Seiler, C. et al. Fully-automated root image analysis (faRIA). Sci Rep 11, 16047 (2021). https://doi.org/10.1038/s41598-021-95480-y

Download citation

Received: 24 February 2021
Accepted: 27 July 2021
Published: 06 August 2021
DOI: https://doi.org/10.1038/s41598-021-95480-y

This article is cited by

RhizoNet segments plant roots to assess biomass and growth for enabling self-driving labs
- Zineb Sordo
- Peter Andeer
- Daniela Ushizima
Scientific Reports (2024)
Root system architecture in cereals: exploring different perspectives of the hidden half
- Ambika Sharma
- Pooja Saini
- Imran Sheikh
Brazilian Journal of Botany (2024)
As good as human experts in detecting plant roots in minirhizotron images but efficient and reproducible: the convolutional neural network “RootDetector”
- Bo Peters
- Gesche Blume-Werry
- Juergen Kreyling
Scientific Reports (2023)
Assessing the fine root growth dynamics of Norway spruce manipulated by air humidity and soil nitrogen with deep learning segmentation of smartphone images
- Marili Sell
- Abraham George Smith
- Ivika Ostonen
Plant and Soil (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Methods

Deep CNN model for root image segmentation

Performance metrics

Ethical approval

Experimental setup

Data and image annotation

Training

Prediction

Results

Training and validation of faRIA

Evaluation of faRIA versus SegRoot

Segmentation of further image modalities

Evaluation of phenotypic traits versus saRIA

Graphical user interface and runtime

Discussion

Conclusion

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links