RDI Calculator: An Analysis Tool to Assess RNA Distributions in Cells

Stueland, Michael; Wang, Tianhong; Park, Hye Yoon; Mili, Stavroula

doi:10.1038/s41598-019-44783-2

Download PDF

Article
Open access
Published: 04 June 2019

RDI Calculator: An Analysis Tool to Assess RNA Distributions in Cells

Scientific Reports volume 9, Article number: 8267 (2019) Cite this article

2287 Accesses
13 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Localization of RNAs to various subcellular destinations has emerged as a widely used mechanism that regulates a large proportion of transcripts in polarized cells. A number of methodologies have been developed that allow detection and imaging of RNAs at single-molecule resolution. However, methodologies to quantitatively describe RNA distributions are limited. Such approaches usually rely on the identification of cytoplasmic and nuclear boundaries which are used as reference points. Here, we describe an automated, interactive image analysis program that facilitates the accurate generation of cellular outlines from single cells and the subsequent calculation of metrics that quantify how a population of RNA molecules is distributed in the cell cytoplasm. We apply this analysis to mRNAs in mouse and human cells to demonstrate how these metrics can highlight differences in the distribution patterns of distinct RNA species. We further discuss considerations for the practical use of this tool. This program provides a way to facilitate and expedite the analysis of subcellular RNA localization for mechanistic and functional studies.

Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations

Article Open access 09 April 2024

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

Article Open access 12 April 2024

Simultaneous single-cell three-dimensional genome and gene expression profiling uncovers dynamic enhancer connectivity underlying olfactory receptor choice

Article Open access 15 April 2024

Introduction

RNA molecules are transcribed in the nucleus and exported to the cytoplasm where they usually serve as messengers for decoding the genetic information into protein products. In the cytoplasm, RNAs are either uniformly distributed or they can become localized to various subcellular destinations through mechanisms including motor-based active transport or diffusion followed by anchoring^1,2. It has been increasingly appreciated that a large proportion of the mammalian transcriptome becomes differentially distributed in the cytoplasm through such mechanisms^3,4,5,6,7. Hundreds to thousands of RNAs are localized in axons or dendrites of neurons^8,9,10, apical or basal surfaces of epithelial cells^11,12, or the leading edge and protrusive regions of mesenchymal migrating cells^13,14,15. Importantly, changes in RNA localization are linked to physiological responses and have functional consequences^{14,16,17,18,19}, highlighting the importance of understanding and exploiting the underlying mechanisms.

While a number of methods are available for high-resolution, single-molecule imaging of RNAs either in fixed or live cells^20,21, relatively fewer options exist for describing the observed RNA distributions in an unbiased, objective manner and in quantifiable terms. In highly polarized cells, such as neurons, quantitative assessment of RNA localization from imaging data can be done by assigning RNAs into easily distinguishable compartments such as dendrites, axonal shafts or growth cones^8,22. Such distinctions become much harder and inherently biased in smaller cell types which have either irregular and varied morphologies or lack clear boundaries of functional locations. For example, there is no objective way of defining the exact spatial boundaries of the leading edge of a migrating cell or the apical and basal surfaces of an epithelial cell^11,12,16,23. Furthermore, such demarcations apply a binary choice onto the description of RNA distributions, thus precluding the ability to detect and differentiate distributions which result from more gradual RNA concentration gradients.

To address these issues, various quantitative metrics have been used to describe cytoplasmic RNA distributions^14,24,25,26. These approaches identify RNAs within a specified cell boundary and usually describe their spatial distribution in relation to particular cellular features such as the plasma membrane, the nuclear envelope or the cell centroid. It is therefore important that these features are accurately identified during image analysis. One case in point concerns RNAs targeted to peripheral protrusive regions of mesenchymal migrating cells^13,14. Such protrusions are thin and because they contain a relatively small cytoplasmic volume are less efficiently detected by fluorescent stains commonly used to demark the cell volume. Failure to include them during the identification of the cell boundary could lead to omission of RNAs and thus inaccurate calculation of distribution metrics. Automated cell segmentation methods might not always accurately identify all relevant features, especially in images with low signal-to-noise ratios.

We report here an interactive analysis tool (named RDI Calculator, for RNA Distribution Index Calculator) that automatically identifies cellular and nuclear boundaries, but additionally allows their easy editing and correction when necessary. The program then calculates different RNA distribution metrics from single cells, thus facilitating an accurate, unbiased analysis of large sample sizes. We employ this program to demonstrate how this analysis can quantitatively highlight differences in the distribution patterns of various endogenous RNAs. We further present practical considerations for the implementation of this tool.

Materials and Methods

Cell culture

NIH/3T3 mouse fibroblast cells (ATCC) were grown in DMEM supplemented with 10% calf serum, sodium pyruvate and penicillin/streptomycin (Invitrogen) at 37 °C, 5% CO₂. MDA-MB-231 breast cancer cells (ATCC) were grown in Leibovitz’s L15 media supplemented with 10% fetal bovine serum and penicillin/streptomycin at 37 °C in atmospheric air. Cell lines have been tested yearly for mycoplasma and are free of contamination.

RNA in situ hybridization

For in situ hybridization, cells were plated on fibronectin-coated polyacrylamide gels, or on fibronectin-coated 20 μm micropatterned line tracks (CYTOOchips Motility, CYTOO), or on fibronectin-coated glass coverslips for 2–3 hours and subsequently fixed in 4% paraformaldehyde for ten minutes. FISH was performed with QuantiGene ViewRNA ISH Cell Assay kit (Affymetrix, cat# QVCM0001) according to the manufacturer’s instructions. The Affymetrix probe sets used were: Ddr2 cat# VB1-14375, RhoA cat# VB6-14572, Cyb5r3 VB1-18647, Arpc3 cat# VB6-14571. To detect PolyA RNAs, LNA modified oligodT probes (30 nucleotides) labeled with ATTO-655 were added during hybridization, pre-amplification, amplification and last hybridization steps of QuantiGene ViewRNA ISH Cell Assay. Nuclei were stained using DAPI and cells were additionally stained with Cell mask stain (Thermo Fisher Scientific) to obtain cell outlines.

Preparation of polyacrylamide gel substrates

Thin polyacrylamide gels were prepared on glass coverslips as described previously¹⁴. Gels were coated with 0.1 mg ml⁻¹ fibronectin (Sigma-Aldrich) overnight.

Image acquisition

Images were obtained using a Leica SP8 or a Zeiss 780 confocal microscope (equipped with an HC PL APO 63x oil CS2 objective, 1.40 NA or a HC PL APO 40x oil CS2 objective, 1.30 NA). Z-stacks through the cell volume were obtained and files were analyzed using RDI Calculator as detailed below.

Statistics

Prism 7 by GraphPad software was used for graph generation and statistical analyses. For multiple comparisons, one-way ANOVA was used with Tukey’s multiple comparisons test. For comparison of two data sets, Student’s t-test was used. Significance level was set to p < 0.05.

Automated Analysis Algorithm

Overview and image requirements

The input files are 4-channel images and their format can be either.lif or.tif. The images can be either confocal z-stacks or single-plane images. If stacks are provided, the program will generate a maximum intensity, z-projected file. The subsequent calculations require first determination of the outer boundaries of the cell being analyzed as well as of the position of the nucleus within the cell being analyzed. Therefore, usually, one channel is used to acquire an image of the cell nucleus (for example through DAPI staining) and a second channel is used to acquire an image of the whole cell volume (for example through staining with appropriate cell stain reagents or through immunostaining, or fluorescence imaging, of diffuse cytoplasmic proteins). The additional two channels are used for imaging of the RNAs to be analyzed through single molecule in situ hybridization (see methods). Binary masks of the whole cell and nucleus are subsequently generated and displayed for user-approval. If accepted, the binary mask files are automatically named and saved, otherwise a number of options are presented for manual editing of individual masks. RNA distribution metrics (PDI, PI, DI), mean RNA intensity and cell area are then calculated and the results are exported and saved in a .csv spreadsheet format (Fig. 1). Step-by-step instructions for running the analysis are detailed in Supplementary file 1. Supplementary source files include test images and scripts for running the analysis.

Images can be any bit-depth but 12-bit or higher are recommended. As usual for images used for quantitative signal analysis, using the full-dynamic range of the detector and acquiring non-saturated images is important. This program has been generated and tested in MATLAB 9.1 and later versions and runs in both Windows and Mac operating systems.

Cell and nucleus identification

To generate a binary mask of the cell area, images of the whole cell volume are acquired, aiming for good contrast between the cell and the background. A Sobel operator is then applied to find the edges of the cell. Specifically, a 3 × 3 filter is applied across the image in both X and Y dimensions to measure the gradient of signal intensity. A threshold is then applied to this gradient, resulting in a binary image highlighting the boundaries of the cell. Next, a slight dilation (of one pixel in all directions) is applied to the binary image. A flood-fill operation is further applied to produce a filled-in binary mask of the cell. Note that the algorithm recognizes the cell that covers the largest area in an image. For images with multiple cells, individual cells could be cropped out, or alternatively different cells could be selected by rejecting the automatically selected cell and redefining the cell mask with the available editing options (see below).

To generate a binary mask of the nuclear area, nuclear images (e.g. through DAPI staining) are acquired, aiming again for a good contrast between the nucleus and background. First, a Wiener filter is applied to the nuclear channel. Specifically, across the image, variance is computed in a 5 × 5 pixel area. Pixel intensity is then smoothed out in inverse relationship to variance. This preserves the edges in the image, while eliminating random background noise. Next, an automatic Otsu threshold is applied to the image, producing a binary image as a result. This binary image of the nucleus is then multiplied by the binary image of the cell mask, which removes any nuclear signal not inside the cell mask, thus eliminating any nuclei in the original image that are not within the cell being analyzed. Finally, a flood-fill operation in applied, producing a nuclear mask for only the cell of interest.

User input on cell and nuclear mask identification

The outlines of the cell and nuclear masks, which are generated through the above-mentioned process, are then displayed and overlaid with the corresponding image channels or with the RNA channels, so that the user can assess whether they are suitable for use in downstream analysis. For images with adequate signal-to-noise, these steps are usually sufficient, and the generated masks can be automatically saved. However, in certain cases the standard method of identifying the boundaries of the cell may not be sufficient. For these scenarios, a number of methods have been built in to allow the user to modify the image, modify the Sobel edge detection method, or modify the generated mask itself.

Wiener noise removal

This allows the user to pre-process the cell mask channel prior to the Sobel edge detection. As mentioned above, this uses a 5 × 5 filter to detect signal variance across the image, and smooth out pixel intensity in areas of relatively low variance. This can be repeated any number of times to increase the noise-removal effect, though for most purposes once should be adequate.

Image sharpening

This allows the user to pre-process the cell mask channel, prior to the Sobel edge detection, with unsharp masking. This increases the contrast at the edges of signal and non-signal. This may help if the cell staining is faint, especially at cell peripheries. This can also be iterated multiple times.

Dilation

This allows the user to dilate the generated mask by one pixel in all directions. This can be done multiple times for increased dilation. This function can be useful in ensuring that RNAs found at peripheral edges of the cell are included in the masked area used for analysis.

Threshold for edge detection

This allows the user to set the threshold for the Sobel Edge detection method. Specifically, the gradient of signal intensity for each pixel is given a value. A potential threshold is automatically calculated, and the user can multiply that by a given number. The default for this is 0.5, with lower numbers detecting more potential edges and larger numbers being more stringent.

Using a polygon tool to alter the cell mask

The following sections denote ways that the user can manually draw a polygon to alter the cell mask. The user has the option to draw a polygon on top of an overlay of the original cell mask channel, and this is then used to modify the calculated cell mask boundaries. Calculation of the boundaries of the nuclear mask is done after this step, so if a new region is highlighted with nuclear signal, the nuclear mask may change.

Draw: In this option, the user draws a polygon to approximate the boundaries of the cell. This will entirely replace the original cell mask with the drawn polygon.

Limit: In this option, the user draws a rough polygon only around the region they want analyzed. This will multiply the polygon by the calculated cell mask, so that anything not circled by the polygon will be excluded.

Expand: In this option, the user draws a polygon connecting to the generated cell mask, around any additional area to be analyzed. This will add the polygon to the generated mask.

Output metrics

Calculations of RNA distribution indexes are performed using maximum-intensity projections of z-stacked images. Intensity values, above a user-determined threshold, are used for analysis. The program does not attempt to identify discrete RNA particles or assign numbers of RNA molecules contained within clusters or diffraction-limited RNA spots. Instead, pixel intensities are used in order to accommodate analysis in cases where single RNA identification cannot be confidently performed, such as in the case of abundant RNA species, or when imaging polyadenylated RNA. We note that we have compared intensity-based measurements with measurements after single RNA spot detection and found the calculated metrics to be very similar.

Three different indexes are calculated, a polarization index, a dispersion index and a peripheral distribution index. Each index describes a distinct aspect of how a population of RNA molecules are distributed within the cell cytoplasm^14,24. The Polarization Index (PI) is calculated by identifying the centroid of the RNA signal and measuring its displacement from the centroid of the cell. This displacement is divided by the radius of gyration, calculated as the root-mean-square distance of all pixels to the centroid of the cell, in order to normalize the polarization to the size and elongation of the cell²⁴.

$$PI=\frac{\sqrt{{({\bar{x}}_{RNA}-{\bar{x}}_{cell})}^{2}+{({\bar{y}}_{RNA}-{\bar{y}}_{cell})}^{2}}}{R{g}_{cell}},$$

where ${\bar{x}}_{RNA}$, ${\bar{y}}_{RNA}$ are the coordinates of the RNA centroid and ${\bar{x}}_{cell}$, ${\bar{y}}_{cell}$, are the coordinates of the centroid of the cell in the two dimensional image, and Rg_cell is the radius of gyration. PI values increase with increased polarization of the RNA signal and with increased distance from the cell centroid (Fig. 2A).

To derive the Dispersion Index (DI), the second moment of RNA pixel intensity positions relative to the centroid of the total RNA signal is calculated.

$${\mu }_{2}=\,\sum _{i,j}{r}_{ij}^{2}\frac{{I}_{ij}}{{\sum }_{i,j}{I}_{ij}},$$

where r_ij is the distance of the pixel (i,j) to the centroid of the RNA signal and I_ij is the intensity value of pixel (i,j) in the two dimensional image. To normalize for differences in cell morphology, the second moment of the RNA is divided by the second moment of a hypothetical uniform distribution, which is derived as the second moment of all pixels within the binary cell mask image²⁴. A completely diffuse RNA has a DI value of 1. The DI value of an RNA that is concentrated in any region within the cell is less than 1 and is inversely correlated with the degree of RNA concentration. An RNA that is distributed towards the cell periphery exhibits a DI value larger than 1, but this is affected by the degree of polarization (Fig. 2B).

The Peripheral Distribution Index (PDI) is calculated similar to the dispersion index, but in this case the second moment of RNA pixel intensity positions is calculated relative to the centroid of the nucleus¹⁴. This metric is not affected by the polarization of the RNA distribution. PDI value is 1 for a completely diffuse RNA, it is less than 1 for a perinuclear RNA and more than 1 for a peripherally distributed RNA (Fig. 2C).

In addition to the RNA distribution indexes described above, the program also reports the cellular area, in μm² units, utilizing the pixel dimension information from the metadata of the provided images, as well as the average RNA signal within the cell being analyzed. For this, each RNA channel is multiplied by the cell mask. This multiplies all RNA signal outside the cell by 0, and all signal inside the cell by 1. Every pixel of the RNA channel is then summed and divided by the pixel sum of the cell mask. This is the total RNA signal divided by cell area, or the average RNA signal across the area of the cell.

Results

RDI analysis can quantitatively differentiate distinct RNA distributions in cell populations

To illustrate how this analysis can be used to describe the distribution of distinct RNA species, we used single molecule in situ hybridization to detect mRNAs in mouse fibroblast cells. We visualized RNAs that exhibit distinct distribution patterns (Fig. 3A,B). Specifically, these include: (1) the P4hb mRNA, encoding the beta subunit of prolyl-4-hydroxylase, which functions as an ER chaperone²⁷, (2) the Arpc3 mRNA, encoding subunit 3 of the actin-nucleating Arp2/3 complex²⁸, (3) the RhoA mRNA, encoding the RhoA GTPase involved in organization of the actin cytoskeleton²⁹, and (4) the Cyb5r3 mRNA, encoding cytochrome b5 reductase 3 involved in fatty acid and cholesterol metabolism³⁰. These RNAs exhibit visibly distinct patterns of distribution in the cytoplasm (Fig. 3). Specifically, the P4hb RNA exhibits a perinuclear distribution and does not reach to the cell periphery, likely reflecting the ER association of its encoded protein. The Arpc3 RNA appears diffuse, exhibiting a more uniform distribution throughout the cell body. Nevertheless, it is not prominently found within peripheral protrusive regions. The RhoA RNA is diffusely distributed similar to Arpc3 RNA. The Cyb5r3 RNA exhibits a peripheral distribution being relatively absent from the perinuclear region and concentrating within peripheral protrusions (Fig. 3A,B).

Using the RDI calculator, PDI, PI and DI values were derived for each of the above RNAs from multiple individual cells (Fig. 3C–E). The P4hb RNA has the lowest PDI value (with a mean value of 0.36), reflecting its perinuclear distribution. This value is significantly different from the PDI values of Arpc3 or RhoA RNAs. Both Arpc3 and RhoA RNAs have similar PDI indexes (mean values of 0.66 for Arpc3 and 0.67 for RhoA), consistent with their indistinguishable distribution. Furthermore, indicating the absence of both of these RNAs from peripheral protrusions, the PDI index for both is lower than 1. The Cyb5r3 RNA has the highest PDI index (mean value of 1.86) reflecting its pronounced accumulation at peripheral protrusive regions (Fig. 3C). Therefore, differentially distributed RNAs can be robustly distinguished using the PDI metric.

Relationship between DI and PDI values varies depending on the context

As shown in Fig. 3C,E, DI values for the tested RNAs parallel the PDI values and reveal the same differences in distribution. This concordance in PDI and DI values is predicted and observed also in the simulated images (Fig. 2B,C). This concordance is expected when RNA distributions are non-polarized. Indeed, the cells analyzed in Fig. 3 are spreading briefly on uniformly fibronectin-coated surfaces and thus do not exhibit any obvious morphological or functional polarization. Consistently, RNA distributions are non-polarized, reflected in the low PI values (ranging from 0.12-0.24) (Fig. 3D). Under such conditions, there is largely agreement between the DI and PDI metrics.

By contrast, if RNA distributions are polarized, DI and PDI metrics are expected to diverge and to be increasingly different with increasing distance of the RNAs from the nucleus (Fig. 2B,C). To illustrate this point, we imaged mouse fibroblast cells migrating on micropatterned substrates consisting of 20 μm-wide lines. On these defined line-patterns, a large proportion of the cells polarize to form a leading edge and a trailing, retracting tail (Fig. 4A). We visualized, in these cells, the Cyb5r3 RNA as well as the Ddr2 RNA, encoding a collagen receptor, and performed RDI analysis from multiple images (Fig. 4). Both RNAs are peripherally localized and belong to a co-regulated group that depends on the APC (Adenomatous Polyposis Coli) protein for their peripheral localization¹⁴. Indeed, both Ddr2 and Cyb5r3 RNAs had PDI values higher than 1, indicating that they are peripherally localized (Fig. 4B). Interestingly however, the PDI metric revealed a significant difference between the two RNAs. The Cyb5r3 RNA had a significantly higher PDI value compared to Ddr2 RNA, indicating a higher concentration at the periphery. In contrast, the DI metric was similar between Cyb5r3 and Ddr2 RNAs. Therefore, in these polarized cells, the DI and PDI metrics are not in concordance and thus reveal different aspects of subcellular RNA distributions (Fig. 4B). As mentioned above, this differentiation is expected when RNA distributions are polarized. In agreement with that, under these conditions, the Cyb5r3 RNA has a significantly higher PI index compared to Ddr2 (mean PI value 0.28 for Ddr2 and 0.51 for Cyb5r3). Therefore, the three metrics obtained through the RDI analysis can provide useful quantitative measures to describe and differentiate between cytoplasmic RNA patterns within cell populations and study their mechanistic underpinnings.

Relationships between the three metrics can distinguish subsets of cells in a population

Furthermore, pairwise correlation of the three metrics for each RNA revealed an interesting feature. The difference between the Cyb5r3 and Ddr2 RNAs in their peripheral accumulation and polarization (detected in the graphs of Fig. 4B) did not result because the Cyb5r3 RNA has overall higher PDI and PI values in all cells of the population. Specifically, in a fraction of the cells (enclosed by the dotted line in Fig. 4C,D) the Cyb5r3 RNAs exhibit the same range of values as those exhibited by Ddr2. Apart from that population of cells, however, a distinct subset of cells (red data points in Fig. 4C) exhibit increased polarization of Cyb5r3 RNA (PI > 0.8) and increased peripheral distribution. Ddr2 RNA metrics in the same cells do not show any similar corresponding bias (red data points in Fig. 4D). This observation suggests that the difference observed between Cyb5r3 and Ddr2 RNAs is driven by the behavior of a subset of cells. Biologically, this could indicate that in this subset of cells, a particular mechanism operates that leads to specific polarized clustering of Cyb5r3 at the cell periphery. Even though in other regards the two RNAs are co-regulated (i.e. the regulation of their localization by APC), the Ddr2 RNA is not subject to this polarized clustering, at least not to the same degree as Cyb5r3 RNA. Correlating the increased polarization of Cyb5r3 RNA with activities or distributions of candidate factors could provide the basis for testable hypotheses. Thus, these results highlight how obtaining quantitative information of RNA distributions from multiple cells in a population can also serve to discern patterns observed in subsets of cells, and thus point towards biologically relevant insights.

Effect of noise on RDI-calculated metrics

We discuss below certain considerations that should be heeded in order to obtain RDI values that accurately reflect the distributions of the RNAs being analyzed. Given that the reported values are calculated from intensity distributions within the cell of interest, it is important to exclude from these calculations any signal originating from background noise. This is especially relevant in cases of low-abundance RNAs. As displayed in Fig. 5A, when detecting RNA species with only few copies per cell, only a small proportion of pixels report specific signal (Fig. 5A, left panel). The vast majority of pixels correspond to background noise, which usually is uniformly present throughout the cell or exhibits a perinuclear bias (Fig. 5A, right panel and graph). Even though this background signal is lower in intensity, it cumulatively affects the calculated metrics towards values that would describe more perinuclear or diffuse RNAs. For example, in the case of the peripherally localized RNA Ddr2, applying no background subtraction returns PDI values close to 1, indicative of a diffuse distribution (Fig. 5B). Increasing the background threshold leads to a gradual increase in PDI values which eventually reach a plateau at values that more accurately reflect the distribution of the RNA being analyzed (Fig. 5B). Such a response curve can provide the basis for assessing the appropriate degree of noise subtraction. Given that this value can vary depending on the specificity of probes used, the acquisition settings and detector properties (compare Fig. 5B,C), the background value to be subtracted is requested by the program as a user-defined input and can be separately specified for individual channels. In our experience, in situ hybridization protocols using Z-probes for signal amplification lead to good signal-to-noise. If hybridization and image acquisition conditions are maintained, then subtracting a constant amount, usually 5–15% off the lower end of the dynamic range, is sufficient for most cases.

Consideration of 3D cellular geometry for interpretation of RDI-calculated metrics

Analysis of simulated images (Fig. 2) predicts certain values for the calculated indexes. For example, a completely diffuse RNA would have a PDI and DI index of 1. This is true in the case of 2-dimensional simulations. However, when cellular material is being observed, two-dimensional images are generated from signal originating from a three-dimensional volume. Importantly, as detailed in this section, variations in 3D morphology impact on the values of the calculated metrics and should be considered for correct interpretation of derived values.

To image different 3D morphologies of the same cell population, we plated fibroblast cells on fibronectin-coated polyacrylamide substrates of varying degrees of stiffness. On soft substrates (1 kPa Young’s modulus) cells remain mostly round and don’t spread efficiently (Fig. 6A). They are thus characterized by a small spreading area (Fig. 6B) and an increased height, evident by a broad intensity peak along the z-imaging axis (Fig. 6C). On substrates of increasing stiffness (3, 5, 13 and 280 kPa) cells spread gradually more (Fig. 6A,B) and the cell height is reduced, seen by a narrower intensity peak along the z-axis (Fig. 6C). On a 2-dimensional z-projected image, cells exhibiting different degrees of spreading, will have a quite distinct distribution of the bulk cytoplasm. On less spread cells, cytoplasmic material will appear uniform throughout the cell body. By contrast, in highly spread cells, the cytoplasm will appear unevenly distributed in central versus peripheral regions (with higher cytoplasmic material around the nucleus and a gradual decrease towards the periphery with a thin layer of cytoplasm reaching into the mostly membranous protrusions). Consequently, a molecule that is freely diffusing in the cytoplasmic volume would appear more perinuclear in a projected image of a spread cell and the signal would have a PDI or DI index less than 1.

To assess these predictions, we plated cells on substrates of varying stiffness and detected polyadenylated RNA (through oligo-dT hybridization) as a surrogate of the cytoplasmic volume accessible to RNA molecules (Fig. 6A). Indeed, consistent with the above expectations, in 2D-projected images, polyA RNA appears uniformly diffuse in the cytoplasm of less spread cells (PDI and DI values close to 1) but appears gradually more perinuclear (evident by decreasing PDI and DI values) as the spreading area of the cells increases (Fig. 6D,E). This doesn’t mean that polyA RNAs are actively excluded from protrusions, but rather reflects the changes in the spatial distribution of the cytoplasmic volume. Therefore, while the 2D simulations provide some reference points, the meaning of the actual reported values should be interpreted in the context of the 3D geometry of the cells being observed. We suggest that the distribution of polyadenylated RNA would be a useful internal control against which distributions of individual RNAs could be compared and interpreted.

Discussion

We report here an automated, interactive analysis method for the rapid calculation of metrics that quantitatively describe RNA distributions in cell populations. This Matlab-based program allows the automated identification of cell boundaries thus reducing the amount of time required for analysis of a series of images. As an additional step to increase accuracy, intermediate steps in the analysis are presented to the user for validation to avoid errors in the final output. Overall, these features will facilitate the rapid analysis of large datasets that can better reflect the overall cell populations being examined. They will also allow for a more confident assessment of the uniformity exhibited within a sampled population, or of the existence of potentially interesting behaviors manifested in subsets of cells.

We have also highlighted considerations that should be taken into account when interpreting these metrics and for comparative analyses. Given sufficient elimination of background noise, the metrics calculated with this program can robustly compare distributions of different RNAs within the same cells. To signify that, the script labels one RNA species as ‘localized’ and the other as ‘control’. The identity of the RNA that can serve as an appropriate control can vary depending on the cell type, the particular conditions and the goal of the experiment. We suggest that a useful comparison is against the distribution of the general polyadenylated RNA population in the cytoplasm of the cells being analyzed. This can be detected through oligo-dT hybridization and can provide a measure of the overall RNA distribution against which specific RNAs can be compared.

Apart from comparisons of different RNAs within the same cells, understanding the mechanisms underlying localization of individual RNAs to particular compartments requires comparison of RNA distributions across various experimental conditions. It is important to note that in these types of experiments, any observed changes in RDI metrics could result either from perturbation of cellular mechanisms acting specifically on the RNA of interest or they could result indirectly as a consequence of changes in 3D cellular morphology, which, as detailed in the results presented above, can affect the calculated values. To provide a means of assessing changes in 3D cellular morphology among cell populations, the program reports the cellular area of each observed cell. For cells of similar volume, changes of their 2D footprint would be indicative of potential changes in 3D geometry. We emphasize however that area values are just indicative and cannot support by themselves conclusive inferences regarding 3D architecture. Imaging the distribution of polyadenylated RNA could provide a more direct indication of whether any observed differences are specific to the RNA of interest or whether they result from changes in the relative distribution of the bulk cytoplasm within the cells being observed. An alternative approach could involve the use of substrates of defined size and shape¹⁶. Such micropatterned substrates can be used to confine cells within a particular shape and thus circumvent the confounding effects brought about by drastically different morphologies.

We note that while we have implemented this program for the study of RNA distributions, the same analysis and metrics can be used to quantitatively assess the cellular distribution of any molecule or activity detected through microscopy-based imaging.

Data Availability

The datasets generated and analyzed in the current study are available from the corresponding author on reasonable request.

References

Meignin, C. & Davis, I. Transmitting the message: intracellular mRNA localization. Curr Opin Cell Biol 22, 112–119 (2010).
Article CAS Google Scholar
Besse, F. & Ephrussi, A. Translational control of localized mRNAs: restricting protein synthesis in space and time. Nat Rev Mol Cell Biol 9, 971–980 (2008).
Article CAS Google Scholar
Holt, C. E. & Bullock, S. L. Subcellular mRNA localization in animal cells and why it matters. Science 326, 1212–1216 (2009).
Article ADS CAS Google Scholar
Holt, C. E. & Schuman, E. M. The central dogma decentralized: new perspectives on RNA function and local translation in neurons. Neuron 80, 648–657 (2013).
Article CAS Google Scholar
Mili, S. & Macara, I. G. RNA localization and polarity: from A(PC) to Z(BP). Trends Cell Biol 19, 156–164 (2009).
Article CAS Google Scholar
Buxbaum, A. R., Haimovich, G. & Singer, R. H. In the right place at the right time: visualizing and understanding mRNA localization. Nat Rev Mol Cell Biol 16, 95–109 (2015).
Article CAS Google Scholar
Medioni, C., Mowry, K. & Besse, F. Principles and roles of mRNA localization in animal development. Development 139, 3263–3276 (2012).
Article CAS Google Scholar
Zivraj, K. H. et al. Subcellular profiling reveals distinct and developmentally regulated repertoire of growth cone mRNAs. The Journal of neuroscience: the official journal of the Society for Neuroscience 30, 15464–15478 (2010).
Article CAS Google Scholar
Cajigas, I. J. et al. The local transcriptome in the synaptic neuropil revealed by deep sequencing and high-resolution imaging. Neuron 74, 453–466 (2012).
Article CAS Google Scholar
Taliaferro, J. M. et al. Distal Alternative Last Exons Localize mRNAs to Neural Projections. Molecular cell 61, 821–833 (2016).
Article CAS Google Scholar
Moor, A. E. et al. Global mRNA polarization regulates translation efficiency in the intestinal epithelium. Science (2017).
Nagaoka, K., Udagawa, T. & Richter, J. D. CPEB-mediated ZO-1 mRNA localization is required for epithelial tight-junction assembly and cell polarity. Nat Commun 3, 675 (2012).
Article ADS Google Scholar
Mili, S., Moissoglu, K. & Macara, I. G. Genome-wide screen reveals APC-associated RNAs enriched in cell protrusions. Nature 453, 115–119 (2008).
Article ADS CAS Google Scholar
Wang, T., Hamilla, S., Cam, M., Aranda-Espinoza, H. & Mili, S. Extracellular matrix stiffness and cell contractility control RNA localization to promote cell migration. Nat Commun 8, 896 (2017).
Article ADS CAS Google Scholar
Condeelis, J. & Singer, R. H. How and why does beta-actin mRNA target? Biol Cell 97, 97–110 (2005).
Article CAS Google Scholar
Yasuda, K., Clatterbuck-Soper, S. F., Jackrel, M. E., Shorter, J. & Mili, S. FUS inclusions disrupt RNA localization by sequestering kinesin-1 and inhibiting microtubule detyrosination. The Journal of cell biology 216, 1015–1034 (2017).
Article CAS Google Scholar
Liu-Yesucevitz, L. et al. Local RNA translation at the synapse and in disease. The Journal of neuroscience: the official journal of the Society for Neuroscience 31, 16086–16093 (2011).
Article CAS Google Scholar
Yoon, Y. J. et al. Glutamate-induced RNA localization and translation in neurons. Proceedings of the National Academy of Sciences of the United States of America (2016).
Katz, Z. B. et al. beta-Actin mRNA compartmentalization enhances focal adhesion stability and directs cell migration. Genes &. development 26, 1885–1890 (2012).
CAS Google Scholar
Kim, S. H., Vieira, M., Shim, J. Y., Choi, H. & Park, H. Y. Recent progress in single-molecule studies of mRNA localization in vivo. RNA Biol (2018).
Tutucci, E., Livingston, N. M., Singer, R. H. & Wu, B. Imaging mRNA In Vivo, from Birth to Death. Annu Rev Biophys 47, 85–106 (2018).
Article CAS Google Scholar
Fallini, C., Donlin-Asp, P. G., Rouanet, J. P., Bassell, G. J. & Rossoll, W. Deficiency of the Survival of Motor Neuron Protein Impairs mRNA Localization and Local Translation in the Growth Cone of Motor Neurons. The Journal of neuroscience: the official journal of the Society for Neuroscience 36, 3811–3820 (2016).
Article CAS Google Scholar
Latham, V. M., Yu, E. H., Tullio, A. N., Adelstein, R. S. & Singer, R. H. A Rho-dependent signaling pathway operating through myosin localizes beta-actin mRNA in fibroblasts. Curr Biol 11, 1010–1016 (2001).
Article CAS Google Scholar
Park, H. Y., Trcek, T., Wells, A. L., Chao, J. A. & Singer, R. H. An unbiased analysis method to quantify mRNA localization reveals its correlation with cell motility. Cell Rep 1, 179–184 (2012).
Article CAS Google Scholar
Battich, N., Stoeger, T. & Pelkmans, L. Image-based transcriptomics in thousands of single human cells at single-molecule resolution. Nature methods 10, 1127–1133 (2013).
Article CAS Google Scholar
Samacoits, A. et al. A computational framework to study sub-cellular RNA localization. Nat Commun 9, 4584 (2018).
Article ADS Google Scholar
Noiva, R. Protein disulfide isomerase: the multifunctional redox chaperone of the endoplasmic reticulum. Semin Cell Dev Biol 10, 481–493 (1999).
Article CAS Google Scholar
Mingle, L. A. et al. Localization of all seven messenger RNAs for the actin-polymerization nucleator Arp2/3 complex in the protrusions of fibroblasts. Journal of cell science 118, 2425–2433 (2005).
Article CAS Google Scholar
Lessey, E. C., Guilluy, C. & Burridge, K. From mechanical force to RhoA activation. Biochemistry 51, 7420–7432 (2012).
Article CAS Google Scholar
Lund, R. R. et al. NADH-Cytochrome b5 Reductase 3 Promotes Colonization and Metastasis Formation and Is a Prognostic Marker of Disease-Free and Overall Survival in Estrogen Receptor-Negative Breast Cancer. Mol Cell Proteomics 14, 2988–2999 (2015).
Article CAS Google Scholar

Download references

Acknowledgements

We thank all members of the Mili lab for useful discussions and suggestions. This work was supported by the Intramural Research Program of the Center for Cancer Research, NCI, National Institutes of Health (S.M.).

Author information

Authors and Affiliations

Laboratory of Cellular and Molecular Biology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD, USA
Michael Stueland, Tianhong Wang & Stavroula Mili
Department of Physics and Astronomy, Seoul National University, Seoul, Korea
Hye Yoon Park

Authors

Michael Stueland
View author publications
You can also search for this author in PubMed Google Scholar
Tianhong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hye Yoon Park
View author publications
You can also search for this author in PubMed Google Scholar
Stavroula Mili
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.S. developed the image analysis algorithm and analyzed data. T.W. performed experiments. HY.P. provided image analysis code. S.M. performed experiments and analyzed data. S.M. and M.S. wrote the manuscript. All authors reviewed and edited the text.

Corresponding author

Correspondence to Stavroula Mili.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

RDI Calculator_User Manual

Dataset 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Stueland, M., Wang, T., Park, H.Y. et al. RDI Calculator: An Analysis Tool to Assess RNA Distributions in Cells. Sci Rep 9, 8267 (2019). https://doi.org/10.1038/s41598-019-44783-2

Download citation

Received: 13 March 2019
Accepted: 20 May 2019
Published: 04 June 2019
DOI: https://doi.org/10.1038/s41598-019-44783-2

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.