Data-analysis strategies for image-based cell profiling

Caicedo, Juan C; Cooper, Sam; Heigwer, Florian; Warchal, Scott; Qiu, Peng; Molnar, Csaba; Vasilevich, Aliaksei S; Barry, Joseph D; Bansal, Harmanjit Singh; Kraus, Oren; Wawer, Mathias; Paavolainen, Lassi; Herrmann, Markus D; Rohban, Mohammad; Hung, Jane; Hennig, Holger; Concannon, John; Smith, Ian; Clemons, Paul A; Singh, Shantanu; Rees, Paul; Horvath, Peter; Linington, Roger G; Carpenter, Anne E

doi:10.1038/nmeth.4397

Download PDF

Review Article
Open access
Published: 01 September 2017

Data-analysis strategies for image-based cell profiling

Juan C Caicedo¹,
Sam Cooper²,
Florian Heigwer ORCID: orcid.org/0000-0002-8230-1485³,
Scott Warchal⁴,
Peng Qiu⁵,
Csaba Molnar⁶,
Aliaksei S Vasilevich⁷,
Joseph D Barry⁸,
Harmanjit Singh Bansal⁹,
Oren Kraus¹⁰,
Mathias Wawer¹¹,
Lassi Paavolainen¹²,
Markus D Herrmann¹³,
Mohammad Rohban¹,
Jane Hung^1,14,
Holger Hennig ORCID: orcid.org/0000-0002-4272-2445¹⁵,
John Concannon¹⁶,
Ian Smith¹⁷,
Paul A Clemons¹¹,
Shantanu Singh¹,
Paul Rees^1,18,
Peter Horvath^6,12,
Roger G Linington ORCID: orcid.org/0000-0003-1818-4971¹⁹ &
…
Anne E Carpenter ORCID: orcid.org/0000-0003-1555-8261¹

Nature Methods volume 14, pages 849–863 (2017)Cite this article

86k Accesses
404 Citations
96 Altmetric
Metrics details

Subjects

Abstract

Image-based cell profiling is a high-throughput strategy for the quantification of phenotypic differences among a variety of cell populations. It paves the way to studying biological systems on a large scale by using chemical and genetic perturbations. The general workflow for this technology involves image acquisition with high-throughput microscopy systems and subsequent image processing and analysis. Here, we introduce the steps required to create high-quality image-based (i.e., morphological) profiles from a collection of microscopy images. We recommend techniques that have proven useful in each stage of the data analysis process, on the basis of the experience of 20 laboratories worldwide that are refining their image-based cell-profiling methodologies in pursuit of biological discovery. The recommended techniques cover alternatives that may suit various biological goals, experimental designs, and laboratories' preferences.

Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations

Article Open access 09 April 2024

Srinivas Niranj Chandrasekaran, Beth A. Cimini, … Anne E. Carpenter

Simultaneous single-cell three-dimensional genome and gene expression profiling uncovers dynamic enhancer connectivity underlying olfactory receptor choice

Article Open access 15 April 2024

Honggui Wu, Jiankun Zhang, … X. Sunney Xie

scGPT: toward building a foundation model for single-cell multi-omics using generative AI

Article 26 February 2024

Haotian Cui, Chloe Wang, … Bo Wang

Main

Image analysis is heavily used to quantify phenotypes of interest to biologists, especially in high-throughput experiments^1,2,3. Recent advances in automated microscopy and image analysis allow many treatment conditions to be tested in a single day, thus enabling the systematic evaluation of particular morphologies of cells. A further revolution is currently underway: images are also being used as unbiased sources of quantitative information about cell state in an approach known as image-based profiling or morphological profiling⁴. Herein, the term morphology will be used to refer to the full spectrum of biological phenotypes that can be observed and distinguished in images, including not only metrics of shape but also intensities, staining patterns, and spatial relationships (described in 'Feature extraction').

In image-based cell profiling, hundreds of morphological features are measured from a population of cells treated with either chemical or biological perturbagens. The effects of the treatment are quantified by measuring changes in those features in treated versus untreated control cells⁵. By describing a population of cells as a rich collection of measurements, termed the 'morphological profile', various treatment conditions can be compared to identify biologically relevant similarities for clustering samples or identifying matches or anticorrelations. This profiling strategy contrasts with image-based screening, which also involves large-scale imaging experiments but has a goal of measuring only specific predefined phenotypes and identifying outliers.

Similarly to other profiling methods that involve hundreds of measurements or more from each sample^6,7, the applications of image-based cell profiling are diverse and powerful. As reviewed recently^8,9, these applications include identifying disease-specific phenotypes, gene and allele functions, and targets or mechanisms of action of drugs.

However, the field is currently a wild frontier, including novel methods that have been proposed but not yet compared, and few methods have been used outside the laboratories in which they were developed. The scientific community would greatly benefit from sharing methods and software code at this early stage, to enable more rapid convergence on the best practices for the many steps in a typical profiling workflow (Fig. 1).

**Figure 1: Representative workflow for image-based cell profiling.**

Here, we document the options at each step in the computational workflow for image-based profiling. We divide the workflow into eight main steps (Fig. 1). For each step, we describe the process, its importance, and its applicability to different experimental types and scales. We present previously published methods relevant to each step, provide guidance regarding the theoretical pros and cons for each alternative option, and refer to any prior published comparisons of methods. We do not cover the upstream steps (sample preparation and image-acquisition recommendations)^1,2 or computational practicalities such as the necessary information-technology infrastructure to store and process images or data. The workflow's starting point is a large set of images. The assays can be specifically designed for profiling, such as Cell Painting^10,11, but any image-based assays can be used, including a panel of multiple parallel image-based assays¹², or time-lapse microscopy for analyzing dynamics¹³ or even whole organisms¹⁴.

This paper is the result of a 'hackathon', in which the authors met to discuss and share their expertise in morphological profiling. Hands-on data-analysis challenges and the accompanying discussions helped to identify the best practices in the field and to contribute algorithms to a shared code base.

We hope to provide a valuable foundation and framework for future efforts and to lower the barrier to entry for research groups that are new to image-based profiling. The detailed workflows used by each individual laboratory contributing to this article can be found online (https://github.com/shntnu/cytomining-hackathon-wiki/wiki/).

Step 1: image analysis

Image analysis transforms digital images into measurements that describe the state of every single cell in an experiment. This process makes use of various algorithms to compute measurements (often called features) that can be organized in a matrix in which the rows are cells in the experiment, and the columns are extracted features.

Field-of-view illumination correction. Every image acquired by a microscope exhibits inhomogeneous illumination mainly because a nonuniform light source or optical path often yields shading around edges. This effect is often underestimated; however, intensities usually vary by 10–30%, thus corrupting accurate segmentation and intensity measurements¹⁵. Illumination correction is a process to recover the true image from a distorted one. There are three main approaches to illumination correction:

Prospective methods. These methods build correction functions from reference images, such as dark and bright images with no sample in the foreground. The approach requires careful calibration at the time of acquisition and relies on assumptions that are often inappropriate, thus yielding an incomplete correction in practice¹⁶.

Retrospective single-image methods. These methods calculate the correction model for each image individually^17,18,19. However, the result can change from image to image and thus may alter the relative intensity.

Retrospective multi-image methods. These methods build the correction function by using the images acquired in the experiment. These methods are often based on smoothing¹⁶, surface fitting²⁰, or energy-minimization models¹⁵.

Illumination correction is an important step for high-throughput quantitative profiling; the strategy of choice in most of our laboratories is a retrospective multi-image correction function. This procedure produces more robust results, particularly when separate functions are calculated for each batch of images (often with a different function for each plate and always with a different function for different imaging sessions or instruments). We recommend use of prospective and single-image methods for only qualitative experiments.

Segmentation. Typically, each cell in the image is identified and measured individually; that is, its constituent pixels are grouped to distinguish the cell from other cells and from the background. This process is called 'segmentation' (Fig. 2), and there are two main approaches:

**Figure 2: Methods used for image analysis.**

Model based. The experimentalist chooses an appropriate algorithm and manually optimizes parameters on the basis of visual inspection of segmentation results. A common procedure is first to identify nuclei, as can often be done easily, and then to use the results as seeds for the identification of the cell outline. A priori knowledge (i.e., a 'model') is needed, such as the objects' expected size and shape²¹. Model-based approaches typically involve histogram-based methods, such as thresholding, edge detection, and watershed transformation²².

Machine learning. A classifier is trained to find the optimal segmentation solution by providing it with ground-truth data and manually indicating which pixels of an image belong to different classes of objects²³. This approach typically involves applying various transformations to the image to capture different patterns in the local pixel neighborhood. Segmentation is ultimately achieved by applying the trained model to new images to classify pixels accordingly.

Both approaches are used in profiling experiments. The model-based approach is most common (for example, in CellProfiler²⁴); it performs well for fluorescence microscopy images of cultured cells²². However, it requires manual parameter adjustment for each new experimental setup. Machine-learning-based segmentation (for example, in Ilastik²³) can perform better on difficult segmentation tasks, such as highly variable cell types or tissues. It does not require as much computational expertise, but it does require manual labeling of training pixels for each experimental setup and sometimes even for each batch of images. The creation of ground-truth data in the process of labeling allows for quantitative performance assessment.

Feature extraction. The phenotypic characteristics of each cell are measured in a step called feature extraction, which provides the raw data for profiling. The major types of features are:

Shape features. These features are computed on the boundaries of nuclei, cells, or other segmented compartments. These include standard size and shape metrics such as perimeter, area, and roundness^25,26.

Intensity-based features. These features are computed from the actual intensity values in each channel of the image on a single-cell basis, within each compartment (nucleus, cell, or other segmented compartments). These metrics include simple statistics (for example, mean intensity, and maximum intensity).

Texture features. These features quantify the regularity of intensities in images, and periodic changes can be detected by using mathematical functions such as cosines and correlation matrices. These features have been extensively used for single-cell analysis^27,28,29,30.

Microenvironment and context features. These features include counts and spatial relationships among cells in the field of view (on the basis of the number of and distance to cells in a neighborhood) as well as its position relative to a cell colony^31,32,33. Segmented regions are not limited to nuclei, and cells and may also include subcellular structures that can be quantified as measurements (for example, speckles within a nucleus or distances between the nucleus and individual cytoplasmic vesicles).

Whereas screening experiments typically measure one or two features of interest to quantify specific effects³⁴, cell profiling involves computing as many features as possible to select robust, concise, and biologically meaningful features to increase the chances of detecting changes in the molecular states of cells. The most common practice is to measure hundreds or even thousands of features of many varieties; the details are typically described in the software's documentation^24,35,36.

Step 2: image quality control

It is largely impossible to manually verify image quality in high-throughput experiments, so automated methods are needed to objectively flag or remove images and cells that are affected by artifacts. These methods seek to decrease the risk of contaminating the data with incorrect values.

Field-of-view quality control.

Images can be corrupted by artifacts such as blurring (for example, improper autofocusing) or saturated pixels (for example, debris or aggregations that are inappropriately bright). Typically, statistical measures of image intensity are used for quality control.

Metrics can be computed to detect blurring, including the ratio of the mean and the s.d. of each image's pixel intensities, the normalized measure of the intensity variance³⁷, and the image correlation across subregions of the image³⁸. The log–log slope of the power spectrum of pixel intensities is another effective option, because the high-frequency components of an image are lost as it becomes more blurred³⁹; this procedure has been found to be the most effective in a recent comparison for high-throughput microscopy⁴⁰. For detecting saturation artifacts, the percentage of saturated pixels has been found to be the best among all tested metrics.

We recommend computing various measures that represent a variety of artifacts that might occur in an experiment to increase the chance of artifact identification. Then, with data-analysis tools, these measurements can be reviewed to identify acceptable quality-control thresholds for each measure⁴⁰. It is also possible to use supervised machine-learning algorithms to identify problematic images^41,42, but these algorithms require example annotations and classifier training and validation, and thus may require more effort and introduce a risk of overfitting.

Cell-level quality control. Outlier cells may exhibit highly unusual phenotypes but may also result from errors in sample preparation, imaging, image processing, or image segmentation. Errors include incorrectly segmented cells, partly visible cells at image edges, out-of-focus cells, and staining artifacts. Although errors are best decreased through careful techniques and protocols, there are several strategies for detecting outlier cells:

Model-free outlier detection. This strategy includes methods to define normal limits by using statistics. Data points represented with a single variable (for example, distance values or single features) can be analyzed with univariate statistical tools, including the 3- or 5-s.d. rules, Winsorizing, and the adjusted box-plot rule⁴³. Robust statistics based on estimators such as the median and the median absolute deviation⁴⁴ can also be used and extended to multivariate situations⁴⁵. Additional multivariate methods include principal component analysis (PCA) and Mahalanobis-based outlier detection⁴⁶.

Model-based outlier detection. This strategy involves training a model of normal samples to aid in detecting outlier cells⁴⁷. For instance, if a linear regression among features is suitable, outliers can be detected as data points with a large residual that does not follow the general trend⁴⁸. Alternately, a supervised-machine-learning classifier can be trained by providing examples of outliers^49,50,51.

After they are detected, outlier cells can be removed, or when the number of outliers in the sample is too high, the entire sample can be examined manually or omitted from analysis^47,52. Importantly, cell-outlier detection should be performed at the whole-population level; that is, it should not be separately configured per well, per replicate, or per plate. Extreme caution is recommended, to avoid removing data points that represent cells and samples with interesting phenotypes^53,54. Samples can be composed of various subpopulations of cells, and outlier-detection methods may incorrectly assume normality or homogenous populations (Fig. 3). For this reason, most laboratories skip outlier detection at the level of individual cells, other than to check for segmentation problems.

**Figure 3: Diversity of feature distributions in morphological profiling.**

Step 3: preprocessing extracted features

Preparing extracted cell features for further analysis is a delicate step that can enhance the observation of useful patterns or can corrupt the information and lead to incorrect conclusions.

Missing values. Feature-extraction software may yield non-finite symbols (such as NaN and INF) representing incomputable values. In general, use of these symbols is preferred to assigning a numerical value that could be interpreted as having a phenotypic meaning. The presence of non-finite symbols poses challenges to applying statistics or machine-learning algorithms. There are three alternate solutions for handling missing values:

Removing cells. If a small proportion of cells have missing values, excluding them can be considered. However, those cells may indicate a valid and relevant phenotype, a possibility that should be assessed carefully (described in 'Cell-level quality control').

Removing features. If a large proportion of cells have a missing value for a particular feature, they might be removed on the grounds that the feature is insufficiently informative. Again, this removal should be assessed carefully for its effect on unexpected cell phenotypes.

Applying imputation. If the proportion of cells with missing values for certain features is relatively small, several statistical rules may be applied to complete these values. The use of zeros or the mean value is common in general statistical analysis but should not be the default option for single-cell profiling. If too many values are artificially added to the data matrix, the downstream analysis may be affected or biased by false data.

Deciding how to proceed with missing values is primarily dependent on experimental evaluations and empirical observations. Removing cells or features is more common than applying imputation. However, there is no single rule that applies in all cases, and the best practice is to collect convincing evidence supporting these decisions, especially with the use of quality measures and replicate analysis (described in 'Downstream analysis').

Plate-layout-effect correction. High-throughput assays use multiwell plates, which are subject to edge effects and gradient artifacts. Concerns regarding spatial effects across each plate are not unique to imaging and have been widely discussed in both the microarray-normalization and high-throughput-screening literature^{44,55,56,57,58}. They can be decreased to some degree at the sample-preparation step⁵⁹.

We recommend checking for plate effects to determine whether any artifacts are present within plates or across multiple batches. The simplest method is a visual check, through plotting a measured variable (often cell count or cell area) as a heat map in the same spatial format as the plate; this procedure allows for easy identification of row and column effects as well as drift across multiple plates.

We recommend using a two-way median polish to correct for positional effects. This procedure involves iterative median smoothing of rows and columns to remove positional effects, then dividing each well value by the plate median absolute deviation to generate a B score⁶⁰. However, this procedure cannot be used on nonrandom plate layouts such as compound titration series or controls placed along an entire row or column⁵⁴. Other approaches include 2D polynomial regression and running averages, both of which correct spatial biases by using local smoothing⁶¹. Notably, image-based profiling is often sufficiently sensitive to distinguish among different well positions containing the same sample. Thus, to mitigate these positional effects, samples should be placed in random locations with respect to the plate layout. However, because such scrambling of positions is rarely practical, researchers must take special care to interpret results carefully and to consider the effects that plate-layout effects might have on the biological conclusions.

Batch-effect correction. Batch effects are subgroups of measurements that result from undesired technical variation (for example, changes in laboratory conditions, sample manipulation, or instrument calibration) rather than constituting a meaningful biological signal (Fig. 4). Batch effects pose a major challenge to high-throughput methodologies, and correction is an important preliminary step; if undetected, batch effects can lead to misinterpretation and false conclusions⁶².

**Figure 4: Example diagnostic plots for detecting batch effects and plate-layout effects.**

We recommend identifying batch effects by inspecting correlations among profiles (described in 'Single-cell data aggregation'). Specifically, by plotting heat maps of the correlation between all pairs of wells within an experiment, sorted by experimental repeat, batch effects can be identified as patterns of high correlation corresponding to technical artifacts (Fig. 4a). As a quantitative check, within-plate correlations should be in the same range as across-plate correlations.

When correction is needed, standardization and quantile normalization, as discussed in 'Feature transformation and normalization', can be applied within plates rather than to the entire screen⁶³. This procedure should be performed only if samples are relatively randomly distributed across plates. Canonical correlation analysis can also be used to transform data to maximize the similarity between technical replicates across experiments^64,65. Nonetheless, care should be taken to ensure that batch effects have been correctly decreased without false amplification of other sources of noise.

Feature transformation and normalization. Morphological profiles include features that display varying shapes of statistical distributions⁶⁶. It is therefore essential to transform feature values with simple mathematical operations, such that the values are approximately normally distributed and mean centered and have comparable s.d. Normal distributions make it easier to work with numeric values from a mathematical, statistical, and computational point of view. We highlight three key steps in this process:

Distribution testing. The need for transforming feature values can be evaluated for each feature on the basis of diagnostic measures and plots (Fig. 3). Graphical methods such as histograms, cumulative distribution curves, and quantile–quantile plots allow for visual identification of features that deviate from symmetric distributions. Analytical tests can also be used, including the Kolmogorov–Smirnov (KS) test and the Kullback–Leibler divergence, both of which aim to compute ratios of deviation from normality.

Logarithmic transformations. These transformations are often used to obtain approximate normal distributions for features that have highly skewed values or require range correction^67,68. Transformations include the generalized logarithmic function⁶⁸ and other adaptations that use shrinkage terms to avoid problems with nonpositive and near-zero feature values^69,70, as well as the Box–Cox transformation⁶⁷.

Relative normalization. This procedure consists of computing statistics (for example, median and median absolute deviation) in one population of samples, and then centering and scaling the rest with respect to that population. Ideally, features are normalized across an entire screen in which batch effects are absent; however, normalization within plates is generally performed to correct for batch effects (described in 'Batch-effect correction'). When choosing the normalizing population, we suggest the use of control samples (assuming that they are present in sufficient quantity), because the presence of dramatic phenotypes may confound results. This procedure is good practice regardless of the normalization being performed within plates or across the screen. Alternately, all samples on a plate can be used as the normalizing population when negative controls are unavailable, too few, or unsuitable for some reason, and when samples on each plate are expected to not be enriched in dramatic phenotypes.

We recommend applying normalization across all features. Normalization can be applied even if features are not transformed, and it is preferable to remove biases while simultaneously fixing range issues. z-score normalization is the most commonly used procedure in our laboratories. Normalization also aligns the range of different features, thus decreasing the effects of unbalanced scales when computing similarities (described in 'Measuring profile similarity') or applying analysis algorithms (described in 'Downstream analysis'). It is advisable to compare several transformation and normalization methods, because their performance can vary significantly among assays⁷¹.

Step 4: dimensionality reduction

At this point in the workflow, it can be useful to ask which of the measured features provide the most value in answering the biological question being studied.

Dimensionality reduction aims to filter less informative features and/or merge related features in the morphological profiles, given that morphological features calculated for profiling are often relatively redundant. The resulting compact representation is computationally more tractable, and it additionally avoids overrepresentation of similar features, that is, having a subgroup of features that measure similar or redundant properties of cells. Redundant features can diminish the signals of other more complementary features that are underrepresented, thus confounding downstream analysis.

Feature selection. Feature selection reduces dimensionality by discarding individual features while leaving the remainder in their original format (and thus retaining their interpretability). Options include:

Finding correlated features. One feature is selected from a subgroup that is known to be correlated. For instance, some texture features are highly correlated; thus, not all of them are needed, because they may represent the same underlying biological property. The feature–feature correlation matrix is computed, and pairs with a correlation exceeding a given threshold are identified iteratively. At each step, the feature with the largest mean absolute correlation with the rest of the features is removed.

Filtering on the basis of replicate correlation. Features that provide the highest additional information content^69,70 on the basis of replicate correlation are iteratively selected as follows. An initial set of features is selected, and each of the remaining features is regressed on the selected set. The resulting residual data vector represents the additional information not already present in the selected features. The correlation of this residual vector across replicates is used to quantify information content. As a separate step, features with low replicate correlation are often excluded from analysis because they are too noisy^69,72.

Minimum redundancy–maximum relevance. A subset of features can have high replicate correlation without contributing substantially new information. To prevent selecting redundant features, minimum redundancy–maximum relevance⁷³ adds a constraint based on mutual information to the selection algorithm. The resulting selected features have high replicate correlation while preserving a diverse set of measurements⁷⁴.

Support-vector-machine-based recursive-feature elimination. A support vector machine is trained to implicitly weigh useful features in a classification task. Then, the features with the lowest weight are iteratively removed until the accuracy of the classification task begins to decline⁷⁵. In profiling applications, it may be desirable to select the features that best separate the treatments from the negative controls^76,77; the selected features would then be those that maximally differentiate phenotypes.

No previous studies have compared these options. Most groups use the filter method based on replicate correlation^69,70,72, and some add more powerful algorithms despite the computational cost. A combination of methods could be used, especially in tandem with the replicate-correlation strategy. There are other methodologies that may be useful, such as rescaling features in correlated groups such that their sum is one or selecting the features that contribute to most of the variance in the first two principal components.

Linear transformation. Methods of linear transformation seek lower-dimensional subspaces of higher-dimensional data that maintain information content. Linear transformation can be performed on single-cell profiles and aggregated sample-level profiles. Unlike feature selection, transformations can combine individual features, thus making the resulting features more powerful and information rich but potentially impeding their interpretability. Linear transformation across all samples in the experiment is often needed for downstream analysis, to avoid overrepresentation of related features. Options used in morphological profiling are:

PCA. This procedure maximizes variance in successive orthogonal dimensions. PCA has been shown to outperform other dimensionality-reduction methods, such as random-forest selection for discriminating small-molecule-inhibitor effects⁷⁸, and independent component analysis and statistical image moments (Zernike/Fourier) for separating cell lines and preserving cell morphology after reconstruction from a lower-dimensional space⁷⁹.

Factor analysis and linear discriminant analysis. Factor analysis, which is closely related to PCA, finds nonorthogonal combinations of features representing frequent patterns in the data⁸⁰. Linear discriminant analysis finds a projection that maximizes the separation between positive and negative controls⁸¹. Both procedures have been successfully used in morphological profiling.

Among our laboratories, and in data science more generally, PCA is the most commonly used choice. Its simplicity and ability to retain a large amount of information in fewer dimensions probably explains its popularity. One comparative analysis using image-based profiling data has shown that factor analysis, compared with some alternate transformations, can identify a compact set of dimensions and improve downstream analysis results⁷⁷.

Step 5: single-cell data aggregation

Profiles are data representations that describe the morphological state of an individual cell or a population of cells. Population-level (also called image-level or well-level) representations are obtained by aggregating the measurements of single cells into a single vector to summarize the typical features of the population, so that populations can be compared (Fig. 5).

**Figure 5: Single-cell data aggregation.**

Simple aggregations. There are three simple and commonly used strategies for creating aggregated population-level profiles from all individual cell profiles in the sample:

Mean profile. Assuming a normal distribution of features, a profile built from the means of each feature for all cells in the population can provide a useful summary. This method has been used for compound classification^77,82. The profile length is sometimes doubled by also computing the s.d. of each feature.

Median profile. Taking the median for each feature over all the cells in a sample (and optionally the median absolute deviation) can be more robust to non-normal distributions and can mitigate the effects of outliers. If outliers are artifacts or errors, this procedure is useful, but the median may misrepresent populations with rare phenotypes by considering them as undesired outliers.

KS profile. This profile compares the probability distribution of a feature in a sample with respect to negative controls by using the KS nonparametric statistical test⁸³. The resulting profile is the collection of KS statistics for the features, which reveal how different the sample is with respect to the control.

There are other tests that may perform well but have not been evaluated for morphological profiling. Such tests include the Anderson–Darling statistic and the Mann–Whitney U test. Other aggregation strategies can be designed by using bootstrap estimators previously used for phenotype classification⁸⁴.

The median profile has been found to have better performance than other profiling strategies in two different studies^16,77 and is the preferred choice in most of our laboratories. One choice that varies among groups is whether to construct profiles at the level of images, fields of view, wells, or replicates. One could, for example, calculate a mean profile across all cells in a given replicate (regardless of the image or well) or instead calculate means for each image individually and then calculate means across images to create the replicate-level profile.

Subpopulation identification and aggregation. In most image-based cell-profiling workflows, it is implicitly assumed that ensemble averages of single-cell measurements reflect the dominant biological mechanism influenced by the treatment condition. However, subpopulations of cells are known to exhibit different phenotypes even within the same well^85,86. Classifying populations of single cells on the basis of their shape^87,88,89,90, cell-cycle phase^13,88,91, or signaling state⁹² can aid in interpretation and visualization of cell-profiling data⁹³. Cellular heterogeneity poses practical challenges for effective measurement methods that account for this variability.

Making use of subpopulations usually involves three key steps:

Subpopulation identification. Cells are clustered according to their morphological phenotypes, by using single-cell profiles (from controls or from the whole experiment). Clustering can be supervised, wherein reference phenotypes are selected^94,95,96, or unsupervised, as in k-means clustering^90,97 and Gaussian mixture model fitting⁹².

Classification. Single-cell data points from all treatment conditions are then assigned to one of the subpopulations identified in the previous step. This assignment can be done by using a feature-evaluation rule, such as proximity, similarity, or feature weighting. This step is necessary because subpopulation identification is typically performed only on a subset of cells.

Aggregation. For each treatment condition, vectors are calculated and yield the number (or fraction) of cells within each subpopulation. Thus, the dimensionality of these vectors is the number of identified subpopulations.

An unproven hypothesis in the field is that profiles based on identification of phenotypically coherent subpopulations of cells should improve the accuracy of profiling, given the prevalence of heterogeneity and the existence of small subpopulations that might be ignored in mean or median profiling. In fact, to date, subpopulation-based profiling has not improved separation of treatment conditions^77,98. Nonetheless, defining subpopulations can assist in inferring biological meaning, by identifying over- and underrepresented subpopulations of cells under a given treatment condition⁹⁹ and by improving understanding of the dynamics of how cells transition between different phenotypes^98,100.

Step 6: measuring profile similarity

A key component of downstream analysis is the definition of a metric to compare treatments or experimental conditions. Similarity metrics reveal connections among morphological profiles.

Similarity-metric calculation. With a suitable metric, the similarities among a collection of treatment conditions can facilitate downstream analysis and allow for direct visualization of data structure, for example in distance heat maps (Fig. 6a). Image-based cell-profiling studies use three types of metrics:

**Figure 6: Visualizations for downstream analysis.**

Distance measures. These measures involve calculating how far apart two points are in the high-dimensional feature space. Those used in morphological profiling include Euclidean^72,83, Mahalanobis¹⁰¹, and Manhattan distances. Distance measures are very useful to quantify the difference in magnitude between profiles, because they aggregate the lengths of feature variations regardless of directionality. This procedure is useful to compute estimates of phenotypic strength of treatments with respect to controls.

Similarity measures. These measures involve computing a statistical estimate of the likelihood of a relation between two profiles. Statistics used in morphological profiling include Pearson's correlation¹⁰², Spearman's rank correlation¹⁰³, Kendall's rank correlation⁷⁸, and cosine similarity⁷⁷. Similarity measures quantify the proximity between profiles, because they detect deviations from one sample to another regardless of the absolute magnitude. This procedure is useful in finding relations and groups of samples that share common properties.

Learned similarity measures. These measures involve training machine-learning models that weight features differently according to prior knowledge about samples. The model can be a classifier that systematically identifies differences between two samples by using cross-validation¹⁰⁴ or by determining transformations of features that lead to maximal enrichment of groups of related samples⁸⁹. These strategies can highlight patterns that are not discriminated by regular metrics and that usually require more computational power to be calculated.

The performance of distance and similarity metrics relies on the quality of selected features (described in 'Feature selection'). High-dimensional feature profiles are often prone to the drawback of dimensionality, which consists of a decreasing ability of metrics to discern differences between vectors when the dimensionality increases. Dimensionality reduction can mitigate this effect (described in 'Linear transformations'). However, the choice of the metric can also be crucial, because good metrics better exploit the structure of the available features.

A comparison of metrics on one particular imaging data set has demonstrated that rank correlations (Spearman's and Kendall's) perform best for multiple untransformed feature vectors, whereas Euclidean and Manhattan distances are best for calculating z-prime factor values between positive and negative controls⁷⁸. A comparison of metrics in gene expression data sets has suggested that Pearson's correlation performs best when features are ratios, whereas Euclidean distance is best on other distributions¹⁰⁵.

The consensus from our laboratories is that selecting an optimal metric is probably specific to feature-space dimensionality and distributions that result from prior steps in the pipeline. For a typical pipeline, Pearson's correlation generally appears to be a good choice. Notably, indexes measuring clustering quality¹⁰⁶, for example the Davies–Bouldin Index, silhouette statistic, and receiver operating characteristic–area under the curve can aid in metric choice^78,98.

Concentration-effect handling. In experiments involving chemical perturbations, multiple concentrations are usually tested. Generally, researchers are interested in identifying phenotypic similarities among compounds even if those similarities occur at different doses. The following strategies are used to compute dose-independent similarity metrics:

Titration-invariant similarity score. First, the titration series of a compound is built by computing the similarity score between each dose and negative controls. Then, the set of scores is sorted by increasing dose and is split into subseries by using a window of certain size (for instance, windows of three doses). Two compounds are compared by computing the correlation between their subwindows, and only the maximum value is retained⁸³.

Maximum correlation. For a set of n doses for each compound, the NxN correlation matrix is computed between all pairs of concentrations, and the maximum value is used as the dose-independent similarity score⁷².

The use of the maximum correlation is practical when a small number of concentrations are being tested. Depending on the experimental design, multiple concentrations can be treated differently. For instance, doses that do not yield a profile distinct from those of negative controls can be omitted, and the remaining doses can be combined to yield a single profile for the compound. Alternatively, if all concentrations are expected to have a phenotype, an entire compound can be left out of the analysis when its doses do not cluster together consistently¹⁰⁷. In addition, high doses can be removed if they are observed to be too toxic according to certain criteria, such as a minimum cell count^102,107.

Step 7: assay quality assessment

Assessing quality for morphological profiling assays can be challenging: basing the assessment on a few positive controls is not reassuring, but there are rarely a large number of controls available, nor are there other sources of ground truth. Every measured profile combines a mixture of the signal relating to the perturbation together with unintended effects such as batch effects and biological noise. Tuning the sample-preparation technique, choosing cell lines or incubation times, and choosing among alternatives within the computational pipeline all benefit from use of a quantitative indicator of whether the assay is better or worse as a result of particular design choices. Options include:

Comparison to ground truth. If the expected similarities between pairs of biological treatments are known, they can be used to validate predicted values. For instance, different concentrations of the same compound are expected to cluster together, and computed similarities should reflect that clustering. Similarly, if a subset of biological treatments is known to fall into particular classes, classification accuracy can be an appropriate metric⁷⁷. However, it is challenging to obtain ground-truth annotations at a large scale. To our knowledge, the only publicly available image data set with a large number of class annotations is for human MCF7 breast cancer cells (in this case, various classes of compound 'mechanisms of action')¹⁰⁸. Importantly, for proper evaluation of this data set, one complete compound set, including all concentrations, should be left out of training. A common mistake is to leave out a single dose of a single compound, inappropriately leaving the remaining doses of the same compound available to the classifier for training. Additional benchmarks beyond this data set are greatly needed.

Replicate reproducibility. This is typically measured as the similarity among the profiles of replicate pairs of the same biological treatment, which should be significantly higher than the similarity to profiles of other experimental conditions (controls and/or other biological treatments). This procedure requires at least two replicates of the experiment, a condition usually met for modern morphological profiling experiments. To assess significance, similarity scores are compared with a suitable null distribution. A null distribution is usually built with pairs of samples that are not expected to be highly correlated, and it mainly depends on the hypothesis being tested. For instance, the use of all pairs of biological treatments can provide a diverse null distribution for measuring replicate correlation, and a null formed by random pairs of control samples can be compared against controls grouped by well location to reveal position effects. A P value can be computed nonparametrically by evaluating the probability of random pairs having greater similarity than a particular replicate pair.

Effect size. The difference between positive and negative controls, also known as the effect size, can be used as a measure of quality. This measure can be computed with a wide variety of statistical formulations, including univariate and multivariate methods, and also by assuming parametric and nonparametric models^109,110. The disadvantage of this approach is that maximizing effect size alone may cause a bias toward detecting only those phenotypes that distinguish the control while ignoring other phenotypes.

Exploratory approaches. Several methods have not been tested but might prove useful. Clustering can be used to ascertain the overall structure of relationships among samples in the experiment: a pipeline that produces substructures or many distinct clusters is likely to be preferable over one in which the distances between all pairs of samples are similar. The cumulative variance of the principal components is a metric not yet applied to morphological profiling experiments. Highly diverse signals from different biological treatments should require more components to explain a predefined fraction of variance (for example, 99%).

Currently, replicate reproducibility is the most commonly used method, given that ground truth is rarely available. Specifically, methods are often optimized to maximize the percentage of replicates that are reproducible relative to a null (under suitable cross validation). Using a null comprising pairwise correlations between different treatments is safer than using a null comprising correlations between treatments and negative controls; in the latter case, it is possible to optimize the assay to distinguish samples from negative controls while diminishing important differences among samples.

Step 8: downstream analysis

Downstream analysis is the process of interpreting and validating patterns in the morphological profiles. The most important readouts are the similarities and relationships among the experimental conditions tested. Visualization of the relationships and the use of machine learning can help to uncover biologically meaningful structures and connections among various treated samples. Most laboratories use a combination of these strategies; generally, unsupervised clustering is a good starting point for exploring the data. From there, the goals of the study strongly influence the combination of approaches used.

Clustering. Finding clusters is one of the most effective ways of extracting meaningful relationships from morphological profiles. Clustering algorithms can be used for identifying new associations among treatments as well as validating known connections and ruling out batch effects. There are several ways of clustering a data set. Hierarchical clustering, the most widely adopted strategy, is used to identify groups of highly correlated experimental conditions⁸⁷ and to identify treatments with unexpected positive or negative connections⁹⁹. Although it is not discussed in detail here, examining relationships among features rather than among samples can yield useful biological insights: for example, the amount of mitochondrial material in cells is generally proportional to cell size, thus revealing stereotyped control of these parameters, but certain chemical perturbants can disrupt this relationship¹¹¹.

Hierarchical clustering is computed by using a similarity matrix that contains the similarity values for all pairs of samples (described in 'Measuring profile similarity'). This similarity matrix can be visualized as a heat map to reveal patterns in the data for several or up to hundreds of samples. The heat maps' rows and columns are typically sorted by using the hierarchical structure discovered by the clustering algorithm. This hierarchical structure is known as a dendrogram, which links samples together according to their proximity in the feature space, and is usually visualized together with the heat map to highlight negative and positive correlations in the data (Fig. 6a). Bootstrapping has been used to evaluate the statistical significance of the results obtained with hierarchical clustering, as well as other probabilistic algorithms used in the analysis of single-cell populations³². Resampling methods can generally be used to estimate variance, error bars, or other statistical properties of the data and can aid in making more accurate predictions and interpretations.

Visualization of high-dimensional data. Visualizations are useful to reveal the distribution and grouping of high-dimensional data points by using a 2D (and sometimes 3D) map layout that approximates their positions in the feature space. The relationships among points are implicitly encoded in how close together or far apart they are in the visualization. This method can be used to study cell heterogeneity by using single-cell data points, or sample relations by using aggregated profiles. Single-cell data are usually downsampled for practical reasons: to decrease data size and identify rare cell types^112,113. The following are the most common approaches for data visualization:

Data projections. A projection of the data matrix is displayed in a 2D (or 3D) scatter plot that approximates the geometry of the original point cloud. The most common methods include PCA (Fig. 6b), Isomap¹¹⁴, t-distributed stochastic neighbor embedding (tSNE)¹¹⁵ (Fig. 6c), and viSNE¹¹⁶.

Hierarchical visualizations. Plots are used to find structures in the data and reveal relationships between samples (Fig. 6d,e). The most commonly used choices are spanning-tree progression analysis of density-normalized events (SPADE)^113,117 and minimum spanning trees¹¹⁸, which allow for relationships among hierarchical groups of single cells or samples to be identified by using branches that may represent phenotypes or treatments.

In many cases, data points in a visualization are colored on the basis of positive controls or otherwise known labels in the data, a common practice in analysis of single-cell flow cytometry data^116,119,120. The color code can also illustrate other information in the data set, such as cell phenotypes, compound doses, values of measured features, or treatment conditions (Fig. 6e). Visualizations can be more effective if they are interactive, thereby allowing researchers to create and test hypotheses ad hoc. Software packages such as Shiny, GGobi, iPlots in R, Bokeh in Python, and D3.js in Javascript provide interactive plotting capacities, most of which can also be deployed in server-client environments for dissemination to the public.

Classification. Classification rules can be useful for transferring labels from annotated samples to unknown data points, for example, classifying the mechanism of action of new compounds in a chemical library. As such, classification strategies require prior knowledge in the form of annotations for at least some of the data points in the collection. Given examples of data points that belong to different classes of interest, supervised classification algorithms learn a rule that computes the probability of each unknown data point falling into one of the classes.

It is relatively uncommon to have a large number of annotated samples in morphological profiling, because most experiments are designed to be exploratory. However, when this information is available, a classification strategy can provide informative insights into the treatments. The most commonly used classification rule in morphological profiling experiments is the nearest-neighbors algorithm, which finds the closest data points in the collection of annotated samples and recommends a label for the new sample. For instance, this algorithm has been used for classifying the mechanism of action in a compound library⁷⁷. Other supervised prediction models can also be used to learn relations between morphological features and biological activity assays, such as Bayesian matrix factorization, neural networks, and random forests¹²¹.

The classification performance is validated in a holdout test using precision, recall, and accuracy measures. It is absolutely critical for confidence in these metrics that the holdout test set not overlap with any data points in the training set. The most recommended practice is to use samples treated in a different experimental batch to create the holdout test set (other ground-truth recommendations are described in 'Assay quality assessment').

Sharing

Both authors and the scientific community benefit from sharing code and data¹²². Numerous tools currently exist that address the steps outlined in this paper (Box 1); these tools can be useful both for beginners to experiment with and learn from and for experts to integrate into pipelines and build upon. Although data must be kept confidential for sensitive patient material, intellectual-property concerns are generally not the major issue with sharing; the primary hurdle in the process is usually the often substantial time and effort required of the authors. We do not consider code or data labeled 'available upon request' to qualify as being openly shared, given the poor efficacy statistics^123,124. We therefore recommend the following options to make code and data available publicly online.

Code sharing. Options for sharing code include:

Step-by-step narrative. For software with only a graphical user interface, a detailed walkthrough of each step of the workflow can be provided; however, this option is suboptimal.

Online code repository. The code should preferably be publicly hosted rather than being provided on a university website or as journal supplemental files. The options range from repositories such as Github and BitBucket to tools such as Jupyter notebooks and knitr documents¹²⁵, which allow for reproducible reports containing code, documentation, and figures to be shared within a single document.

Packaging. Researchers can capture and share the computational environment used to create the results, such as providing virtual machines or Docker containers. Doing so ensures that all code, dependencies and data are available in a single container^126,127, which is convenient for the user and also protects against changes in software libraries and dependencies.

Data sharing. In image-based cell profiling, publicly available data are valuable not only for reproducing results but also for identifying completely new biological findings. Options include:

Sharing processed data only. Sharing only processed data (for example, extracted features) has been common, often through supplemental data files available via the journal or via a general-purpose data repository such as Dryad (http://datadryad.org/).

Sharing images and data online. Few raw-image sets have been made available online, primarily because of the large size of the image data (tens of gigabytes for each 384-well plate) and therefore the high cost of maintaining the data on public servers. However, recent initiatives are decreasing this cost for authors, including the Image Data Resource (IDR; https://idr-demo.openmicroscopy.org/)¹²⁸, which accepts cellular images at the scale of high-throughput image profiling experiments. Generally, smaller sets of annotated images for testing image analysis methods are available in the Broad Bioimage Benchmark Collection (https://data.broadinstitute.org/bbbc/)¹⁰⁸ and the Cell Image Library (http://www.cellimagelibrary.org/). Some resources, such as IDR, support using an ontology for describing phenotypes¹²⁹. Before these public resources became available, some laboratories provided the data through their institutional servers^{13,32,52,89,103,130,131}. Tools such as OMERO¹³² and openBIS¹³³ have been used to create project-specific portals for easy online data exploration^32,52,130, but bulk download of very large data sets can remain challenging.

We strongly encourage sharing of both data and images online, given how rapidly feature-extraction methods are changing, particularly via deep-learning methods (described in 'Alternate workflows').

Box 1: SOFTWARE TOOLS

A large range of software tools and libraries currently exist that seek to address the steps outlined in this paper. For each step, the alternatives are usually several software packages or programming languages that require either parameterization or coding.

Tools for image-analysis software have been previously reviewed¹⁵⁰, and the variety in functionalities and platforms can fit a diverse range of workflows. Some of the open-source alternatives include CellProfiler²⁴ and EBImage³⁵, whereas Columbus and MetaXpress are commercial solutions.

After collection of features or measurements with image-analysis software, the next steps in the workflow may require a combination of tools and programming languages. Statistical packages such as R have proven to be very useful for single-cell data analysis, including cytominer, which is specific to morphological profiling. Other programming languages such as Python, Matlab and shell scripts can be used to process data with specific algorithms, including machine learning, data transformation, or simple data filtering and selection.

Each step may require specialized methods or may be solved with off-the-shelf implementations. The field is constantly changing, and the next breakthroughs in theory and practice may require new tools not yet available. In either case, the practice of sharing code is highly valued, to ensure rapid implementation of techniques, optimization of pipelines, and reproducibility of the results by others.

Alternate workflows

The data-processing workflow and recommendations presented in this paper have evolved as a result of years of efforts in different laboratories. They have been robustly used in various studies and have proven to be successful in making biological discoveries^8,9. However, the field is eager to adapt as the computer-vision and machine-learning communities make progress in designing new algorithms for processing image data. Some of our laboratories are already exploring alternate workflows, such as those described below.

Segmentation-free classical-feature extraction. Instead of identifying single cells that are measured and characterized, this strategy computes classical features from whole field-of-view images or from discrete tiles within images. Examples of these include PhenoRipper^134,135 and WND-Charm/CP-CHARM^136,137,138.

Deep-learning feature extraction. Deep learning techniques have recently and dramatically come to dominate the state-of-the-art performance in various computer vision tasks¹³⁹. The most relevant model for image analysis is currently the convolutional neural network (CNN), which learns to extract useful features directly from raw pixel data by using multiple nonlinear transformations, in contrast to the classical features described in 'Feature extraction'. This model has been used for segmentation and classification of biomedical images^140,141, for phenotype discovery in single-cell images from imaging flow cytometry¹⁴², and more recently for deep-learning approaches for morphological profiling: morphological profiling^143,144. The following are the most relevant deep-learning approaches for morphological profiling:

Learning features from raw pixels. This approach has been used for problems in which phenotypes of interest are predefined, and a set of categorized examples is needed to train the network. This approach has been successfully used for protein-localization problems^145,146,147 and mechanism-of-action prediction¹⁴⁴. Input images can be single cells^146,147 or full fields of view^144,145.

Transferring learned features from other domains. Using a CNN trained on a large data set for other tasks different from the original is known as transfer learning. CNNs pretrained with natural images have been evaluated as feature extractors for full image profiling of compounds; its accuracy matches the results of classical features without requiring segmentation or training¹⁴³. The preprocessing steps described in 'Field-of-view quality control' and 'Field-of-view illumination correction' are still likely to be necessary for obtaining improved results. If there are few annotations available for phenotype-classification tasks, transfer learning can also be used to improve performance¹⁴⁶.

Learning transformations of classical features. feature transformations similar to those described in 'Linear transformations' can be obtained with a technique known as the autoencoder. Deep autoencoders have been evaluated for high-content morphology data, thus suggesting that they may potentially have better performance for downstream analysis according to homogeneity of clusters¹⁴⁸. Another study has evaluated deep autoencoders for profiling and has also obtained competitive performance¹⁴⁹.

Using full images results in a loss of single-cell resolution but offers several advantages: the avoidance of the segmentation step eliminates the sometimes tedious manual tuning of segmentation and feature extraction algorithms, saves computation time, avoids segmentation errors, and may better capture visual patterns resulting from multiple cells. Using single-cell images explicitly captures heterogeneity and may offer improved accuracy with less training.

Although segmentation-free classical-feature extraction can be helpful for quality control, we generally consider it to be incapable of accomplishing most profiling tasks. Deep-learning techniques, although not yet proven to be more powerful than the standard workflow, are nonetheless very promising. We are actively pursuing optimized workflows based on deep learning and are gaining an understanding of how these techniques can be adapted for improving the computation and interpretation of useful image features.

We caution that it is possible to obtain excellent results on a ground-truth data set with a method that fails in realistic-use cases. This phenomenon may be especially true for machine-learning-based methods with millions of internal parameters and again reinforces the need for new and disparate sets of ground-truth data in the field.

Conclusions

It is an exciting time for the field of image-based cell profiling, as methods are rapidly evolving and applications leading to major biological discoveries are beginning to be published. We see the collection and sharing of large biologically interesting image sets, the organizing of benchmark ground-truth data sets, and the testing of new methods to be the major areas in which effort is currently most needed.

In future work, as a community, we aim to build shared codebases, namely toolboxes of algorithms in R and Python. The beginnings of this effort can be found online (https://github.com/CellProfiler/cytominer/), and we welcome additional contributors as well as participants in the cytomining hackathon, which will be held annually. A shared codebase will facilitate the development and dissemination of novel methods and the comparison of alternative methods, particularly as additional ground-truth data become publicly available.

Data availability

This work did not analyze new data. The plots and figures presented in the manuscript were obtained by processing the BBBC021 image collection, which is publicly available in https://data.broadinstitute.org/bbbc/BBBC021/.

References

Boutros, M., Heigwer, F. & Laufer, C. Microscopy-based high-content screening. Cell 163, 1314–1325 (2015).
Article CAS PubMed Google Scholar
Mattiazzi Usaj, M. et al. High-content screening for quantitative cell biology. Trends Cell Biol. 26, 598–611 (2016).
Article CAS PubMed Google Scholar
Fetz, V., Prochnow, H., Brönstrup, M. & Sasse, F. Target identification by image analysis. Nat. Prod. Rep. 33, 655–667 (2016).
Article CAS PubMed Google Scholar
Pennisi, E. 'Cell painting' highlights responses to drugs and toxins. Science 352, 877–878 (2016).
Article CAS PubMed Google Scholar
Grys, B.T. et al. Machine learning and computer vision approaches for phenotypic profiling. J. Cell Biol. 216, 65–71 (2017).
Article CAS PubMed PubMed Central Google Scholar
Feng, Y., Mitchison, T.J., Bender, A., Young, D.W. & Tallarico, J.A. Multi-parameter phenotypic profiling: using cellular effects to characterize small-molecule compounds. Nat. Rev. Drug Discov. 8, 567–578 (2009).
Article CAS PubMed Google Scholar
Mader, C.C., Subramanian, A. & Bittker, J. Multidimensional profile based screening: understanding biology through cellular response signatures. in High Throughput Screening Methods: Evolution and Refinement (eds. Bittker, J.A. & Ross, N.T.) 214–238 (RSC Publishing, 2016).
Caicedo, J.C., Singh, S. & Carpenter, A.E. Applications in image-based profiling of perturbations. Curr. Opin. Biotechnol. 39, 134–142 (2016).
Article CAS PubMed Google Scholar
Bougen-Zhukov, N., Loh, S.Y., Lee, H.K. & Loo, L.-H. Large-scale image-based screening and profiling of cellular phenotypes. Cytometry A 91, 115–125 (2017).
Article PubMed Google Scholar
Gustafsdottir, S.M. et al. Multiplex cytological profiling assay to measure diverse cellular states. PLoS One 8, e80999 (2013).
Article PubMed PubMed Central CAS Google Scholar
Bray, M.-A. et al. Cell Painting, a high-content image-based assay for morphological profiling using multiplexed fluorescent dyes. Nat. Protoc. 11, 1757–1774 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kang, J. et al. Improving drug discovery with high-content phenotypic screens by systematic selection of reporter cell lines. Nat. Biotechnol. 34, 70–77 (2016).
Article CAS PubMed Google Scholar
Neumann, B. et al. Phenotypic profiling of the human genome by time-lapse microscopy reveals cell division genes. Nature 464, 721–727 (2010).
Article CAS PubMed PubMed Central Google Scholar
Hasson, S.A. & Inglese, J. Innovation in academic chemical screening: filling the gaps in chemical biology. Curr. Opin. Chem. Biol. 17, 329–338 (2013).
Article CAS PubMed PubMed Central Google Scholar
Smith, K. et al. CIDRE: an illumination-correction method for optical microscopy. Nat. Methods 12, 404–406 (2015).
Article CAS PubMed Google Scholar
Singh, S., Bray, M.-A., Jones, T.R. & Carpenter, A.E. Pipeline for illumination correction of images for high-throughput microscopy. J. Microsc. 256, 231–236 (2014).
Article CAS PubMed PubMed Central Google Scholar
Likar, B., Maintz, J.B., Viergever, M.A. & Pernus, F. Retrospective shading correction based on entropy minimization. J. Microsc. 197, 285–295 (2000).
Article CAS PubMed Google Scholar
Lévesque, M.P. & Lelièvre,, M. Evaluation of the iterative method for image background removal in astronomical images. (TN 2007-344) (DRDC Valcartier, 2008).
Babaloukas, G., Tentolouris, N., Liatis, S., Sklavounou, A. & Perrea, D. Evaluation of three methods for retrospective correction of vignetting on medical microscopy images utilizing two open source software tools. J. Microsc. 244, 320–324 (2011).
Article PubMed Google Scholar
Can, A. et al. Multi-modal imaging of histological tissue sections. in 2008 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro 288–291 (2008).
Molnar, C. et al. Accurate morphology preserving segmentation of overlapping cells based on active contours. Sci. Rep. 6, 32412 (2016).
Article CAS PubMed PubMed Central Google Scholar
Stoeger, T., Battich, N., Herrmann, M.D., Yakimovich, Y. & Pelkmans, L. Computer vision for image-based transcriptomics. Methods 85, 44–53 (2015).
Article CAS PubMed Google Scholar
Sommer, C., Straehle, C., Köthe, U. & Hamprecht, F.A. Ilastik: interactive learning and segmentation toolkit. in 2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro 230–233 (2011).
Carpenter, A.E. et al. CellProfiler: image analysis software for identifying and quantifying cell phenotypes. Genome Biol. 7, R100 (2006).
Article PubMed PubMed Central CAS Google Scholar
Rodenacker, K. & Bengtsson, E. A feature set for cytometry on digitized microscopic images. Anal. Cell. Pathol. 25, 1–36 (2003).
Article PubMed PubMed Central Google Scholar
Wählby, C. Algorithms for applied digital image cytometry PhD thesis. Uppsala University (2003).
Haralick, R.M., Shanmugam, K. & Dinstein, I. Textural features for image classification. IEEE Trans. Syst. Man Cybern. SMC-3, 610–621 (1973).
Article Google Scholar
Turner, M.R. Texture discrimination by Gabor functions. Biol. Cybern. 55, 71–82 (1986).
CAS PubMed Google Scholar
Boland, M.V., Markey, M.K. & Murphy, R.F. Automated recognition of patterns characteristic of subcellular structures in fluorescence microscopy images. Cytometry 33, 366–375 (1998).
Article CAS PubMed Google Scholar
Coelho, L.P. et al. Determining the subcellular location of new proteins from microscope images using local features. Bioinformatics 29, 2343–2349 (2013).
Article CAS PubMed PubMed Central Google Scholar
Snijder, B. et al. Population context determines cell-to-cell variability in endocytosis and virus infection. Nature 461, 520–523 (2009).
Article CAS PubMed Google Scholar
Snijder, B. et al. Single-cell analysis of population context advances RNAi screening at multiple levels. Mol. Syst. Biol. 8, 579 (2012).
Article PubMed PubMed Central CAS Google Scholar
Sero, J.E. et al. Cell shape and the microenvironment regulate nuclear translocation of NF-κB in breast epithelial and tumor cells. Mol. Syst. Biol. 11, 790 (2015).
Article PubMed CAS Google Scholar
Singh, S., Carpenter, A.E. & Genovesio, A. Increasing the content of high-content screening: an overview. J. Biomol. Screen. 19, 640–650 (2014).
Article PubMed PubMed Central CAS Google Scholar
Pau, G., Fuchs, F., Sklyar, O., Boutros, M. & Huber, W. EBImage: an R package for image processing with applications to cellular phenotypes. Bioinformatics 26, 979–981 (2010).
Article CAS PubMed PubMed Central Google Scholar
Schneider, C.A., Rasband, W.S. & Eliceiri, K.W. NIH Image to ImageJ: 25 years of image analysis. Nat. Methods 9, 671–675 (2012).
Article CAS PubMed PubMed Central Google Scholar
Groen, F.C., Young, I.T. & Ligthart, G. A comparison of different focus functions for use in autofocus algorithms. Cytometry 6, 81–91 (1985).
Article CAS PubMed Google Scholar
Haralick, R.M. Statistical and structural approaches to texture. Proc. IEEE 67, 786–804 (1979).
Article Google Scholar
Field, D.J. & Brady, N. Visual sensitivity, blur and the sources of variability in the amplitude spectra of natural scenes. Vision Res. 37, 3367–3383 (1997).
Article CAS PubMed Google Scholar
Bray, M.-A., Fraser, A.N., Hasaka, T.P. & Carpenter, A.E. Workflow and metrics for image quality control in large-scale high-content screens. J. Biomol. Screen. 17, 266–274 (2012).
Article CAS PubMed Google Scholar
Goode, A. et al. Distributed online anomaly detection in high-content screening. in 2008 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro 249–252 (2008).
Lou, X., Fiaschi, L., Koethe, U. & Hamprecht, F.A. Quality classification of microscopic imagery with weakly supervised learning. in Machine Learning in Medical Imaging (eds. Wang, F., Shen, D., Yan, P. & Suzuki, K.) 176–183 (Springer Berlin Heidelberg, 2012).
Bamnett, V. & Lewis, T. Outliers in statistical data (Wiley, 1994).
Malo, N., Hanley, J.A., Cerquozzi, S., Pelletier, J. & Nadon, R. Statistical practice in high-throughput screening data analysis. Nat. Biotechnol. 24, 167–175 (2006).
CAS PubMed Google Scholar
Liberali, P., Snijder, B. & Pelkmans, L. Single-cell and multivariate approaches in genetic perturbation screens. Nat. Rev. Genet. 16, 18–32 (2015).
Article CAS PubMed Google Scholar
Prastawa, M., Bullitt, E., Ho, S. & Gerig, G. A brain tumor segmentation framework based on outlier detection. Med. Image Anal. 8, 275–283 (2004).
Article PubMed Google Scholar
Hulsman, M. et al. Analysis of high-throughput screening reveals the effect of surface topographies on cellular morphology. Acta Biomater. 15, 29–38 (2015).
Article CAS PubMed Google Scholar
Rousseeuw, P.J. & Leroy, A.M. Robust Regression and Outlier Detection (Wiley, 2005).
Rämö, P., Sacher, R., Snijder, B., Begemann, B. & Pelkmans, L. CellClassifier: supervised learning of cellular phenotypes. Bioinformatics 25, 3028–3030 (2009).
Article PubMed CAS Google Scholar
Horvath, P., Wild, T., Kutay, U. & Csucs, G. Machine learning improves the precision and robustness of high-content screens: using nonlinear multiparametric methods to analyze screening results. J. Biomol. Screen. 16, 1059–1067 (2011).
Article CAS PubMed Google Scholar
Dao, D. et al. CellProfiler Analyst: interactive data exploration, analysis and classification of large biological image sets. Bioinformatics 32, 3210–3212 (2016).
Article CAS PubMed PubMed Central Google Scholar
Liberali, P., Snijder, B. & Pelkmans, L. A hierarchical map of regulatory genetic interactions in membrane trafficking. Cell 157, 1473–1487 (2014).
Article CAS PubMed Google Scholar
Zhu, Y., Hernandez, L.M., Mueller, P., Dong, Y. & Forman, M.R. Data acquisition and preprocessing in studies on humans: what is not taught in statistics classes? Am. Stat. 67, 235–241 (2013).
Article PubMed PubMed Central Google Scholar
Mpindi, J.-P. et al. Impact of normalization methods on high-throughput screening data with high hit rates and drug testing with dose-response data. Bioinformatics 31, 3815–3821 (2015).
CAS PubMed PubMed Central Google Scholar
Kluger, Y., Yu, H., Qian, J. & Gerstein, M. Relationship between gene co-expression and probe localization on microarray slides. BMC Genomics 4, 49 (2003).
Article PubMed PubMed Central Google Scholar
Yu, H. et al. Positional artifacts in microarrays: experimental verification and construction of COP, an automated detection tool. Nucleic Acids Res. 35, e8 (2007).
Article PubMed Google Scholar
Makarenkov, V. et al. An efficient method for the detection and elimination of systematic error in high-throughput screening. Bioinformatics 23, 1648–1657 (2007).
Article CAS PubMed Google Scholar
Homouz, D., Chen, G. & Kudlicki, A.S. Correcting positional correlations in Affymetrix genome chips. Sci. Rep. 5, 9078 (2015).
Article CAS PubMed PubMed Central Google Scholar
Lundholt, B.K., Scudder, K.M. & Pagliaro, L. A simple technique for reducing edge effect in cell-based assays. J. Biomol. Screen. 8, 566–570 (2003).
Article CAS PubMed Google Scholar
Brideau, C., Gunter, B., Pikounis, B. & Liaw, A. Improved statistical methods for hit selection in high-throughput screening. J. Biomol. Screen. 8, 634–647 (2003).
Article PubMed Google Scholar
Reisen, F. et al. Linking phenotypes and modes of action through high-content screen fingerprints. Assay Drug Dev. Technol. 13, 415–427 (2015).
Article CAS PubMed Google Scholar
Leek, J.T. et al. Tackling the widespread and critical impact of batch effects in high-throughput data. Nat. Rev. Genet. 11, 733–739 (2010).
Article CAS PubMed Google Scholar
Bolstad, B.M., Irizarry, R.A., Astrand, M. & Speed, T.P. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics 19, 185–193 (2003).
Article CAS PubMed Google Scholar
Vaisipour, S. Detecting, correcting, and preventing the batch effects in multi-site data, with a focus on gene expression microarrays. PhD thesis University of Alberta (2014).
Stein, C.K. et al. Removing batch effects from purified plasma cell gene expression microarrays with modified ComBat. BMC Bioinformatics 16, 63 (2015).
Article PubMed PubMed Central CAS Google Scholar
Haney, S.A. Rapid assessment and visualization of normality in high-content and other cell-level data and its impact on the interpretation of experimental results. J. Biomol. Screen. 19, 672–684 (2014).
Article PubMed Google Scholar
Durbin, B.P., Hardin, J.S., Hawkins, D.M. & Rocke, D.M. A variance-stabilizing transformation for gene-expression microarray data. Bioinformatics 18 (Suppl. 1), S105–S110 (2002).
Article PubMed Google Scholar
Huber, W., von Heydebreck, A., Sültmann, H., Poustka, A. & Vingron, M. Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics 18 (Suppl. 1), S96–S104 (2002).
Article PubMed Google Scholar
Laufer, C., Fischer, B., Billmann, M., Huber, W. & Boutros, M. Mapping genetic interactions in human cancer cells with RNAi and multiparametric phenotyping. Nat. Methods 10, 427–431 (2013).
Article CAS PubMed Google Scholar
Fischer, B. et al. A map of directional genetic interactions in a metazoan cell. eLife 4, e05464 (2015).
Article PubMed Central CAS Google Scholar
Birmingham, A. et al. Statistical methods for analysis of high-throughput RNA interference screens. Nat. Methods 6, 569–575 (2009).
Article CAS PubMed PubMed Central Google Scholar
Woehrmann, M.H. et al. Large-scale cytological profiling for functional analysis of bioactive compounds. Mol. Biosyst. 9, 2604–2617 (2013).
Article CAS PubMed Google Scholar
Ding, C. & Peng, H. Minimum redundancy feature selection from microarray gene expression data. J. Bioinform. Comput. Biol. 3, 185–205 (2005).
Article CAS PubMed Google Scholar
Ng, A.Y.J. et al. A cell profiling framework for modeling drug responses from HCS imaging. J. Biomol. Screen. 15, 858–868 (2010).
Article CAS PubMed Google Scholar
Guyon, I., Weston, J., Barnhill, S. & Vapnik, V. Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389–422 (2002).
Article Google Scholar
Loo, L.-H., Wu, L.F. & Altschuler, S.J. Image-based multivariate profiling of drug responses from single cells. Nat. Methods 4, 445–453 (2007).
Article CAS PubMed Google Scholar
Ljosa, V. et al. Comparison of methods for image-based profiling of cellular morphological responses to small-molecule treatment. J. Biomol. Screen. 18, 1321–1329 (2013).
Article CAS PubMed Google Scholar
Reisen, F., Zhang, X., Gabriel, D. & Selzer, P. Benchmarking of multivariate similarity measures for high-content screening fingerprints in phenotypic drug discovery. J. Biomol. Screen. 18, 1284–1297 (2013).
Article CAS PubMed Google Scholar
Pincus, Z. & Theriot, J.A. Comparison of quantitative methods for cell-shape analysis. J. Microsc. 227, 140–156 (2007).
Article CAS PubMed Google Scholar
Young, D.W. et al. Integrating high-content screening and ligand-target prediction to identify mechanism of action. Nat. Chem. Biol. 4, 59–68 (2008).
Article CAS PubMed Google Scholar
Kümmel, A. et al. Integration of multiple readouts into the Z′ factor for assay quality assessment. J. Biomol. Screen. 15, 95–101 (2010).
Article PubMed CAS Google Scholar
Adams, C.L. et al. Compound classification using image-based cellular phenotypes. Methods Enzymol. 414, 440–468 (2006).
Article CAS PubMed Google Scholar
Perlman, Z.E. et al. Multidimensional drug profiling by automated microscopy. Science 306, 1194–1198 (2004).
Article CAS PubMed Google Scholar
Candia, J. et al. From cellular characteristics to disease diagnosis: uncovering phenotypes with supercells. PLoS Comput. Biol. 9, e1003215 (2013).
Article CAS PubMed PubMed Central Google Scholar
Altschuler, S.J. & Wu, L.F. Cellular heterogeneity: do differences make a difference? Cell 141, 559–563 (2010).
Article CAS PubMed PubMed Central Google Scholar
Snijder, B. & Pelkmans, L. Origins of regulated cell-to-cell variability. Nat. Rev. Mol. Cell Biol. 12, 119–125 (2011).
Article CAS PubMed Google Scholar
Bakal, C., Aach, J., Church, G. & Perrimon, N. Quantitative morphological signatures define local signaling networks regulating cell morphology. Science 316, 1753–1756 (2007).
Article CAS PubMed Google Scholar
Jones, T.R. et al. CellProfiler Analyst: data exploration and analysis software for complex image-based screens. BMC Bioinformatics 9, 482 (2008).
Article PubMed PubMed Central CAS Google Scholar
Fuchs, F. et al. Clustering phenotype populations by genome-wide RNAi and multiparametric imaging. Mol. Syst. Biol. 6, 370 (2010).
Article PubMed PubMed Central CAS Google Scholar
Sailem, H., Bousgouni, V., Cooper, S. & Bakal, C. Cross-talk between Rho and Rac GTPases drives deterministic exploration of cellular shape space and morphological heterogeneity. Open Biol. 4, 130132 (2014).
Article PubMed PubMed Central CAS Google Scholar
Mukherji, M. et al. Genome-wide functional analysis of human cell-cycle regulators. Proc. Natl. Acad. Sci. USA 103, 14819–14824 (2006).
Article CAS PubMed PubMed Central Google Scholar
Singh, D.K. et al. Patterns of basal signaling heterogeneity can distinguish cellular populations with different drug sensitivities. Mol. Syst. Biol. 6, 369 (2010).
Article PubMed PubMed Central CAS Google Scholar
Sailem, H.Z., Cooper, S. & Bakal, C. Visualizing quantitative microscopy data: History and challenges. Crit. Rev. Biochem. Mol. Biol. 51, 96–101 (2016).
Article PubMed PubMed Central Google Scholar
Kiger, A.A. et al. A functional genomic analysis of cell morphology using RNA interference. J. Biol. 2, 27 (2003).
Article CAS PubMed PubMed Central Google Scholar
Yin, Z. et al. Online phenotype discovery in high-content RNAi screens using gap statistics. in Proc. Int. Symposium on Computational Models of Life Sciences Vol. 952 (eds. Pham, T.D. & Zhou, X.), 86–95 (AIP Publishing, 2007).
Jones, T.R. et al. Scoring diverse cellular morphologies in image-based screens with iterative feedback and machine learning. Proc. Natl. Acad. Sci. USA 106, 1826–1831 (2009).
Article CAS PubMed PubMed Central Google Scholar
Volz, H.C. et al. Single-cell phenotyping of human induced pluripotent stem cells by high-throughput imaging. Preprint at http://www.biorxiv.org/content/early/2015/09/16/026955/ (2015).
Cooper, S., Sadok, A., Bousgouni, V. & Bakal, C. Apolar and polar transitions drive the conversion between amoeboid and mesenchymal shapes in melanoma cells. Mol. Biol. Cell 26, 4163–4170 (2015).
Article CAS PubMed PubMed Central Google Scholar
Rohban, M.H. et al. Systematic morphological profiling of human gene and allele function via Cell Painting. eLife 6, e24060 (2017).
Article PubMed PubMed Central Google Scholar
Gordonov, S. et al. Time series modeling of live-cell shape dynamics for image-based phenotypic profiling. Integr. Biol. 8, 73–90 (2016).
Article CAS Google Scholar
Caie, P.D. et al. High-content phenotypic profiling of drug response signatures across distinct cancer cells. Mol. Cancer Ther. 9, 1913–1926 (2010).
Article CAS PubMed Google Scholar
Schulze, C.J. et al. “Function-first” lead discovery: mode of action profiling of natural product libraries using image-based screening. Chem. Biol. 20, 285–295 (2013).
Article CAS PubMed PubMed Central Google Scholar
Singh, S. et al. Morphological profiles of RNAi-induced gene knockdown are highly reproducible but dominated by seed effects. PLoS One 10, e0131370 (2015).
Article PubMed PubMed Central CAS Google Scholar
Zhang, X. & Boutros, M. A novel phenotypic dissimilarity method for image-based high-throughput screens. BMC Bioinformatics 14, 336 (2013).
Article PubMed PubMed Central Google Scholar
Gibbons, F.D. & Roth, F.P. Judging the quality of gene expression-based clustering methods using gene annotation. Genome Res. 12, 1574–1581 (2002).
Article CAS PubMed PubMed Central Google Scholar
Rendón, E., Abundez, I. & Arizmendi, A. Internal versus external cluster validation indexes. Int. J. Computers Communications 5, 27–34 (2011).
Google Scholar
Vial, M.-L. et al. A grand challenge. 2. Phenotypic profiling of a natural product library on Parkinson's patient-derived cells. J. Nat. Prod. 79, 1982–1989 (2016).
Article CAS PubMed Google Scholar
Ljosa, V., Sokolnicki, K.L. & Carpenter, A.E. Annotated high-throughput microscopy image sets for validation. Nat. Methods 9, 637 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hutz, J.E. et al. The multidimensional perturbation value. J. Biomol. Screen. 18, 367–377 (2013).
Article PubMed Google Scholar
Rajwa, B. Effect-size measures as descriptors of assay quality in high-content screening: a brief review of some available methodologies. Assay Drug Dev. Technol. 15, 15–29 (2017).
Article CAS PubMed Google Scholar
Kitami, T. et al. A chemical screen probing the relationship between mitochondrial content and cell size. PLoS One 7, e33755 (2012).
Article CAS PubMed PubMed Central Google Scholar
Zare, H., Shooshtari, P., Gupta, A. & Brinkman, R.R. Data reduction for spectral clustering to analyze high throughput flow cytometry data. BMC Bioinformatics 11, 403 (2010).
Article PubMed PubMed Central Google Scholar
Qiu, P. et al. Extracting a cellular hierarchy from high-dimensional cytometry data with SPADE. Nat. Biotechnol. 29, 886–891 (2011).
Article CAS PubMed PubMed Central Google Scholar
Tenenbaum, J.B., de Silva, V. & Langford, J.C. A global geometric framework for nonlinear dimensionality reduction. Science 290, 2319–2323 (2000).
Article CAS PubMed Google Scholar
van der Maaten, L. & Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).
Google Scholar
Amir, A.D. et al. viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia. Nat. Biotechnol. 31, 545–552 (2013).
Article CAS PubMed Central Google Scholar
Anchang, B. et al. Visualization and cellular hierarchy inference of single-cell data using SPADE. Nat. Protoc. 11, 1264–1279 (2016).
Article CAS PubMed Google Scholar
Qiu, P., Gentles, A.J. & Plevritis, S.K. Discovering biological progression underlying microarray samples. PLoS Comput. Biol. 7, e1001123 (2011).
Article CAS PubMed PubMed Central Google Scholar
Bendall, S.C. et al. Single-cell mass cytometry of differential immune and drug responses across a human hematopoietic continuum. Science 332, 687–696 (2011).
Article CAS PubMed PubMed Central Google Scholar
Haghverdi, L., Buettner, F. & Theis, F.J. Diffusion maps for high-dimensional single-cell analysis of differentiation data. Bioinformatics 31, 2989–2998 (2015).
Article CAS PubMed Google Scholar
Simm, J. et al. Repurposed high-throughput images enable biological activity prediction for drug discovery. Preprint at http://www.biorxiv.org/content/early/2017/03/30/108399/ (2017).
Carpenter, A.E., Kamentsky, L. & Eliceiri, K.W. A call for bioimaging software usability. Nat. Methods 9, 666–670 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ince, D.C., Hatton, L. & Graham-Cumming, J. The case for open computer programs. Nature 482, 485–488 (2012).
Article CAS PubMed Google Scholar
Collberg, C., Proebsting, T. & Warren, A.M. Repeatability and Benefaction in Computer Systems Research (Technical Report 14-04) (University of Arizona, 2015).
Shen, H. Interactive notebooks: sharing the code. Nature 515, 151–152 (2014).
Article CAS PubMed Google Scholar
Boettiger, C. An introduction to Docker for reproducible research. Oper. Syst. Rev. 49, 71–79 (2015).
Article Google Scholar
Beaulieu-Jones, B.K. & Greene, C.S. Reproducibility of computational workflows is automated using continuous analysis. Nat. Biotechnol. 35, 342–346 (2017).
Article CAS PubMed PubMed Central Google Scholar
Williams, E. et al. Image Data Resource: a bioimage data integration and publication platform. Nat. Methods 14, 775–781 (2017).
Article CAS PubMed PubMed Central Google Scholar
Jupp, S. et al. The cellular microscopy phenotype ontology. J. Biomed. Semantics 7, 28 (2016).
Article PubMed PubMed Central Google Scholar
Breinig, M., Klein, F.A., Huber, W. & Boutros, M. A chemical-genetic interaction map of small molecules using high-throughput imaging in cancer cells. Mol. Syst. Biol. 11, 846 (2015).
Article PubMed PubMed Central CAS Google Scholar
Badertscher, L. et al. Genome-wide RNAi Screening identifies protein modules required for 40S subunit synthesis in human cells. Cell Rep. 13, 2879–2891 (2015).
Article CAS PubMed Google Scholar
Allan, C. et al. OMERO: flexible, model-driven data management for experimental biology. Nat. Methods 9, 245–253 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bauch, A. et al. openBIS: a flexible framework for managing and analyzing complex data in biology research. BMC Bioinformatics 12, 468 (2011).
Article PubMed PubMed Central Google Scholar
Rajaram, S., Pavie, B., Wu, L.F. & Altschuler, S.J. PhenoRipper: software for rapidly profiling microscopy images. Nat. Methods 9, 635–637 (2012).
Article CAS PubMed Google Scholar
Pavie, B. et al. Rapid analysis and exploration of fluorescence microscopy images. J. Vis. Exp. e51280 (2014).
Shamir, L. et al. Wndchrm: an open source utility for biological image analysis. Source Code Biol. Med. 3, 13 (2008).
Article PubMed PubMed Central Google Scholar
Orlov, N. et al. WND-CHARM: multi-purpose image classification using compound image transforms. Pattern Recognit. Lett. 29, 1684–1693 (2008).
Article PubMed PubMed Central Google Scholar
Uhlmann, V., Singh, S. & Carpenter, A.E. CP-CHARM: segmentation-free image classification made accessible. BMC Bioinformatics 17, 51 (2016).
Article PubMed PubMed Central CAS Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article CAS PubMed Google Scholar
Kraus, O.Z. & Frey, B.J. Computer vision for high content screening. Crit. Rev. Biochem. Mol. Biol. 51, 102–109 (2016).
Article PubMed Google Scholar
Van Valen, D.A. et al. Deep learning automates the quantitative analysis of individual cells in live-cell imaging experiments. PLoS Comput. Biol. 12, e1005177 (2016).
Article PubMed PubMed Central CAS Google Scholar
Eulenberg, P., Koehler, N., Blasi, T., Filby, A. & Carpenter, A.E. Deep learning for imaging flow cytometry: cell cycle analysis of Jurkat cells. Preprint at http://www.biorxiv.org/content/early/2016/10/17/081364/ (2016).
Pawlowski, N., Caicedo, J.C., Singh, S., Carpenter, A.E. & Storkey, A. Automating morphological profiling with generic deep convolutional networks. Preprint at http://www.biorxiv.org/content/early/2016/11/02/085118/ (2016).
Godinez, W.J., Hossain, I., Lazic, S.E., Davies, J.W. & Zhang, X. A multi-scale convolutional neural network for phenotyping high-content cellular images. Bioinformatics (2017).
Kraus, O.Z., Ba, J.L. & Frey, B.J. Classifying and segmenting microscopy images with deep multiple instance learning. Bioinformatics 32, i52–i59 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kraus, O.Z. et al. Automated analysis of high-content microscopy data with deep learning. Mol. Syst. Biol. 13, 924 (2017).
Article PubMed PubMed Central CAS Google Scholar
Pärnamaa, T. & Parts, L. Accurate classification of protein subcellular localization from high throughput microscopy images using deep learning. G3 (Bethesda) 7, 1385–1392 (2017).
Article CAS Google Scholar
Zamparo, L. & Zhang, Z. Deep autoencoders for dimensionality reduction of high-content screening data. Preprint at https://arxiv.org/abs/1501.01348/ (2015).
Kandaswamy, C., Silva, L.M., Alexandre, L.A. & Santos, J.M. High-content analysis of breast cancer using single-cell deep transfer learning. J. Biomol. Screen. 21, 252–259 (2016).
Article CAS PubMed Google Scholar
Eliceiri, K.W. et al. Biological imaging software tools. Nat. Methods 9, 697–710 (2012).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The cytomining hackathon 2016 was supported in part by a grant from the National Institutes of Health BD2K program (U54 GM114833). Work on this paper was supported in part by NSF CAREER DBI 1148823 (to A.E.C.).

Author information

Authors and Affiliations

Imaging Platform, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA
Juan C Caicedo, Mohammad Rohban, Jane Hung, Shantanu Singh, Paul Rees & Anne E Carpenter
Imperial College London, London, UK
Sam Cooper
German Cancer Research Center and Heidelberg University, Heidelberg, Germany
Florian Heigwer
Institute of Genetics & Molecular Medicine, University of Edinburgh, Edinburgh, UK
Scott Warchal
Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, Georgia, USA
Peng Qiu
Synthetic and System Biology Unit, Hungarian Academy of Sciences, Szeged, Hungary
Csaba Molnar & Peter Horvath
Laboratory for Cell Biology–Inspired Tissue Engineering, MERLN Institute, Maastricht University, Maastricht, The Netherlands
Aliaksei S Vasilevich
Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
Joseph D Barry
National Centre for Biological Sciences, Bangalore, India
Harmanjit Singh Bansal
Electrical and Computer Engineering, University of Toronto, Toronto, Ontario, Canada
Oren Kraus
Chemical Biology and Therapeutics Science Program, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
Mathias Wawer & Paul A Clemons
Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
Lassi Paavolainen & Peter Horvath
Institute of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
Markus D Herrmann
Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
Jane Hung
Department of Systems Biology & Bioinformatics, University of Rostock, Rostock, Germany
Holger Hennig
Department of Chemical Biology and Therapeutics, Novartis Institutes for Biomedical Research, Cambridge, Massachusetts, USA
John Concannon
Connectivity Map Project, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA
Ian Smith
College of Engineering, Swansea University, Swansea, UK
Paul Rees
Department of Chemistry, Simon Fraser University, Burnaby, British Columbia, Canada
Roger G Linington

Authors

Juan C Caicedo
View author publications
You can also search for this author in PubMed Google Scholar
Sam Cooper
View author publications
You can also search for this author in PubMed Google Scholar
Florian Heigwer
View author publications
You can also search for this author in PubMed Google Scholar
Scott Warchal
View author publications
You can also search for this author in PubMed Google Scholar
Peng Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Csaba Molnar
View author publications
You can also search for this author in PubMed Google Scholar
Aliaksei S Vasilevich
View author publications
You can also search for this author in PubMed Google Scholar
Joseph D Barry
View author publications
You can also search for this author in PubMed Google Scholar
Harmanjit Singh Bansal
View author publications
You can also search for this author in PubMed Google Scholar
Oren Kraus
View author publications
You can also search for this author in PubMed Google Scholar
Mathias Wawer
View author publications
You can also search for this author in PubMed Google Scholar
Lassi Paavolainen
View author publications
You can also search for this author in PubMed Google Scholar
Markus D Herrmann
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Rohban
View author publications
You can also search for this author in PubMed Google Scholar
Jane Hung
View author publications
You can also search for this author in PubMed Google Scholar
Holger Hennig
View author publications
You can also search for this author in PubMed Google Scholar
John Concannon
View author publications
You can also search for this author in PubMed Google Scholar
Ian Smith
View author publications
You can also search for this author in PubMed Google Scholar
Paul A Clemons
View author publications
You can also search for this author in PubMed Google Scholar
Shantanu Singh
View author publications
You can also search for this author in PubMed Google Scholar
Paul Rees
View author publications
You can also search for this author in PubMed Google Scholar
Peter Horvath
View author publications
You can also search for this author in PubMed Google Scholar
Roger G Linington
View author publications
You can also search for this author in PubMed Google Scholar
Anne E Carpenter
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to writing the manuscript and editing the text.

Corresponding author

Correspondence to Anne E Carpenter.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Caicedo, J., Cooper, S., Heigwer, F. et al. Data-analysis strategies for image-based cell profiling. Nat Methods 14, 849–863 (2017). https://doi.org/10.1038/nmeth.4397

Download citation

Received: 19 May 2016
Accepted: 28 July 2017
Published: 01 September 2017
Issue Date: 01 September 2017
DOI: https://doi.org/10.1038/nmeth.4397

This article is cited by

High-content analysis identified synergistic drug interactions between INK128, an mTOR inhibitor, and HDAC inhibitors in a non-small cell lung cancer cell line
- Sijiao Wang
- Juliano Oliveira-Silveira
- Jungseog Kang
BMC Cancer (2024)
Orientation-invariant autoencoders learn robust representations for shape profiling of cells and organelles
- James Burgess
- Jeffrey J. Nirschl
- Serena Yeung-Levy
Nature Communications (2024)
Machine learning-assisted high-content imaging analysis of 3D MCF7 microtissues for estrogenic effect prediction
- Hui Li
- Haitham Seada
- Kim Boekelheide
Scientific Reports (2024)
High-dimensional phenotyping to define the genetic basis of cellular morphology
- Matthew Tegtmeyer
- Jatin Arora
- Soumya Raychaudhuri
Nature Communications (2024)
Fluorescence-based multifunctional light sheet imaging flow cytometry for high-throughput optical interrogation of live cells
- Prakash Joshi
- Prashant Kumar
- Partha Pratim Mondal
Communications Physics (2024)