Standardizing Flow Cytometry Immunophenotyping Analysis from the Human ImmunoPhenotyping Consortium

Finak, Greg; Langweiler, Marc; Jaimes, Maria; Malek, Mehrnoush; Taghiyar, Jafar; Korin, Yael; Raddassi, Khadir; Devine, Lesley; Obermoser, Gerlinde; Pekalski, Marcin L.; Pontikos, Nikolas; Diaz, Alain; Heck, Susanne; Villanova, Federica; Terrazzini, Nadia; Kern, Florian; Qian, Yu; Stanton, Rick; Wang, Kui; Brandes, Aaron; Ramey, John; Aghaeepour, Nima; Mosmann, Tim; Scheuermann, Richard H.; Reed, Elaine; Palucka, Karolina; Pascual, Virginia; Blomberg, Bonnie B.; Nestle, Frank; Nussenblatt, Robert B.; Brinkman, Ryan Remy; Gottardo, Raphael; Maecker, Holden; McCoy, J Philip

doi:10.1038/srep20686

Download PDF

Article
Open access
Published: 10 February 2016

Standardizing Flow Cytometry Immunophenotyping Analysis from the Human ImmunoPhenotyping Consortium

Greg Finak¹^na1,
Marc Langweiler²^na1,
Maria Jaimes³,
Mehrnoush Malek⁴,
Jafar Taghiyar⁴,
Yael Korin⁵,
Khadir Raddassi⁶,
Lesley Devine⁶,
Gerlinde Obermoser⁷,
Marcin L. Pekalski⁸,
Nikolas Pontikos⁸,
Alain Diaz⁹,
Susanne Heck¹⁰,
Federica Villanova¹⁰,
Nadia Terrazzini¹¹,
Florian Kern¹²,
Yu Qian¹³,
Rick Stanton¹³,
Kui Wang¹⁴,
Aaron Brandes¹⁵,
John Ramey¹,
Nima Aghaeepour^4,16,
Tim Mosmann^2,17,
Richard H. Scheuermann¹³,
Elaine Reed⁵,
Karolina Palucka⁷,
Virginia Pascual⁷,
Bonnie B. Blomberg⁹,
Frank Nestle¹⁰,
Robert B. Nussenblatt¹⁸,
Ryan Remy Brinkman^4,19^na2,
Raphael Gottardo¹^na2,
Holden Maecker²⁰^na2 &
…
J Philip McCoy²¹^na2

Scientific Reports volume 6, Article number: 20686 (2016) Cite this article

55k Accesses
184 Citations
26 Altmetric
Metrics details

Subjects

Abstract

Standardization of immunophenotyping requires careful attention to reagents, sample handling, instrument setup, and data analysis, and is essential for successful cross-study and cross-center comparison of data. Experts developed five standardized, eight-color panels for identification of major immune cell subsets in peripheral blood. These were produced as pre-configured, lyophilized, reagents in 96-well plates. We present the results of a coordinated analysis of samples across nine laboratories using these panels with standardized operating procedures (SOPs). Manual gating was performed by each site and by a central site. Automated gating algorithms were developed and tested by the FlowCAP consortium. Centralized manual gating can reduce cross-center variability, and we sought to determine whether automated methods could streamline and standardize the analysis. Within-site variability was low in all experiments, but cross-site variability was lower when central analysis was performed in comparison with site-specific analysis. It was also lower for clearly defined cell subsets than those based on dim markers and for rare populations. Automated gating was able to match the performance of central manual analysis for all tested panels, exhibiting little to no bias and comparable variability. Standardized staining, data collection, and automated gating can increase power, reduce variability, and streamline analysis for immunophenotyping.

Validation of a hybrid approach to standardize immunophenotyping analysis in large population studies: The Health and Retirement Study

Article Open access 29 May 2020

DeVon Hunter-Schlichting, John Lane, … Bharat Thyagarajan

Cyto-Feature Engineering: A Pipeline for Flow Cytometry Analysis to Uncover Immune Populations and Associations with Disease

Article Open access 06 May 2020

Amy Fox, Taru S. Dutt, … Marcela Henao-Tamayo

Standardization procedure for flow cytometry data harmonization in prospective multicenter studies

Article Open access 14 July 2020

Lucas Le Lann, Pierre-Emmanuel Jouve, … PRECISESADS Clinical Consortium

Introduction

Flow cytometry is one of the most powerful tools for single-cell analysis of the immune system at a cellular level; yet it suffers from a lack of standardization beyond the simplest clinical assays that count major subsets. In research settings, each study tends to use its own combination of markers and fluorochromes, even when purportedly analyzing similar cell subsets. Sample handling, instrument type and setup, gating and analysis strategies, and ways in which the data are reported can all vary^1,2. Unfortunately, these differences can all affect the results and how they are interpreted^{3,4,5,6,7,8,9}.

The Human Immune Phenotyping Consortium (HIPC) was developed by the Federation of Clinical Immunology Societies (FOCIS) to address these issues by promoting standardization of flow cytometry immunophenotyping in clinical studies, so that data could be compared across sites and studies. As part of these efforts, the HIPC immunophenotyping panel was developed². The HIPC panels consist of five eight-color antibody cocktails, designed to phenotype major immune cell subsets in peripheral blood mononuclear cells (T cells, Treg, Th1/2/17, B cells, and NK/dendritic cells/monocytes). These panels were designed to standardize routine immunophenotyping in humans while still being compatible with widely available clinical flow cytometers. Although they were not designed to represent the full complexity of cutting-edge research, the cocktails were designed to be easily expanded with additional colors to serve that purpose. The Euroflow consortium^7,10,11,12 and the ONE Study¹³ have successfully developed standardized immunophenotyping panels and procedures for Leukemia and Lymphoma diagnostics and whole blood immunophenotyping, respectively¹³.

Here we demonstrate that an automated data analysis strategy can be integrated into a workflow utilizing a standardized staining panel.

Following development and testing of the HIPC panels, lyophilized reagent cocktails in 96-well plates were developed (BD Lyoplate, BD Biosciences, San Diego, CA). The use of lyophilized reagent cocktails is a proven method for improving standardization^3,14,15, in that it protects against errors of reagent addition or mis-titration, provides improved reagent stability, and simplifies assay setup.

In addition to antibodies and reagent differences, analysis strategies for flow cytometry data remain highly non-standardized making results difficult to reproduce and compare across experiments. Traditionally, the majority of flow cytometry experiments have been analyzed visually, either by serial manual inspection of one or two dimensions at a time (a process termed “gating”, with boundaries or “gates” defining cell populations of interest). However, these visual approaches are labor intensive and highly subjective, and they neglect information present in the data that are not visible to the human eye, thus representing a major obstacle to the automation and reproducibility of research. For example, in a study of Intracellular Cytokine Staining (ICS) standardization involving 15 institutions, the mean inter-laboratory coefficient of variation ranged from 17 to 44%, even though the cell preparation was standardized and the testing was performed by using the same samples and reagents at each site³. Most of the variation observed was attributed to gating, even though experts in the field had conducted the analyses. It was concluded that the analysis, particularly gating, was a significant source of variability, and it was suggested that analysis strategies should be standardized.

Over the past eight years, there has been a surge in the development and application of computational methods for flow cytometry data analysis in an effort to overcome limitations in manual analysis¹⁶ and the importance of automated, high-dimensional analysis was highlighted in a recent position paper¹⁷. Pedreira et al. showed significant correlation between automated gating and manual data analysis of PBMC subsets and could discriminate between normal and reactive samples and B-cell chronic lymphoproliferative disorders^18,19. Fiser et al. showed how hierarchical clustering with a Mahalanobis distance metric could be used to classify PBMCs into different phenotypic subsets with good agreement to manual analysis²⁰. Although their approach was limited to relatively small numbers of events due to computational limitations, it demonstrated the utility of an unsupervised approach that takes into account the information in the full multidimensional data. Recently, the FlowCAP (Flow Cytometry: Critical Assessment of Population Identification Methods) consortium provided an objective approach to compare computational methods with both manual gating and external variables using statistical performance measures²¹. Based on the results of these study, Aghaeepour et al. concluded that computational methods had reached a sufficient level of maturity and accuracy for reliable use in flow cytometry data analysis.

Based on these encouraging results, we hypothesized that computational algorithms could be used to improve the standardization of flow cytometry results beyond what can be accomplished by the standardization of the wet lab component alone. In order to select the best computational methods for this task, we leveraged the FlowCAP project to compare and select the best performing algorithms based on a pilot dataset. The best-performing algorithms were combined using the OpenCyto framework²² to leverage the best features of each, and compared to a central manual analysis in terms of variability and bias on four staining panels using both lyophilized and cryopreserved control cells.

Materials and Methods

Cells

Lyophilized control PBMC (CytoTrol, Beckman Coulter, Miami, FL) were reconstituted and used according to the vendor’s instructions. Cryopreserved PBMC from three donors were frozen in replicate vials of 10⁷ cells per vial and obtained from Precision R&D (Frederick, MD).

Staining cocktails and lyophilized reagent plates

The HIPC Immunophenotyping panels are listed in Table 1. These staining panels were designed to identify the major subsets of B cells, T cells, T-helper cells, and dendritic cells, monocytes, and NK cells². All reagents were first tested and optimal titers determined among three of the nine participating HIPC laboratories.

Table 1 The HIPC antibody panel, specificities and clones.

Full size table

The lyophilized reagent plates, along with a consensus staining protocol, were distributed to nine international laboratories for cross-site testing. The protocol included fluorescence target channels for use with pre-stained single-color control beads included in the reagent plates. Two experiments were performed: one with lyophilized control cells (CytoTrol, Beckman Coulter, Miami, FL), and the other with replicate vials of cryopreserved PBMC from three healthy subjects (Precision R&D). Data collected included manual analysis for the specified cell subsets, as performed at each site, FCS files, from which central analyses (manual and automated) were performed, and instrument setup parameters. The cell subsets pre-specified for evaluation in the study are shown in Table 2. The number of cell events per FCS file for each staining panel varied widely across centers. They are summarized here by their median (min, max): T-cell: 125,900 (44,680, 483,300), DC/Mono/NK: 108,300 (49,330, 474,600), B-cell: 141,600 (38,900, 449,200), T-regulatory: 135,500 (54,340, 458,400).

Table 2 Cell populations evaluated by the HIPC panels.

Full size table

Design of inter-laboratory experiments

In the initial cross-site experiment, nine sites stained four replicates of lyophilized control cells (CytoTrol) in lyophilized reagent plates. Lyophilized control cells were chosen in order to eliminate variability, as cryopreservation and thawing of PBMC were expected to introduce considerable staining variability. However, lyophilization was also found to alter the staining profile of certain markers, compromising the assessment of some populations (e.g., those involving IgD). All sites used either a Fortessa or LSR cytometer (Becton Dickinson, San Jose, CA).

In the second cross-site experiment, nine sites stained three replicates of each of three cryopreserved PBMC samples, to assess variability in the context of real-life samples (cryopreserved and thawed PBMC). The same lot of lyophilized reagent plates was used for both experiments. Sites provided results of gating for defined cell subsets using their own gating schema based on general instructions for the experiment from lyophilized samples (provided in Supplementary Material), as well as FCS files for a centralized analysis from both experiments. Only eight sites returned data for the second experiment, and one of the eight was excluded since they did not collect one of the required markers/channels in each of the panels.

Central manual analysis

FCS files submitted by each participating site consisted of triplicates of each of three samples stained with the five cocktails included in the lyoplate.

These were accompanied by FCS files of the single-stained compensation bead samples in the lyoplate, To optimize compensation for centralized analysis, post-acquisition data analyses were performed using FlowJo (Tree Star Inc., version 9.6.3). Tube-specific matrices were constructed for each site, necessitated by the tandem (APC-H7, PE-Cy7) conjugates associated with each of the five cocktails.

Initial filtering of data from each cocktail delineated lymphoid or mononuclear populations using FSC-A/SSC-A profiles and excluded doublets using FSC-A/FSC-H profiles and dead cells using FSC-A/fixable green live-dead profiles. Subsequent gating was designed to identify major lymphocyte and monocyte cell populations specified previously (2). The design of the lyoplate did not offer the opportunity to establish gate placement using Fluorescence-Minus-One (FMO) controls. Therefore, guidance for gate placement was accomplished by setting up FMO controls using the same liquid reagents that were used in lyophilized form in the lyoplate.

In two instances (B-cell, Treg) Boolean gates were constructed to aid in identifying several populations. Gating schemes for all panels can be found in Supplementary Figures 1–4, and live visualization of the manual (and automated) gates for each sample can be found online at the ImmuneSpace portal.

The flowWorkspace (v 3.15.17) package²⁰ was used to import the manually gated data into R for further analysis¹⁹. Manual gate import scripts can be found online at ImmuneSpace²³. Of the nine centers, one center failed to submit results, and one center was excluded from the analysis because markers in the FCS file were mislabeled and could not be matched to the expected panels.

Automated analysis algorithms

The two top performing gating algorithms - OpenCyto (v. 1.7.4)²², flowDensity (v. 1.4.0)²⁴ - in a study run by the FlowCAP consortium aimed at selecting the best performing algorithms for this larger study were chosen for the analysis presented in this paper. (See Supplementary Figures 5–6). Gating was performed using OpenCyto plug-in algorithms^22,24, enabling different gating algorithms to be selected for different steps of the gating pipeline for each panel, depending on their strengths.

OpenCyto is a BioConductor framework for constructing robust and reproducible end-to-end flow data analysis pipelines. The framework can handle large data sets in a memory efficient manner and allows the incorporation of domain-specific knowledge by encoding hierarchical relationships between cell populations as part of the pipeline, making it ideal for reproducing hierarchical manual analysis. Pipeline templates are defined through a text-based csv file, promoting reusability and eliminating the need to write data-set specific code. OpenCyto supports several general purpose data-driven gating approaches natively, as well as user-defined methods via a plug-in framework.

flowDensity is based on a supervised sequential bi-variate clustering approach that generates a set of pre-defined cell populations. It chooses the best cut-off for individual markers using characteristics of the density distribution and takes just seconds to run per file. flowDensity is available as R/BioConductor package, and is integrated into OpenCyto as a plug-in.

The Thelper panel was excluded in preliminary analysis as too variable to be usable. OpenCyto gating templates for four of the lyoplate panels (B-cell, T-cell, T-regulatory, and DC/Mono/NK)), as well as R code used to perform the automated gating, import manually gated data, data cleaning, statistical analysis and plotting are available through the ImmuneSpace portal (https://www.immunespace.org/project/HIPC/Lyoplate/begin.view). The gating of the FCS files from templates takes approximately 45 minutes, although setting up the initial templates is an iterative process requiring substantially more time. Automated and manual gates can be visualized across samples and centers through an interactive web application built within the ImmuneSpace database. In addition, all analyses performed here are provided as R reports that can be rerun by any users on the web-server providing complete transparency of code and results including gating and statistical modeling of gated population statistics. Raw and processed data can also be easily downloaded, which can be used to reproduce the analyses locally or perform novel analyses. Automated and manual gate definition can also be exported in Gating-ML²⁵, an open standard extensible markup language for describing flow cytometry gating, as well as CLR format .

**Figure 1: Individual and central manual analysis of B-cell, T-reg, T-cell subsets.**

**Figure 2: Example of inter- and intra-site variability from experiment 1 (lyophilized cells).**

Statistical Analysis

Cell population statistics were extracted from the manual and automated gating approaches using R’s flow cytometry tools and analyzed as described below.

**Figure 3: Center, biological and residual variability per population and gating method for the B-cell panel.**

Different sources of variability (center, sample, and residual) were assessed by fitting a linear mixed effects model to the proportion of cells identified in each cell population in each staining panel. For a fixed staining panel, cell subset, and gating method, we let p_rij represent the proportion of cells in replicate r from sample i, and center j. We transformed the proportion, (1): y_rij=logit(p_rij), and model (2):

where μ_i are the intercepts, α_i are the sample-level random effects, β_j are the center-level random effects, and ∈_rij are the residual technical errors, with . The estimates of the ’s from the model are the components of variance due to the different sources of variability (Fig. 4). Sample-level estimates were obtained by replacing the sample-level random effects, α_i , in the above model, with fixed effects for which we obtained confidence intervals in Fig. 5.

**Figure 4: Estimated cell proportions from each population and gating method in the B-cell panel.**

**Figure 5: Power analysis comparing site-specific and central gating for the B-cell panel.**

We defined bias as the difference between sample-level estimates of population proportions for automated gating and sample-level estimates of population proportions for manual gating, after adjusting for center-to-center variability, and taking into account the 95% confidence intervals on those estimates. For a given population and sample i , bias is defined as .

Results

Individual versus central manual analysis

Central manual analysis significantly reduced the variability in comparison with individual site analysis (Fig. 1A). This was not unexpected based on previous studies 3 , and given that the individual site analysis was done without a shared gating template and with only general instructions as to how each particular cell subset (e.g., CD3+CD4+ lymphocytes) was to be gated 2 . We further compared the two experiments using only the data from central manual analysis (Fig. 1B). In general, except for those subsets that could not be effectively identified using lyophilized cells, the CV’s were similar or slightly lower for the lyophilized cells compared to the cryopreserved PBMC (Fig. 1B).

Findings from central manual analysis

The within-site replicates for both experiments were very good, for essentially all cell subsets. In general, consistency between sites was more variable than within sites (representative examples of inter and intra-site variability from the T-cell panel are shown in Fig. 2A,B, respectively). The within-site coefficients of variability for the different cell populations and panels were reduced by between 94% and 43% (mean 73%, IQR 18%) compared to the between-site CVs for the same panels and populations. While larger, more easily identified subsets (e.g., CD3+ and CD4+ T cells) tended to have CV’s of <10% across sites, subsets that were difficult to identify due to dim staining, and/or that required multiple successive gates, had higher CV’s. While these results are not surprising, they do highlight the challenges of cross-site flow cytometry data analysis and the need for more standardized and objective data analysis approaches.

Automated analysis of cryopreserved PBMCs reduces technical (center-to-center) variability for some subsets

We assessed which experimental factors had the largest impact on the variability of estimated population statistics from the three gating methods using a linear mixed model. In the T, B and T-regulatory panels, the majority of measured cell populations exhibited biological variation that was larger than technical variation. In contrast, for the DC panel, technical variability was the primary source of variation in the data for the majority of measured cell populations (Fig. 3, and Supplementary Figures 7–9). The residual variation captures variability due to other sources not explicitly captured by the model. We examined the performance of individual panels more closely.

In the T-cell and B-cell panels, the OpenCyto and flowDensity methods generated cell population estimates with lower variability compared to manual gating for some cell populations. Specifically the transitional B-cell and plasmablast populations, and the CD4 effector, CD4 effector memory, CD8 central memory and CD8 effector populations were improved (Fig. 3 and Supplementary Figure 7).

The CD8 activated and effector memory cell subsets were problematic for automated methods, as seen in scatterplots of manual vs. automated cell population estimates (Supplementary Figure 10). Both cell populations exhibited poor concordance between automated and manual gating across multiple centers, and larger total variation than manual gating. Likewise in the B-cell panel, naive and IgD-containing cell subsets (memory IgD+ and IgD−) had larger total variability for automated versus manual gating and poor concordance across centers for low abundance (low proportion) cell subsets.

Automated algorithms recapitulate manual analyses with low bias

In addition to variability as a metric of performance, we are also interested in evaluating the bias (i.e. whether the point estimates differ significantly between manual and automated gating). Figure 4 shows population proportion estimates and 95% confidence intervals for each subset, method, and donor in the cryopreserved PBMC B-cell panel. In general, the estimates are comparable manual and automated gating as evidenced by the overlapping confidence intervals across methods, indicating that any differences between point estimates are not significant.

Cell subsets that did show differences were investigated further (memory IgD+/− and transitional B cells). The increased variability in the cell populations defined by the IgD can be explained by the poor resolution of IgD in some centers where there is little information in the data to delineate positive and negative cells (Supplementary Figure 11). An example is center G, where the naive and IgD-CD27- cell population estimates are outliers compared to the other centers (Supplementary Figures 12, 13). In other instances the upstream manual gating could be identified as sub-optimal in some samples, impacting downstream population estimates (e.g., Plasmablasts, Supplementary Figure 14).

Low abundance cell populations were not always problematic for automated gating. In the T-regulatory cell panel, automated gating performed surprisingly well relative to central manual gating (Supplementary Figures 15,16). The cell population estimates showed little to no bias (Supplementary Figure 17). While the T-regulatory cell populations were amongst the cell subsets with the lowest proportions considered, automated methods performed well, indicating that the success of automated gating depends on the ability of a panel to resolve cell subpopulations, perhaps more so than the prevalence of the cell subsets within the panel.

Despite large technical variability, the DC/Mono/NK panel was entirely consistent with manual gating and the population estimates were relatively unbiased (Supplementary Figures 18,19 and 20). Unfortunately, the substantial technical variation overwhelmed the biological variation, rendering the panel impractical for detecting changes in cell frequency due to biological effects.

The T-cell panel performance was consistent with the T-regulatory and B-cell panels. Most cell population estimates were comparable to manual gating with little bias (Supplementary Figures 21, 22). Problematic populations included the CD8 activated cell subset, which was based on poorly resolved markers and had low abundance, making it difficult to identify in a data-driven manner (Supplementary Figure 10), as well as CD8 effector and CD4 effector memory T cells. These cell populations showed some bias compared to manual gates, but examination of the automated gates demonstrated that their placement was, nonetheless, reasonable and the observed bias is due to an accumulation of subtle differences in the upstream gating.

Reagent and analysis standardization won’t replace good laboratory practices

By examining the cell population statistics from centralized manual gating and comparing them to automated analysis, we identified centers that were outliers for certain cell populations in certain staining panels. One such example is the previously mentioned B-cell subsets in center G.

A number of other centers had outlier populations in the T-cell panel, including Center F for CD4 effector cells from sample 12828, Center B for CD8 effector memory cells from sample 1369, and Center C for CD8 naive cells from sample 12828. Closer examination of pairwise event plots for the relevant samples from these centers identified data quality issues possibly related to protocol adherence.

Center B failed to collect the scatter channels that allowed for gating of singlets for the T-cell panel, thus none of the samples from Center B had a singlet gate for the centralized manual gating scheme (Supplementary Figure 23). Inspection of the dot plots did not immediately reveal the cause of the difference, but samples from Center B did exhibit poor resolution in the CCR7 dimension (Supplementary Figure 24). Samples from Center C appeared to have problematic compensation in the CD45RA and CD197 (CCR7) dimensions (Supplementary Figure 25), leading to drastically different cell population distributions for the CD8 effector / memory T-cell subsets from other centers. One of the replicates from Center B sample 12828 exhibited a trimodal CD3 distribution (Supplementary Figure 26), accounting for the outlier nature of this sample. While standardization of reagents (via lyoplates) and harmonization of analysis pipelines can sometimes address data quality issues caused by differences in protocol adherence between centers or inadequate quality control and compensation issues), such problems are still best addressed through detailed SOPs, quality control, and proficiency testing.

Power analysis indicates centralized gating can help control for technical effects

In order to assess the relative importance of different sources of variability and their impact on statistical power, we performed a power analysis for each staining panel. We calculated the minimum detectable effect size at 80% power (the probability of detecting a difference in the cell population proportion due to treatment if one truly exists) for varying sample sizes with different assumptions about the observed sources of variability (i.e., assuming the data are gated locally, gated centrally, or data are generated from a single center). We evaluated the minimum effect size for each cell subset in each panel, using an average estimate of the variability from the different centralized (manual and automated) gating approaches, rather than any specific gating method. Variance estimates due to technical effects (center-to-center), biological effects (sample-to-sample) and differences in local vs. central gating,were drawn from mixed effects model fits. Our results demonstrate that most of the benefit in increased power comes from centralized and standardized gating (Fig. 5 and Supplementary Figures 27,28), and that additional benefits from eliminating center-specific effects are relatively minor, but can be moderated by increasing sample size. These results also show which cell subsets in each panel exhibit significant technical variability.

Discussion

The HIPC created a set of lyophilized standard 8-color immunophenotyping cocktails that allow for standardized cell subsetting. Since the data were acquired on high-end instruments, differences in laser power and filters can contribute to site-to-site variability. A standard protocol for use of these plates was developed. Together with detailed target values for setting PMT voltages, we hypothesized that this approach would provide the ability for highly reproducible immunophenotyping across sites. While this was achieved for most basic cell subsets, it is clear that optimal reagent and instrument performance is needed for consistent results with minor and “dim” subsets. It is not entirely clear in advance which fluorochromes/antibodies will work well in a dried down cocktail. In case of IgD the liquid format was not giving optimal resolution and consequently the staining of the lyophilized reagent was also poor. The poor resolution of IgD has a trigger effect on all its children populations. Replacement of this reagent with one that yields more distinct staining would improve reproducibility. Additional checks on instrument performance and adherence to staining and acquisition protocols would also likely increase the reproducibility for these more difficult to analyze cell subsets. More detailed gating instructions to centers could help reduce the impact of local gating on reproducibility, but would likely not achieve the precision of central gating since one analysts would have to observe samples gated at other centers.

Our analysis of these multi-site data indicates that central analysis is more reproducible than individual site analysis, as evidenced by significantly lower coefficients of variability (Fig. 1), and that automated algorithms can reproduce manual central analysis with comparable reproducibility and little to no bias. In most cases (e.g. B-cell and T-cell panels), automated analysis provided matched or lowered variability compared to manual analysis (e.g. plasmablasts, transitional B-cells, CD8 central memory, CD8 effector, CD4 effector memory), demonstrating that automated analysis can improve upon existing manual methods.

When manual and automated methods showed significant disagreement, this appeared to be associated with rare cell subsets (e.g. CD8 activated cells in the T-cell panel), or poorly resolved populations (e.g. IgD+ cells in the B-cell panel). In some cases, variability decrease appeared to be due to improved performance of the automated gating approach. For example, in the B-cell panel plasmablast cell subset, visual inspection of the event-level data showed that the automated gates for the upstream CD20- parent populations were more reasonable than the manual ones (Supplementary Figure 14). In other cases, disagreement could be traced back to centers not adhering to experimental protocols, resulting in problematic data quality (e.g. problems with compensation, or potential problems with staining or marker resolution. Such issues can sometimes be resolved by automated analysis, but highlight the important role of careful adherence to experimental protocol, quality control, detailed SOPs, and proficiency testing in cross-center studies.

The bias observed in some cell subsets in automated vs. manual gating could be traced back to an accumulation of subtle differences in the upstream gating. Importantly, none of these upstream gates were problematic. This raises an important issue with hierarchical gating. The dependencies between cell population definitions enable differences in upstream gates to propagate through to downstream populations. Data-driven automated gating can mitigate this issue through consistency and reproducibility.

Using the standardized lyoplates combined with a unified gating strategy utilizing automated methods it was possible to resolve biological variation between samples for the T-cell, B-cell, and T-regulatory panels, while the technical variability in the DC/Mono/NK panel was too large to reliably resolve biological differences between samples. Particular care is needed if utilizing this panel in a cross-center setting. It is important to note that the automated gating strategy proposed for these standardized panels could likely be replaced by an alternate gating strategy to define the same cell populations with comparable results. We stress that the important factor for success is consistency in the gating strategy and consistency in the application of experimental protocols. While considerable effort is required to perform a centralized manual analysis of large cross-center data sets, this work shows that manual analysis efforts can be reduced as automated gating analysis can be applied with confidence using the methods profiled here.

In addition to being automated, and thus less time-consuming, computational methods lead to analyses that are objective, reproducible, and reusable across data sets that utilize common staining panels. These tools coupled with standardized experimental standard operating procedures should make it possible to more easily compare and integrate data across multiple sites, which will open the door to novel cross-center studies that would not be possible otherwise.

This study follows the “open science” trend by providing complete transparency of data and results, ensuring that reproducibility can be verified^21,22,26. All materials, including primary data files, processed data, workspaces and analysis code, are made freely available using existing data standards and providing a valuable resource to the experimental and computational communities.

Additional Information

How to cite this article: Finak, G. et al. Standardizing Flow Cytometry Immunophenotyping Analysis from the Human ImmunoPhenotyping Consortium. Sci. Rep. 6, 20686; doi: 10.1038/srep20686 (2016).

References

Maecker, H. T. & McCoy, J. P. A model for harmonizing flow cytometry in clinical trials. Nat Immunol. 11, 975–978 (2010).
Article CAS Google Scholar
Maecker, H. T., McCoy, J. P. & Nussenblatt, R. Standardizing immunophenotyping for the Human Immunology Project. Nat. Rev. Immunol. 12, 1–10 (2012).
Article Google Scholar
Maecker, H. T. et al. Standardization of cytokine flow cytometry assays. BMC Immunol. 6, 13; doi: 10.1186/1471-2172-6-17 (2005).
Article CAS PubMed PubMed Central Google Scholar
Nomura, L., Maino, V. C. & Maecker, H. T. Standardization and optimization of multiparameter intracellular cytokine staining. Cytom Part A. 73A, 984–991 (2008).
Article Google Scholar
McNeil, L. K. et al. A harmonized approach to intracellular cytokine staining gating: Results from an international multiconsortia proficiency panel conducted by the Cancer Immunotherapy Consortium (CIC/CRI). Cytom Part A. 83, 728–738 (2013).
Article Google Scholar
Owens, M. A., Vall, H. G., Hurley, A. A. & Wormsley, S. B. Validation and quality control of immunophenotyping in clinical flow cytometry. J. Immunol. Methods. 243, 33–50 (2000).
Article CAS Google Scholar
Kalina, T. et al. EuroFlow standardization of flow cytometer instrument settings and immunophenotyping protocols. Leukemia. 26, 1986–2010, (2012).
Article CAS Google Scholar
Perfetto, S. P., Ambrozak, D., Nguyen, R., Chattopadhyay, P. & Roederer, M. Quality assurance for polychromatic flow cytometry. Nat Protoc. 1, 1522–1530 (2006).
Article CAS Google Scholar
McCoy, J. P., Carey, J. L. & Krause, J. R. Quality control in flow cytometry for diagnostic pathology. I. Cell surface phenotyping and general laboratory procedures. Am. J. Clin. Pathol. 93, S27–37 (1990).
PubMed Google Scholar
Lecrevisse, Q. et al. Euro flow flow cytometry software tools for improving characterization of haematological malignancies. Int J Lab Hematol, 32, 34–36 (2010).
Article Google Scholar
van Dongen, J. J. M. et al. EuroFlow antibody panels for standardized n-dimensional flow cytometric immunophenotyping of normal, reactive and malignant leukocytes. Leukemia. 26, 1908–1975 (2012).
Article CAS Google Scholar
Van Dongen, J. J. M. & Orfao, A. EuroFlow: Resetting leukemia and lymphoma immunophenotyping. Basis for companion diagnostics and personalized medicine. Leukemia 26, 1899–1907 (2012).
Article CAS Google Scholar
Streitz, M. et al. Standardization of whole blood immune phenotype monitoring for clinical trials: panels and methods from the ONE study. Transplant Res. 2, 17; doi: 10.1186/2047-1440-2-17 (2013).
Article CAS PubMed PubMed Central Google Scholar
Dunne, J. & Maecker, H. H. Automation of cytokine flow cytometry assays. J Lab Autom. 9, 5–9 (2004).
CAS Google Scholar
Jaimes, M. C. et al. Quality assurance of intracellular cytokine staining assays: Analysis of multiple rounds of proficiency testing. J. Immunol. Methods. 363, 143–157. (2011).
Article CAS Google Scholar
O’Neill, K., Aghaeepour, N., Špidlen, J. & Brinkman, R. Flow Cytometry Bioinformatics. PLoS Comput Biol. 9, e1003365; doi: 10.1371/journal.pcbi.1003365 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Kvistborg, P. et al. Thinking outside the gate: single-cell assessments in multiple dimensions. Immunity. 42, 591–592 (2015).
Article CAS Google Scholar
Pedreira, C. E., Costa, E. S., Arroyo, M. E., Almeida, J. & Orfao, A. A multidimensional classification approach for the automated analysis of flow cytometry data. IEEE Trans. Biomed. Eng. 55, 1155–1162 (2008).
Article Google Scholar
Costa, E. S. et al. A new automated flow cytometry data analysis approach for the diagnostic screening of neoplastic B-cell disorders in peripheral blood samples with absolute lymphocytosis. Leukemia 20, 1221–1230 (2006).
Article CAS Google Scholar
Fišer, K. et al. Detection and monitoring of normal and leukemic cell populations with hierarchical clustering of flow cytometry data. Cytom Part A 81, 25–34 (2012).
Article Google Scholar
Aghaeepour, N. et al. Critical assessment of automated flow cytometry data analysis techniques. Nat Methods. 10, 228–38 (2013).
Article CAS Google Scholar
Finak, G. et al. OpenCyto: An Open Source Infrastructure for Scalable, Robust, Reproducible, and Automated End-to-End Flow Cytometry Data Analysis. PLoS Comput Biol 10(8), e1003696; doi: 10.1371/journal.pcbi. 1003806 (2014).
Article PubMed PubMed Central Google Scholar
Finak et al. ImmuneSpace Web Portal Lyoplate Project Analyses (2015) (https://www.immunespace.org/project/HIPC/Lyoplate/begin.view?)
Malek, M. et al. flowDensity: Reproducing manual gating of flow cytometry data by automated density-based cell population identification. Bioinformatics, 31, 606–607 (2015).
Article CAS Google Scholar
Spidlen, J. & Moore, W. ISAC Data Standards Task Force, Brinkman RR. ISAC’s Gating-ML 2.0 data exchange standard for gating description. Cytom Part A 87, 683–687 (2015).
Article Google Scholar
Finak, G. et al. High-throughput flow cytometry data normalization for clinical trials. Cytom Part A, 85, 277–286 (2013).
Article Google Scholar

Download references

Acknowledgements

This work was supported by grant 5U19AI090019, and [R01 EB008400] from the National Institutes of Health, and the Human Immunology Project Consortium (HIPC) [U19 AI089986], a Wellcome Trust Strategic Award (100140) to the Cambridge Institute for Medical Research (CIMR) and from the intramural research program of NHLBI, NIH. The authors thank Meena Malipatlolla, Shannon Opiela, PhD, and Kay Kayembe for technical assistance. GF is an ISAC scholar. The authors would like to acknowledge all of the member of the FlowCAP consortium and extend our thanks for their participation.

Author information

Greg Finak and Marc Langweiler: These authors contributed equally to this work.
Ryan Remy Brinkman, Raphael Gottardo, Holden Maecker and J Philip McCoy: These authors jointly supervised this work.

Authors and Affiliations

Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, WA, 98109, Seattle
Greg Finak, John Ramey & Raphael Gottardo
Hematology Branch, National Institutes of Health, Maryland, Bethesda, USA
Marc Langweiler & Tim Mosmann
BD Biosciences, CA, San Jose, USA
Maria Jaimes
Terry Fox Laboratory , British Columbia Cancer Agency, V3J 4W6, Canada
Mehrnoush Malek, Jafar Taghiyar, Nima Aghaeepour & Ryan Remy Brinkman
UCLA Pathology and Laboratory Medicine, Los Angeles, CA
Yael Korin & Elaine Reed
Dept of Neurology, Yale School of Medicine, New Haven, CT
Khadir Raddassi & Lesley Devine
Baylor Institute for Immunology Research, Dallas, TX
Gerlinde Obermoser, Karolina Palucka & Virginia Pascual
University of Cambridge, JDRF/Wellcome Trust Diabetes and Inflammation Laboratory, Cambridge Institute for Medical Research, Cambridge, UK
Marcin L. Pekalski & Nikolas Pontikos
Dept Microbiology & Immunology, University of Miami Miller School of Medicine, Miami, FL
Alain Diaz & Bonnie B. Blomberg
Guys and St Thomas Hospital, Guy’s Hospital, London, UK
Susanne Heck, Federica Villanova & Frank Nestle
School of Pharmacy and Biomolecular Sciences, University of Brighton, Brighton, BN2 4GJ, United Kingdom
Nadia Terrazzini
Division of Medicine, Brighton and Sussex Medical School, Brighton, BN1 9PS, United Kingdom
Florian Kern
Department of Informatics, J. Craig Venter Institute, La Jolla, CA, 92037
Yu Qian, Rick Stanton & Richard H. Scheuermann
School of Mathematics and Physics, University of Queensland, Brisbane, Australia
Kui Wang
The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Aaron Brandes
Baxter Laboratory in Stem Cell Biology, Stanford University, California, 94305, Stanford, USA
Nima Aghaeepour
University of Rochester Medical Center, School of Medicine and Dentistry, Rochester, 14642, NY
Tim Mosmann
Laboratory of Immunology, National Eye Institute, National Institutes of Health, Maryland, Bethesda, USA
Robert B. Nussenblatt
Department of Medical Genetics, University of British Columbia, Canada
Ryan Remy Brinkman
Institute for Immunity, Transplantation, and Infection, Stanford University School of Medicine, Stanford, CA, 94305
Holden Maecker
NHLBI Flow Cytometry Core, NIH, Bethesda, MD., 20892
J Philip McCoy

Authors

Greg Finak
View author publications
You can also search for this author in PubMed Google Scholar
Marc Langweiler
View author publications
You can also search for this author in PubMed Google Scholar
Maria Jaimes
View author publications
You can also search for this author in PubMed Google Scholar
Mehrnoush Malek
View author publications
You can also search for this author in PubMed Google Scholar
Jafar Taghiyar
View author publications
You can also search for this author in PubMed Google Scholar
Yael Korin
View author publications
You can also search for this author in PubMed Google Scholar
Khadir Raddassi
View author publications
You can also search for this author in PubMed Google Scholar
Lesley Devine
View author publications
You can also search for this author in PubMed Google Scholar
Gerlinde Obermoser
View author publications
You can also search for this author in PubMed Google Scholar
Marcin L. Pekalski
View author publications
You can also search for this author in PubMed Google Scholar
Nikolas Pontikos
View author publications
You can also search for this author in PubMed Google Scholar
Alain Diaz
View author publications
You can also search for this author in PubMed Google Scholar
Susanne Heck
View author publications
You can also search for this author in PubMed Google Scholar
Federica Villanova
View author publications
You can also search for this author in PubMed Google Scholar
Nadia Terrazzini
View author publications
You can also search for this author in PubMed Google Scholar
Florian Kern
View author publications
You can also search for this author in PubMed Google Scholar
Yu Qian
View author publications
You can also search for this author in PubMed Google Scholar
Rick Stanton
View author publications
You can also search for this author in PubMed Google Scholar
Kui Wang
View author publications
You can also search for this author in PubMed Google Scholar
Aaron Brandes
View author publications
You can also search for this author in PubMed Google Scholar
John Ramey
View author publications
You can also search for this author in PubMed Google Scholar
Nima Aghaeepour
View author publications
You can also search for this author in PubMed Google Scholar
Tim Mosmann
View author publications
You can also search for this author in PubMed Google Scholar
Richard H. Scheuermann
View author publications
You can also search for this author in PubMed Google Scholar
Elaine Reed
View author publications
You can also search for this author in PubMed Google Scholar
Karolina Palucka
View author publications
You can also search for this author in PubMed Google Scholar
Virginia Pascual
View author publications
You can also search for this author in PubMed Google Scholar
Bonnie B. Blomberg
View author publications
You can also search for this author in PubMed Google Scholar
Frank Nestle
View author publications
You can also search for this author in PubMed Google Scholar
Robert B. Nussenblatt
View author publications
You can also search for this author in PubMed Google Scholar
Ryan Remy Brinkman
View author publications
You can also search for this author in PubMed Google Scholar
Raphael Gottardo
View author publications
You can also search for this author in PubMed Google Scholar
Holden Maecker
View author publications
You can also search for this author in PubMed Google Scholar
J Philip McCoy
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.M., J.P.M., R.R.N., V.P., B.B.B., E.R., K.P., Y.K., K.R., L.D., G.O., M.L.P., N.P., A.D., N.T., F.K., M.J., F.N., S.H. and F.V. led the Lyoplate studies, contributed to panel design and data generation. R.G., G.F., R.H.S, R.R.B., T.M. and N.A. designed and lead the FlowCAP challenge. M.M., J.T., A.B., R.S., Y.Q., K.W. and J.R. contributed automated gating results to FlowCAP. G.F., M.L., M.M. contributed to data analysis. G.F., M.L., R.G., J.P.M., H.M., R.H.S. and R.R.B. and M.J. wrote the paper. All authors reviewed the manuscript, provided manuscript feedback and contributed to data interpretation.

Corresponding author

Correspondence to J Philip McCoy.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information (PDF 5667 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Finak, G., Langweiler, M., Jaimes, M. et al. Standardizing Flow Cytometry Immunophenotyping Analysis from the Human ImmunoPhenotyping Consortium. Sci Rep 6, 20686 (2016). https://doi.org/10.1038/srep20686

Download citation

Received: 19 August 2015
Accepted: 05 January 2016
Published: 10 February 2016
DOI: https://doi.org/10.1038/srep20686

This article is cited by

Assessment of a multisite standardized biospecimen collection protocol for immune phenotyping in neurodevelopmental disorders
- Shane Cleary
- Grace Teskey
- Jane A. Foster
Scientific Reports (2023)
Deep Immunophenotyping of Human Whole Blood by Standardized Multi-parametric Flow Cytometry Analyses
- Jian Gao
- Yali Luo
- Feng Qian
Phenomics (2023)
Description and optimization of a multiplex bead-based flow cytometry method (MBFCM) to characterize extracellular vesicles in serum samples from patients with hematological malignancies
- Lin Li
- André Görgens
- Helga Schmetzer
Cancer Gene Therapy (2022)
Ulcerative colitis is characterized by a plasmablast-skewed humoral response associated with disease activity
- Mathieu Uzzan
- Jerome C. Martin
- Saurabh Mehandru
Nature Medicine (2022)
Sustained peripheral immune hyper-reactivity (SPIHR): an enduring biomarker of altered inflammatory responses in adult rats after perinatal brain injury
- Yuma Kitase
- Eric M. Chin
- Lauren L. Jantzie
Journal of Neuroinflammation (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.