Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

# Fast quantification of gut bacterial species in cocultures using flow cytometry and supervised classification

## Abstract

A bottleneck for microbial community experiments with many samples and/or replicates is the fast quantification of individual taxon abundances, which is commonly achieved through sequencing marker genes such as the 16S rRNA gene. Here, we propose a new approach for high-throughput and high-quality enumeration of human gut bacteria in a defined community, combining flow cytometry and supervised classification to identify and quantify species mixed in silico and in defined communities in vitro. We identified species in a 5-species in silico community with an F1 score of 71%. In addition, we demonstrate in vitro that our method performs equally well or better than 16S rRNA gene sequencing in two-species cocultures and agrees with 16S rRNA gene sequencing data on the most abundant species in a four-species community. We found that shape and size differences alone are insufficient to distinguish species, and that it is thus necessary to exploit the multivariate nature of flow cytometry data. Finally, we observed that variability of flow cytometry data across replicates differs between gut bacterial species. In conclusion, the performance of supervised classification of gut species in flow cytometry data is species-dependent, but is for some combinations accurate enough to serve as a faster alternative to 16S rRNA gene sequencing.

## Introduction

Microbial communities play an important role in global and industrial biochemical processes [1] and human health and disease [2,3,4]. Understanding their structure and functionality is key in manipulating microbial communities such that they carry out specific functions such as, among others, fuel production [5], plastic degradation [6], or human microbiota modulation [7].

A commonly encountered bottleneck in microbial community experiments, especially if they involve a large number of replicates, is quantifying the abundance of the different species. Conventional enumeration methods based on colony-forming units (CFU, e.g., [8]) assume that species are culturable and can be differentiated based on biochemistry or morphology. In culture-independent approaches, microbial abundance is commonly estimated by sequencing marker genes such as the 16S rRNA gene or by counting shotgun metagenomic reads mapped to genomes and reference genes [9,10,11]. Despite ongoing automation efforts, the entire process from DNA extraction to sequencing is still time-consuming. Varying 16S rRNA gene copy numbers, nucleic acid extraction and amplification efficiencies as well as PCR primer selectivity introduce biases that need to be corrected [12, 13]. Moreover, some form of normalization is required to adjust for different sequencing depths. The resulting relative taxon abundances do not allow accurate assessment of whether taxa change or stay constant in relation to other taxa, necessitating specific data transformation or network construction techniques to compute associations from 16S rRNA gene sequencing data [14, 15]. Additional measurements are needed to convert relative abundances to count data (e.g., quantitative PCR, quantitative sequencing spike-ins and flow cytometry [16,17,18,19]).

Flow cytometry (FC) is a single-cell technique that records optical characteristics for thousands of cells [20] and is becoming an alternative approach to sequencing for the exploration of microbial communities [21, 22]. Several tools have been developed to partly or fully automatically cluster events in FC data into groups [23,24,25]. In analogy to operating taxonomic units (OTUs) in sequencing data, alpha and beta diversity can be estimated based on the number of these groups [26] and differences in FC groups between samples have served as disease markers [27]. The change of FC groups over time has also been monitored to assess resistance and resilience [28] as well as neutrality [29]. However, FC groups have a disadvantage compared to OTUs: it is difficult to identify the taxa forming these groups.

Since several characteristics are measured per cell, FC data are inherently multivariate. This multivariate nature can be exploited to train a classifier on monocultures which can then be applied to assign cells in communities to different species. Thus, supervised classification techniques have the potential to deliver species-specific counts for community samples without requiring labels. In a pioneering work, neural networks applied to flow cytometry data successfully differentiated between dozens of phytoplankton species [30, 31]. This technique was also advocated by Davey & Kell for bacteria [32]. Recently, Rubbens et al. [33] were able to predict the abundances of soil bacterial species mixed in different proportions in silico as well as in vitro using linear discriminant analysis and random forests. Duygan et al. [34] applied neural networks to flow cytometry data to infer microbial ‘cell type’ diversity in a lake community by comparing samples to signatures of predefined strain and bead standards.

Human gut bacterial communities are known to be involved in gastrointestinal conditions such as inflammatory bowel disease and irritable bowel syndrome. To better understand gut microbial responses to perturbations such as antibiotics or changes in diet and to unravel microbial interactions, artificial gut communities are being studied in vitro. In such studies, species-specific microbial abundances are often assessed through 16S rRNA sequencing [35,36,37]. The main goal here is to evaluate the performance of supervised classification applied to FC data of human gut species and to compare it to 16S rRNA sequencing. Additionally, we look at variability of flow cytometry data across different experiments and between different gut bacteria.

## Methods

### Bacterial strains

Gut bacteria originating from human feces and a common lab strain which was also labeled with a fluorescent protein were selected for this experiment (Table 1).

### Culture conditions

All bacteria were cultivated at 37 °C without agitation under anaerobic conditions in a Don Whitley A135 Anaerobic Workstation with HEPA filter (10% H2, 10% CO2, 80% N2, 55% humidity). 16S rRNA gene sequencing was performed regularly to confirm species identity.

Prior to the experiments, all gut-derived strains were subcultured twice (48 h and 18 h, respectively) in modified Gifu Anaerobic Medium broth (mGAM [38], HyServe), except for F. prausnitzii DSM17677, which was grown in Reinforced Clostridial Medium broth (RCM, Oxoid). All bacterial cultures were sampled in stationary phase, as determined by optical density OD600 in a plate reader (Epoch2, Biotek, Supplementary Fig. 1).

E. coli expressing mCherry was cultivated at 37 °C in RCM broth, in the presence of ampicillin (100 μg/ml), with 200 rpm agitation and under aerobic conditions for 16 hours prior to each experiment.

### Flow cytometry

After 18 hours of growth, the cells were serially diluted 1000× in PBS to an approximate cell density of 106 cells per ml and stained with 1 μl/ml SYBR green I (1:100 dilution in dimethylsulfoxide; 20 min incubation at 37 °C; 10.000 concentrate, Thermo Fisher Scientific) following the protocols described in [18, 33, 34, 39]. Two flow cytometers were used in this study. The setup and experimental use of both instruments are summarized in Table 2. For selected mock communities, we used a benchtop Accuri C6 flow cytometer (BD Biosciences). A threshold value of 2000 was applied to the FL1 channel. The Accuri C6 flow cytometer delivered a multiparametric description of each event in each  sample consisting of 13 parameters (SSC-A, SSC-H, FSC-A, FSC-H, FL1-A, FL1-H, FL2-A, FL2-H, FL3-A, FL3-H, FL4-A, FL4-H and Width). During the study period, the instrument was calibrated daily with Spherotech 8-peak and 6-peak validation beads (BD Biosciences). The four-species community as well as all mock communities involving E. coli with mCherry were measured with a benchtop CytoFLEX S flow cytometer (Beckman Coulter) which, in contrast to Accuri C6 instrument, has the required filters to detect mCherry (mainly 610/20). This resulted in a multiparametric description of each event consisting of 23 parameters (FSC-A, FSC-H, SSC-A, SSC-H, FL1-A, FL1-H, FL2-orange-A, FL2-orange-H, FL3-red-A, FL3-red-H, FL4-A, FL4-H, APC-A750-A, APC-A750H, VSSC-A, VSSC-H, KO525-A, KO252-H, mCherry-A, mCherry-H, PI-A, PI-H and FSC-Width). During the study period, the instrument was calibrated daily with CytoFLEX Daily QC Fluorospheres.

All events were quantified using a volumetric method (measured events/μl).

### In vitro mock communities

Cell densities of all in vitro mock communities were first measured separately with the CytoFLEX, after which the community suspensions were diluted in 0.2 μm filtered PBS to reach final cell densities of approximately 5000 cells/μl. These standardized suspensions were then mixed in intended proportions of 5%, 10%, 20%, 40%, 50%, 60%, 80%, 90 and 95%, in a final volume of 1 ml (Fig. 1). In the case of the mock community with Collinsella aerofaciens and Bacteroides thetaiotaomicron, proportions of 1 and 2% were also prepared. Due to inherent pipetting errors, the mock communities do not reach the intended proportions precisely. For this reason, cell density in each monoculture (for the intended proportions) was counted with the flow cytometer by diluting the cells in 0.2 μm filtered PBS. The proportions based on these measurements differ somewhat from the intended proportions and are referred to here as expected proportions. Proportions predicted through supervised classification are compared to these expected proportions. Removal of debris and/or background was accomplished here by gating for SYBR/mCherry events in FL1 and FL3 channels respectively.

### In vitro co-growth communities

The in vitro co-growth community was composed of four bacteria (Roseburia intestinalis, Blautia hydrogenotrophica, Bacteroides thetaiotaomicron and Faecalibacterium prausnitzii). Monocultures were grown for 18 hours in RCM, after which cells were counted with the CytoFLEX flow cytometer and cultures were diluted to roughly obtain 1*106 cells per ml per species. Next, bacteria were added in equal proportions to obtain a total final concentration of 4*106 cells/ml in 10 mL RCM. Both monocultures and communities were grown for 48 hours in batch, and samples were taken at timepoints 24 h and 48 h. Three biological replicates were prepared in this experiment.

### 16S rRNA gene sequencing

Samples were centrifuged at 12130 × g for 10 min. The supernatant was removed, and the remaining pellets were stored at −80 °C until further processing. DNA extraction was carried out using the MoBio PowerMicrobiome RNA isolation kit as previously described [40]. Next, the V4 region for the 16S rRNA gene was amplified with the primer pair 515 F/806 R and sequencing was performed using the Illumina MiSeq platform to generate paired-end reads of 250 base pairs. After demultiplexing with sdm as part of the LotuS pipeline [41] without allowing for mismatches, fastq sequences were preprocessed using DADA2 pipeline v1.14.1 [42]. The taxonomy was assigned initially using RDP classifier v2.13 but for taxa that were not correctly identified, the sequence variants were aligned to EzBioCloud database [43] to ensure accurate assignment of the species. Taxa proportions were corrected with 16S rRNA gene copy numbers retrieved from the rRNA operon copy number database rrnDB [44] and the National Center for Biotechnology Information (NCBI, Bethesda (MD): National Library of Medicine (US)), and multiplied by total cell count from the sample obtained by flow cytometry.

### CellScanner

CellScanner is a new standalone tool (manuscript in preparation) for the analysis of flow cytometry data that performs gating and uses supervised classification techniques to assign events from cocultures to species indicated by the user (Fig. 1). CellScanner relies on flow cytometry data from monocultures (reference files) to train 10 classifiers (neural networks of 200 layers, using lbfgs solver and the rectified linear unit activation function). For each classifier, 1000 events per species are selected randomly from the corresponding monoculture. Of these, 875 events (7/8) are used to train the neural network and 125 events (1/8) to test the trained neural network. Each trained classifier then assigns a species to each event in the coculture. This procedure is carried out for each co-culture sample separately. For the analyses on Accuri data, all 13 parameters were taken into account. For CytoFLEX, nine parameters representing the area (FSC-A, SSC-A, FL1-A, FL2-orange-A, FL3-red-A, FL4-A, VSSC-A, mCherry-A, PI-A) and FCS-Width were considered. Only the area (A) records were considered since the height (H) records did not provide additional information and increased the calculation time.

For each experiment, monoculture data from the same experiment was used to train the model.

If the mCherry protein is excited, the emission will be detected by the following filters: 660/10, 690/50, 780/60, 585/42, 610/20 and 690/50. For the classification of E. coli and R. intestinalis without taking the red-fluorescence of mCherry into account, only the following parameters were used in CellScanner: FSC-H, FSC-A, SSC-H, SSC-A, FL1-H, FL1-A, VSSC-H, VSSC-A, KO525-H, KO525-A and FSC-Width, thus removing the information from the filters detecting mCherry.

For data obtained by the Accuri flow cytometer, events were identified as background if they met the criteria given by at least one of the following equations, matching the gating described by Vandeputte et al. [18]:

$$FL3A = = 0\,or\,FL1A = = 0$$
$$FL3A \; > \; 0,0241 \times FL1A^{1.0996}$$
$$FSCA \; > \; 100000\,\& \,SSCA \; > \; 10000$$

Because of the stringent threshold settings of the Accuri, very few blank events were detected (<100 per sample). This “line gating” method was thus sufficient to limit the effect of background events on the prediction. For data obtained from CytoFLEX, the considerable amount of detected debris was removed by supervised classification (machine learning). Ten classifiers were trained on blanks and samples from a monoculture respectively to differentiate between debris and cells from a single species. The events classified as debris or as “unknown” were removed. This machine-learning based gating was repeated for each monoculture.

CellScanner was applied to community files that originated either from cocultures in vitro or were compiled in silico from monoculture files (5000 maximum events per species). In the latter case, 1000 events were selected from each monoculture file. Since 10 classifiers are trained, 10 species assignments are made for each event in the coculture. When the ‘unknown’ setting is enabled, and seven out of the ten classifiers agree on a species, then the event is assigned to the corresponding species, else it is classified as unknown. Without the ‘unknown’ setting, each event is classified according to the majority vote of the classifiers. In case of a tie, the event is randomly classified as one of the species.

CellScanner calculates the accuracy (ACC) and the F1 score:

$$Accuracy = \frac{{TP + TN}}{{P + N}}\,F1\,score = \frac{{2TP}}{{2TP + FP + FN}}$$

With TP= True positive, P =Positive values, N = Negatives values, TN = True negative value, FP= False positive value and FN= False negative values.

The removal of events classified as ‘unknown’ reduces the number of false positives, which increases the specificity (TN/(TN+FP)) and the precision (TP/(TP+FP)) (Supplementary Fig. 2). Unless stated otherwise, the “unknown” setting was enabled and proportions were calculated after removal of unknown events.

For ease of comparison, relative abundances are shown. The absolute abundances as events measured by flow cytometry and classified by CellScanner are reported in Supplementary Tables 16.

### In silico communities

All in silico communities consisted of monoculture data from cells grown in mGAM derived from separate FC measurements performed on the Accuri C6 flow cytometer. The in silico communities were generated by CellScanner, using a maximum of 1000 random events from each monoculture file to create a community file.

### Feature importance

LIME (Local Interpretable Model-agnostic Explanation, [45]) was applied to assess the importance of different flow cytometer parameters (= features) for species classification in 66 pairwise species combinations. The importance of each of the 13 Accuri features was estimated for 50 events per classifier, running 10 classifiers per combination (i.e., 500 events). The more predictive a feature is for either species, the higher is the importance of that feature. Importance values receive a positive or negative sign depending on whether the feature contributes to classifying an event as belonging to a species or as not belonging to it. We took the absolute of each importance value and summed feature values coming from the same detector (A+H). We calculated the mean importance for the seven parameters (FSC, SSC, FL1, FL2, FL3, FL4 and Width) for each pair across the 500 events and subsequently for each shape combination.

### Intraspecies variation

To assess intraspecies variation, we analyzed monocultures of four different species (Escherichia coli, Bacteroides thetaiotaomicron, Blautia hydrogenotrophica, Roseburia intestinalis) and the medium alone (mGAM), using seven monocultures from different dates for each. A thousand events from each experiment were selected randomly, except for blank controls, for which all events were considered (0-70 events per file). We ran CellScanner on the five-species in silico community (four species + blank) with the majority rule described above (events on which less than 70% of classifiers agreed were assigned as “unknown”). We calculated pairwise Euclidean distances between events from the Accuri flow cytometer with R function daisy in the cluster package. Because the FL4 parameter is not predictive and highly variable within every monoculture on Accuri, we removed it from the distance calculation to avoid artefacts.

### Statistical analysis

All statistical analyses were performed using R (version 3.6.1, http://www.R-project.org).

## Results

### Identification of gut bacterial species in in silico communities

We first tested how accurately gut bacterial cells can be identified in an in silico mixture, where the true positive and false negative assignment for each event in the community is known. For this, we collected flow cytometry data of monocultures for ten gut species in stationary phase with Accuri C6, mixed them in silico in equal proportions and quantified the accuracy of species identification in these mixtures with CellScanner (Fig. 2A).

For half of the species, more than 50% of the events were not consistently assigned to a single species and thus classified as “unknown”. This suggests that many species overlap in the measured features, which makes them difficult to distinguish by machine learning. Some species, such as Escherichia coli, have distinct features and are thus easy to classify (Supplementary Fig. 3), while the features of other species, e.g., Bl. hydrogenotrophica, overlap with another species, resulting in misclassification. Of note, two species belonging to the same genus (Bact. thetaiotaomicron and Bact. uniformis) are not misclassified as each other but are more commonly misclassified as species in other genera. The overall accuracy of species identification in this ten-species community is 32%, with an F1 score of 39% (Supplementary Fig. 3), including the events assigned as “unknown”. When reducing the community to five species (Fig. 2B), the overall accuracy almost doubles to 62% and the F1 is 71% (Supplementary Fig. 4). With less overlap between the species, the individual classification true positive rate reaches a minimum of 39% for all species, and the proportion classified as ‘unknown’ decreases substantially.

As expected, when selecting the five species that were easiest to recognize in the ten-species community the accuracy of species identification increased. This could help in species selection when designing a consortium.

### Quantification of species in vitro with mock communities

To test whether we can accurately quantify species in a mixture in vitro, we mixed three gut bacterial species grown to stationary phase (E. coli expressing mCherry, R. intestinalis and F. prausnitzii) in different proportions, resulting in three combinations of two species. Labeled E. coli was included as a positive control because the CytoFLEX flow cytometer can easily distinguish the mCherry colored E. coli cells from the SYBR Green stained R. intestinalis or F. prausnitzii cells (Fig. 3). The proportions predicted with CellScanner and with 16S rRNA gene sequencing were compared to the expected proportions. As shown in Fig. 3A and C, the prediction of CellScanner is almost identical to the expected proportions, with an absolute mean difference of 1%. The absolute mean difference of the 16S results is 25% from the expected proportions, with 2% of the number of events classified as ‘unknown’ on average. For ease of comparison, proportions are compared, absolute abundances for all species combinations are reported in supplement (Supplementary Table 1).

Because it is straightforward to distinguish E. coli labeled with mCherry from a non-labeled species, we tested whether CellScanner could still differentiate the two species when the red fluorescence channels were left out in the software, in essence removing the fluorescent label of E. coli. As shown in Figs. 3C and 3D, CellScanner predictions are less accurate without these channels, with events classified as ‘unknown’ increasing to 8% (Supplementary Table 2) but are still close to the expected abundances (absolute mean abundance difference of 3%). Thus, for these species pairs, information from scattered light and the remaining channels was sufficient to accurately identify both species in the mixture.

Next, we tested how well CellScanner could identify unlabeled species in stationary phase in mock communities of known proportions. First, we collected mock community data for 11 ratios of Bact. thetaiotaomicron and C. aerofaciens and found that proportions predicted by CellScanner are relatively close to the expectation, with an absolute mean difference of 19% (Fig. 4A, Supplementary Table 3). We then repeated this experiment for F. prausnitzii and R. intestinalis. For comparison, we also determined mock community proportions through 16S rRNA gene sequencing. In case of F. prausnitzii and R. intestinalis, both sequencing and CellScanner are close to the expected abundance (absolute mean difference of 7% for 16S versus 13 and 18% for F. prausnitzii and R. intestinalis respectively; Fig. 4B and Supplementary Table 4).

For a mock community of three species, the absolute mean difference between the expected abundance and CellScanner’s prediction is 13%, 17 and 23% for R. intestinalis, E. coli and Bact. uniformis respectively (Fig. 4C; Supplementary Table 5 shows the results when keeping ‘unknown’ events). Although the confusion matrix (Fig. 4D) shows that E. coli should be easily distinguished from the other bacteria, it is not always predicted in the correct proportion (e.g., Fig. 4C, Ratio 3). In conclusion, supervised classification works well for some bacterial species combinations but not for others, in agreement with previous findings [33].

The experiments described above were performed with in vitro mock communities with abundance measurements available for each species for each ratio. Finally, we tested whether CellScanner could identify species in a community of four gut bacteria grown together for 48 h. Since 50% of the events were classified as ‘unknown’ using the settings as described in the “Methods” (Supplementary Fig. 5 and Supplementary Table 6), we ran CellScanner without any “unknown” assignment to assess how these previous “unknown” events were classified (Fig. 5). In both cases, CellScanner and 16S rRNA gene sequencing agree on R. intestinalis dominance after 24 and 48 h of growth. Although CellScanner’s accuracy drops with increasing species number, prediction of bacterial dominance is still in agreement with sequencing results in this case.

### Shape and size differences do not explain classification accuracy

Flow cytometry data depends on physical characteristics of a cell such as size (forward scatter) and shape (side scatter). To test whether differences in size and shape improve classification accuracy, we compared all 66 pairwise predictions for 12 gut bacterial species in silico (Supplementary Fig. 6) and found that difference in shape is not sufficient to ensure a high accuracy (>80%, Fig. 6A) and that vice versa, species pairs with the same shape can reach high accuracies. For instance, 89% of the pairwise predictions within the bacillus group resulted in an F1 score greater than 80%. Next, we compared the importance of different features of FC data (i.e., forward scatter, side scatter etc.) for species classification. The feature importance was calculated with Lime (see “Methods”), a program that compares the feature value for each event to the feature values of the training dataset with similar values to assess with which probability this feature represents a specific species. This way, Lime assesses whether a particular feature makes a useful contribution to the classification task.

We assessed feature importance for all 66 pair-wise predictions for 12 bacteria in silico (Fig. 6B) and confirmed that neither size nor shape was sufficient to separate species. For all predictions, forward and side scatter had a lower importance (respectively 10 and 12% of the global importance on average) than the other features together (78% of the global importance on average), but they also contributed to classification. Thus, more than two features are needed to separate the species with high accuracy, emphasizing that multivariate data are necessary for classification. As expected, the feature importance of fluorescence channels decreased with their distance to the SYBR-Green emission spectrum, with FL-1 having the highest and FL-4 the lowest values. In addition, we found that 50 of the pairwise predictions with a forward scatter feature importance value higher than 20% were related to Bif. adolescentis (bifid group). This suggests that the shape of this species is distinct enough from the others to have an impact on species classification.

### Species properties in flow cytometry data differ across biological replicates

To assess the robustness of the predictions to biological variation, we tested whether CellScanner could distinguish monoculture data of the same species across biological replicates. We mixed monoculture data of each species to obtain in silico communities. Bact. thetaiotaomicron biological replicates are separated with an accuracy of 68% and Bl. hydrogenotrophica with 65%, for seven monocultures each. The other species are harder to distinguish between experiments, in particular E. coli for which we observed an accuracy of 36% (Fig. 6C). Differences between monocultures of Bact. thetaiotaomicron are already apparent in the 3D plot, where the clusters are distinct (Supplementary Fig. 7).

To confirm that monocultures of the same species can vary from one experiment to another, and to emphasize that this variation depends on the species, we assessed intra- and inter-cluster variation (Fig. 6C). We computed intra-cluster variation as the mean of all pairwise Euclidean distances between events per experiment and then averaged across all experiments of a species. For inter-cluster variation, we computed the mean pairwise Euclidean distance between experiment centroids, where the centroid is defined as the mean distance between experiment-specific events. A large intra-cluster variation is due to high heterogeneity within experiments whereas a large inter-cluster variation indicates high variation between experiments. We also assessed the variability across technical replicates, which is small compared to biological replicates (Supplementary Fig. 8, Supplementary Table 3). These results confirm that clusters change across biological replicates, and that this change is species-specific (e.g., strong for Bact. thetaiotaomicron and weak for E. coli).

We observed the highest intra- and inter-cluster variation for the blanks (Fig. 6C), which we attribute to the small number of particles and to their high diversity. Because the size and shape of particles in the blanks differ, they do not always form a well-defined cluster. However, their features are distinct enough to separate these particles from events representing bacterial cells using supervised classification.

In summary, heterogeneity across biological replicates is variable and species-specific.

## Discussion

In this study, we evaluated supervised classification applied to FC data as a method to count gut bacterial species in mixtures. Assessment of this method on mock communities in silico and in vitro showed that it can resolve proportions in cocultures, but also that its accuracy depends on the species combination. In addition, in a low-complexity gut community, it reproduced trends seen with 16S rRNA gene sequencing.

Our method has several advantages: it avoids labor-intensive DNA extraction or plating, does not require fluorescent labeling of species, and in contrast to 16S rRNA gene sequencing delivers absolute abundances. However, the method is limited to cocultures and small bacterial communities; we observed a decline in accuracy with increasing species number (Fig. 2).

In the co-growth experiment (Fig. 5), it is of note that Bl. hydrogenotrophica disappears from the second replicate in 16S data, but not in FC-based data. It may still be present in 16S rRNA gene sequencing data but was too rare to be captured during sequencing. Alternatively, in FC data, cells from other species may have been misclassified as Bl. hydrogenotrophica, inflating its abundance in FC-based counts (Supplementary Fig. 9). However, 16S rRNA gene sequencing accuracy in small communities can also be low. For example, the 16S rRNA gene sequencing results differed on average 25% for the expected abundance in the mock community with E. coli expressing mCherry (Gram-negative) and R. intestinalis (Gram-positive), but only 9% in the community with R. intestinalis and F. prausnitzii, where both bacteria are Gram-positive. This variation is in line with previous studies showing that 16S rRNA gene sequencing results of mock communities did not match the expected community compositions [46,47,48]. In the absence of a ground truth for the co-growth experiment, we do not know which technique is closer to the true counts.

We show that FC features linked to cell shape and size are not sufficient to distinguish species. FC-based data are commonly analyzed by incorporating one or two features at the same time (2-D histograms). In the present study, feature importance was often evenly spread across three or more features when classifying species pairs and also included spillover channels. This is in line with previous experiments [49], where the authors show (using the same FC instrument), that FL1, FL2, FSC and FL3 are the channels resulting in the best identification. In addition, they note that with increasing community complexity, more channels are needed for an optimal identification. It is therefore important to use multivariate methods to benefit from all generated data in order to classify each event with higher accuracy. In our experiments, a single non-discriminating dye (SYBR Green) staining all cells was combined with an E. coli strain expressing a fluorescent protein to be able to distinguish it from other species. As expected, a species-specific fluorescent label increased the accuracy of supervised classification (Supplementary Table 7). Likely, further combinations with other fluorescent labels allowing to distinguish different species may lead to an increased accuracy of single-cell predictions in (complex) microbial communities [50, 51]. For instance, a fluorescent polyclonal antibody against F. prausnitzii was developed recently for use in flow cytometry [52].

For some species such as E. coli, variability across biological replicates of monocultures was low, while other bacteria (e.g., Bact. thetaiotaomicron and Bl. hydrogenotrophica) showed considerable variation. We found technical variability to be consistently small (Supplementary Fig. 8), implying that the variability mostly had biological sources. Vives-Rego et al. [53] hypothesized that both cell size diversity and cell cycle variations lie at the origin of experimental variation. This can influence the monoculture data when comparing datasets from different experiments with the same bacterial monoculture measurements. We tried to keep this to a minimum in our experiments by using bacteria in the stationary phase of the growth cycle and using the same medium for each experiment. Another reason for the variability could be explained by bacterial aggregation, which may differ for each species per experiment and could influence the measured parameters [54]. We accounted for this by vortexing the samples, but since we are not sure whether this resolved the issue, it could be further explored in future studies. In the case of Bact. thetaiotaomicron, biological variability can be attributed to cell shape switching between three morphologies resembling the Greek letters θ, ι and ο [55]. However, cell shape variation is probably not the only factor explaining biological variability since heterogeneity is also observed for other bacteria with only one cell shape such as Bl. hydrogenotrophica [56]. The latter species can occur singly or in pairs, which might affect the readings in the flow cytometer if they cross the light beam when still attached to each other. An important additional limitation of our method is the observation that coculturing bacteria can lead to reduced phenotypic heterogeneities [57], and therefore the characteristics measured in monocultures might not always represent the same characteristics when grown in coculture. Future research could identify the traits that affect the features measured by FC. If one of the species in the coculture could be distinctly labeled, classifiers could be trained on these events without taking the channels used for the label into account. Subsequently, these classifiers could be used on a community without labeled species. Although this would require initial labeling of species, the labeling step(s) could be omitted later allowing for a more efficient throughput of samples. The need for training data (and hence monocultures) for different media and physiological states could be overcome with unsupervised clustering approaches. However, such approaches would require a method of linking clusters to species and may not be sufficiently accurate. Alternatively, a publicly available collection of monocultures and trained classifiers built by the research community could address this problem in the long term.

The method was tested on gut bacteria in stationary phase. Since cells may change their physiology throughout the growth curve, it is a limitation of this method that mono- and coculture samples need to be taken from the same growth phase. In addition to using calibration beads to calibrate the flow cytometer, a standardized bacterial mock community could be used to account for differences in sample handling during different FC experiments [58].

The field of flow cytometry is constantly evolving and detection of small particles is getting more accurate. As the resolution of FC instruments increases, we are able to obtain more detailed data for each event, which will increase classification accuracy. In addition, better and smaller cameras improve imaging technologies for FC (IFC), which allows capturing a photo of each individual event that could subsequently be automatically analyzed. IFC can capture multiple cellular parameters such as size, volume, and shape [59,60,61]. Combining the multiparametric data from conventional FC and IFC could further boost the accuracy of supervised classification. In addition, more recent machine learning techniques, such as UMAP, may outperform the classification technique used here [62].

Taken together, our results illustrate that machine learning combined with FC can give accurate abundances for unlabeled species in cocultures and captures trends in small communities. In combination with multiplex labeling, this approach has the potential to become a fast yet accurate technique for differential counting of microorganisms in small communities.

## Data availability

CellScanner is available on GitHub: https://github.com/Clem-Jos/CellScanner. All flow cytometry data is available on flowrepository.org. To open the link, please paste it into a browser. Ratios BT & CA: https://flowrepository.org/id/FR-FCM-Z3TX Ratios RI, FP & EC: https://flowrepository.org/id/FR-FCM-Z3TM Ratios RI, BU & EC https://flowrepository.org/id/FR-FCM-Z3TP Cogrowth of RI, BH, BT & FP: https://flowrepository.org/id/FR-FCM-Z3TQ Monoculture data: https://flowrepository.org/id/FR-FCM-Z3U2

## References

1. Falkowski PG, Fenchel T, Delong EF. The microbial engines that drive earth’s biogeochemical cycles. Science. 2008;320:1034–9.

2. Blumberg R, Powrie F. Microbiota, disease, and back to health: a metastable journey. Sci Transl Med. 2012;4:137rv7.

3. Nicholson JK, Holmes E, Kinross J, Burcelin R, Gibson G, Jia W, et al. Host-gut microbiota metabolic interactions. Science. 2012;336:1262–7.

4. Clemente JC, Ursell LK, Wegener Parfrey L, Knight R. The impact of the gut microbiota on human health: An integrative view. Cell. 2012;148:1258–70.

5. Kazamia E, Aldridge DC, Smith AG. Synthetic ecology—A way forward for sustainable algal biofuel production? J Biotechnol. 2012;162:163–9.

6. Wierckx N, Prieto MA, Pomposiello P, de Lorenzo V, O’Connor K, Blank LM. Plastic waste as a novel substrate for industrial biotechnology. Microb Biotechnol. 2015;8:900–3.

7. Buffie CG, Bucci V, Stein RR, McKenney PT, Ling L, Gobourne A, et al. Precision microbiome reconstitution restores bile acid mediated resistance to Clostridium difficile. Nature. 2015;517:205–8.

8. Saleem M, Fetzer I, Dormann CF, Harms H, Chatzinotas A Predator richness increases the effect of prey diversity on prey yield. Nat Commun. 2012;3:1305.

9. Langille MGI, Zaneveld J, Caporaso JG, McDonald D, Knights D, Reyes JA, et al. Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences. Nat Biotechnol. 2013;31:814–21.

10. Costea PI, Zeller G, Sunagawa S, Pelletier E, Alberti A, Levenez F, et al. Towards standards for human fecal sample processing in metagenomic studies. Nat Biotechnol. 2017;35:1069–76.

11. Nissen JN, Johansen J, Allesøe RL, Sønderby CK, Armenteros JJA, Grønbech CH, et al. Improved metagenome binning and assembly using deep variational autoencoders. Nat Biotechnol. 2021;39:555–60.

12. McLaren MR, Willis AD, Callahan BJ. Consistent and correctable bias in metagenomic sequencing experiments. Elife. 2019;8:e46923.

13. Morgan JL, Darling AE, Eisen JA. Metagenomic sequencing of an in vitro-simulated microbial community. PLoS One. 2010;5:422–33.

14. Friedman J, Alm EJ. Inferring Correlation Networks from Genomic Survey Data. PLoS Comput Biol. 2012;8:e1002687.

15. Gloor GB, Macklaim JM, Pawlowsky-Glahn V, Egozcue JJ Microbiome datasets are compositional: And this is not optional. Front Microbiol. 2017;8. https://doi.org/10.3389/fmicb.2017.02224.

16. Nadkarni MA, Martin FE, Jacques NA, Hunter N. Determination of bacterial load by real-time PCR using a broad-range (universal) probe and primers set. Microbiology. 2002;148:257–66.

17. Props R, Kerckhof FM, Rubbens P. Vrieze J De, Sanabria EH, Waegeman W, et al. Absolute quantification of microbial taxon abundances. ISME J. 2017;11:584–7.

18. Vandeputte D, Kathagen G, D’Hoe K, Vieira-Silva S, Valles-Colomer M, Sabino J, et al. Quantitative microbiome profiling links gut community variation to microbial load. Nature. 2017;551:507–11.

19. Morton JT, Marotz C, Washburne A, Silverman J, Zaramela LS, Edlund A, et al. Establishing microbial composition measurement standards with reference frames. Nat Commun. 2019;10:2719.

20. Díaz M, Herrero M, García LA, Quirós C. Application of flow cytometry to industrial microbial bioprocesses. Biochem Eng J. 2010;48:385–407.

21. De Roy K, Clement L, Thas O, Wang Y, Boon N. Flow cytometry for fast microbial community fingerprinting. Water Res. 2012;46:907–19.

22. Müller S, Nebe-Von-Caron G. Functional single-cell analyses: Flow cytometry and cell sorting of microbial populations and communities. FEMS Microbiol Rev. 2010;34:554–87.

23. Mosmann TR, Naim I, Rebhahn J, Datta S, Cavenaugh JS, Weaver JM, et al. SWIFT-scalable clustering for automated identification of rare cell populations in large, high-dimensional flow cytometry datasets, Part 2: Biological evaluation. Cytom Part A. 2014;85:422–33.

24. Ludwig J, Zu Siederdissen CH, Liu Z, Stadler PF, Müller S. FlowEMMi: An automated model-based clustering tool for microbial cytometric data. BMC Bioinformatics. 2019;20:1–17.

25. Van Gassen S, Callebaut B, Van Helden MJ, Lambrecht BN, Demeester P, Dhaene T, et al. FlowSOM: Using self-organizing maps for visualization and interpretation of cytometry data. Cytom Part A. 2015;87:636–45.

26. Props R, Monsieurs P, Mysara M, Clement L, Boon N. Measuring the biodiversity of microbial communities by flow cytometry. Methods Ecol Evol. 2016;7:1376–85.

27. Rubbens P, Props R, Kerckhof FM, Boon N, Waegeman W. Cytometric fingerprints of gut microbiota predict Crohn’s disease state. ISME J. 2021;15:354–8.

28. Liu Z, Cichocki N, Bonk F, Günther S, Schattenberg F, Harms H, et al. Ecological Stability Properties of Microbial Communities Assessed by Flow Cytometry A novel approach to determine microbiome stability properties and follow its dynamics using flow cytometric data. Ecol Evol Sci. 2018;3:e00564–17.

29. Liu Z, Cichocki N, Hübschmann T, Süring C, Ofiţeru ID, Sloan WT, et al. Neutral mechanisms and niche differentiation in steady-state insular microbial communities revealed by single cell analysis. Environ Microbiol. 2019;21:164–81.

30. Frankel DS, Olson RJ, Frankel SL, Chisholm SW. Use of a neural net computer system for analysis of flow cytometric data of phytoplankton populations. Cytometry. 1989;10:540–50.

31. Boddy L, Morris CW, Wilkins MF, Al-Haddad L, Tarran GA, Jonker RR, et al. Identification of 72 phytoplankton species by radial basis function neural network analysis of flow cytometric data. Mar Ecol Prog Ser. 2000;195:47–59.

32. Davey HM, Kell DB. Flow cytometry and cell sorting of heterogeneous microbial populations: The importance of single-cell analyses. Microbiol Rev. 1996;60:641–96.

33. Rubbens P, Props R, Boon N, Waegeman W. Flow cytometric single-cell identification of populations in synthetic bacterial communities. PLoS One. 2017;12:e0169754.

34. Özel Duygan BD, Hadadi N, Babu AF, Seyfried M, van der Meer JR. Rapid detection of microbiota cell type diversity using machine-learned classification of flow cytometry data. Commun Biol. 2020;3:379.

35. Oliphant K, Parreira VR, Cochrane K, Allen-Vercoe E. Drivers of human gut microbial community assembly: coadaptation, determinism and stochasticity. ISME J. 2019;13:3080–92.

36. Venturelli OS, Carr AC, Fisher G, Hsu RH, Lau R, Bowen BP, et al. Deciphering microbial interactions in synthetic human gut microbiome communities. Mol Syst Biol. 2018;14:e8157.

37. Das P, Ji B, Kovatcheva-Datchary P, Bäckhed F, Nielsen J. In vitro co-cultures of human gut bacterial species as predicted from co-occurrence network analysis. PLoS One. 2018;13:1–14.

38. Rettedal EA, Gumpert H, Sommer MOA. Cultivation-based multiplex phenotyping of human gut microbiota allows targeted recovery of previously uncultured bacteria. Nat Commun. 2014;5:4714.

39. D’hoe K, Vet S, Faust K, Moens F, Falony G, Gonze D, et al. Integrated culturing, modeling and transcriptomics uncovers complex interactions and emergent behavior in a three-species synthetic gut community. Elife. 2018;7:1–30.

40. Falony G, Joossens M, Vieira-Silva S, Wang J, Darzi Y, Faust K, et al. Population-level analysis of gut microbiome variation. Science. 2016;352:560–4.

41. Hildebrand F, Tadeo R, Voigt AY, Bork P, Raes J. LotuS: An efficient and user-friendly OTU processing pipeline. Microbiome. 2014;2:30.

42. Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJA, Holmes SP. DADA2: High-resolution sample inference from Illumina amplicon data. Nat Methods. 2016;13:581–3.

43. Yoon SH, Ha SM, Kwon S, Lim J, Kim Y, Seo H, et al. Introducing EzBioCloud: A taxonomically united database of 16S rRNA gene sequences and whole-genome assemblies. Int J Syst Evol Microbiol. 2017;67:1613–7.

44. Stoddard SF, Smith BJ, Hein R, Roller BRK, Schmidt TM rrnDB: Improved tools for interpreting rRNA gene abundance in bacteria and archaea and a new foundation for future development. Nucleic Acids Res. 2015;43:D593–8.

45. Ribeiro MT, Singh S, Guestrin C “Why Should I Trust You?”: Explaining the Predictions of Any Classifier. In: KDD ’16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016. p. 1135–44.

46. Bender JM, Li F, Adisetiyo H, Lee D, Zabih S, Hung L, et al. Quantification of variation and the impact of biomass in targeted 16S rRNA gene sequencing studies. Microbiome. 2018;6:155.

47. Teng F, Darveekaran Nair SS, Zhu P, Li S, Huang S, Li X, et al. Impact of DNA extraction method and targeted 16S-rRNA hypervariable region on oral microbiota profiling. Sci Rep. 2018;8:16321.

48. Fouhy F, Clooney AG, Stanton C, Claesson MJ, Cotter PD. 16S rRNA gene sequencing of mock microbial populations-impact of DNA extraction method, primer choice and sequencing platform. BMC Microbiol. 2016;16:123

49. Rubbens P, Props R, Garcia-Timermans C, Boon N, Waegeman W. Stripping flow cytometry: How many detectors do we need for bacterial identification? Cytom Part A. 2017;91:1184–91.

50. Sträuber H, Müller S. Viability states of bacteria-Specific mechanisms of selected probes. Cytom Part A. 2010;77:623–34.

51. Mason DJ, Shanmuganathan S, Mortimer FC, Gant VA. A fluorescent gram stain for flow cytometry and epifluorescence microscopy. Appl Environ Microbiol. 1998;64:2681–5.

52. Bellais S, Nehlich M, Ania M, Duquenoy A, Mazier W, van den Engh G, et al. Species-targeted sorting and cultivation of commensal bacteria from the gut microbiome using flow cytometry under anaerobic conditions. Microbiome. 2022;10:1–17.

53. Vives-Rego J, Resina O, Comas J, Loren G, Julià O. Statistical analysis and biological interpretation of the flow cytometric heterogeneity observed in bacterial axenic cultures. J Microbiol Methods. 2003;53:43–50.

54. Simón-Soro Á, D’Auria G, Collado MC, Džunková M, Culshaw S, Mira A. Revealing microbial recognition by specific antibodies. BMC Microbiol. 2015;15:132.

55. Distaso A. Contribution à l’étude sur l’intoxication intestinale. Cent Bakteriol Parasit Orig. 1912;62:433.

56. Bernalier A, Willems A, Leclerc M, Rochet V, Collins MD. Ruminococcus hydrogenotrophicus sp. nov., a new H2/CO2-utilizing acetogenic bacterium isolated from human feces. Arch Microbiol. 1996;166:176–83.

57. Heyse J, Buysschaert B, Props R, Rubbens P, Skirtach AG, Waegeman W, et al. Coculturing bacteria leads to reduced phenotypic heterogeneities. Appl Environ Microbiol. 2019;85:e02814–18.

58. Cichocki N, Hübschmann T, Schattenberg F, Kerckhof FM, Overmann J, Müller S. Bacterial mock communities as standards for reproducible cytometric microbiome analysis. Nat Protoc. 2020;15:2788–812.

59. Mikami H, Kawaguchi M, Huang CJ, Matsumura H, Sugimura T, Huang K, et al. Virtual-freezing fluorescence imaging flow cytometry. Nat Commun 2020;11:1–11. Available from: https://www.nature.com/articles/s41467-020-14929-2

60. Han Y, Gu Y, Zhang AC, Lo YH. Review: Imaging Technologies for Flow Cytometry. Lab Chip. 2016;16:4639.

61. Oheim M Advances and challenges in high-throughput microscopy for live-cell subcellular imaging. [Internet]. 2011;6:1299–315. Available from: https://doi.org/10.1517/17460441.2011.637105

62. McInnes L, Healy J, Melville J UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. https://arxiv.org/abs/1802.03426v3 2018.

63. Eggerth AH, Gagnon BH. The Bacteroides of Human Feces. J Bacteriol. 1933;25:389–413.

64. Kim MS, Roh SW, Bae JW. Bifidobacterium stercoris sp. nov., isolated from human faeces. Int J Syst Evol Microbiol. 2010;60:2823–7.

65. Baron EJ, Summanen P, Downes J, Roberts MC, Wexler H, Finegold SM. Bilophila wadsworthia, gen. nov. and sp. nov., a unique gram-negative anaerobic rod recovered from appendicitis specimens and human faeces. J Gen Microbiol. 1989;135:3405–11.

66. Liu C, Finegold SM, Song Y, Lawson PA. Reclassification of Clostridium coccoides, Ruminococcus hansenii, Ruminococcus hydrogenotrophicus, Ruminococcus luti, Ruminococcus productus and Ruminococcus schinkii as Blautia coccoides gen. nov., comb. nov., Blautia hansenii comb. nov., Blautia hydroge. Int J Syst Evol Microbiol. 2008;58:1896–902.

67. Kageyama A, Benno Y, Nakase T. Phylogenetic and phenotypic evidence for the transfer of Eubacterium aerofaciens to the genus Collinsella as Collinsella aerofaciens gen. nov., comb. nov. Int J Syst Bacteriol. 1999;49:557–65.

68. National Research Counsil (US) Steering Group for the Workshop on Size Limits of Very Small Microorganisms, Riley M. Size Limits of Very Small Microorganisms - Proceedings of a workshop. 1999.

69. Duncan SH, Hold GL, Harmsen HJM, Stewart CS, Flint HJ. Growth requirements and fermentation products of Fusobacterium prausnitzii, and a proposal to reclassify it as Faecalibacterium prausnitzii gen. nov., comb. nov. Int J Syst Evol Microbiol. 2005;52:2141–6.

70. Sakamoto M, Benno Y. Reclassification of Bacteroides distasonis, Bacteroides goldsteinii and Bacteroides merdae as Parabacteroides distasonis gen. nov., comb. nov., Parabacteroides goldsteinii comb. nov and Parabacteroides merdae comb. nov. Int J Syst Evol Microbiol. 2006;56:1599–605.

71. Hayashi H, Shibata K, Sakamoto M, Tomita S, Benno Y. Prevotella copri sp. nov. and Prevotella stercorea sp. nov., isolated from human faeces. Int J Syst Evol Microbiol. 2007;57:941–6.

72. Duncan SH, Hold GL, Barcenilla A, Stewart CS, Flint HJ. Roseburia intestinalis sp. nov., a novel saccharolytic, butyrate-producing bacterium from human faeces. Int J Syst Evol Microbiol. 2002;52:1615–20.

73. Moore WEC, Cato EP, Holdeman LV Ruminococcus bromii sp. n. and Emendation of the Description of Ruminococcus Sijpestein. 1972;22:19–21.

## Acknowledgements

We thank Lieve Vanmellaert for supplying the E. coli strains. This project was supported by funding from the Research Foundation—Flanders (grant no. G0I0918N) and from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program under grant agreement no. 801747.

## Author information

Authors

### Contributions

KF designed the study, CV and AB performed experiments and CJ analyzed flow cytometry data. VP and GH gave advice on flow cytometry and cultivation of gut bacteria, respectively. CV and KF wrote the manuscript, and all authors discussed the results.

### Corresponding author

Correspondence to Karoline Faust.

## Ethics declarations

### Competing interests

The authors declare no competing interests.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

van de Velde, C.C., Joseph, C., Biclot, A. et al. Fast quantification of gut bacterial species in cocultures using flow cytometry and supervised classification. ISME COMMUN. 2, 40 (2022). https://doi.org/10.1038/s43705-022-00123-6

• Revised:

• Accepted:

• Published:

• DOI: https://doi.org/10.1038/s43705-022-00123-6