The correlation between expression profiles measured in single cells and in traditional bulk samples

Dzamba, David; Valihrach, Lukas; Kubista, Mikael; Anderova, Miroslava

doi:10.1038/srep37022

Download PDF

Article
Open access
Published: 16 November 2016

The correlation between expression profiles measured in single cells and in traditional bulk samples

David Dzamba^1,2^na1,
Lukas Valihrach³^na1,
Mikael Kubista³^na1 &
…
Miroslava Anderova^1,2^na1

Scientific Reports volume 6, Article number: 37022 (2016) Cite this article

3215 Accesses
7 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Reverse transcription quantitative PCR (RT-qPCR) is already an established tool for mRNA detection and quantification. Since recently, this technique has been successfully employed for gene expression analyses, and also in individual cells (single cell RT-qPCR). Although the advantages of single cell measurements have been proven several times, a study correlating the expression measured on single cells, and in bulk samples consisting of a large number of cells, has been missing. Here, we collected a large data set to explore the relation between gene expression measured in single cells and in bulk samples, reflected by qPCR Cq values. We measured the expression of 95 genes in 12 bulk samples, each containing thousands of astrocytes, and also in 693 individual astrocytes. Combining the data, we described the relation between Cq values measured in bulk samples with either the percentage of the single cells that express the given genes, or the average expression of the genes across the single cells. We show that data obtained with single cell RT-qPCR are fully consistent with measurements in bulk samples. Our results further provide a base for quality control in single cell expression profiling, and bring new insights into the biological process of cellular expression.

Bayesian inference of gene expression states from single-cell RNA-seq data

Article 29 April 2021

Jérémie Breda, Mihaela Zavolan & Erik van Nimwegen

Scaling up reproducible research for single-cell transcriptomics using MetaNeighbor

Article 07 July 2021

Stephan Fischer, Megan Crow, … Jesse Gillis

Accurate estimation of cell composition in bulk expression through robust integration of single-cell information

Article Open access 24 April 2020

Brandon Jew, Marcus Alvarez, … Eran Halperin

Introduction

Single cell analysis methods have been recently shown to gain an increasing importance when biological complexity is studied¹. The unique position of these techniques was highlighted when the single cell RNA/DNA sequencing was awarded as the method of the year 2013 in Nature Methods². Although, there is no doubt that single cell sequencing is revolutionizing our capacity to understand and describe biological diversity³, there are still some limits preventing its wide exploitation⁴. In comparison to that, the reverse transcription quantitative polymerase chain reaction, RT-qPCR, is a routine and cost-effective method for precise and accurate mRNA analysis^5,6. For its broad application and high level of standardization, RT-qPCR is described as a “gold” standard method for mRNA quantification, often used to validate the results achieved by other techniques⁷. The advantages of RT-qPCR are its high sensitivity, specificity and excellent reproducibility.

Traditionally, gene expression has been studied in samples that may be composed of many different cell types. The measured expression is then a combined response of all the cells which may mask the possible response of a particular cell type. Specific fluorescent labeling allows cells of a certain type to be purified and analyzed. If analyzed in bulk, the measured results will still be the average of all the cells present⁸ and variations due to fluctuations, local environment, stimuli and other effects may go unnoticed. To study these variations, single cell expression profiling was introduced⁹. Single cell RT-qPCR allows the identification and characterization of rare cells, exploration of population heterogeneity and the finding of correlations between gene expressions on the cellular level^10,11,12. Expression profiling of single cells revealed that gene expression occurs in burst^13,14 resulting in the existence of small population of cells having high expression and large population expressing very small number of transcripts. When the histogram showing the distribution of transcripts of a certain gene is transformed into the log-scale, it can be fitted by Gaussian curve describing the normal distribution⁹.

Expression profiling in single cells by RT-qPCR are challenging measurements¹⁵, and concerns about the conclusions made have occasionally been raised. To address some of these concerns, we have performed an extensive comparison of gene expression measured in single astrocytes during ageing, with the gene expression measured in classical bulk samples containing thousands of astrocytes. We show that the single cell data, when collected and pre-processed appropriately, are fully consistent with measurements in bulk samples. This means that the manipulation of single cells, which involves collection of cells by fluorescence-activated cell sorter (FACS), the RT-qPCR preamplification workflow, data analysis, data pretreatment and handling of missing data, have a negligible influence on the gene expression in the individual cells. Based on this comparison, we design a quality control assessment and validation scheme of the measured data, which provides a new valuable dimension to the interpretation of single cell expression data.

Results and Discussion

In this study we measured the expression of 95 genes in cortical astrocytes isolated from 1-, 3–4-, 10- and 20–23-month-old GFAP/EGFP mice (three at each time point: 1 M.1-3, 3–4 M.1-3, 10 M.1-3 and 20–23 M.1-3), in which the expression of enhanced green fluorescent protein (EGFP) is controlled by the human promoter for glial fibrillary acidic protein (GFAP)¹⁶. From each mouse, single astrocytes and bulk astrocyte samples were collected and their expression profiles were measured. In total, the final data set contained over 66 000 data points (12 bulk samples + 693 single cells multiplied by 95 genes). For the list of measured genes and assay primer sequences, see Supplementary Table S1.

The relation between single cell and bulk gene expression data

The aim of this study was to describe the relation between gene expression measured in single cells and in bulk samples, using RT-qPCR. We modelled this relation using two different approaches. In both we considered only the data obtained with intron spanning primers, which suppresses background due to the genomic DNA present. In the first approach, for each gene we compared the expression in bulk samples expressed in Cq values, with the fraction of single cells that have detectable expression of these genes. This relation is shown in Fig. 1A, where each data point represents one out of the 70 genes measured in one out of the twelve mice (70 × 12 = 840 in total). The data were fitted with the sigmoidal function:

where x represents the bulk Cq value, y represents the fraction of genes expressing single cells, Cq50 is the bulk Cq value that corresponds to gene expression in 50% of the cells, and the slope is the derivative of the sigmoidal curve at Cq50. For the astrocytes in our experiment Cq50 was 14.85 cycles and the slope was −1.12 percent per cycle.

In the second approach we compared the bulk Cq values with the average of the single cell gene expression, expressed as 28 – Cq (as 28 cycles is considered the detection limit corresponding to a single template molecule, or less for samples analyzed on the Biomark platform; negative values were replaced with 0). To reduce bias introduced when calculating average Cq values for the transcripts in the single cells due to missing data, these were replaced by the largest Cq measured for the particular gene in a single cell + 2¹⁵. This relation is expected to be linear and indeed it is for the high and medium expressed genes (Fig. 1B). Transcripts from very low expressed genes were not detected in any of the single cells, even though they were found to be expressed in bulk samples. This is due to the relatively small number of single cells characterized in each measurement (tens of single cells) compared to thousands of cells in the bulk samples. Consequently, low level transcripts may then go undetected in single cells, because transcripts are not present in any of the cells analyzed. Noteworthy, the expression profiles of single cells and bulk samples in Fig. 1A,B were obtained using different RNA extraction protocols (the direct lysis for single cells vs. column-based extraction protocol for bulk samples). However, as was shown these protocols differ only in extraction efficiency, keeping the expression profiles correlated¹⁷. Therefore the only effect of different extraction protocols is a scale factor, which could offset the data but does not affect conclusions.

The observed relations show that single cell profiling data are fully consistent with those measured in bulk samples. Therefore, the additional information obtained from the single cell profiling about the heterogeneity among the cells should be biologically relevant. The only limiting aspect of single cell profiling is that very low expression, that is detectable in bulk samples, may go unnoticed when profiling a limited number of single cells. This is, however, not an artifact; rather it is a consequence of the small number of cells typically analyzed in single cell profiling studies, related to the highly skewed distribution of transcripts among individual cells due to transcriptional bursting, i.e. the large population of cells without or with very low transcription activity vs. a few cells exhibiting high expression¹⁸, and inherent limited sensitivity of the single cell RT-qPCR^19,20.

Requirements for high quality single cell RT-qPCR results

Even when analyzing single cells, the genomic DNA (gDNA), that is usually present in only two copies, can give rise to false positive results and compromise quantification. Conventional samples can be treated with DNase to remove gDNA²¹, but this is hard to include in a single cell workflow where the volume should be kept low, RNA concentration high, and washing steps avoided. The gDNA background can be measured and corrected by performing RT-minus controls, but this is not possible in single cell measurements because of the very small amount of material available. The general recommendation is to design the assays with the forward and reverse primers spanning an intron or to cross intron/exon junctions, which effectively eliminates amplification of the genomic copy²². However, not every gene has an intron and some have too short introns for this strategy to be efficient. In addition, many genes have pseudogenes that lack introns and are amplified²³.

Some of the assays we designed had primer sets that do not span introns and are expected to amplify any gDNA present. These assays are indicated with red crosses in Fig. 1A. Assays without intron spanning primes are clearly off the fitted models. If considered, the fraction of positive cells would be seriously overestimated based on comparison to the bulk samples that were treated with DNase (Fig. 1A). This comparison is therefore the most effective method to assess the quality of single cell profiling data. Another quality control test in single cell RT-qPCR is to perform classical RT minus control reactions of the entire cell. Assays that produce positive signals evidently amplify gDNA and give rise to outliers in the average single-cell expression vs. bulk Cq value graph (Fig. 1B). These assays should be omitted from analysis.

The sigmoidal fit enables normalization of the bulk Cq values similar to reference genes

To test the robustness of the sigmoidal fit of the percentage of positive single cells to the bulk Cq values, the data from each of the 12 mice were analyzed separately (data from mice 20–23 M.1-3 are shown in Fig. 2A). This time the Cq values of the bulk samples were not normalized to the expression of reference genes, which spreads the data along the x-axis. Interestingly, differences between the calculated Cq50 values corresponded with differences between Cq values of reference genes in these samples, which were used for normalization (see Supplementary Text S2). The normalization of bulk data performed using reference genes was therefore very similar to the normalization which would be performed according to the differences in calculated Cq50 values (for comparison of normalization shifts see Fig. 2B). However, this could only be achieved with the gene sets which also included highly expressed genes, since these are needed in order to perform the sigmoidal fit correctly. Nonetheless, matching of the normalization shifts calculated from expression of reference genes, and from Cq50 values, serve as an indirect validation of sigmoidal fitting.

Note also that normalization to the Cq50 value does not require any reference genes to be known and can therefore be used to normalize bulk samples in situations when genes with stable expression cannot be identified, for example, when comparing different cell or tissue types. Another possible application could be the normalization of single cell RNA-Seq data, when commonly used methods failed due to the large proportion of zero values²⁴. In such cases, an artificial bulk sample can be created by summing reads for each gene across pools of cells. Data in the log scale are plotted against the percentage of positive cells and fitted by the sigmoidal functions. Their parameters can be consequently used for normalization. However, the validity of this approach needs to be confirmed by other studies.

The distribution of transcripts across cells

The correlation between the expression of a particular gene measured in single cells, and its expression in corresponding bulk samples at different time points, can be shown in the plot of the fraction of positive single cells vs. bulk Cq values (Fig. 3). The fraction of positive cells and the Cq at each time point is indicated, including the standard error of the mean (SEM) with error bars. The sigmoidal fit of the fraction of positive cells vs. bulk Cq for all the genes is also shown. From this plot we can identify genes expressed at low level in a larger number of cells (positioned right from the sigmoidal fit, e.g. Fig. 3A) and genes expressed at high level in a smaller number of cells (positioned left from the sigmoidal fit, e.g. Fig. 3B,C), from cells with normal distribution of transcripts across cells as represented by the sigmoidal fit. This phenomenon cannot be seen when analyzing bulk samples, since only information about average expression can be extracted. The two-dimensional type of analysis also provides additional ways to identify significances that would go unnoticed in either bulk-based (p < 0.05 for 10 M vs. 20–23 M in Fig. 3A) or single cell (p < 0.05 for 1 M vs. 10 M in Fig. 3B) experiments only.

During the ageing process the fraction of cells expressing Grin3a decreases from about 12% at 1 M to 2% at 20–23 M, while Cq for the bulk samples increases from 18 to 19.2 cycles (Fig. 3A). Similar trend is seen for the highly expressed Gria2 gene (Fig. 3B). Hence, for Grin3a and Gria2, the lower overall expression with ageing observed for the bulk sample is due to reduction of the number of cells expressing these genes. Another example of the importance of the position of measured data relative to the sigmoidal fit is Pcna, for which is the situation opposite (Fig. 3C). There is hardly any change in its expression during ageing as judged from the bulk Cq, and it is expressed in about 4% of the astrocytes independent of age (i.e. 1–2 cells per animal). The average number of Pcna transcripts in these cells must be much higher compared to other genes with similar expression, since the Pcna data are offset by almost 2 Cq values to lower values relative to the sigmoid fit. The total amount of Pcna transcripts in the 4% of the astrocytes that express Pcna, corresponds to the amount of transcripts typically found for other genes when expressed in 25% of the cells. This can be rationalized as the Pcna gene codes the proliferating cell nuclear antigen, which has high expression but only in actively proliferating cells²⁵. Gene expression of Pcna in a very small fraction of astrocytes has been reported previously²⁶.

Figure 3D,E show examples of genes with low expression, where single cell data in several cases fail to detect expression, while in the bulk samples expression of these genes is clearly seen to decrease during ageing. This provides evidence of the much higher sensitivity of bulk measurements compared to single cell profiling, which is particularly relevant for low expressed genes.

The heterogeneity of GFAP/EGFP glial cells during aging

To explore the heterogeneity of GFAP/EGFP glial cells during aging, the complete data set (all single cells 1–23 M) was subjected to multivariate analysis. The principal component analysis (PCA) divided the cells into two groups, G1 and G2 (Fig. 4A), based on the difference in the gene expression - significant differences were observed in the expression of Gria2, Gjb6, Aldh1l1 and Grm3 genes (Fig. 4B). Intriguingly, the division into two clusters was influenced by the age of the animals the cells were isolated from – 16.3% and 50.7% of cells in G1 group in 1 M and 20–23 M cells, respectively (Fig. 4C). A more detailed analysis divided by the age groups is provided in the Supplementary Text S3 and the results show that the division of the cells into groups G1 and G2 (and corresponding gene expression profiles) were stable in all age groups. This analysis thus shows that GFAP/EGFP glial cells do not form a homogeneous glial cell group but part of these cells shows higher expression of genes coding glutamate receptors GluA2 and mGluR3 (Gria2 and Grm3), gap beta-6 junction protein (Gjb6) and 10-formyltetrahydrofolate dehydrogenase (Aldh1l1) and these cells are more numerous in younger animals.

The largest impact on the division into G1 and G2 groups had Gria2 gene (Fig. 4B), as this gene was highly expressed in G2 group and its expression in G1 group was at the limit of detection. Gria2 gene is a gene coding GluA2 subunit of AMPA receptors, which are crucial receptors in glutamate signaling pathway. Presence of this subunit markedly diminishes Ca²⁺ permeability of AMPA receptors and for example cerebellar Bergmann glial cells were shown to lack this subunit^27,28. In fact, this subunit is present in the most of the GFAP/EGFP glial cells and unlike other glutamate receptor subunits, its expression doesn’t even significantly change after middle cerebral artery occlusion (MCAO) model of ischemic injury²⁹. In the mouse cortical region it is present also in neurons, oligodendrocyte progenitor cells (NG2-glia) and in newly formed oligodendrocytes²⁶. The fact that there are two distinct subpopulations of GFAP/EGFP glial cells, which differ mainly in the presence or absence of Gria2 gene expression, raises a question of functional consequences for these cells. From all the cells we analyzed (693 cells), 68% (471 cells) of them expressed at least one AMPA receptor subunit (Gria1–4) and the vast majority of these cells (96.5%, 455 cells) expressed Gria2 gene. We can thus assume that in general, cells which did not express Gria2 gene didn’t possess AMPA receptors and were thus not able to fully participate in glutamate signaling. Even though glutamate signaling in glial cells is still not completely elucidated, we hypothesize, that GFAP/EGFP glial cells, which possess AMPA receptors, express also Gria2 gene to ensure limited Ca²⁺ influx and providing thus protection against glutamate excitotoxicity. Nonetheless, the validity of this conclusion needs to be confirmed by functional studies.

Conclusions

The study has shown that single cell expression profiles measured with RT-qPCR, are fully consistent with the expression measurements in bulk samples, when the background due to gDNA is considered. Low level transcripts may go undetected in single cell studies, because of the small number of cells typically analyzed in an experiment, such that those transcripts are not present in any of the cells analyzed. Reliable measurement of the expression of such low expressed genes requires bulk samples. Finally, the combination of a single cell and bulk approach offers the possibility to validate the measured data and provides valuable biological insight.

Methods

Sample collection

All experiments were performed on cells from acutely isolated brains of GFAP/EGFP transgenic mice [line designation TgN(GFAPEGFP)]¹⁶. All procedures involving the use of laboratory animals were performed in accordance with the European Communities Council Directive 24 November 1986 (86/609/EEC) and animal care guidelines approved by the Institute of Experimental Medicine, Academy of Sciences of the Czech Republic (Animal Care Committee on April 17, 2009; approval number 036/2012).

The mice were anaesthetized, and cerebral cortices were removed and used for preparation of cell suspension using a papain dissociation kit (Worthington, NJ, USA). The astrocytes were collected using FACS (BD Influx, CA, USA), based on their EGFP fluorescence. Initially, cells were sorted into 96-well plates, each well containing 5 μl of nuclease-free water with bovine serum albumin (BSA, 1 mg/ml, Thermo Fisher Scientific, MA, USA). One cell was sorted into each well. The collection medium prevented unspecific RNA binding to plastic surface, enhanced reverse transcription efficiency and supported RNA stability¹⁷. Noteworthy, direct cell lysis with BSA was also proved to be superior to standard column based extraction methods¹⁷. After collecting 2–3 plates of single cells, remaining cells were collected as a bulk sample into an Eppendorf tube containing 500 μl of RLT-buffer with β-mercaptoethanol (RNeasy Micro Kit, Qiagen, Germany). Samples were immediately frozen to −80 °C and stored until analysis. The complete procedure has been published previously³⁰. The detailed description can be found as Supplementary Text S4.

The cells were collected from a total of twelve 1–23-month-old GFAP/EGFP mice, with three at each time-point: 1-month-old (1 M.1-3), 3-4-month-old (3–4 M.1-3), 10-month-old (10 M.1-3) and 20–23-month-old mice (20–23 M.1-3). The number of cells collected and analyzed from each mouse is shown in Table 1.

Table 1 Number of cells analyzed in each mouse (1 M.1–23 M.3).

Full size table

RT-qPCR and data analysis

RNA from the bulk samples was extracted using RNeasy Micro Kit (Qiagen) according to the manufacturer’s instructions, which included DNase treatment. Four μl of RNA were reverse transcribed in two 10-μl reactions into cDNA using the standard protocol of SuperScript III Reverse Transcriptase (Life Technologies, CA, USA). cDNA was diluted 2-times and 4 μl was used for pre-amplification. Pre-amplification was performed in two 40-μl reaction mixes, each containing 48 pairs of primers (for sequences and distribution into the two mixes, see Supplementary Table S1). The two pools of pre-amplified cDNA were mixed, diluted 5-times, and 2 μl was used for qPCR performed in the Biomark high throughput qPCR instrument (Fluidigm, CA, USA). The expression of 95 genes was measured in each sample.

RNA extraction from single cells was not needed, because the RNA was already released during the collection stage (astrocytes are lysed by the osmotic pressure created in 5 μl nuclease free water, supplemented with 1 mg/ml BSA). RNA was reverse transcribed into cDNA. Four μl of non-diluted cDNA was pre-amplified using the same experimental set-up as for the bulk samples. Two pools of pre-amplified cDNA were mixed, diluted 2.5-times, and analyzed in the Biomark high throughput qPCR instrument. Detailed protocols describing reverse transcription, pre-amplification and qPCR have been published previously³⁰. In addition, all protocols and information can be found as Supplementary Text S4.

RT-qPCR data were processed and analyzed with GenEx software (Ver. 6.0.1.612, MultiD). Reactions generating melting curves with deviating melting temperature (Tm) or aberrant melt profiles were considered negative. Samples showing no sign of any of the tested genes were excluded from analysis. Bulk data from different mice were normalized by the average of the expression in logarithmic scale of the most stable reference genes³¹, as identified by the NormFinder algorithm³² (see Supplementary Text S2). The normalization was performed by a positive or negative shift of Cq values, according to the differences observed in reference genes, so the average of all Cq values did not change. The normalized data are therefore presented as Cq values and not as ΔCq.

Multivariate analysis were performed in GenEx software (Ver. 6.0.1.612, MultiD). All missing data, for each gene separately, were replaced with the highest Cq +2. The Cq data were, for each gene separately, converted into relative quantities expressed relative to the sample with the lowest expression (maximum Cq) and transformed into a logarithmic scale with base 2. Data was further mean-centered and analyzed by PCA, dendograms and Kohonen self-organizing maps (SOM).

Statistics

The fitting of the sigmoidal function was performed utilizing the Total Least Squares method, and linear fit was performed using Deming regression. Significant differences between data obtained from mice of different ages were identified by Multivariate Analysis of Variance (MANOVA) using R software²⁹, Mann-Whitney test using GenEx software (Ver. 6.0.1.612, MultiD) and Chi-squared test using SAS software (Ver. 9.2).

Additional Information

How to cite this article: Dzamba, D. et al. The correlation between expression profiles measured in single cells and in traditional bulk samples. Sci. Rep. 6, 37022; doi: 10.1038/srep37022 (2016).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Souza, N. de. Single-cell methods. Nat. Methods 9, 35 (2012).
Article Google Scholar
Method of the Year 2013. Nat. Methods 11, 2014 (2014).
Macaulay, I. C. & Voet, T. Single Cell Genomics: Advances and Future Perspectives. PLoS Genet. 10 (2014).
Saliba, A. E., Westermann, A. J., Gorski, S. A. & Vogel, J. Single-cell RNA-seq: Advances and future challenges. Nucleic Acids Res. 42, 8845–8860 (2014).
Article CAS Google Scholar
Wong, M. L. & Medrano, J. F. Real-time PCR for mRNA quantification. Biotechniques 39, 75–85 (2005).
Article CAS Google Scholar
Costa, C., Giménez-Capitán, A., Karachaliou, N. & Rosell, R. Comprehensive molecular screening: from the RT-PCR to the RNA-seq. Transl. lung cancer Res. 2, 87–91 (2013).
CAS PubMed PubMed Central Google Scholar
Nolan, T., Hands, R. E. & Bustin, S. a. Quantification of mRNA using real-time RT-PCR. Nat. Protoc. 1, 1559–1582 (2006).
Article CAS Google Scholar
Wills, Q. F. et al. Single-cell gene expression analysis reveals genetic associations masked in whole-tissue experiments. Nat. Biotechnol. 31, 748–752 (2013).
Article CAS Google Scholar
Bengtsson, M., Ståhlberg, A., Rorsman, P. & Kubista, M. Gene expression profiling in single cells from the pancreatic islets of Langerhans reveals lognormal distribution of mRNA levels. Genome Res. 15, 1388–1392 (2005).
Article CAS Google Scholar
Ståhlberg, A. et al. Defining cell populations with single-cell gene expression profiling: correlations and identification of astrocyte subpopulations. Nucleic Acids Res. 39, e24 (2011).
Article Google Scholar
Ståhlberg, A., Rusnakova, V. & Kubista, M. The added value of single-cell gene expression profiling. Brief. Funct. Genomics 12, 81–89 (2013).
Article Google Scholar
Ståhlberg, A. & Kubista, M. The workflow of single-cell expression profiling using quantitative real-time PCR. Expert Rev. Mol. Diagn. 14, 323–331 (2014).
Article Google Scholar
Elowitz, M. B., Levine, A. J., Siggia, E. D. & Swain, P. S. Stochastic gene expression in a single cell. Science 297, 1183–1186 (2002).
Article CAS ADS Google Scholar
Cai, L., Friedman, N. & Xie, X. S. Stochastic protein expression in individual cells at the single molecule level. Nature 440, 358–362 (2006).
Article CAS ADS Google Scholar
Ståhlberg, A., Rusnakova, V., Forootan, A., Anderova, M. & Kubista, M. RT-qPCR work-flow for single-cell data analysis. Methods 59, 80–88 (2013).
Article Google Scholar
Nolte, C. et al. GFAP promoter-controlled EGFP-expressing transgenic mice: a tool to visualize astrocytes and astrogliosis in living brain tissue. Glia 33, 72–86 (2001).
Article CAS Google Scholar
Svec, D. et al. Direct cell lysis for single-cell gene expression profiling. Front. Oncol. 3, 274 (2013).
Article Google Scholar
Bengtsson, M., Hemberg, M., Rorsman, P. & Ståhlberg, A. Quantification of mRNA in single cells and modelling of RT-qPCR induced noise. BMC Mol. Biol. 9, 63 (2008).
Article Google Scholar
Levesque-Sergerie, J.-P., Duquette, M., Thibault, C., Delbecchi, L. & Bissonnette, N. Detection limits of several commercial reverse transcriptase enzymes: impact on the low- and high-abundance transcript levels assessed by quantitative RT-PCR. BMC Mol. Biol. 8, 93 (2007).
Article Google Scholar
Okello, J. B. A. et al. Quantitative assessment of the sensitivity of various commercial reverse transcriptases based on armored HIV RNA. PLoS One 5, (2010).
Derveaux, S., Vandesompele, J. & Hellemans, J. How to do successful gene expression analysis using real-time PCR. Methods 50, 227–230 (2010).
Article CAS Google Scholar
Ståhlberg, A. & Bengtsson, M. Single-cell gene expression profiling using reverse transcription quantitative real-time PCR. Methods 50, 282–288 (2010).
Article Google Scholar
Pink, R. C. et al. Pseudogenes: Pseudo-functional or key regulators in health and disease? RNA 17, 792–798 (2011).
Article CAS Google Scholar
L. Lun, A. T., Bach, K. & Marioni, J. C. Pooling across cells to normalize single-cell RNA sequencing data with many zero counts. Genome Biol. 17, 75 (2016).
Article Google Scholar
Kumar, D., Minocha, N., Rajanala, K. & Saha, S. The distribution pattern of proliferating cell nuclear antigen in the nuclei of Leishmania donovani. Microbiology 155, 3748–3757 (2009).
Article CAS Google Scholar
Zhang, Y. et al. An RNA-Sequencing Transcriptome and Splicing Database of Glia, Neurons, and Vascular Cells of the Cerebral Cortex. J. Neurosci. 34, 11929–11947 (2014).
Article CAS Google Scholar
Hollmann, M., Hartley, M. & Heinemann, S. Ca2+ permeability of KA-AMPA–gated glutamate receptor channels depends on subunit composition. Science 252, 851–853 (1991).
Article CAS ADS Google Scholar
Burnashev, N. et al. Calcium-permeable AMPA-kainate receptors in fusiform cerebellar glial cells. Science 256, 1566–1570 (1992).
Article CAS ADS Google Scholar
Dzamba, D. et al. Quantitative Analysis of Glutamate Receptors in Glial Cells from the Cortex of GFAP/EGFP Mice Following Ischemic Injury: Focus on NMDA Receptors. Cell. Mol. Neurobiol, doi: 10.1007/s10571-015-0212-8 (2015)
Rusnakova, V. et al. Heterogeneity of astrocytes: from development to injury - single cell gene expression. PLoS One 8, e69734 (2013).
Article CAS ADS Google Scholar
Vandesompele, J. et al. Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biol. 3, RESEARCH0034 (2002).
Andersen, C. L., Ledet-Jensen, J. & Orntoft, T. Normalization of real-time quantitative RT-PCR data: a mode-based variance estimation approach to identify genes suited for normalization, applied to bladder and colon cancer data sets. Cancer Res. 64, 5245–5250 (2004).
Article CAS Google Scholar

Download references

Acknowledgements

This study was supported by the Grant Agency of the Czech Republic (GACR P303/12/0855; P303/13/02154 S; P303/16/10214 S); by CZ.1.05/1.1.00/02.0109 BIOCEV provided by ERDF and MEYS; and by the project OPVK - Biotechnological expert in structural biology and gene expression (Reg. n.: CZ.1.07/2.3.00/30.0045). We would also like to thank Lukas Adam for help with data fitting.

Author information

Present address: Miroslava Anderova, Department of Cellular Neurophysiology, Institute of Experimental Medicine, AS CR, Videnska 1083, Prague 4, Czech Republic.

Authors and Affiliations

Department of Cellular Neurophysiology, Institute of Experimental Medicine, Academy of Sciences of the Czech Republic, Prague, Czech Republic
David Dzamba & Miroslava Anderova
2nd Faculty of Medicine, Charles University, Prague, Czech Republic
David Dzamba & Miroslava Anderova
Laboratory of Gene Expression, Institute of Biotechnology, Academy of Sciences of the Czech Republic, BIOCEV, Vestec, Czech Republic
Lukas Valihrach & Mikael Kubista

Authors

David Dzamba
View author publications
You can also search for this author in PubMed Google Scholar
Lukas Valihrach
View author publications
You can also search for this author in PubMed Google Scholar
Mikael Kubista
View author publications
You can also search for this author in PubMed Google Scholar
Miroslava Anderova
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.D. participated in the preparation of samples for the RT-qPCR experiments, analyzed the data, performed the statistical analyses, prepared the figures and participated in writing of the manuscript. L.V. performed the RT-qPCR experiments and participated in writing of the manuscript. M.K. was involved in data analyses and participated in writing of the manuscript. M.A. was involved in designing of the experiments and participated in writing of the manuscript. All authors read and approved the final manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Dzamba, D., Valihrach, L., Kubista, M. et al. The correlation between expression profiles measured in single cells and in traditional bulk samples. Sci Rep 6, 37022 (2016). https://doi.org/10.1038/srep37022

Download citation

Received: 09 June 2016
Accepted: 24 October 2016
Published: 16 November 2016
DOI: https://doi.org/10.1038/srep37022

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.