Benchmarking of RNA-sequencing analysis workflows using whole-transcriptome RT-qPCR expression data

Everaert, Celine; Luypaert, Manuel; Maag, Jesper L. V.; Cheng, Quek Xiu; Dinger, Marcel E.; Hellemans, Jan; Mestdagh, Pieter

doi:10.1038/s41598-017-01617-3

Download PDF

Article
Open access
Published: 08 May 2017

Benchmarking of RNA-sequencing analysis workflows using whole-transcriptome RT-qPCR expression data

Celine Everaert^1,2,3,
Manuel Luypaert⁴,
Jesper L. V. Maag ORCID: orcid.org/0000-0002-2578-8872⁵,
Quek Xiu Cheng⁵,
Marcel E. Dinger ORCID: orcid.org/0000-0003-4423-934X⁵,
Jan Hellemans⁴ &
…
Pieter Mestdagh^1,2,3

Scientific Reports volume 7, Article number: 1559 (2017) Cite this article

26k Accesses
209 Citations
81 Altmetric
Metrics details

Subjects

Abstract

RNA-sequencing has become the gold standard for whole-transcriptome gene expression quantification. Multiple algorithms have been developed to derive gene counts from sequencing reads. While a number of benchmarking studies have been conducted, the question remains how individual methods perform at accurately quantifying gene expression levels from RNA-sequencing reads. We performed an independent benchmarking study using RNA-sequencing data from the well established MAQCA and MAQCB reference samples. RNA-sequencing reads were processed using five workflows (Tophat-HTSeq, Tophat-Cufflinks, STAR-HTSeq, Kallisto and Salmon) and resulting gene expression measurements were compared to expression data generated by wet-lab validated qPCR assays for all protein coding genes. All methods showed high gene expression correlations with qPCR data. When comparing gene expression fold changes between MAQCA and MAQCB samples, about 85% of the genes showed consistent results between RNA-sequencing and qPCR data. Of note, each method revealed a small but specific gene set with inconsistent expression measurements. A significant proportion of these method-specific inconsistent genes were reproducibly identified in independent datasets. These genes were typically smaller, had fewer exons, and were lower expressed compared to genes with consistent expression measurements. We propose that careful validation is warranted when evaluating RNA-seq based expression profiles for this specific gene set.

Benchmarking long-read RNA-sequencing analysis tools using in silico mixtures

Article 02 October 2023

Xueyi Dong, Mei R. M. Du, … Matthew E. Ritchie

QuantSeq. 3′ Sequencing combined with Salmon provides a fast, reliable approach for high throughput RNA expression analysis

Article Open access 11 December 2019

Susan M. Corley, Niamh M. Troy, … Marc R. Wilkins

Variability in estimated gene expression among commonly used RNA-seq pipelines

Article Open access 17 February 2020

Sonali Arora, Siobhan S. Pattwell, … Hamid Bolouri

Introduction

Due to the drop in cost of massively parallel sequencing, RNA-sequencing (RNA-seq) has become a viable alternative to gene expression microarrays¹. Nowadays, RNA-seq is generally considered the gold standard for whole transcriptome gene expression quantification, not only in research but also for clinical applications. Compared to microarrays, RNA-seq has several major advantages. First, no prior knowledge about the content of the transcriptome is required, providing an unbiased view on the ensemble of transcripts in a sample and the possibility of evaluating allelic expression. Second, RNA-seq enables a much more detailed analysis of alternative splicing events. While certain microarray platforms can be used to study alternative splicing², this is typically limited to known isoforms and occurs at much lower resolution. Finally, RNA-seq gene expression measurements tend to cover a much broader dynamic range and can be more sensitive compared to microarrays^{3, 4}. Nevertheless, the field of RNA-seq still faces many challenges, especially in terms of data processing and analyses. In contrast to the microarray field, where data processing converged over the years into a well-defined set of broadly accepted workflows, the number of RNA-seq data processing workflows is still increasing, with none accepted as the standard so far. RNA-seq data processing workflows typically come in two different flavours. First, there are methods that align reads directly to a reference genome, followed by quantification of mapped reads (e.g. Tophat-Cufflinks⁵, Tophat-HTSeq^{6, 7} and STAR-HTSeq^{7, 8}). Secondly, there are the so-called pseudoalignment methods (e.g. Salmon⁹ and Kallisto¹⁰) that break up reads into k-mers before assigning them to transcripts. This results in a substantial gain in speed compared to the alignment based workflows. The workflows also differ in how they estimate expression abundance, with some enabling quantification on transcript level (i.e. Cufflinks, Salmon and Kallisto) while others are restricted to gene level quantification.

Studies benchmarking RNA-seq processing workflows typically rely on simulated RNA-seq datasets or RT-qPCR data for just a few hundred genes^10,11,12. Often, these studies focus their analysis on evaluating absolute quantification performance (i.e. gene expression correlation between RNA-seq and RT-qPCR data) without assessing relative quantification performance (i.e. differential gene expression correlation). Still, the latter is what most RNA-seq studies are aiming for. Recently, Teng and colleagues developed a series of performance parameters to evaluate RNA-seq quantification workflows¹³. Using both matching microarray data and simulated RNA-seq data, they concluded that the performance of the various workflows was comparable but poor.

Here, we compared RNA-sequencing data, processed using five workflows with expression data generated by wet-lab validated qPCR assays for 18 080 protein-coding genes. We decided to include workflows representative for the two major methodologies available today (i.e. pseudoalligment and alignment-based methods). For the alignment based methodologies, frequently used pipelines like Star/Tophat-HTSeq and Tophat-Cufflinks were selected whereas for the pseudo-alignment algorithms we included Salmon and Kallisto. The samples that were applied for this study are the well-characterized MAQC-I RNA-samples MAQCA (Universal Human Reference RNA, pool of 10 cell lines) and MAQCB (Human Brain Reference RNA)¹⁴. RT-qPCR is still considered the method of choice for validation of gene expression data obtained by high-throughput profiling platforms. We therefore reasoned that a transcriptome-wide RT-qPCR dataset would serve as a solid benchmark to assess the accuracy of the selected RNA-seq processing workflows. In addition, we provide an analysis framework that can be applied to other workflows not included in this study. While this is not the first study to compare RNA-seq data with transcriptome-wide qPCR data, the analyses presented here are more comprehensive compared to other studies.

Results

Aligning qPCR and RNA-seq datasets

Every assay included in the whole-transcriptome qPCR dataset detects a specific subset of transcripts that contribute proportionally to the gene-level Cq-value. In order to apply these as a benchmark for RNA-seq based gene expression values, we aligned transcripts detected by qPCR with transcripts considered for RNA-seq based gene expression quantification. For the transcript based workflows (Cufflinks, Kallisto and Salmon), we calculated the gene level TPM values by aggregating transcript-level TPM-values of those transcripts detected by the respective qPCR assays. For Tophat-HTSeq and Star-HTSeq, gene level counts were converted to gene-level TPM values. First, genes were filtered based on a minimal expression of 0.1 TPM in all samples and replicates, to avoid the bias for low expressed genes. This resulted in the selection of 13 045 and 13 309 genes for RNA-seq dataset 1 and 2 respectively. The mean expression across replicates was calculated and used for further analysis.

Expression correlation

To evaluate concordance in gene expression intensities between RNA-seq and qPCR, we first calculated expression correlation between normalized RT-qPCR Cq-values and log transformed RNA-seq expression values. Overall, high expression correlations were observed between RNA-seq and qPCR expression intensities for all workflows (Pearson correlation, Salmon R² = 0.845, Kallisto R² = 0.839, Tophat-Cufflinks R² = 0.798, Tophat-HTSeq R² = 0.827, Star-HTseq R² = 0.821) (Fig. 1, Supplemental Fig. 1a). Comparing expression values between Tophat-HTSeq and Star-HTSeq revealed almost identical results (R² = 0.994, Supplemental Fig. 1b) suggesting little impact of the mapping algorithm on quantification. We therefore decided to only consider Tophat-HTSeq for further analysis. In order to further study discrepancies in gene expression correlation, we first transformed TPM and normalized Cq-values to gene expression ranks (Supplemental Figs 1c and 2) and calculated the difference in rank between RNA-seq and qPCR. Outlier genes were defined as genes with an absolute rank difference of more than 5000 (further referred to as rank outlier genes) (Fig. 2A). The average number of rank outlier genes ranged from 407 (Salmon) to 591 (Tophat-HTSeq) and the majority of these had higher expression ranks in RNA-seq data (i.e. higher expressed in RNA-seq data), irrespective of the workflow. Rank outlier genes for MAQCA significantly overlapped with rank outlier genes for MAQCB for each of the workflows (Fig. 2B, Fisher Exact test, p < 1.10⁻¹⁰). Also between workflows, a significant overlap was observed (Fig. 2C and Supplemental Fig. 3, Super Exact Test, p values < 1.10⁻¹⁰). These observations were confirmed in both datasets (Supplemental Figs 4–6) and point to systematic discrepancies between quantification technologies (i.e. qPCR and RNA-seq) rather than workflows. Still, a number of workflow-specific rank outlier genes were identified (Fig. 2B). The rank outlier genes are characterized by a significantly lower RT-qPCR expression value (Fig. 2D, Kolmogorov-Smirnov, p < 1.10⁻¹⁰), explaining at least part of the observed rank difference. Similar results were obtained in the second dataset (Supplemental Fig. 6).

Fold change correlation

As RNA-sequencing and qPCR produce relative gene expression measures, comparing gene expression differences between samples is the most relevant approach to benchmark RNA-seq quantification workflows. To this end, we calculated gene expression fold changes between MAQCA and MAQCB and evaluated fold change correlations between RNA-seq and qPCR. High fold change correlations were observed for each workflow (Fig. 3 and Supplemental Fig. 1d, Pearson, Salmon R² = 0.929, Kallisto R² = 0.930, Tophat-Cufflinks R² = 0.927, Tophat-HTSeq R² = 0.934, Star-HTseq R² = 0.933) suggesting an overall high concordance between RNA-seq and qPCR with nearly identical performance for the individual workflows. As for the expression ranks, the fold changes obtained with Tophat-HTSeq and Star-HTSeq were highly identical (Supplemental Fig. 2f, R² = 0.996), suggesting that the mapping algorithm does not effect fold change calculations between samples.

To quantify potential discrepancies between RNA-seq and qPCR, genes were divided into four groups based on their differential expression (log fold change > 1) between MAQCA and MAQCB (Fig. 4A). The first two groups consist of genes for which both methods agree on the differential expression status (i.e. differentially expressed or not differentially expressed). These genes are further referred to as concordant genes. The third and fourth group consist of genes for which both methods disagree on the differential expression status (i.e. differentially expressed by only one method or differentially expressed by both methods but with opposite direction). These genes are collectively referred to as non-concordant genes. The fraction of non-concordant genes ranged from 15.1% (Tophat-HTSeq) to 19.4% (Salmon) and was consistently lower for the alignment-based algorithms compared to the pseudoaligners (Fig. 4B). While the non-concordant fraction appears large, it mainly consists of genes for which the difference in log fold change between methods (ΔFC) is relatively low. For instance, over 66% of all genes in the non-concordant fraction have a ΔFC < 1 and 93% have a ∆FC < 2, irrespective of the workflow (Supplemental Fig. 7). We therefore defined a fifth group of genes with ΔFC > 2. These genes represent between 7.1% (Tophat-HTSeq) and 8% (Tophat-Cufflinks) of the entire non-concordant fraction (Fig. 4B) and, together with the genes that have differential expression going in opposite directions, we considered as truly deviating between RNA-seq and qPCR. When evaluating the expression levels of the various fractions of non-concordant genes, it’s clear that the non-concordant genes with ΔFC > 2 and non-concordant opposite direction genes are primarily expressed at low levels (i.e. first expression quartile, Fig. 4B and Supplemental Fig. 8). In contrast, non-concordant genes with ΔFC < 2 are equally distributed across expression quartiles (Fig. 4B). An overview of all non-concordant genes is available in Supplemental Table 2.

To evaluate the extent to which the non-concordant genes are workflow-specific, we assessed the overlap of non-concordant genes between workflows (Fig. 5A and Supplemental Fig. 9). While a significant number of genes are shared between all workflows, several genes were identified that are specific to one workflow or a group of workflow (i.e. alignment based and pseudoaligners). Whereas the former points to systematic discrepancies between quantification technologies (i.e. qPCR and RNA-seq), the latter points to differences between individual workflows or groups of workflows. The number of workflow-specific, non-concordant genes with ΔFC > 2 ranged from 5 (Kallisto) to 55 (Tophat-HTSeq). These are genes where the workflow fails to reproduce the differential expression (observed by qPCR and all other workflows) or genes for which the workflow observes differential expression that is not confirmed by qPCR or any of the other workflows. Examples of workflow-specific non-concordant genes with ΔFC > 2 are shown in Fig. 5B. LRRC74B and HNRNPA1L2 are differentially expressed according to Salmon and Tophat-HTSeq respectively, but are non-differential according to the other workflows and RT-qPCR. Conversely, AUNIP and MYBPC2 are non-differential according to Tophat-Cufflinks and Kallisto respectively, but differential according to RT-qPCR and the other workflows. When grouping workflows, we identified 70 non-concordant genes with ΔFC > 2 specific for pseudoalignment algorithms and 62 non-concordant genes with ΔFC > 2 specific for mapping algorithms. Similar results were obtained in the second dataset (Supplemental Figs 10–12).

To verify whether these genes were consistent between independent RNA-seq datasets, we compared results between dataset 1 and 2. Workflow-specific genes were found to be significantly overlapping between both datasets (Fig. 5C). This was especially the case for Tophat-Cufflinks and Tophat-HTSeq specific genes. Also genes specific for pseudoalignment algorithms and mapping algorithms were significantly overlapping between dataset 1 and 2 (Fig. 5B). These results suggest that each workflow (or group of workflows) consistently fails to accurately quantify a small subset of genes, at least in the samples considered for this study.

Features of non-concordant genes

In order to evaluate why accurate quantification of specific genes failed, we computed various features including GC-content, gene length, number of exons, and number of paralogs. These features were determined for concordant and non-concordant genes and compared between both groups (Fig. 6). Non-concordant genes specific for pseudoalignment algorithms and mapping algorithms were significantly smaller (Wilcoxon: p < 0.001, Kolmogorov-Smirnov: p < 0.001) and had fewer exons (Wilcoxon: p < 0.003, Kolmogorov-Smirnov: p < 0.001) compared to concordant genes. No significant difference in GC-content or number of paralogs was observed. Besides evaluating gene characteristics, we also assessed the number of poor quality reads (below Q20) and multi-mapping reads. The number of poor quality and multi-mapping reads was higher for non-concordant compared to concordant genes. This was observed for both pseudoalignment (Chi-square: p < 2.2e-16; relative risk poor quality = 1.12, multi-mapping = 1.071) and mapping workflows (Chi-square: p < 2.2e-16; relative risk poor quality = 1.073, multi-mapping = 1.075).

Discussion

Based on a unique dataset of RT-qPCR expression measurements for 18 080 protein-coding genes, we evaluated the performance of five RNA-seq processing workflows, including both alignment based and pseudoalignment algorithms. Of note, RNA-seq workflows not included in this study may perform differently than those selected here. We decided to run each workflow using the default analysis parameters as we reasoned that this is likely what most users do. Nevertheless, adjusting or fine-tuning these parameters might further improve performance of individual algorithms. Algorithm performance may also depend on the RNA-seq library prep method. Here, we used stranded polyA+ libraries sequenced in paired-end mode. Performance may differ when evaluating unstranded libraries, total RNA libraries or single end reads. Moreover, the annotation of the reference transcriptome could also influence quantification results. RT-qPCR assays may for instance also detect transcripts not included in the reference annotation and hence not taken into account by the RNA-seq processing workflows. This could result in an underestimation of the TPM values with respect to Cq-values obtained by qPCR. However, the expression correlation plots indicate that more genes show the opposite pattern and have a higher expression when quantified by RNA-seq as compared to RT-qPCR (Fig. 1). This may, in part, be explained by differences in amplification efficiency. Another possible explanation is that for this benchmark a transcriptome, filtered for transcripts detected by the qPCR assays, was used. Reads mapping to shared exons from transcripts not detected by the qPCR assay are therefore expected to increasing the quantification values for the RNA-seq workflows. Using a pre-filtered transcriptome indeed results in higher gene-level TPM-values for a small subset of genes compared to a non-filtered transcriptome, where gene-level TPM-values were generated by summing transcript-level TPM-values of transcripts detected by the qPCR assays (Supplemental Fig. 13). Fold changes between samples were largely unaffected. Taken together, the use of an extensive or non-filtered annotation will result in more reliable quantification. For the HTSeq workflow, post-quantification filtering is not possible, resulting in a lower correlation with RT-qPCR data. Of note, this phenomenon is due to the transcript specificity of the RT-qPCR assay designs and not to the quantification workflow itself. Another caveat of using a filtered transcriptome is that increased TPM-values of some genes will result in decreased TPM-values of others given the relative nature of this measure. However, this should not affect any of the analysis where differences between samples are compared.

For the comparison between RNA-seq and RT-qPCR, we focussed our analysis on differential gene expression correlations as these are conceptually more relevant and more closely resemble the main application of RNA-seq. We deliberately avoided introducing differential gene expression algorithms like DESeq¹⁵, edgeR¹⁶ or LimmaVoom¹⁷ as these may further influence the results and prevent us from assessing workflow differences at the level of gene expression quantification. Instead, differential gene expression was assessed by means of fold change correlations directly derived from TPM values. From these analyses, we concluded that the choice of mapper hardly affects results and that, in general, there is a high concordance between RT-qPCR and RNA-seq for each of the RNA-seq processing workflows. This is exemplified by the high number of genes (80–85%) for which a concordant (differential or non-differential) gene expression was observed. These conclusions are in contrast to those published by Teng et al. who reported a poor performance of RNA-seq processing algorithms when evaluating differential gene expression¹³. This may be due to the fact that conclusions in this study were partially based on simulated data. Performance was indeed higher when, in the same study, microarray data was used to benchmark the RNA-seq results.

As the non-concordant genes in our study were mostly borderline, we defined a set of severely non-concordant genes for which fold changes differed substantially between RNA-seq and RT-qPCR. These genes represented on average 1.8% of the total number of genes considered (n = 13 045) and were reproducibly identified between datasets. This implicates that both alignment and pseudoalignment algorithms have problems with a limited but specific set of genes. These genes were typically lower expressed, smaller and had fewer exons, confirming findings from a recent study reporting on problematic genes in RNA-seq data¹⁸. In addition, the reads mapping to non-concordant genes had lower quality and mapped more often to multiple regions. Although the effects of the individual features (i.e. transcript length, number of exons and read quality) are small, combinations of these features may better explain the non-concordance of individual genes. However, additional features that were not assessed here may also contribute. Whether the same genes will pose problems in samples other than those assessed in this study requires further examination. Finally, we cannot exclude the possibility that the 18080 protein-coding genes considered here have features that favour accurate quantification by RNA-seq, compared to genes not included in this study. For these genes, primer design is likely to be hampered by specificity issues. Such genes would also be more challenging to analyse using RNA-seq. Therefore, it remains to be determined to what extent our findings can be extrapolated to all genes (i.e. protein coding genes not included in the study and long non-coding RNAs).

Conclusion

All workflows show a good concordance with RT-qPCR expression measurements and no workflow outperforms the others. Of note, each workflow revealed a small but specific set of genes with inconsistent expression measurements, reproducibly identified in independent datasets. These genes were typically smaller, had fewer exons and were lower expressed compared to genes with consistent expression measurements. Careful validation is warranted when evaluating RNA-seq based expression profiles for this specific set of genes.

Methods

Samples

For this benchmark we used the well-characterized MAQC-I RNA-samples MAQCA (Universal Human Reference RNA, Agilent Technologies,) and MAQCB (Human Brain Reference RNA, Thermo Fisher Scientific)¹⁴. For both samples, RNA-sequencing was performed.

RT-qPCR

RT-qPCR data for 18080 protein-coding genes were generated in the context of the Sequencing Quality Control study (SEQC) (17) using PrimePCR assays (BioRad) (Supplemental Table 1). In order to define the ensemble of transcripts amplified by every individual qPCR assay, assays were re-mapped on the reference transcriptome (ensembl v75). Genes with a Cq-value between 11 and 32 were considered for further analysis. Cq-values were normalized using the global mean normalization strategy¹⁹.

RNA-Seq

For the first RNA-seq dataset (GSE83402), we generated replicate libraries for MAQCA and MAQCB using the stranded TruSeq mRNA library prep kit (Illumina) with 100 ng input RNA according to the manufacturer’s instructions. Libraries were sequenced on a NextSeq 500 (Illumina), generating paired-end 75 bp reads, with a mean of 50 M reads per sample. A second, independent RNA-seq dataset for MAQCA and MAQCB was obtained from the the SEQC study (GSE47792)²⁰. Two replicates for MAQCA (ILM_BGI_A_1 and ILM_BGI_A_2) and MAQCB (ILM_BGI_B_1 and ILM_BGI_B_2), sequenced at the Beijing Genomics Institute with a mean of 73 M reads, were selected.

RNA-seq data processing

Fastq files were processed with five popular workflows (Tophat-HTSeq, Tophat-Cufflinks, STAR-HTSeq, Kallisto and Salmon) using the most recent versions of the software available at the time of analysis (Bowtie2 v2.1.0, Tophat v2.0.10, Cufflinks v2.1.1, HTSeq v0.5.4, Kallisto v0.42.1 and Salmon v0.6.0). For every workflow, default analysis settings and parameters were used. The same reference transcriptome was used for all workflows (Ensembl GRCh37, release 75). For Tophat-Cufflinks and Tophat-HTSeq, the transcriptome was filtered for transcripts detected by the RT-qPCR assays prior to running the Cufflinks and HT-seq algorithms. For Salmon and Kallisto the quantification was performed on the full transcriptome and gene-level TPM-values were calculated by summing transcript-level TPM values of those transcripts detected by the RT-qPCR assays. Tophat mapped on average 77.2% of the reads. For Tophat-Cufflinks and Tophat-HTSeq, the FPKM values were converted to TPM²¹. To get the TPM values for Tophat-HTSeq and Star-HTSeq, we took into account the length of the longest transcript. Calculating TPM-values using the median or minimum transcript length did not change fold-change correlation, however for the absolute values a higher correlation was obtained by using the length of the longest transcript (Supplemental Fig. 14). To define a TPM cutoff, we applied a measure published previously in the miRNA Quality Control Study²², relying on single positive reduction in replicate experiments. We defined this cut-off for both datasets, for both samples and for all workflows (Supplemental Fig. 15). Based on these values, one general cut-off was defined. As genes were also filtered based on the qPCR expression data, we decided to set this cut-off just below the lowest cut-off that was calculated, at 0.1 TPM. The cut-off was applied as such that only those genes were retained with expression above 0.1 TPM for all workflows and samples. The fastq files of the first dataset and output of the different workflows are available trough GEO (GSE83402).

$${TPM}=\,(\frac{{FPK}{{M}}_{i}}{{\sum }_{j}{FPK}{{M}}_{j}})\cdot {10}^{6}$$

(1)

Statistics

For the statistical analysis, R (version 3.2.2) was used. Expression correlation was calculated using either Pearson or Spearman. To test for significant overlap of individual elements in the Venn diagrams, either the Fisher Exact test, for 2 sets, of the Super Exact test²³, for multiple sets, was used. To test differences between sets of genes, the non-parametric Wilcoxon signed-rank test and the Kolmogorov-Smirnov test were used.

References

Mortazavi, A., Williams, B. A., McCue, K., Schaeffer, L. & Wold, B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods 5, 621–628, doi:10.1038/nmeth.1226 (2008).
Article CAS PubMed Google Scholar
Pan, Q. et al. Revealing global regulatory features of mammalian alternative splicing using a quantitative microarray platform. Mol. Cell 16, 929–941, doi:10.1016/j.molcel.2004.12.004 (2004).
Article CAS PubMed Google Scholar
Casneuf, T., Van de Peer, Y. & Huber, W. In situ analysis of cross-hybridisation on microarrays and the inference of expression correlation. BMC Bioinformatics 8, 461, doi:10.1186/1471-2105-8-461 (2007).
Article PubMed PubMed Central Google Scholar
Okoniewski, M. J. & Miller, C. J. Hybridization interactions between probesets in short oligo microarrays lead to spurious correlations. BMC Bioinformatics 7, 276, doi:10.1186/1471-2105-7-276 (2006).
Article PubMed PubMed Central Google Scholar
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 7, 562–578, doi:10.1038/nprot.2012.016 (2012).
Article CAS PubMed PubMed Central Google Scholar
Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA-Seq. Bioinforma. Oxf. Engl. 25, 1105–1111, doi:10.1093/bioinformatics/btp120 (2009).
Article CAS Google Scholar
Anders, S., Pyl, P. T. & Huber, W. HTSeq–a Python framework to work with high-throughput sequencing data. Bioinforma. Oxf. Engl. 31, 166–169, doi:10.1093/bioinformatics/btu638 (2015).
Article CAS Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinforma. Oxf. Engl. 29, 15–21 (2013).
Article CAS Google Scholar
Patro, R., Duggal, G., Love, M. I., Irizarry, R. A. & Kingsford, C. Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods 14, 417–419 (2017).
Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527, doi:10.1038/nbt.3519 (2016).
Article CAS PubMed Google Scholar
Chandramohan, R., Wu, P.-Y., Phan, J. H. & Wang, M. D. Benchmarking RNA-Seq quantification tools. Conf. Proc. Annu. Int. Conf. IEEE Eng. Med. Biol. Soc. IEEE Eng. Med. Biol. Soc. Annu. Conf. 2013, 647–650 (2013).
Google Scholar
Patro, R., Mount, S. M. & Kingsford, C. Sailfish enables alignment-free isoform quantification from RNA-seq reads using lightweight algorithms. Nat. Biotechnol. 32, 462–464, doi:10.1038/nbt.2862 (2014).
Article CAS PubMed PubMed Central Google Scholar
Teng, M. et al. A benchmark for RNA-seq quantification pipelines. Genome Biol. 17, doi:10.1186/s13059-016-1060-7 (2016).
MAQC Consortium. et al. The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements. Nat. Biotechnol. 24, 1151–1161, doi:10.1038/nbt1239 (2006).
Article PubMed Central Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550, doi:10.1186/s13059-014-0550-8 (2014).
Article PubMed PubMed Central Google Scholar
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinforma. Oxf. Engl. 26, 139–140, doi:10.1093/bioinformatics/btp616 (2010).
Article CAS Google Scholar
Ritchie, M. E. et al. Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. gkv007. 43, e47–e47, doi:10.1093/nar/gkv007 (2015).
Robert, C. & Watson, M. Errors in RNA-Seq quantification affect genes of relevance to human disease. Genome Biol. 16, doi:10.1186/s13059-015-0734-x (2015).
Mestdagh, P. et al. A novel and universal method for microRNA RT-qPCR data normalization. Genome Biol. 10, R64, doi:10.1186/gb-2009-10-6-r64 (2009).
Article PubMed PubMed Central Google Scholar
SEQC/MAQC-III Consortium. A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium. Nat. Biotechnol. 32, 903–914, doi:10.1038/nbt.2957 (2014).
Article Google Scholar
Wagner, G. P., Kin, K. & Lynch, V. J. Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples. Theory Biosci. Theor. Den Biowissenschaften 131, 281–285, doi:10.1007/s12064-012-0162-3 (2012).
Article CAS Google Scholar
Mestdagh, P. et al. Evaluation of quantitative miRNA expression platforms in the microRNA quality control (miRQC) study. Nat. Methods 11, 809–815, doi:10.1038/nmeth.3014 (2014).
Article CAS PubMed Google Scholar
Wang, M., Zhao, Y. & Zhang, B. Efficient Test and Visualization of Multi-Set Intersections. Sci. Rep. 5, 16923, doi:10.1038/srep16923 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

C.E. and P.M. are supported by the Fund for Scientific Research Flanders (FWO).

Author information

Authors and Affiliations

Center for Medical Genetics, Ghent University, Ghent, Belgium
Celine Everaert & Pieter Mestdagh
Cancer Research Institute Ghent, Ghent University, Ghent, Belgium
Celine Everaert & Pieter Mestdagh
Bioinformatics Institute Ghent N2N, Ghent University, Ghent, Belgium
Celine Everaert & Pieter Mestdagh
Biogazelle, Ghent, Belgium
Manuel Luypaert & Jan Hellemans
Kinghorn Cancer Center, Sydney, Australia
Jesper L. V. Maag, Quek Xiu Cheng & Marcel E. Dinger

Authors

Celine Everaert
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Luypaert
View author publications
You can also search for this author in PubMed Google Scholar
Jesper L. V. Maag
View author publications
You can also search for this author in PubMed Google Scholar
Quek Xiu Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Marcel E. Dinger
View author publications
You can also search for this author in PubMed Google Scholar
Jan Hellemans
View author publications
You can also search for this author in PubMed Google Scholar
Pieter Mestdagh
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.E. and P.M. analysed the data and wrote the manuscript. M.L. and J.H. updated RT-qPCR assay annotations and provided the data. J.L.V.M., Q.X.C. and M.E.D. performed STAR-HTSeq analysis.

Corresponding author

Correspondence to Pieter Mestdagh.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplemental Figures

Supplemental Table1

Supplemental Table2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Everaert, C., Luypaert, M., Maag, J.L.V. et al. Benchmarking of RNA-sequencing analysis workflows using whole-transcriptome RT-qPCR expression data. Sci Rep 7, 1559 (2017). https://doi.org/10.1038/s41598-017-01617-3

Download citation

Received: 18 July 2016
Accepted: 03 April 2017
Published: 08 May 2017
DOI: https://doi.org/10.1038/s41598-017-01617-3

This article is cited by

Differentially expressed transcripts of Tetracapsuloides bryosalmonae (Cnidaria) between carrier and dead-end hosts involved in key biological processes: novel insights from a coupled approach of FACS and RNA sequencing
- Saloni Shivam
- Reinhard Ertl
- Gokhlesh Kumar
Veterinary Research (2023)
Molecular evaluation of the metabolism of estrogenic di(2-ethylhexyl) phthalate in Mycolicibacterium sp.
- Mousumi Bhattacharyya
- Rinita Dhar
- Tapan K. Dutta
Microbial Cell Factories (2023)
A novel Chr1-miR-200 driven whole transcriptome signature shapes tumor immune microenvironment and predicts relapse in early-stage lung adenocarcinoma
- Simon Garinet
- Audrey Didelot
- Hélène Blons
Journal of Translational Medicine (2023)
Multi-omics strategies uncover the molecular mechanisms of nitrogen, phosphorus and potassium deficiency responses in Brassica napus
- Ying Fu
- Annaliese S. Mason
- Huasheng Yu
Cellular & Molecular Biology Letters (2023)
Genome-wide identification and bioinformatics analysis of the WD40 transcription factor family and candidate gene screening for anthocyanin biosynthesis in Rhododendron simsii
- Cheng Wang
- Yafang Tang
- Ang Lyu
BMC Genomics (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.