Abstract
*Motivation:* High-throughput nucleotide sequencing provides quantitative readouts in assays for RNA expression (RNA-Seq), protein-DNA binding (ChIP-Seq) or cell counting (barcode sequencing). Statistical inference of differential signal in such data requires estimation of their variability throughout the dynamic range. When the number of replicates is small, error modelling is needed to achieve statistical power.
Results:
We propose an error model that uses the negative binomial distribution, with variance and mean linked by local regression, to model the null distribution of the count data. The method controls type-I error and provides good detection power. *Availability:* A free open-source R software package, DESeq, is available from the Bioconductor project and from "http://www-huber.embl.de/users/anders/DESeq":http://www-huber.embl.de/users/anders/DESeq.
Similar content being viewed by others
Article PDF
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Anders, S., Huber, W. Differential expression analysis for sequence count data. Nat Prec (2010). https://doi.org/10.1038/npre.2010.4282.2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/npre.2010.4282.2
Keywords
This article is cited by
-
Molecular and metabolic insights into purplish leaf coloration through the investigation of two mulberry (Morus alba) genotypes
BMC Plant Biology (2024)
-
The sexual identity of neighboring plants influences root-associated bacteria under drought
Plant and Soil (2024)
-
Transcriptomic Response of the Atlantic Surfclam (Spisula solidissima) to Acute Heat Stress
Marine Biotechnology (2024)
-
Full-length transcriptome profiling for fruit development in Diospyros oleifera using nanopore sequencing
BMC Genomic Data (2023)
-
Transcriptomic profiling of Parkinson's disease brains reveals disease stage specific gene expression changes
Acta Neuropathologica (2023)