Review abstract


Nature Methods 6, S22 - S32 (2009)
Published online: 15 October 2009 | doi:10.1038/nmeth.1371

Computation for ChIP-seq and RNA-seq studies

Shirley Pepke1, Barbara Wold2 & Ali Mortazavi2


Genome-wide measurements of protein-DNA interactions and transcriptomes are increasingly done by deep DNA sequencing methods (ChIP-seq and RNA-seq). The power and richness of these counting-based measurements comes at the cost of routinely handling tens to hundreds of millions of reads. Whereas early adopters necessarily developed their own custom computer code to analyze the first ChIP-seq and RNA-seq datasets, a new generation of more sophisticated algorithms and software tools are emerging to assist in the analysis phase of these projects. Here we describe the multilayered analyses of ChIP-seq and RNA-seq datasets, discuss the software packages currently available to perform tasks at each layer and describe some upcoming challenges and features for future analysis tools. We also discuss how software choices and uses are affected by specific aspects of the underlying biology and data structure, including genome size, positional clustering of transcription factor binding sites, transcript discovery and expression quantification.

Top
  1. Center for Advanced Computing Research and
  2. Division of Biology, California Institute of Technology, Pasadena, California, USA.

Correspondence to: Ali Mortazavi2 e-mail: alim@caltech.edu




Extra navigation

Apply for your free subscription to
Nature Methods

Subscribe

Open Innovation Challenges

naturejobs

ADVERTISEMENT