How does DNA sequence motif discovery work?

D'haeseleer, Patrik

doi:10.1038/nbt0806-959

Primer
Published: 01 August 2006

How does DNA sequence motif discovery work?

Patrik D'haeseleer¹

Nature Biotechnology volume 24, pages 959–961 (2006)Cite this article

8338 Accesses
70 Citations
12 Altmetric
Metrics details

How can we computationally extract an unknown motif from a set of target sequences? What are the principles behind the major motif discovery algorithms? Which of these should we use, and how do we know we've found a 'real' motif?

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Starting from a single site, expectation maximization algorithms such as MEME⁴ alternate between assigning sites to a motif (left) and updating the motif model (right).**

References

D'haeseleer. P. What are DNA sequence motifs? Nat. Biotechnol. 24, 423–425 (2006).
Article CAS Google Scholar
Sinha, S. & Tompa, M. YMF: a program for discovery of novel transcription factor binding sites by statistical overrepresentation. Nucleic Acids Res. 31, 3586–3588 (2003).
Article CAS Google Scholar
Pavesi, G. et al. Weeder Web: discovery of transcription factor binding sites in a set of sequences from co-regulated genes. Nucleic Acids Res. 32 (Web Server Issue), W199–W203 (2004).
Article CAS Google Scholar
Bailey, T.L. & Elkan, C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc. Int. Conf. Intell. Syst. Mol. Biol. 2, 28–36 (1994).
CAS PubMed Google Scholar
Tompa, M. et al. Assessing computational tools for the discovery of transcription factor binding sites. Nat. Biotechnol. 23, 137–144 (2005).
Article CAS Google Scholar
Li, N. & Tompa, M. Analysis of computational approaches for motif discovery. Alg. Mol. Biol. 1, 8 (2006).
Article Google Scholar
Hu, J., Li, B. & Kihara, D. Limitations and potentials of current motif discovery algorithms. Nucleic Acids Res. 33, 4899–4913 (2005).
Article CAS Google Scholar
Thijs, G. et al. A Gibbs sampling method to detect overrepresented motifs in the upstream regions of coexpressed genes. J. Comp. Biol. 9, 447–464 (2002).
Article CAS Google Scholar
Huber, B.R. & Bulyk, M.L. Meta-analysis discovery of tissue-specific DNA sequence motifs from mammalian gene expression data. BMC Bioinformatics 7, 229 (2006).
Article Google Scholar
Hughes, J.D. et al. Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. J. Mol. Biol. 296, 1205–1214 (2000).
Article CAS Google Scholar
McGuire, A.M., Hughes, J.D. & Church, G.M. Conservation of DNA regulatory motifs and discovery of new motifs in microbial genomes. Genome Res. 10, 744–757 (2000).
Article CAS Google Scholar
Huang, H.-D. et al. Identifying transcriptional regulatory sites in the human genome using an integrated system. Nucleic Acids Res. 32, 1948–1956 (2004).
Article CAS Google Scholar

Download references

Author information

Authors and Affiliations

Microbial Systems Division, Biosciences Directorate, Lawrence Livermore National Laboratory, 7000 East Ave., PO Box 808, L-448, Livermore, 94551, California, USA
Patrik D'haeseleer

Authors

Patrik D'haeseleer
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

D'haeseleer, P. How does DNA sequence motif discovery work?. Nat Biotechnol 24, 959–961 (2006). https://doi.org/10.1038/nbt0806-959

Download citation

Issue Date: 01 August 2006
DOI: https://doi.org/10.1038/nbt0806-959

This article is cited by

biomapp::chip: large-scale motif analysis
- Jader M. Caldonazzo Garbelini
- Danilo S. Sanches
- Aurora T. Ramirez Pozo
BMC Bioinformatics (2024)
Sequence motif finder using memetic algorithm
- Jader M. Caldonazzo Garbelini
- André Y. Kashiwabara
- Danilo S. Sanches
BMC Bioinformatics (2018)
DiNAMO: highly sensitive DNA motif discovery in high-throughput sequencing data
- Chadi Saad
- Laurent Noé
- Martin Figeac
BMC Bioinformatics (2018)
SamSelect: a sample sequence selection algorithm for quorum planted motif search on large DNA datasets
- Qiang Yu
- Dingbang Wei
- Hongwei Huo
BMC Bioinformatics (2018)
A systematic approach to RNA-associated motif discovery
- Tian Gao
- Jiang Shu
- Juan Cui
BMC Genomics (2018)

How does DNA sequence motif discovery work?

Access options

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

This article is cited by

biomapp::chip: large-scale motif analysis

Sequence motif finder using memetic algorithm

DiNAMO: highly sensitive DNA motif discovery in high-throughput sequencing data

SamSelect: a sample sequence selection algorithm for quorum planted motif search on large DNA datasets

A systematic approach to RNA-associated motif discovery

Search

Quick links

Access options

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

biomapp::chip: large-scale motif analysis

Sequence motif finder using memetic algorithm

DiNAMO: highly sensitive DNA motif discovery in high-throughput sequencing data

SamSelect: a sample sequence selection algorithm for quorum planted motif search on large DNA datasets

A systematic approach to RNA-associated motif discovery

Search

Quick links