Top-down standards will not serve systems biology

Quackenbush, John; Stoeckert, Christian; Ball, Catherine; Brazma, Alvis; Gentleman, Robert; Huber, Wolfgang; Irizarry, Rafael; Salit, Marc; Sherlock, Gavin; Spellman, Paul; Winegarden, Neil

doi:10.1038/440024a

Download PDF

Correspondence
Published: 01 March 2006

Top-down standards will not serve systems biology

John Quackenbush¹,
Christian Stoeckert²,
Catherine Ball³,
Alvis Brazma⁴,
Robert Gentleman⁵,
Wolfgang Huber⁶,
Rafael Irizarry⁷,
Marc Salit⁸,
Gavin Sherlock⁹,
Paul Spellman¹⁰ &
…
Neil Winegarden¹¹

Nature volume 440, page 24 (2006)Cite this article

864 Accesses
15 Citations
Metrics details

Sir

Marvin Cassman and his colleagues, in their Commentary “Barriers to progress in systems biology” (Nature 438, 1079; 2005), discuss the development of standards in systems-biology research. We agree with the need for well-curated databases, software systems that can work together to analyse such data and integrated models that can deliver the fruits of systems-based research to laboratory biologists. But we have concerns about the proposed solution, which is presented as a ‘top-down’ approach that ignores many existing and emerging standards. It seems based on false assumptions about the research community and ignores the community it is intended to serve.

Cassman and his colleagues argue that standards are needed because much software developed in research settings is not reusable by other groups of working biologists, who are not appropriately trained. But the community has many excellent quantitative scientists and software developers — and with the advent of genomics, an increasing number of physicists, mathematicians, statisticians, computer scientists and engineers have joined the ranks of biologists.

It is not a lack of training that influences software design, but the realities of developing software in a research environment where developing a professional software system is not the primary goal. As fields mature and the methodologies used to generate the data become well known and established, it is both appropriate and valuable to have standardized, easy-to-use software. But standardized approaches are not always appropriate for developing software to support new research using novel methodologies in exciting new ways.

Our collective experience, gained through the Microarray Gene Expression Data Society and the BioConductor project, clearly demonstrates that flexible systems are needed and that most initial efforts are neither well documented nor widely used. But that is not a bad thing — as science charts a particular path, the appropriate tools, if given room to evolve, do emerge and rise to the top, becoming better documented and more robust.

Even with the relatively straightforward task of assembling and annotating genome-sequencing data, computationally elegant solutions to software interoperability (such as the common object request broker architecture, or CORBA) were ultimately abandoned in favour of FASTA-formatted sequence data and tab-delimited output from various analytical tools strung together using Perl. It wasn't elegant or pretty, but it delivered what was needed in a way that sophisticated users at various locations could replicate and adapt to suit their needs. When combined with well-engineered databases and websites to provide access, the genome projects also delivered the fruits of their work to the broader community in a form that has been extremely useful and continues to evolve. Engineering this ahead of time, particularly when the field and the tools were evolving so rapidly, quite simply would have failed.

We believe that the centralized approach proposed by Cassman and colleagues would not fare well compared with more democratic, community-based approaches that understand and include research-driven development efforts. Creating a rigid standard before a field has matured can result in a failed and unused standard, in the best of circumstances, and, in the worst, can have the effect of stifling innovation.

Author information

Authors and Affiliations

Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute and Harvard School of Public Health, 44 Binney Street, Boston, 02115, Massachusetts, USA
John Quackenbush
University of Pennsylvania, Philadelphia
Christian Stoeckert
Stanford University, Stanford, California
Catherine Ball
European Bioinformatics Institute, EMBL, Cambridge
Alvis Brazma
Fred Hutchinson Cancer Research Center, Seattle
Robert Gentleman
European Bioinformatics Institute, EMBL, Cambridge
Wolfgang Huber
Johns Hopkins School of Public Health, Baltimore
Rafael Irizarry
US National Institute of Standards and Technology,
Marc Salit
Stanford University, Stanford, California
Gavin Sherlock
Lawrence Berkeley Laboratory, California
Paul Spellman
University Health Network, Toronto, Canada
Neil Winegarden

Authors

John Quackenbush
View author publications
You can also search for this author in PubMed Google Scholar
Christian Stoeckert
View author publications
You can also search for this author in PubMed Google Scholar
Catherine Ball
View author publications
You can also search for this author in PubMed Google Scholar
Alvis Brazma
View author publications
You can also search for this author in PubMed Google Scholar
Robert Gentleman
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Huber
View author publications
You can also search for this author in PubMed Google Scholar
Rafael Irizarry
View author publications
You can also search for this author in PubMed Google Scholar
Marc Salit
View author publications
You can also search for this author in PubMed Google Scholar
Gavin Sherlock
View author publications
You can also search for this author in PubMed Google Scholar
Paul Spellman
View author publications
You can also search for this author in PubMed Google Scholar
Neil Winegarden
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Quackenbush, J., Stoeckert, C., Ball, C. et al. Top-down standards will not serve systems biology. Nature 440, 24 (2006). https://doi.org/10.1038/440024a

Download citation

Published: 01 March 2006
Issue Date: 02 March 2006
DOI: https://doi.org/10.1038/440024a

This article is cited by

A minimum information standard for reproducing bench-scale bacterial cell growth and productivity
- Ariel Hecht
- James Filliben
- Marc Salit
Communications Biology (2018)
LabKey Server: An open source platform for scientific data integration, analysis and collaboration
- Elizabeth K Nelson
- Britt Piehler
- Mark Igra
BMC Bioinformatics (2011)
Systems biology driven software design for the research enterprise
- John Boyle
- Christopher Cavnor
- Ilya Shmulevich
BMC Bioinformatics (2008)
Beyond standardization: dynamic software infrastructures for systems biology
- Morris A. Swertz
- Ritsert C. Jansen
Nature Reviews Genetics (2007)
High-throughput electronic biology: mining information for drug discovery
- William Loging
- Lee Harland
- Bryn Williams-Jones
Nature Reviews Drug Discovery (2007)

Top-down standards will not serve systems biology

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

This article is cited by

A minimum information standard for reproducing bench-scale bacterial cell growth and productivity

LabKey Server: An open source platform for scientific data integration, analysis and collaboration

Systems biology driven software design for the research enterprise

Beyond standardization: dynamic software infrastructures for systems biology

High-throughput electronic biology: mining information for drug discovery

Search

Quick links

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

A minimum information standard for reproducing bench-scale bacterial cell growth and productivity

LabKey Server: An open source platform for scientific data integration, analysis and collaboration

Systems biology driven software design for the research enterprise

Beyond standardization: dynamic software infrastructures for systems biology

High-throughput electronic biology: mining information for drug discovery

Search

Quick links