Using a data triangle to understand molecular nutrition

Evelo, Chris

doi:10.1038/npre.2011.5689.1

Presentation
Open access
Published: 16 February 2011

International Conference on Nutrigenomics 2010 / 10th International Conference on Mechanisms of Antimutagenesis and Anticarcinogenesis 2010

Using a data triangle to understand molecular nutrition

Chris Evelo¹

Nature Precedings (2011)Cite this article

238 Accesses
Metrics details

Abstract

Until recently nutrigenomics was mainly about transcriptomics related data. That already confronted us with overwhelming analytical problems. We learned to mathematically and statistically treat genome wide expression studies and studies directed to gene expression regulation. Nutrigenomics researchers had to become bilingual speaking: English and R1 and learned to think about co-expression, clusters and false discovery rates. The latter in fact proofed to be a trap. Removing all the false positives made us loose the information we were really interested in. To understand the results of our genomics experiments we often had to confront what we were measuring with what we already knew. After all false positives are not likely to all be related to the same meaningful biological process. That asked for the development of new analytical tools like Cytoscape for network analysis and PathVisio for pathway analysis. More importantly we had to structure what we know. Text mining and data mining helped us to do that, but what was really needed was mobilization of all the knowledge that is present in the heads of the scientific community. WikiPathways was our contribution to the rapidly emerging field of community curation. Thus we started to become able to integrate different types of technologies that span the full gene expression pipeline and to understand that in the biological context.Today the story repeats itself. Genome wide genetics is becoming real. We can do Genome Wide Association Studies and soon we can sequence individual genomes in relation to food intake and phenotypic responses. And then what? How can we deal with that new avalanche of data? The oversampling problems will be a few orders of magnitude larger; after all there can be hundreds of SNPs in every gene. There will just be too many to understand which SNPs are important from the data alone. We will again have to relate them to the biological processes. But is that enough? I think not. We will only understand the outcome of those large scale genetics studies if we not only attribute the SNPs to genes and thereby to pathways. We will also have to consider the actual sequences and see what the functional effect is that the SNP causes. Is it likely to influence transcription factor binding, miRNA effects, or protein-protein interactions? This calls for new types of data integration, for which we already have the tools. And it calls for new creative ways to do that. What we really need is teams of creative minds. Some new initiatives seem to show that these are already being formed.1: http://www.r-project.org

Article PDF

Author information

Authors and Affiliations

Department of Bioinformatics - BiGCaT, Maastricht University https://www.nature.com/nature
Chris Evelo

Authors

Chris Evelo
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Creative Commons Attribution 3.0 License.

Reprints and permissions

About this article

Cite this article

Evelo, C. Using a data triangle to understand molecular nutrition. Nat Prec (2011). https://doi.org/10.1038/npre.2011.5689.1

Download citation

Received: 16 February 2011
Accepted: 16 February 2011
Published: 16 February 2011
DOI: https://doi.org/10.1038/npre.2011.5689.1

Using a data triangle to understand molecular nutrition

Abstract

Similar content being viewed by others

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

An open source knowledge graph ecosystem for the life sciences

Genome-wide association studies

Article PDF

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Search

Quick links

Abstract

Similar content being viewed by others

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

An open source knowledge graph ecosystem for the life sciences

Genome-wide association studies

Article PDF

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links