Data mining

Definition

Data mining is the process of extracting potentially useful information from data sets. It uses a suite of methods to organise, examine and combine large data sets, including machine learning, visualisation methods and statistical analyses. Data mining is used in computational biology and bioinformatics to detect trends or patterns without knowledge of the meaning of the data.

Latest Research and Reviews

News and Comment

  • Comments and Opinion |

    • Yasset Perez-Riverol
    • , Mingze Bai
    • , Felipe da Veiga Leprevost
    • , Silvano Squizzato
    • , Young Mi Park
    • , Kenneth Haug
    • , Adam J Carroll
    • , Dylan Spalding
    • , Justin Paschall
    • , Mingxun Wang
    • , Noemi del-Toro
    • , Tobias Ternent
    • , Peng Zhang
    • , Nicola Buso
    • , Nuno Bandeira
    • , Eric W Deutsch
    • , David S Campbell
    • , Ronald C Beavis
    • , Reza M Salek
    • , Ugis Sarkans
    • , Robert Petryszak
    • , Maria Keays
    • , Eoin Fahy
    • , Manish Sud
    • , Shankar Subramaniam
    • , Ariana Barbera
    • , Rafael C Jiménez
    • , Alexey I Nesvizhskii
    • , Susanna-Assunta Sansone
    • , Christoph Steinbeck
    • , Rodrigo Lopez
    • , Juan A Vizcaíno
    • , Peipei Ping
    •  & Henning Hermjakob
    Nature Biotechnology 35, 406–409
  • Comments and Opinion |

    • John Vivian
    • , Arjun Arkal Rao
    • , Frank Austin Nothaft
    • , Christopher Ketchum
    • , Joel Armstrong
    • , Adam Novak
    • , Jacob Pfeil
    • , Jake Narkizian
    • , Alden D Deran
    • , Audrey Musselman-Brown
    • , Hannes Schmidt
    • , Peter Amstutz
    • , Brian Craft
    • , Mary Goldman
    • , Kate Rosenbloom
    • , Melissa Cline
    • , Brian O'Connor
    • , Megan Hanna
    • , Chet Birger
    • , W James Kent
    • , David A Patterson
    • , Anthony D Joseph
    • , Jingchun Zhu
    • , Sasha Zaranek
    • , Gad Getz
    • , David Haussler
    •  & Benedict Paten
    Nature Biotechnology 35, 314–316
  • Editorial |

    A recent recommendation that a large number of professional data stewards be trained and employed in all data-rich research projects raises the exciting prospect they will conduct research on data-intensive research itself. It also focuses us on questions about the role of all scientists in data quality and accessibility as well as how best to measure the value of good data stewardship to science and society.

  • Editorial |

    The FAIR data principles are simple guidelines for ensuring that machines can find and use data, supporting data reuse by individuals. More—and better—research can be generated by designing data and algorithms to be findable, accessible, interoperable and reusable, together with the tools and workflows that led to these data.