Data mining in biotechnology

Persidis, Aris

doi:10.1038/72722

Industry Trends
Published: February 2000

Data mining in biotechnology

Aris Persidis¹

Nature Biotechnology volume 18, pages 237–238 (2000)Cite this article

311 Accesses
6 Citations
Metrics details

Access through your institution

Buy or subscribe

Data mining has been defined as “the nontrivial extraction of implicit, previously unknown, and potentially useful information from data”.¹ In areas other than the life sciences and healthcare, data mining is a huge industry, with more than a hundred companies providing a vast array of software products and services to clients that obtain, generate, and rely on large quantities of data. The industries that rely daily on data mining for a number of their functions include marketing, manufacturing, database providers, government, the travel industry, banking and the financial industry, telecommunications, and engineering, among others. The common theme is that these industries all have truly massive amounts of information—about their operations and also about their clients—collected in a variety of ways. In order to maximize the usefulness of this information, they rely on software that helps glean specific patterns and trends from the data, in addition to making predictions and offering simulations of future events.

It should come as no surprise that the biopharmaceutical industry is increasingly beginning to employ a variety of data-mining methodologies to help it deal with the enormous amounts of biological information of various forms that the industry collects. Ranging from annotated databases of disease profiles and molecular pathways to sequences, structure–activity relationships (SAR), chemical structures of combinatorial libraries of compounds, individual and population clinical trial results, the biopharmaceutical industry is inundated with information, and data mining is the centerpiece of advanced methodologies to help the industry deal with this information overload².

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

References

Frawley, W. et al. AI Magazine, Fall 1992, 213– 228.
Persidis, A. Nat. Biotechnol. 17, 1239 (1999 ).
Article CAS Google Scholar
Persidis, A. Nat. Biotechnol. 17, 828–830 (1999).
Article CAS Google Scholar
Information on general data-mining companies can be found at http://www.kdnuggets.com.

Download references

Author information

Authors and Affiliations

managing director of RHeoGene, 706 Forest Street, Charlottesville, 22903, VA
Aris Persidis

Authors

Aris Persidis
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Persidis, A. Data mining in biotechnology. Nat Biotechnol 18, 237–238 (2000). https://doi.org/10.1038/72722

Download citation

Issue Date: February 2000
DOI: https://doi.org/10.1038/72722

Data mining in biotechnology

Access options

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Search

Quick links

Access options

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links