Abstract
Data integration plays an increasingly important role in bringing together the large amounts of diverse information spread across disparate resources and presenting a comprehensive overview of these data to the scientific community. The UniProt Knowledgebase (UniProtKB) acts as a central hub of protein knowledge by providing a unified view of protein sequence and functional information. Manual and automatic annotation procedures are used to add data directly to the database while extensive cross-referencing to more than 120 external databases provides access to additional relevant information in more specialised data collections. UniProtKB also integrates data such as protein sequences, protein-protein interactions, Gene Ontology terms and official gene nomenclature from a range of resources. All information in UniProtKB is attributed to its original source, allowing users to trace the provenance of all data. In addition, UniProtKB data is made freely available in a range of formats to facilitate integration with other databases and the UniProt Consortium is committed to using and promoting common data exchange formats and technologies. This approach ensures that information is captured in the most appropriate resource for subsequent integration with other databases and also ensures maximum curation efficiency by preventing duplication of efforts across multiple resources. How UniProt achieves this data capture and integration will be presented. The UniProt resource is available at "www.uniprot.org":http://www.uniprot.org.
Similar content being viewed by others
Article PDF
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Magrane, M., Consortium, U. UniProt Knowledgebase: a hub of integrated data. Nat Prec (2010). https://doi.org/10.1038/npre.2010.5092.1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/npre.2010.5092.1
Keywords
This article is cited by
-
Identification of potential drug targets and inhibitor of the pathogenic bacteria Shigella flexneri 2a through the subtractive genomic approach
In Silico Pharmacology (2018)
-
Development of novel genic microsatellite markers from transcriptome sequencing in sugar maple (Acer saccharum Marsh.)
BMC Research Notes (2017)
-
Boronic Acid-Modified Magnetic Fe3O4@mTiO2 Microspheres for Highly Sensitive and Selective Enrichment of N-Glycopeptides in Amniotic Fluid
Scientific Reports (2017)
-
Conserved-residue mutations in Wzy affect O-antigen polymerization and Wzz-mediated chain-length regulation in Pseudomonas aeruginosa PAO1
Scientific Reports (2013)