The CALBC RDF Triple store: retrieval over large literature content

Croset, Samuel

doi:10.1038/npre.2010.5383.1

Download PDF

Presentation
Open access
Published: 13 December 2010

SWAT4LS 2010

The CALBC RDF Triple store: retrieval over large literature content

Samuel Croset¹

Nature Precedings (2010)Cite this article

46 Accesses
4 Citations
Metrics details

Abstract

Integration of the scientific literature into a biomedical research infrastructure requires the processing of the literature, identification of the contained named entities (NEs) and concepts, and to represent the content in a standardised way.The CALBC project partners (PPs) have produced a large-scale annotated biomedical corpus with four different semantic groups through the harmonisation of annotations from automatic text mining solutions (Silver Standard Corpus, SSC). The four semantic groups were chemical entities and drugs (CHED), genes and proteins (PRGE), diseases and disorders (DISO) and species (SPE). The content of the SSC has been fully integrated into RDF Triple Store (4,568,678 triples) and has been aligned with content from the GeneAtlas (182,840 triples), UniProtKb (12,552,239 triples for human) and the lexical resource LexEBI (BioLexicon). RDF Triple Store enables querying the scientific literature and bioinformatics resources at the same time for evidence of genetic causes, such as drug targets and disease involvement.

Article PDF

Author information

Authors and Affiliations

European Bioinformatics Institute (EBI) https://www.nature.com/nature
Samuel Croset

Authors

Samuel Croset
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Samuel Croset.

Rights and permissions

Creative Commons Attribution 3.0 License.

Reprints and permissions

About this article

Cite this article

Croset, S. The CALBC RDF Triple store: retrieval over large literature content. Nat Prec (2010). https://doi.org/10.1038/npre.2010.5383.1

Download citation

Received: 12 December 2010
Accepted: 13 December 2010
Published: 13 December 2010
DOI: https://doi.org/10.1038/npre.2010.5383.1

The CALBC RDF Triple store: retrieval over large literature content

Abstract

Similar content being viewed by others

An empirical meta-analysis of the life sciences linked open data on the web

A novel machine learning framework for automated biomedical relation extraction from large-scale literature repositories

Semantic wikis as flexible database interfaces for biomedical applications

Article PDF

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Search

Quick links

Abstract

Similar content being viewed by others

An empirical meta-analysis of the life sciences linked open data on the web

A novel machine learning framework for automated biomedical relation extraction from large-scale literature repositories

Semantic wikis as flexible database interfaces for biomedical applications

Article PDF

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links