We developed the OMArk software package for evaluating protein-coding gene annotation quality. In addition to assessing the completeness of a proteome, OMArk estimates the overall quality of the gene set’s content, a feature that will help to improve public protein sequence data.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$29.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
References
Blaxter, M. et al. Why sequence all eukaryotes? Proc. Natl Acad. Sci. USA 119, e2115636118 (2022). An article that presents the reasoning behind the ongoing massive sequencing of all eukaryotic genomes.
Salzberg, S. L. Next-generation genome annotation: we still struggle to get it right. Genome Biol. 20, 92 (2019). A piece summarizing the challenges of gene annotation.
Altenhoff, A. M. et al. OMA orthology in 2024: improved prokaryote coverage, ancestral and extant GO enrichment, a revamped synteny viewer and more in the OMA ecosystem. Nucleic Acids Res. https://doi.org/10.1093/nar/gkad1020 (2023). The latest paper presenting the OMA orthology database, which OMArk uses as reference.
Rossier, V., Vesztrocy, A. W., Robinson-Rechavi, M. & Dessimoz, C. OMAmer: tree-driven and alignment-free protein assignment to subfamilies outperforms closest sequence approaches. Bioinformatics https://doi.org/10.1093/bioinformatics/btab219 (2021). This paper describes OMAmer, the first step in the workflow of OMArk.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This is a summary of: Nevers, Y. et al. Quality assessment of gene repertoire annotations with OMArk. Nat. Biotechnol. https://doi.org/10.1038/s41587-024-02147-w (2024).
Rights and permissions
About this article
Cite this article
OMArk, a tool for gene annotation quality control, reveals erroneous gene inference. Nat Biotechnol (2024). https://doi.org/10.1038/s41587-024-02155-w
Published:
DOI: https://doi.org/10.1038/s41587-024-02155-w