Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Research Briefing
  • Published:

OMArk, a tool for gene annotation quality control, reveals erroneous gene inference

We developed the OMArk software package for evaluating protein-coding gene annotation quality. In addition to assessing the completeness of a proteome, OMArk estimates the overall quality of the gene set’s content, a feature that will help to improve public protein sequence data.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: OMArk concept and output graph.

References

  1. Blaxter, M. et al. Why sequence all eukaryotes? Proc. Natl Acad. Sci. USA 119, e2115636118 (2022). An article that presents the reasoning behind the ongoing massive sequencing of all eukaryotic genomes.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Salzberg, S. L. Next-generation genome annotation: we still struggle to get it right. Genome Biol. 20, 92 (2019). A piece summarizing the challenges of gene annotation.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Altenhoff, A. M. et al. OMA orthology in 2024: improved prokaryote coverage, ancestral and extant GO enrichment, a revamped synteny viewer and more in the OMA ecosystem. Nucleic Acids Res. https://doi.org/10.1093/nar/gkad1020 (2023). The latest paper presenting the OMA orthology database, which OMArk uses as reference.

    Article  PubMed Central  Google Scholar 

  4. Rossier, V., Vesztrocy, A. W., Robinson-Rechavi, M. & Dessimoz, C. OMAmer: tree-driven and alignment-free protein assignment to subfamilies outperforms closest sequence approaches. Bioinformatics https://doi.org/10.1093/bioinformatics/btab219 (2021). This paper describes OMAmer, the first step in the workflow of OMArk.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This is a summary of: Nevers, Y. et al. Quality assessment of gene repertoire annotations with OMArk. Nat. Biotechnol. https://doi.org/10.1038/s41587-024-02147-w (2024).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

OMArk, a tool for gene annotation quality control, reveals erroneous gene inference. Nat Biotechnol (2024). https://doi.org/10.1038/s41587-024-02155-w

Download citation

  • Published:

  • DOI: https://doi.org/10.1038/s41587-024-02155-w

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing