Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Using metric learning to identify the lab-of-origin of engineered DNA

Determining the origin of engineered DNA can help to foster responsible innovation within the biotechnology community. A convolutional neural network approach that learns distances between engineered DNA sequences and various labs that could have created them is used to accurately predict the lab-of-origin.

This is a preview of subscription content, access via your institution

Access options

Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.

Fig. 1: Identifying the lab-of-origin of DNA sequences.


  1. Nielsen, A. A. & Voigt, C. A. Deep learning to predict the lab-of-origin of engineered DNA. Nat. Commun. 9, 3135 (2018). The first study to predict the lab of origin of engineered DNA using deep learning.

    Article  Google Scholar 

  2. Alley, E. C. et al. A machine learning toolkit for genetic engineering attribution to facilitate biosecurity. Nat. Commun. 11, 6293 (2020). An article that presents a recurrent neural network approach to predicting the lab-of-origin.

    Article  Google Scholar 

  3. Wang, Q., Kille, B., Liu, T. R., Elworth, R. A. L. & Treangen, T. J. Plasmidhawk improves lab of origin prediction of engineered plasmids using sequence alignment. Nat. Commun. 12, 1167 (2021). An article that presents a pan-genome method for lab-of-origin prediction and previous state-of-the-art.

    Article  Google Scholar 

  4. Hoffer, E. & Ailon, N. Deep metric learning using triplet network. In Similarity-Based Pattern Recognition. SIMBAD 2015. Lecture Notes in Computer Science Vol. 9370 (eds Feragen, A. et al.) 84–92 (Springer, 2015). The first paper to propose deep metric learning using the triplet network model.

  5. Fei-Fei, L., Fergus, R. & Perona, P. One-shot learning of object categories. IEEE Trans. Pattern Anal. Mach. Intell. 28, 594–611 (2006). An article that presents the possibility of training deep algorithms with few samples (few-shot) or one sample (one-shot).

    Article  Google Scholar 

Download references

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This is a summary of: Soares, I. M. et al. Improving lab-of-origin prediction of genetically engineered plasmids via deep metric learning. Nat. Comput. Sci. (2022).

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Using metric learning to identify the lab-of-origin of engineered DNA. Nat Comput Sci 2, 296–297 (2022).

Download citation

  • Published:

  • Issue Date:

  • DOI:


Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing