MECAT: fast mapping, error correction, and de novo assembly for single-molecule sequencing reads

Xiao, Chuan-Le; Chen, Ying; Xie, Shang-Qian; Chen, Kai-Ning; Wang, Yan; Han, Yue; Luo, Feng; Xie, Zhi

doi:10.1038/nmeth.4432

Brief Communication
Published: 18 September 2017

MECAT: fast mapping, error correction, and de novo assembly for single-molecule sequencing reads

Nature Methods volume 14, pages 1072–1074 (2017)Cite this article

9930 Accesses
250 Citations
28 Altmetric
Metrics details

Subjects

Abstract

We present a tool that combines fast mapping, error correction, and de novo assembly (MECAT; accessible at https://github.com/xiaochuanle/MECAT) for processing single-molecule sequencing (SMS) reads. MECAT's computing efficiency is superior to that of current tools, while the results MECAT produces are comparable or improved. MECAT enables reference mapping or de novo assembly of large genomes using SMS reads on a single computer.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Principle and property of DDF scoring algorithm in MECAT alignment.**

Scalable long read self-correction and assembly polishing with multiple sequence alignment

Article Open access 12 January 2021

Illumina reads correction: evaluation and improvements

Article Open access 26 January 2024

Efficient hybrid de novo assembly of human genomes with WENGAN

Article Open access 14 December 2020

Accession codes

Primary accessions

Sequence Read Archive

SRX1424851

References

Schadt, E.E., Turner, S. & Kasarskis, A. Hum. Mol. Genet. 19, R227–R240 (2010).
Article CAS Google Scholar
Eid, J. et al. Science 323, 133–138 (2009).
Article CAS Google Scholar
Chin, C.S. et al. Nat. Methods 10, 563–569 (2013).
Article CAS Google Scholar
Jain, M. et al. Nat. Methods 12, 351–356 (2015).
Article CAS Google Scholar
Sović, I. et al. Nat. Commun. 7, 11307 (2016).
Article Google Scholar
Loman, N.J., Quick, J. & Simpson, J.T. Nat. Methods 12, 733–735 (2015).
Article CAS Google Scholar
Koren, S. et al. Nat. Biotechnol. 30, 693–700 (2012).
Article CAS Google Scholar
Seo, J.S. et al. Nature 538, 243–247 (2016).
Article CAS Google Scholar
Shi, L. et al. Nat. Commun. 7, 12065 (2016).
Article CAS Google Scholar
Gordon, D. et al. Science 352, aae0344 (2016).
Article Google Scholar
Berlin, K. et al. Nat. Biotechnol. 33, 623–630 (2015).
Article CAS Google Scholar
Chin, C.S. et al. Nat. Methods 13, 1050–1054 (2016).
Article CAS Google Scholar
Koch, P., Platzer, M. & Downie, B.R. Nucleic Acids Res. 42, e80 (2014).
Article CAS Google Scholar
Koren, S., Walenz, B.P., Berlin, K., Miller, J.R. & Phillippy, A.M. Genome Res. 27, 722–736 (2017).
Article CAS Google Scholar
Chaisson, M.J. & Tesler, G. BMC Bioinformatics 13, 238 (2012).
Myers, E.W. Algorithmica 1, 251–266 (1986).
Article Google Scholar
Myers, G. Algorithms in Bioinformatics, 52–67 (2014).
Li, H. Preprint at https://arxiv.org/abs/1303.3997 (2013).
Langmead, B. & Salzberg, S.L. Nat. Methods 9, 357–359 (2012).
Article CAS Google Scholar

Download references

Acknowledgements

We thank D.P. Wang for supplying the Chinese human data set. We thank the NCBI assembly group for the Han-1 Chinese annotation. This work was collectively supported by the National Natural Science Foundation of China (31471232, 31471789 and 31600667), the Fundamental Research Funds for the Central Universities (15ykjc23d), the Guangdong Natural Science Foundation (2015A030313127), the Joint Research Fund for the Overseas Natural Science of China (3030901001222), infrastructure support from Center for Precision Medicine (Sun Yat-sen University), China Postdoctoral Science Foundation (2017M612798), and the National Institute of Food and Agriculture (NIFA), USA (2017-70016-26051).

Author information

Chuan-Le Xiao and Ying Chen: These authors contributed equally to this work.

Authors and Affiliations

State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
Chuan-Le Xiao, Shang-Qian Xie, Kai-Ning Chen, Yan Wang, Yue Han & Zhi Xie
School of Data and Computer Science, and Guangdong Provincial Key Laboratory of Computational Science, Sun Yat-sen University, Guangzhou, China
Chuan-Le Xiao & Ying Chen
Southern Regional Collaborative Innovation Center for Grain and Oil Crops in China, Hunan Agricultural University, Hunan, China
Chuan-Le Xiao
College of Plant Protection, Hunan Agricultural University, Changsha, China
Chuan-Le Xiao
Institute of Tropical Agriculture and Forestry, Hainan University, Haikou, China
Shang-Qian Xie
CookGene Bio-technology Co., Ltd., Guangzhou, China
Yue Han
School of Computing, Clemson University, Clemson, South Carolina, USA
Feng Luo

Authors

Chuan-Le Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Ying Chen
View author publications
You can also search for this author in PubMed Google Scholar
Shang-Qian Xie
View author publications
You can also search for this author in PubMed Google Scholar
Kai-Ning Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yue Han
View author publications
You can also search for this author in PubMed Google Scholar
Feng Luo
View author publications
You can also search for this author in PubMed Google Scholar
Zhi Xie
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.-L.X. conceived and designed this project. Y.C. and C.-L.X. implemented the algorithms. S.-Q.X., C.L.-X., and Y.C. performed the test experiments. K.-N.C., Y.W., and Y.H. coordinated the data release and assisted with executing the pipeline. F.L. provided theoretical analysis of the algorithms. C.-L.X., F.L., Z.X., Y.C., and S.-Q.X. wrote the manuscript. All authors read and approved the final version of the manuscript.

Corresponding authors

Correspondence to Chuan-Le Xiao, Feng Luo or Zhi Xie.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Integrated supplementary information

Supplementary Figure 1 The numbers of aligned reads generated by MECAT, BWA and BLASR using five PacBio datasets and three Nanopore datasets.

*denotes the Nanopore dataset

Supplementary Figure 2 Workflow of the MECAT consensus algorithm.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–2, Supplementary Notes 1–11 and Supplementary Tables 1, 2, 3, 5 and 6.

Life Sciences Reporting Summary

Supplementary Table 4

The read coverage of human reference genome alignment by BLASR and MECAT around regions with large structural variants

Supplementary Table 7

Comparison of Read Coverage of Reference Genome Alignment at Large Structural Variants

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xiao, CL., Chen, Y., Xie, SQ. et al. MECAT: fast mapping, error correction, and de novo assembly for single-molecule sequencing reads. Nat Methods 14, 1072–1074 (2017). https://doi.org/10.1038/nmeth.4432

Download citation

Received: 20 April 2017
Accepted: 10 August 2017
Published: 18 September 2017
Issue Date: 01 November 2017
DOI: https://doi.org/10.1038/nmeth.4432

This article is cited by

High-quality genome resource of Lasiodiplodia pseudotheobromae associated with die-back on Eucalyptus trees
- LinQin Lu
- GuoQing Li
- FeiFei Liu
BMC Genomic Data (2024)
A high heterozygosity genome assembly of Aedes albopictus enables the discovery of the association of PGANT3 with blood-feeding behavior
- Yuhua Deng
- Shuyi Ren
- Chuanle Xiao
BMC Genomics (2024)
Genome assembly in the telomere-to-telomere era
- Heng Li
- Richard Durbin
Nature Reviews Genetics (2024)
Technology-enabled great leap in deciphering plant genomes
- Lingjuan Xie
- Xiaojiao Gong
- Longjiang Fan
Nature Plants (2024)
De novo diploid genome assembly using long noisy reads
- Fan Nie
- Peng Ni
- Jianxin Wang
Nature Communications (2024)

MECAT: fast mapping, error correction, and de novo assembly for single-molecule sequencing reads

Subjects

Abstract

Access options

Similar content being viewed by others

Scalable long read self-correction and assembly polishing with multiple sequence alignment

Illumina reads correction: evaluation and improvements

Efficient hybrid de novo assembly of human genomes with WENGAN

Accession codes

Primary accessions

Sequence Read Archive

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Integrated supplementary information

Supplementary Figure 1 The numbers of aligned reads generated by MECAT, BWA and BLASR using five PacBio datasets and three Nanopore datasets.

Supplementary Figure 2 Workflow of the MECAT consensus algorithm.

Supplementary information

Supplementary Text and Figures

Life Sciences Reporting Summary

Supplementary Table 4

Supplementary Table 7

Rights and permissions

About this article

Cite this article

This article is cited by

High-quality genome resource of Lasiodiplodia pseudotheobromae associated with die-back on Eucalyptus trees

A high heterozygosity genome assembly of Aedes albopictus enables the discovery of the association of PGANT3 with blood-feeding behavior

Genome assembly in the telomere-to-telomere era

Technology-enabled great leap in deciphering plant genomes

De novo diploid genome assembly using long noisy reads

Search

Quick links

Subjects

Abstract

Access options

Similar content being viewed by others

Accession codes

Primary accessions

Sequence Read Archive

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Integrated supplementary information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links