gencore: an efficient tool to generate consensus reads for error suppressing and duplicate removing of NGS data
2018
Summary: this paper presents an efficient tool gencore, to eliminate errors and duplicates of next-generation sequencing (NGS) data. This tool clusters the mapped sequencing reads and merges each cluster to generate one consensus read. If the data has unique molecular identifier (UMI), gencore uses it for identifying the reads derived from same original DNA fragment. Comparing to the conventional tool Picard, gencore greatly reduces the output data9s mapping mismatches, which are mostly caused by errors. This error-suppressing feature makes gencore very suitable for the application of detecting ultra-low frequency mutations from deep sequencing data. Comparing to the performance of Picard, gencore is about 3X faster and uses much less memory.
Availability and Implementation: gencore is an open source tool written in C++. It9s hosted in github: https://github.com/OpenGene/gencore
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
16
References
3
Citations
NaN
KQI