PlumX Metrics
Embed PlumX Metrics

NGmerge: Merging paired-end reads via novel empirically-derived models of sequencing errors

BMC Bioinformatics, ISSN: 1471-2105, Vol: 19, Issue: 1, Page: 536
2018
  • 134
    Citations
  • 0
    Usage
  • 170
    Captures
  • 1
    Mentions
  • 0
    Social Media
Metric Options:   Counts1 Year3 Year

Metrics Details

Most Recent News

FOXO1 enhances CAR T cell stemness, metabolic fitness and efficacy

Nature, Published online: 10 April 2024; doi:10.1038/s41586-024-07242-1 Increased effectiveness of anti-cancer chimeric antigen receptor T cell therapy is associated with a stem-like phenotype through increased expression of FOXO1.

Article Description

Background: Advances in Illumina DNA sequencing technology have produced longer paired-end reads that increasingly have sequence overlaps. These reads can be merged into a single read that spans the full length of the original DNA fragment, allowing for error correction and accurate determination of read coverage. Extant merging programs utilize simplistic or unverified models for the selection of bases and quality scores for the overlapping region of merged reads. Results: We first examined the baseline quality score - error rate relationship using sequence reads derived from PhiX. In contrast to numerous published reports, we found that the quality scores produced by Illumina were not substantially inflated above the theoretical values, once the reference genome was corrected for unreported sequence variants. The PhiX reads were then used to create empirical models of sequencing errors in overlapping regions of paired-end reads, and these models were incorporated into a novel merging program, NGmerge. We demonstrate that NGmerge corrects errors and ambiguous bases better than other merging programs, and that it assigns quality scores for merged bases that accurately reflect the error rates. Our results also show that, contrary to published analyses, the sequencing errors of paired-end reads are not independent. Conclusions: We provide a free and open-source program, NGmerge, that performs better than existing read merging programs. NGmerge is available on GitHub ( https://github.com/harvardinformatics/NGmerge ) under the MIT License; it is written in C and supported on Linux.

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know