Deakin University
Browse

Accuracy, speed and error tolerance of short DNA sequence aligners

Download (437.42 kB)
journal contribution
posted on 2016-05-16, 00:00 authored by Mark ZiemannMark Ziemann
Aligning short DNA sequence reads to the genome is an early step in the processing of many types of genomics data, and impacts on the fidelity of downstream results. In this work, the accuracy, speed and tolerance to errors are evaluated in read of varied length for six commonly used mapping tools; BWA aln, BWA mem, Bowtie2, Soap2, Subread and STAR. The accuracy evaluation using Illumina-like simulated reads showed that accuracy varies by read length, but overall BWA aln was most accurate, followed by BWA mem and Bowtie2. BWA mem was most accurate with Ion Torrent-like read sets. STAR was at least 5 fold faster than Bowtie2 or BWA mem. BWA mem tolerated the highest density of mismatches and indels compared to other mappers. These data provide important accuracy and speed benchmarks for commonly used mapping software.

History

Journal

bioRxiv

Location

Cold Spring Harbor, N.Y,

Open access

  • Yes

Language

eng

Publication classification

X Not reportable, X Not reportable

Copyright notice

2016, The Author

Publisher

Cold Spring Harbor Laboratory Press

Usage metrics

    Research Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC