Deakin University
Browse

File(s) under permanent embargo

TRA: Tandem repeat assembler for next generation sequences

conference contribution
posted on 2017-01-30, 00:00 authored by Yongqing Jiang, J Lu, Jingyu HouJingyu Hou, Wanlei Zhou
© 2017 ACM. Eukaryotic genomes contain high volumes of intronic and intergenic regions in which repetitive sequences are abundant. These repetitive sequences represent challenges in genomic assignment of short read sequences generated through next generation sequencing and are often excluded in analysis losing invaluable genomic information. Here we present a method, known as TRA (Tandem Repeat Assembler), for the assembly of repetitive sequences by constructing contigs directly from paired-end reads. Using an experimentally acquired data set for human chromosome 14, tandem repeats >200 bp were assembled. Alignment of the contigs to the human genome reference (GRCh38) revealed that 84.3% of tandem repetitive regions were correctly covered. For tandem repeats, this method outperformed state-of-the-art assemblers by generating correct N50 of contigs up to 512 bp.

History

Event

Australasian Computer Science Week. Multiconference (2017 : Geelng, Victoria)

Pagination

1 - 6

Publisher

ACM

Location

Geelong, Victoria

Place of publication

New York, N.Y.

Start date

2017-01-30

End date

2017-02-03

ISBN-13

9781450347686

Language

eng

Publication classification

E Conference publication; E1 Full written paper - refereed

Copyright notice

2017, ACM

Title of proceedings

ACSW '17 Proceedings of the Australasian Computer Science Week Multiconference