Filtering duplicate reads from 454 pyrosequencing data | Zendy

Susanne Balzer | Zendy; Ketil Malde | Zendy; Markus A. Grohme | Zendy; Inge Jonassen | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Filtering duplicate reads from 454 pyrosequencing data

Author(s) -

Susanne Balzer,

Ketil Malde,

Markus A. Grohme,

Inge Jonassen

Publication year - 2013

Publication title -

bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 3.599

H-Index - 390

eISSN - 1367-4811

pISSN - 1367-4803

DOI - 10.1093/bioinformatics/btt047

Subject(s) - pyrosequencing , metagenomics , sanger sequencing , computer science , dna sequencing , biology , computational biology , sequence assembly , data mining , genetics , gene , gene expression , transcriptome

Throughout the recent years, 454 pyrosequencing has emerged as an efficient alternative to traditional Sanger sequencing and is widely used in both de novo whole-genome sequencing and metagenomics. Especially the latter application is extremely sensitive to sequencing errors and artificially duplicated reads. Both are common in 454 pyrosequencing and can create a strong bias in the estimation of diversity and composition of a sample. To date, there are several tools that aim to remove both sequencing noise and duplicates. Nevertheless, duplicate removal is often based on nucleotide sequences rather than on the underlying flow values, which contain additional information.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research