z-logo
open-access-imgOpen Access
RePS: A Sequence Assembler That Masks Exact Repeats Identified from the Shotgun Data
Author(s) -
Jun Wang,
Gane KaShu Wong,
Peixiang Ni,
Yujun Han,
Xiangang Huang,
Jianguo Zhang,
Chen Ye,
Yong Zhang,
Jianfei Hu,
Kunlin Zhang,
Xin Xu,
Lijuan Cong,
Hong Lü,
Xide Ren,
Xiaoyu Ren,
Jun He,
Lin Tao,
Douglas A. Passey,
Jian Wang,
Huanming Yang,
Jun Yu,
Songgang Li
Publication year - 2002
Publication title -
genome research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 9.556
H-Index - 297
eISSN - 1549-5469
pISSN - 1088-9051
DOI - 10.1101/gr.165102
Subject(s) - biology , contig , shotgun , sequence assembly , sequence (biology) , hybrid genome assembly , shotgun sequencing , construct (python library) , computational biology , clone (java method) , genetics , software , computer science , programming language , genome , gene , gene expression , transcriptome
We describe a sequence assembler, RePS (repeat-masked Phrap with scaffolding), that explicitly identifies exact 20mer repeats from the shotgun data and removes them prior to the assembly. The established software Phrap is used to compute meaningful error probabilities for each base. Clone-end-pairing information is used to construct scaffolds that order and orient the contigs. We show with real data for human and rice that reasonable assemblies are possible even at coverages of only 4× to 6×, despite having up to 42.2% in exact repeats. [The following individuals kindly provided reagents, samples, or unpublished information as indicated in the paper: P. Green and A.F. Smit.]

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom