z-logo
open-access-imgOpen Access
Complete Sequence and Gene Organization of the Genome of a Hyper-thermophilic Archaebacterium, Pyrococcus horikoshii OT3
Author(s) -
Yutaka Kawarabayasi,
Mituhiro Sawada,
Hiroshi Horikawa,
Yuji Haikawa,
Yumi Hino,
Saori Yamamoto,
Mitsuo Sekine,
Sin-ichi Baba,
Hiroki Kosugi,
Akira Hosoyama,
Yoshimi Nagai,
Mari Sakai,
Kyozo Ogura,
Rie Otsuka,
Hiromi Nakazawa,
Minako Takamiya,
Yuhko Ohfuku,
Tomomichi Funahashi,
Toshihiro Tanaka,
Yutaka Kudoh,
Jun Yamazaki,
Norihiro Kushida,
Akio Oguchi,
Kenichi Aoki,
Takio Yoshizawa,
Yoshinobu Nakamura,
Frank T. Robb,
Koki Horikoshi,
Yaeko Masuchi,
Hiroaki Shizuya,
Hisasi Kikuchi
Publication year - 1998
Publication title -
dna research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.647
H-Index - 98
eISSN - 1756-1663
pISSN - 1340-2838
DOI - 10.1093/dnares/5.2.55
Subject(s) - orfs , biology , genetics , pyrococcus horikoshii , genome , gene , contig , fosmid , open reading frame , peptide sequence , biochemistry , enzyme
The complete sequence of the genome of a hyper-thermophilic archaebacterium, Pyrococcus horikoshii OT3, has been determined by assembling the sequences of the physical map-based contigs of fosmid clones and of long polymerase chain reaction (PCR) products which were used for gap-filling. The entire length of the genome was 1,738,505 bp. The authenticity of the entire genome sequence was supported by restriction analysis of long PCR products, which were directly amplified from the genomic DNA. As the potential protein-coding regions, a total of 2061 open reading frames (ORFs) were assigned, and by similarity search against public databases, 406 (19.7%) were related to genes with putative function and 453 (22.0%) to the sequences registered but with unknown function. The remaining 1202 ORFs (58.3%) did not show any significant similarity to the sequences in the databases. Sequence comparison among the assigned ORFs in the genome provided evidence that a considerable number of ORFs were generated by sequence duplication. By similarity search, 11 ORFs were assumed to contain the intein elements. The RNA genes identified were a single 16S-23S rRNA operon, two 5S rRNA genes and 46 tRNA genes including two with the intron structure. All the assigned ORFs and RNA coding regions occupied 91.25% of the whole genome. The data presented in this paper are available on the internet at http:@www.nite.go.jp.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here