Massive Sequence Comparisons as a Help in Annotating Genomic Sequences | Zendy

Alexandra Louis | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Massive Sequence Comparisons as a Help in Annotating Genomic Sequences

Author(s) -

Alexandra Louis

Publication year - 2001

Publication title -

genome research

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 9.556

H-Index - 297

eISSN - 1549-5469

pISSN - 1088-9051

DOI - 10.1101/gr.gr-1776r

Subject(s) - biology , sequence (biology) , genetics , computational biology , intron , dna sequencing , exon , domain (mathematical analysis) , dna , sequence alignment , gene , peptide sequence , mathematical analysis , mathematics

An all-by-all comparison of all the publicly available protein sequences from plants has been performed, followed by a clusterization process. Within each of the 1064 resulting clusters-containing sequences that are orthologous as well as paralogous-the sequences have been submitted to a pyramidal classification and their domains delineated by an automated procedure à la. This process provides a means for easily checking for any apparent inconsistency in a cluster, for example, whether one sequence is shorter or longer than the others, one domain is missing, etc. In such cases, the alignment of the DNA sequence of the gene with that of a close homologous protein often reveals (in 10% of the clusters) probable sequencing errors (leading to frameshifts) or probable wrong intron/exon predictions. The composition of the clusters, their pyramidal classifications, and domain decomposition, as well as our comments when appropriate, are available from http://chlora.infobiogen.fr:1234/PHYTOPROT.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research