The SUPERFAMILY database in 2004: additions and improvements | Zendy

Martin Madera | Zendy; Christine Vogel | Zendy; Sarah Kummerfeld | Zendy; Cyrus Chothia | Zendy; Julian Gough | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

The SUPERFAMILY database in 2004: additions and improvements

Author(s) -

Martin Madera,

Christine Vogel,

Sarah Kummerfeld,

Cyrus Chothia,

Julian Gough

Publication year - 2003

Publication title -

nucleic acids research

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 9.008

H-Index - 537

eISSN - 1362-4954

pISSN - 0305-1048

DOI - 10.1093/nar/gkh117

Subject(s) - uniprot , biology , genome , identifier , sequence database , structural classification of proteins database , computational biology , sequence alignment , protein data bank (rcsb pdb) , ensembl , protein sequencing , genomics , genetics , computer science , protein structure , gene , peptide sequence , biochemistry , programming language

The SUPERFAMILY database provides structural assignments to protein sequences and a framework for analysis of the results. At the core of the database is a library of profile Hidden Markov Models that represent all proteins of known structure. The library is based on the SCOP classification of proteins: each model corresponds to a SCOP domain and aims to represent an entire superfamily. We have applied the library to predicted proteins from all completely sequenced genomes (currently 154), the Swiss-Prot and TrEMBL databases and other sequence collections. Close to 60% of all proteins have at least one match, and one half of all residues are covered by assignments. All models and full results are available for download and online browsing at http://supfam.org. Users can study the distribution of their superfamily of interest across all completely sequenced genomes, investigate with which other superfamilies it combines and retrieve proteins in which it occurs. Alternatively, concentrating on a particular genome as a whole, it is possible first, to find out its superfamily composition, and secondly, to compare it with that of other genomes to detect superfamilies that are over- or under-represented. In addition, the webserver provides the following standard services: sequence search; keyword search for genomes, superfamilies and sequence identifiers; and multiple alignment of genomic, PDB and custom sequences.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research