Gene3D: Multi-domain annotations for protein sequence and comparative genome analysis
Author(s) -
Jonathan Lees,
David Lee,
Romain A. Studer,
Natalie L. Dawson,
Ian Sillitoe,
Sayoni Das,
Corin Yeats,
Benoît H. Dessailly,
Robert Rentzsch,
Christine Orengo
Publication year - 2013
Publication title -
nucleic acids research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 9.008
H-Index - 537
eISSN - 1362-4954
pISSN - 0305-1048
DOI - 10.1093/nar/gkt1205
Subject(s) - ensembl , uniprot , genome , biology , pipeline (software) , computational biology , protein domain , domain (mathematical analysis) , sequence database , refseq , sequence alignment , homology (biology) , protein sequencing , protein structure database , genomics , genetics , computer science , gene , peptide sequence , mathematical analysis , mathematics , programming language
Gene3D (http://gene3d.biochem.ucl.ac.uk) is a database of protein domain structure annotations for protein sequences. Domains are predicted using a library of profile HMMs from 2738 CATH superfamilies. Gene3D assigns domain annotations to Ensembl and UniProt sequence sets including >6000 cellular genomes and >20 million unique protein sequences. This represents an increase of 45% in the number of protein sequences since our last publication. Thanks to improvements in the underlying data and pipeline, we see large increases in the domain coverage of sequences. We have expanded this coverage by integrating Pfam and SUPERFAMILY domain annotations, and we now resolve domain overlaps to provide highly comprehensive composite multi-domain architectures. To make these data more accessible for comparative genome analyses, we have developed novel search algorithms for searching genomes to identify related multi-domain architectures. In addition to providing domain family annotations, we have now developed a pipeline for 3D homology modelling of domains in Gene3D. This has been applied to the human genome and will be rolled out to other major organisms over the next year.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom