CATH: expanding the horizons of structure-based functional annotations for genome sequences
Author(s) -
Ian Sillitoe,
Natalie L. Dawson,
Tony E. Lewis,
Sayoni Das,
Jonathan Lees,
Paul Ashford,
Tolulope Adeyelu,
Harry Scholes,
Ilya S. Senatorov,
Andra Bujan,
Fatima Ceballos Rodriguez-Conde,
Benjamin Dowling,
Janet M. Thornton,
Christine Orengo
Publication year - 2018
Publication title -
nucleic acids research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 9.008
H-Index - 537
eISSN - 1362-4954
pISSN - 0305-1048
DOI - 10.1093/nar/gky1097
Subject(s) - biology , snapshot (computer storage) , domain (mathematical analysis) , context (archaeology) , sequence (biology) , computational biology , cluster analysis , structural classification of proteins database , computer science , data mining , bioinformatics , database , protein structure , genetics , artificial intelligence , mathematical analysis , paleontology , biochemistry , mathematics
This article provides an update of the latest data and developments within the CATH protein structure classification database (http://www.cathdb.info). The resource provides two levels of release: CATH-B, a daily snapshot of the latest structural domain boundaries and superfamily assignments, and CATH+, which adds layers of derived data, such as predicted sequence domains, functional annotations and functional clustering (known as Functional Families or FunFams). The most recent CATH+ release (version 4.2) provides a huge update in the coverage of structural data. This release increases the number of fully- classified domains by over 40% (from 308 999 to 434 857 structural domains), corresponding to an almost two- fold increase in sequence data (from 53 million to over 95 million predicted domains) organised into 6119 superfamilies. The coverage of high-resolution, protein PDB chains that contain at least one assigned CATH domain is now 90.2% (increased from 82.3% in the previous release). A number of highly requested features have also been implemented in our web pages: allowing the user to view an alignment between their query sequence and a representative FunFam structure and providing tools that make it easier to view the full structural context (multi-domain architecture) of domains and chains.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom