Skip to main content

CATH / Gene3D v4.2

Licensed according to this deed.

Published on

Abstract

95 million protein domains classified into 6,119 superfamilies. CATH is a classification of protein structures downloaded from the Protein Data Bank. We group protein domains into superfamilies when there is sufficient evidence they have diverged from a common ancestor. Gene3D uses the information in CATH to predict the locations of structural domains on millions of protein sequences available in public databases. This allows us to include additional annotations to the CATH-Gene3D database such as functional information and active site residues.

Citation

CATH: an expanded resource to predict protein function through structure and sequence. Dawson NL, Lewis TE, Das S, Lees JG, Lee D, Ashford P, Orengo CA, Sillitoe I. Nucleic Acids Res. 2017 Jan doi: 10.1093/nar/gkw1098 Gene3D: Extensive prediction of globular domains in proteins. Lewis TE, Sillitoe I, Dawson N, Lam SD, Clarke T, Orengo CA, Lees JG. Nucleic Acids Res. 2018 Jan doi: 10.1093/nar/gkx1069

Email

http://www.cathdb.info/support/contact

Site

Sponsors

Users

eabeysin

Cite this work

Researchers should cite this work as follows:

  • (2024), "CATH / Gene3D v4.2," https://sciencegateways.org/resources/cathgene3dv42.

    BibTex | EndNote