CATH / Gene3D v4.2
Licensed according to this deed.
Category
Published on
Abstract
95 million protein domains classified into 6,119 superfamilies. CATH is a classification of protein structures downloaded from the Protein Data Bank. We group protein domains into superfamilies when there is sufficient evidence they have diverged from a common ancestor. Gene3D uses the information in CATH to predict the locations of structural domains on millions of protein sequences available in public databases. This allows us to include additional annotations to the CATH-Gene3D database such as functional information and active site residues.