scispace - formally typeset
Open AccessJournal ArticleDOI

The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics

Reads0
Chats0
TLDR
The Carbohydrate-Active Enzyme (CAZy) database is a knowledge-based resource specialized in the enzymes that build and breakdown complex carbohydrates and glycoconjugates and has been used to improve the quality of functional predictions of a number genome projects by providing expert annotation.
Abstract
The Carbohydrate-Active Enzyme (CAZy) database is a knowledge-based resource specialized in the enzymes that build and breakdown complex carbohydrates and glycoconjugates. As of September 2008, the database describes the present knowledge on 113 glycoside hydrolase, 91 glycosyltransferase, 19 polysaccharide lyase, 15 carbohydrate esterase and 52 carbohydrate-binding module families. These families are created based on experimentally characterized proteins and are populated by sequences from public databases with significant similarity. Protein biochemical information is continuously curated based on the available literature and structural information. Over 6400 proteins have assigned EC numbers and 700 proteins have a PDB structure. The classification (i) reflects the structural features of these enzymes better than their sole substrate specificity, (ii) helps to reveal the evolutionary relationships between these enzymes and (iii) provides a convenient framework to understand mechanistic properties. This resource has been available for over 10 years to the scientific community, contributing to information dissemination and providing a transversal nomenclature to glycobiologists. More recently, this resource has been used to improve the quality of functional predictions of a number genome projects by providing expert annotation. The CAZy resource resides at URL: http://www.cazy.org/.

read more

Citations
More filters
Journal ArticleDOI

The Pfam protein families database

TL;DR: The definition and use of family-specific, manually curated gathering thresholds are explained and some of the features of domains of unknown function (also known as DUFs) are discussed, which constitute a rapidly growing class of families within Pfam.
Journal ArticleDOI

The carbohydrate-active enzymes database (CAZy) in 2013

TL;DR: The changes that have occurred in CAZy during the past 5 years are outlined and a novel effort to display the resolution and the carbohydrate ligands in crystallographic complexes of CAZymes is presented.
Journal ArticleDOI

The Effect of Diet on the Human Gut Microbiome: A Metagenomic Analysis in Humanized Gnotobiotic Mice

TL;DR: A translational medicine pipeline is described where human gut microbial communities and diets are re-created in gnotobiotic mice and the impact on microbe and host is defined using metagenomics, creating a well-defined, representative animal model of the human gut ecosystem.
Journal ArticleDOI

Genome sequencing and analysis of the model grass Brachypodium distachyon

John P. Vogel, +136 more
- 11 Feb 2010 - 
TL;DR: The high-quality genome sequence will help Brachypodium reach its potential as an important model system for developing new energy and food crops and establishes a template for analysis of the large genomes of economically important pooid grasses such as wheat.
References
More filters
Journal ArticleDOI

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Journal ArticleDOI

The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)

Gerald A. Tuskan, +115 more
- 15 Sep 2006 - 
TL;DR: The draft genome of the black cottonwood tree, Populus trichocarpa, has been reported in this paper, with more than 45,000 putative protein-coding genes identified.
Journal ArticleDOI

A classification of glycosyl hydrolases based on amino acid sequence similarities.

TL;DR: With the steady increase in sequence and structural data, it is suggested that the enzyme classification system should perhaps be revised.
Journal ArticleDOI

New families in the classification of glycosyl hydrolases based on amino acid sequence similarities.

TL;DR: On the basis of a comparison of 482 sequences corresponding to 52 EC entries, 45 families, out of which 22 are polyspecific, can now be defined and has been implemented in the SWISS-PROT protein sequence data bank.
Related Papers (5)
Trending Questions (3)
How many sequences is clasiffied in cazy database?

The CAZy database classifies over 6400 proteins with assigned EC numbers and 700 proteins with a PDB structure, specializing in enzymes related to complex carbohydrates and glycoconjugates.

Does cazy can help gain an understanding of the enzymatic machinery of many species?

Yes, the CAZy database can help gain an understanding of the enzymatic machinery of many species by providing information on enzymes involved in building and breaking down complex carbohydrates and glycoconjugates.

What emzyems break down carbohydrates?

The paper mentions that the Carbohydrate-Active Enzyme (CAZy) database describes the present knowledge on enzymes that build and breakdown complex carbohydrates and glycoconjugates.