scispace - formally typeset
Open AccessJournal ArticleDOI

The protein common interface database (ProtCID)—a comprehensive database of interactions of homologous proteins in multiple crystal forms

Qifang Xu, +1 more
- 01 Jan 2011 - 
- Vol. 39, pp 761-770
Reads0
Chats0
TLDR
By examining interfaces that are shared by many homologous proteins in different CFs, it is found that the PDB and the Protein Interfaces, Surfaces, and Assemblies (PISA) are not always consistent in their annotations of biological assemblies in a homologueous family.
Abstract
The protein common interface database (ProtCID) is a database that contains clusters of similar homodimeric and heterodimeric interfaces observed in multiple crystal forms (CFs). Such interfaces, especially of homologous but non-identical proteins, have been associated with biologically relevant interactions. In ProtCID, protein chains in the protein data bank (PDB) are grouped based on their PFAM domain architectures. For a single PFAM architecture, all the dimers present in each CF are constructed and compared with those in other CFs that contain the same domain architecture. Interfaces occurring in two or more CFs comprise an interface cluster in the database. The same process is used to compare heterodimers of chains with different domain architectures. By examining interfaces that are shared by many homologous proteins in different CFs, we find that the PDB and the Protein Interfaces, Surfaces, and Assemblies (PISA) are not always consistent in their annotations of biological assemblies in a homologous family. Our data therefore provide an independent check on publicly available annotations of the structures of biological interactions for PDB entries. Common interfaces may also be useful in studies of protein evolution. Coordinates for all interfaces in a cluster are downloadable for further analysis. ProtCiD is available at http://dunbrack2.fccc.edu/protcid.

read more

Citations
More filters
Journal ArticleDOI

J. Appl. Cryst.の発刊に際して

良二 上田
Journal ArticleDOI

Modeling protein quaternary structure of homo- and hetero-oligomers beyond binary interactions by homology

TL;DR: A description of protein-protein interface conservation as a function of evolutionary distance to reduce the noise in deep multiple sequence alignments and a distance measure to structurally compare homologous multimeric protein complexes are defined.
Journal ArticleDOI

Predicting Protein-Protein Interactions from the Molecular to the Proteome Level

TL;DR: This review aims to provide a background on PPIs and their types and list the state-of-the-art methods, servers, databases, and tools for protein-protein interaction prediction.
Journal ArticleDOI

iPfam: a database of protein family and domain interactions found in the Protein Data Bank

TL;DR: The iPfam domain-domain interaction data was integrated within the Pfam database and website, but it has now been migrated to a separate database, allowing for independent development, improving data access and giving clearer separation between the protein family and interactions datasets.
Journal ArticleDOI

Protein interface classification by evolutionary analysis

TL;DR: A protein-protein interface classifier that relies on evolutionary data to detect the biological character of interfaces and it is demonstrated that a more conservative selection of homolog sequences - with relatively high sequence identities to the query - is able to produce a clearer signal than previous attempts.
References
More filters
Journal ArticleDOI

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Journal ArticleDOI

The Protein Data Bank

TL;DR: The goals of the PDB are described, the systems in place for data deposition and access, how to obtain further information and plans for the future development of the resource are described.
Journal ArticleDOI

The Pfam protein families database

TL;DR: The definition and use of family-specific, manually curated gathering thresholds are explained and some of the features of domains of unknown function (also known as DUFs) are discussed, which constitute a rapidly growing class of families within Pfam.
Journal ArticleDOI

Database resources of the National Center for Biotechnology Information

TL;DR: In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI’s website.
Journal ArticleDOI

Inference of macromolecular assemblies from crystalline state.

TL;DR: A new method, based on chemical thermodynamics, is developed for automatic detection of macromolecular assemblies in the Protein Data Bank (PDB) entries that are the results of X-ray diffraction experiments, as found, biological units may be recovered at 80-90% success rate, which makesX-ray crystallography an important source of experimental data on macromolescular complexes and protein-protein interactions.
Related Papers (5)
Trending Questions (1)
What database can search dimer protein?

The database that can search dimer proteins is the Protein Common Interface Database (ProtCID).