The Universal Protein Resource (UniProt)
Amos Marc Bairoch,Rolf Apweiler,Cathy H. Wu,Winona C. Barker,Brigitte Boeckmann,Serenella Ferro,Elisabeth Gasteiger,Hongzhan Huang,Rodrigo Lopez,Michele Magrane,Maria Jesus Martin,Darren A. Natale,Claire O'Donovan,Nicole Redaschi,Lai-Su L. Yeh +14 more
Reads0
Chats0
TLDR
During 2004, tens of thousands of Knowledgebase records got manually annotated or updated; the UniProt keyword list got augmented by additional keywords; the documentation of the keywords and are continuously overhauling and standardizing the annotation of post-translational modifications.Abstract:
The Universal Protein Resource (UniProt) provides the scientific community with a single, centralized, authoritative resource for protein sequences and functional information. Formed by uniting the Swiss-Prot, TrEMBL and PIR protein database activities, the UniProt consortium produces three layers of protein sequence databases: the UniProt Archive (UniParc), the UniProt Knowledgebase (UniProt) and the UniProt Reference (UniRef) databases. The UniProt Knowledgebase is a comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase with extensive cross-references. This centrepiece consists of two sections: UniProt/Swiss-Prot, with fully, manually curated entries; and UniProt/TrEMBL, enriched with automated classification and annotation. During 2004, tens of thousands of Knowledgebase records got manually annotated or updated; we introduced a new comment line topic: TOXIC DOSE to store information on the acute toxicity of a toxin; the UniProt keyword list got augmented by additional keywords; we improved the documentation of the keywords and are continuously overhauling and standardizing the annotation of post-translational modifications. Furthermore, we introduced a new documentation file of the strains and their synonyms. Many new database cross-references were introduced and we started to make use of Digital Object Identifiers. We also achieved in collaboration with the Macromolecular Structure Database group at EBI an improved integration with structural databases by residue level mapping of sequences from the Protein Data Bank entries onto corresponding UniProt entries. For convenient sequence searches we provide the UniRef non-redundant sequence databases. The comprehensive UniParc database stores the complete body of publicly available protein sequence data. The UniProt databases can be accessed online (http://www.uniprot.org) or downloaded in several formats (ftp://ftp.uniprot.org/pub). New releases are published every two weeks.read more
Citations
More filters
Journal ArticleDOI
Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists
TL;DR: The survey will help tool designers/developers and experienced end users understand the underlying algorithms and pertinent details of particular tool categories/tools, enabling them to make the best choices for their particular research interests.
Journal ArticleDOI
The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling
TL;DR: The SWISS-MODEL workspace is a web-based integrated service dedicated to protein structure homology modelling that assists and guides the user in building protein homology models at different levels of complexity.
Journal ArticleDOI
I-TASSER: a unified platform for automated protein structure and function prediction
TL;DR: The iterative threading assembly refinement (I-TASSER) server is an integrated platform for automated protein structure and function prediction based on the sequence- to-structure-to-function paradigm.
Journal ArticleDOI
Biopython: freely available Python tools for computational molecular biology and bioinformatics
Peter J. A. Cock,Tiago Antao,Jeffrey T. Chang,Brad Chapman,Cymon J. Cox,Andrew Dalke,Iddo Friedberg,Thomas Hamelryck,Frank Kauff,Bartosz Wilczyński,Michiel J. L. de Hoon +10 more
TL;DR: Biopython includes modules for reading and writing different sequence file formats and multiple sequence alignments, dealing with 3D macro molecular structures, interacting with common tools such as BLAST, ClustalW and EMBOSS, accessing key online databases, as well as providing numerical methods for statistical learning.
Journal ArticleDOI
Lysine Acetylation Targets Protein Complexes and Co-Regulates Major Cellular Functions
Chunaram Choudhary,Chanchal Kumar,Florian Gnad,Michael L. Nielsen,Michael Rehman,Tobias C. Walther,Jesper V. Olsen,Matthias Mann +7 more
TL;DR: A proteomic-scale analysis of protein acetylation suggests that it is an important biological regulatory mechanism and the regulatory scope of lysine acetylations is broad and comparable with that of other major posttranslational modifications.
References
More filters
Journal ArticleDOI
The Pfam protein families database
Marco Punta,Penny Coggill,Ruth Y. Eberhardt,Jaina Mistry,John Tate,Chris Boursnell,Ningze Pang,Kristoffer Forslund,Goran Ceric,Jody Clements,Andreas Heger,Liisa Holm,Erik L. L. Sonnhammer,Sean R. Eddy,Alex Bateman,Robert D. Finn +15 more
TL;DR: The definition and use of family-specific, manually curated gathering thresholds are explained and some of the features of domains of unknown function (also known as DUFs) are discussed, which constitute a rapidly growing class of families within Pfam.
Journal ArticleDOI
UniProt: the Universal Protein knowledgebase
Rolf Apweiler,Amos Marc Bairoch,Cathy H. Wu,Winona C. Barker,Brigitte Boeckmann,Serenella Ferro,Elisabeth Gasteiger,Hongzhan Huang,Rodrigo Lopez,Michele Magrane,Maria Jesus Martin,Darren A. Natale,Claire O'Donovan,Nicole Redaschi,Lai-Su L. Yeh +14 more
TL;DR: The Swiss-Prot, TrEMBL and PIR protein database activities have united to form the Universal Protein Knowledgebase (UniProt), which is to provide a comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and query interfaces.
Journal ArticleDOI
The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003
Brigitte Boeckmann,Amos Marc Bairoch,Rolf Apweiler,Marie-Claude Blatter,Anne Estreicher,Elisabeth Gasteiger,Maria Jesus Martin,Karine Michoud,Claire O'Donovan,Isabelle Phan,Sandrine Pilbout,Michel Schneider +11 more
TL;DR: The SWISS-PROT protein knowledgebase connects amino acid sequences with the current knowledge in the Life Sciences by providing an interdisciplinary overview of relevant information by bringing together experimental results, computed features and sometimes even contradictory conclusions.
Journal ArticleDOI
The Ensembl genome database project
Tim Hubbard,Daniel Barker,Ewan Birney,Graham Cameron,Yuan Chen,Louise Clark,Tony Cox,James Cuff,Val Curwen,Thomas A. Down,Richard Durbin,Eduardo Eyras,James G. R. Gilbert,Martin Hammond,Lukasz Huminiecki,Arek Kasprzyk,Heikki Lehväslaiho,Philip Lijnzaad,Craig Melsopp,Emmanuel Mongin,Roger Pettett,Matthew Pocock,Simon C. Potter,Alistair G. Rust,Esther Schmidt,Stephen M. J. Searle,Guy Slater,James Smith,William Spooner,Arne Stabenau,Jim Stalker,Elia Stupka,Abel Ureta-Vidal,Imre Vastrik,Michele Clamp +34 more
TL;DR: The Ensembl database project provides a bioinformatics framework to organise biology around the sequences of large genomes and is a comprehensive source of stable automatic annotation of the human genome sequence, with confirmed gene predictions that have been integrated with external data sources.
Journal ArticleDOI
Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure.
TL;DR: A new procedure is described for detecting and correcting those errors that arise at the model-building stage of the procedure and a good procedure for creating HMMs for sequences of proteins of known structure are determined.
Related Papers (5)
Gene Ontology: tool for the unification of biology
M Ashburner,Catherine A. Ball,Judith A. Blake,David Botstein,Heather Butler,J. M. Cherry,Allan Peter Davis,Kara Dolinski,Selina S. Dwight,J.T. Eppig,Midori A. Harris,David P. Hill,Laurie Issel-Tarver,Andrew Kasarskis,Suzanna E. Lewis,John C. Matese,Joel E. Richardson,M. Ringwald,Gerald M. Rubin,Gavin Sherlock +19 more