ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins.

doi:10.1093/NAR/GKM290

Home
/
Papers
/
ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins.

Journal Article•DOI•

ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins.

Markus Wiederstein¹, Manfred J. Sippl¹•Institutions (1)

University of Salzburg¹

01 Jul 2007-Nucleic Acids Research (Oxford University Press)-Vol. 35, pp 407-410

TL;DR: The quality scores of a protein are displayed in the context of all known protein structures and problematic parts of a structure are shown and highlighted in a 3D molecule viewer in the ProSA-web service.

read less

Abstract: A major problem in structural biology is the recognition of errors in experimental and theoretical models of protein structures. The ProSA program (Protein Structure Analysis) is an established tool which has a large user base and is frequently employed in the refinement and validation of experimental protein structures and in structure prediction and modeling. The analysis of protein structures is generally a difficult and cumbersome exercise. The new service presented here is a straightforward and easy to use extension of the classic ProSA program which exploits the advantages of interactive web-based applications for the display of scores and energy plots that highlight potential problems spotted in protein structures. In particular, the quality scores of a protein are displayed in the context of all known protein structures and problematic parts of a structure are shown and highlighted in a 3D molecule viewer. The service specifically addresses the needs encountered in the validation of protein structures obtained from X-ray analysis, NMR spectroscopy and theoretical calculations. ProSA-web is accessible at https://prosa.services.came.sbg.ac.at.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Comparative Protein Structure Modeling Using MODELLER

[...]

Narayanan Eswar¹, Ben Webb¹, Marc A. Marti-Renom, Mallur S. Madhusudhan¹, David Eramian¹, Min-Yi Shen¹, Ursula Pieper¹, Andrej Sali¹ - Show less +4 more•Institutions (1)

University of California, San Francisco¹

01 Nov 2007-Current protocols in protein science

TL;DR: This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications.

...read moreread less

Abstract: Functional characterization of a protein sequence is a common goal in biology, and is usually facilitated by having an accurate three-dimensional (3-D) structure of the studied protein. In the absence of an experimentally determined structure, comparative or homology modeling can sometimes provide a useful 3-D model for a protein that is related to at least one known protein structure. Comparative modeling predicts the 3-D structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described.

...read moreread less

3,495 citations

Journal Article•DOI•

Comparative protein structure modeling using Modeller.

[...]

Narayanan Eswar¹, Ben Webb¹, Marc A. Marti-Renom¹, Mallur S. Madhusudhan¹, David Eramian¹, Min-Yi Shen¹, Ursula Pieper¹, Andrej Sali¹ - Show less +4 more•Institutions (1)

University of California, San Francisco¹

01 Sep 2006-Current protocols in human genetics

TL;DR: This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications.

...read moreread less

Abstract: Functional characterization of a protein sequence is one of the most frequent problems in biology. This task is usually facilitated by accurate three-dimensional (3-D) structure of the studied protein. In the absence of an experimentally determined structure, comparative or homology modeling can sometimes provide a useful 3-D model for a protein that is related to at least one known protein structure. Comparative modeling predicts the 3-D structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described.

...read moreread less

3,006 citations

Journal Article•DOI•

Toward the estimation of the absolute quality of individual protein structure models

[...]

Pascal Benkert¹, Marco Biasini², Torsten Schwede²•Institutions (2)

University of Basel¹, Swiss Institute of Bioinformatics²

01 Feb 2011-Bioinformatics

TL;DR: The ability of the newly introduced QMEAN Z-score to detect experimentally solved protein structures containing significant errors, as well as to evaluate theoretical protein models is demonstrated.

...read moreread less

Abstract: Motivation: Quality assessment of protein structures is an important part of experimental structure validation and plays a crucial role in protein structure prediction, where the predicted models may contain substantial errors. Most current scoring functions are primarily designed to rank alternative models of the same sequence supporting model selection, whereas the prediction of the absolute quality of an individual protein model has received little attention in the field. However, reliable absolute quality estimates are crucial to assess the suitability of a model for specific biomedical applications. Results: In this work, we present a new absolute measure for the quality of protein models, which provides an estimate of the ‘degree of nativeness’ of the structural features observed in a model and describes the likelihood that a given model is of comparable quality to experimental structures. Model quality estimates based on the QMEAN scoring function were normalized with respect to the number of interactions. The resulting scoring function is independent of the size of the protein and may therefore be used to assess both monomers and entire oligomeric assemblies. Model quality scores for individual models are then expressed as ‘Z-scores’ in comparison to scores obtained for high-resolution crystal structures. We demonstrate the ability of the newly introduced QMEAN Z-score to detect experimentally solved protein structures containing significant errors, as well as to evaluate theoretical protein models. In a comprehensive QMEAN Z-score analysis of all experimental structures in the PDB, membrane proteins accumulate on one side of the score spectrum and thermostable proteins on the other. Proteins from the thermophilic organism Thermatoga maritima received significantly higher QMEAN Z-scores in a pairwise comparison with their homologous mesophilic counterparts, underlining the significance of the QMEAN Z-score as an estimate of protein stability. Availability: The Z-score calculation has been integrated in the QMEAN server available at: http://swissmodel.expasy.org/qmean. Contact: torsten.schwede@unibas.ch Supplementary information:Supplementary data are available at Bioinformatics online.

...read moreread less

1,844 citations

Cites background from "ProSA-web: interactive web service ..."

...The performance of QMEAN with respect to other stateof-the-art methods such as ProSA (Sippl, 1993) and DFIRE (Zhou and Zhou, 2002) has also been recently assessed in an independent study (Rykunov and Fiser, 2010)....
[...]
...In contrast to QMEAN, the ProSA Z-score shows a clear correlation with protein size which limits its application as an absolute quality measure....
[...]
...The prediction of absolute model quality has rarely been addressed in the literature: the pioneering tool ProSA (Sippl, 1993) has primarily been developed to evaluate experimental structures and estimates the statistical significance of a structure by comparing its knowledge-based score to random structures with the same sequence....
[...]
...The ProSA (Wiederstein and Sippl, 2007) analysis of the two structure can be found in Supplementary Figure S6....
[...]
...The ProSA Z-score can hardly be used as a measure of absolute model quality as it is highly dependent on the protein size (i.e. the energy gap between the native fold and random decoy structures increases with protein size)....
[...]

Journal Article•DOI•

SHIFTX2: significantly improved protein chemical shift prediction

[...]

Beomsoo Han¹, Yifeng Liu¹, Simon W. Ginzinger², David S. Wishart¹, David S. Wishart³ - Show less +1 more•Institutions (3)

University of Alberta¹, University of Salzburg², National Institute for Nanotechnology³

30 Mar 2011-Journal of Biomolecular NMR

TL;DR: A new computer program, called SHIFTX2, is described which is capable of rapidly and accurately calculating diamagnetic 1H, 13C and 15N chemical shifts from protein coordinate data and will open the door to many long-anticipated applications of chemical shift prediction to protein structure determination, refinement and validation.

...read moreread less

Abstract: A new computer program, called SHIFTX2, is described which is capable of rapidly and accurately calculating diamagnetic 1H, 13C and 15N chemical shifts from protein coordinate data. Compared to its predecessor (SHIFTX) and to other existing protein chemical shift prediction programs, SHIFTX2 is substantially more accurate (up to 26% better by correlation coefficient with an RMS error that is up to 3.3× smaller) than the next best performing program. It also provides significantly more coverage (up to 10% more), is significantly faster (up to 8.5×) and capable of calculating a wider variety of backbone and side chain chemical shifts (up to 6×) than many other shift predictors. In particular, SHIFTX2 is able to attain correlation coefficients between experimentally observed and predicted backbone chemical shifts of 0.9800 (15N), 0.9959 (13Cα), 0.9992 (13Cβ), 0.9676 (13C′), 0.9714 (1HN), 0.9744 (1Hα) and RMS errors of 1.1169, 0.4412, 0.5163, 0.5330, 0.1711, and 0.1231 ppm, respectively. The correlation between SHIFTX2’s predicted and observed side chain chemical shifts is 0.9787 (13C) and 0.9482 (1H) with RMS errors of 0.9754 and 0.1723 ppm, respectively. SHIFTX2 is able to achieve such a high level of accuracy by using a large, high quality database of training proteins (>190), by utilizing advanced machine learning techniques, by incorporating many more features (χ2 and χ3 angles, solvent accessibility, H-bond geometry, pH, temperature), and by combining sequence-based with structure-based chemical shift prediction techniques. With this substantial improvement in accuracy we believe that SHIFTX2 will open the door to many long-anticipated applications of chemical shift prediction to protein structure determination, refinement and validation. SHIFTX2 is available both as a standalone program and as a web server (http://www.shiftx2.ca).

...read moreread less

578 citations

Cites methods from "ProSA-web: interactive web service ..."

...This collection of *250 high resolution X-ray structures was then analyzed for structural defects using a number of structure validation programs including VADAR (Willard et al. 2003), PROSA (Wiederstein and Sippl 2007), and WHAT_CHECK (Hooft et al. 1996)....
[...]
...2003), PROSA (Wiederstein and Sippl 2007), and WHAT_CHECK (Hooft et al....
[...]

Journal Article•DOI•

Ecogenomics and potential biogeochemical impacts of globally abundant ocean viruses

[...]

Simon Roux¹, Jennifer R. Brum¹, Bas E. Dutilh², Bas E. Dutilh³, Bas E. Dutilh⁴, Shinichi Sunagawa⁵, Melissa B. Duhaime⁶, Alexander Loy, Bonnie T. Poulos⁷, Natalie Solonenko¹, Elena Lara⁸, Elena Lara⁹, Julie Poulain¹⁰, Stephane Pesant, Stefanie Kandels-Lewis, Céline Dimier¹¹, Céline Dimier¹², Céline Dimier¹³, Marc Picheral¹³, Marc Picheral¹², Sarah Searson¹², Sarah Searson¹³, Corinne Cruaud¹⁴, Adriana Alberti¹⁴, Carlos M. Duarte⁹, Carlos M. Duarte¹⁵, Josep M. Gasol⁹, Dolors Vaqué⁹, Peer Bork¹⁶, Silvia G. Acinas⁹, Patrick Wincker¹⁴, Patrick Wincker¹³, Mathew B. Sullivan¹ - Show less +29 more•Institutions (16)

Ohio State University¹, Radboud University Nijmegen², Federal University of Rio de Janeiro³, Utrecht University⁴, ETH Zurich⁵, University of Michigan⁶, University of Arizona⁷, National Research Council⁸, Spanish National Research Council⁹, University of Bremen¹⁰, École Normale Supérieure¹¹, University of Paris¹², Centre national de la recherche scientifique¹³, Université Paris-Saclay¹⁴, King Abdullah University of Science and Technology¹⁵, Max Delbrück Center for Molecular Medicine¹⁶

29 Sep 2016-Nature

TL;DR: A global map of abundant, double-stranded DNA viruses complete with genomic and ecological contexts is presented to present a necessary foundation for the meaningful integration of viruses into ecosystem models where they act as key players in nutrient cycling and trophic networks.

...read moreread less

Abstract: Ocean microbes drive biogeochemical cycling on a global scale. However, this cycling is constrained by viruses that affect community composition, metabolic activity, and evolutionary trajectories. Owing to challenges with the sampling and cultivation of viruses, genome-level viral diversity remains poorly described and grossly understudied, with less than 1% of observed surface-ocean viruses known. Here we assemble complete genomes and large genomic fragments from both surface- and deep-ocean viruses sampled during the Tara Oceans and Malaspina research expeditions, and analyse the resulting 'global ocean virome' dataset to present a global map of abundant, double-stranded DNA viruses complete with genomic and ecological contexts. A total of 15,222 epipelagic and mesopelagic viral populations were identified, comprising 867 viral clusters (defined as approximately genus-level groups). This roughly triples the number of known ocean viral populations and doubles the number of candidate bacterial and archaeal virus genera, providing a near-complete sampling of epipelagic communities at both the population and viral-cluster level. We found that 38 of the 867 viral clusters were locally or globally abundant, together accounting for nearly half of the viral populations in any global ocean virome sample. While two-thirds of these clusters represent newly described viruses lacking any cultivated representative, most could be computationally linked to dominant, ecologically relevant microbial hosts. Moreover, we identified 243 viral-encoded auxiliary metabolic genes, of which only 95 were previously known. Deeper analyses of four of these auxiliary metabolic genes (dsrC, soxYZ, P-II (also known as glnB) and amoC) revealed that abundant viruses may directly manipulate sulfur and nitrogen cycling throughout the epipelagic ocean. This viral catalog and functional analyses provide a necessary foundation for the meaningful integration of viruses into ecosystem models where they act as key players in nutrient cycling and trophic networks.

...read moreread less

557 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

The Protein Data Bank

[...]

Helen M. Berman¹, John D. Westbrook, Zukang Feng, Gary L. Gilliland, Talapady N. Bhat, Helge Weissig, Ilya N. Shindyalov, Philip E. Bourne - Show less +4 more•Institutions (1)

Rutgers University¹

01 Jan 2000-Nucleic Acids Research

TL;DR: The goals of the PDB are described, the systems in place for data deposition and access, how to obtain further information and plans for the future development of the resource are described.

...read moreread less

Abstract: The Protein Data Bank (PDB; http://www.rcsb.org/pdb/ ) is the single worldwide archive of structural data of biological macromolecules. This paper describes the goals of the PDB, the systems in place for data deposition and access, how to obtain further information, and near-term plans for the future development of the resource.

...read moreread less

34,239 citations

Journal Article•DOI•

Recognition of errors in three‐dimensional structures of proteins

[...]

Manfred J. Sippl¹•Institutions (1)

University of Salzburg¹

01 Dec 1993-Proteins

TL;DR: Techniques based on knowledge based mean fields which can be used to judge the quality of protein folds are presented, used to identify misfolded structures as well as faulty parts of structural models.

...read moreread less

Abstract: A major problem in the determination of the three-dimensional structure of proteins concerns the quality of the structural models obtained from the interpretation of experimental data. New developments in X-ray crystallography and nuclear magnetic resonance spectroscopy have accelerated the process of structure determination and the biological community is confronted with a steadily increasing number of experimentally determined protein folds. However, in the recent past several experimentally determined protein structures have been proven to contain major errors, indicating that in some cases the interpretation of experimental data is difficult and may yield incorrect models. Such problems can be avoided when computational methods are employed which complement experimental structure determinations. A prerequisite of such computational tools is that they are independent of the parameters obtained from a particular experiment. In addition such techniques are able to support and accelerate experimental structure determinations. Here we present techniques based on knowledge based mean fields which can be used to judge the quality of protein folds. The methods can be used to identify misfolded structures as well as faulty parts of structural models. The techniques are even applicable in cases where only the C alpha trace of a protein conformation is available. The capabilities of the technique are demonstrated using correct and incorrect protein folds.

...read moreread less

1,980 citations

Journal Article•DOI•

Structure of a bacterial multidrug ABC transporter

[...]

Roger J. P. Dawson¹, Kaspar P. Locher¹•Institutions (1)

ETH Zurich¹

14 Sep 2006-Nature

TL;DR: The observed, outward-facing conformation reflects the ATP-bound state, with the two nucleotide-binding domains in close contact and the two transmembrane domains forming a central cavity—presumably the drug translocation pathway—that is shielded from the inner leaflet of the lipid bilayer and from the cytoplasm, but exposed to the outer leaflet and the extracellular space.

...read moreread less

Abstract: Multidrug transporters of the ABC family facilitate the export of diverse cytotoxic drugs across cell membranes. This is clinically relevant, as tumour cells may become resistant to agents used in chemotherapy. To understand the molecular basis of this process, we have determined the 3.0 A crystal structure of a bacterial ABC transporter (Sav1866) from Staphylococcus aureus. The homodimeric protein consists of 12 transmembrane helices in an arrangement that is consistent with cross-linking studies and electron microscopic imaging of the human multidrug resistance protein MDR1, but critically different from that reported for the bacterial lipid flippase MsbA. The observed, outward-facing conformation reflects the ATP-bound state, with the two nucleotide-binding domains in close contact and the two transmembrane domains forming a central cavity—presumably the drug translocation pathway—that is shielded from the inner leaflet of the lipid bilayer and from the cytoplasm, but exposed to the outer leaflet and the extracellular space. Multidrug efflux transporters cause serious problems in cancer chemotherapy and in the treatment of bacterial infections. A puzzling aspect of their biology is how a single transporter can recognize and transport such a wide variety of structurally dissimilar compounds. The publication of the crystal structures of two quite different multidrug efflux transporters will help to solve the mystery. In the first study, the structure of AcrB — a multidrug efflux transporter from E. coli — was determined. Its three constituent subunits were captured at different steps in the transport cycle: prior to substrate binding, substrate-bound, and post-extrusion. The voluminous multidrug binding pocket handles multiple substrates via multi-site binding. The second study determined the structure of an ATP-driven multidrug transporter from S. aureus. The clinical relevance of this 'ABC' family of transporters derives from the fact that they catalyse the extrusion of various cytotoxic compounds used in cancer therapy. The structure, with the transporter in the outward-facing conformation, is a useful model of human homologues and may initiate the rational design of drugs aimed at interfering with the extrusion of agents used in chemotherapy.

...read moreread less

1,244 citations

Journal Article•DOI•

Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins.

[...]

Manfred J. Sippl¹•Institutions (1)

University of Salzburg¹

20 Jun 1990-Journal of Molecular Biology

TL;DR: A prototype of a new approach to the folding problem of polypeptide chains based on the analysis of known protein structures, which derives the energy potentials for the atomic interactions of all amino acid residue pairs as a function of the distance between the involved atoms is presented.

...read moreread less

1,086 citations

Journal Article•DOI•

Structure of MsbA from E. coli: A Homolog of the Multidrug Resistance ATP Binding Cassette (ABC) Transporters

[...]

Geoffrey Chang¹, Christopher B. Roth¹•Institutions (1)

Scripps Research Institute¹

07 Sep 2001-Science

TL;DR: The structure of MsbA can serve as a model for the MDR-ABC transporters that confer multidrug resistance to cancer cells and infectious microorganisms.

...read moreread less

Abstract: Multidrug resistance (MDR) is a serious medical problem and presents a major challenge to the treatment of disease and the development of novel therapeutics. ABC transporters that are associated with multidrug resistance (MDR-ABC transporters) translocate hydrophobic drugs and lipids from the inner to the outer leaflet of the cell membrane. To better elucidate the structural basis for the “flip-flop” mechanism of substrate movement across the lipid bilayer, we have determined the structure of the lipid flippase MsbA from Escherichia coli by x-ray crystallography to a resolution of 4.5 angstroms. MsbA is organized as a homodimer with each subunit containing six transmembrane α-helices and a nucleotide-binding domain. The asymmetric distribution of charged residues lining a central chamber suggests a general mechanism for the translocation of substrate by MsbA and other MDR-ABC transporters. The structure of MsbA can serve as a model for the MDR-ABC transporters that confer multidrug resistance to cancer cells and infectious microorganisms.

...read moreread less

643 citations