Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function
Summary (1 min read)
Introduction
- Automated docking is widely used for the prediction of biomo- lecular complexes in structure/function analysis and in molecular design.
- Dozens of effective methods are available, incorporating different trade-offs in molecular representation, energy evalua- tion, and conformational sampling to provide predictions with a reasonable computational effort.
- In their hands, AutoDock3 has proven to be effective in roughly half of the complexes that the authors have studied.
- The remain- ing half show significant motion of the receptor upon binding, and thus have required a more sophisticated model of motion in the receptor, typically performed outside of AutoDock3.
- This capability also pro- vides an effective method for analysis of covalently attached ligands.
Results and Discussion
- The authors first test of AutoDock4 is a redocking experiment using a set of 188 diverse protein-ligand complexes.
- In 100 of 188 complexes, the docked conformation with lowest energy was within 3.5 Å RMSD of the crystallographic conformation.
- (C) Cross docking with ARG8 treated as flexible in the protease.
- Roughly 2/3 of the small inhibitors were docked successfully, and the mid-size ones were very successful.
- The block at lower right shows docking of cyclic urea inhibitors with protease structures without the structural water.
Conclusions
- Dependence on grid-based energy evaluation is a major limita- tion of AutoDock4.
- It is required to allow rapid evaluation of binding energies during the docking simulation, but it places a severe restriction on the representation of the target macromole- cule: all of the atoms included in the grid must be treated as rigid.
- The off-grid modeling of specific sidechains is a method for incorporating limited flexibility within this paradigm, and the results presented here show that it will be effective in some cases.
- Adding flexibility presents several problems: (1) the calculation of the receptor energy is more computationally intensive since flexible regions must be evaluated by a full pair- wise energy evaluation, and (2) the conformational space is larger, and hence, there is more potential for false positives.
Did you find this useful? Give us your feedback
Citations
20,059 citations
15,616 citations
6,828 citations
4,666 citations
4,543 citations
Additional excerpts
...…re®nement/model-building programs [X-PLOR (BruÈ nger, 1988), CNS (BruÈ nger et al., 1998), REFMAC5 (Murshudov et al., 1997), SHELX (Sheldrick & Schneider, 1997) and O (Jones et al., 1991)] as well as docking programs [AutoDock 2.4/3.0 (Morris et al., 1996, 1998), Hex (Ritchie & Kemp, 2000)]....
[...]
References
41,772 citations
35,161 citations
32,573 citations
25,256 citations
14,725 citations
Related Papers (5)
Frequently Asked Questions (12)
Q2. How many atoms were used to determine the similarity of the docked conformations?
The user-defined root-meanŽ .square positional deviation rmsd tolerance was used to determine if two docked conformations were similar enough to be included in the same cluster, and symmetrically related atoms in the ligand were considered.
Q3. How many test systems were used to test the docking procedure?
Six of the seven test systems used to test the docking procedure, which were originally used to test AUTODOCK, version 2.4,7 were also in the training set of 30 protein]ligand complexes; therefore, to validate the chosen coefficients, linear regression was repeated for the set of 24 protein]ligand complexes, excluding the 6 overlapping test systems.
Q4. How many retries were used to generate a low energy random initial state?
The maximum initial energy allowed was 0.0 kcal moly1, and the maximum number of retries was 1000, used to generate a low energy random initial state to begin each simulated annealing docking.
Q5. What is the atomic solvation parameter for a given atom?
The solvation parameter for a given atom (S, used in the equation above) is calculated as:Si ¼ ðASPi þ QASP jqijÞ (4)where qi is the atomic charge and ASP and QASP are the atomic solvation parameters derived here.
Q6. What was the charge assignment for ter-minal phosphate groups?
Charges on ter-minal phosphate groups were assigned improperly, with a totalcharge of 0.5, so the remaining 0.5 charge was split manually between the four surrounding oxygen atoms.
Q7. What is the term for the loss of torsional entropy upon binding?
The term for the loss of torsional entropy upon binding (DSconf) is directly proportional to the number of rotatable bonds in the molecule (Ntors):Sconf ¼ WconfNtors (3)The number of rotatable bonds include all torsional degreesof freedom, including rotation of polar hydrogen atoms onhydroxyl groups and the like.
Q8. How many complexes were used to calibrate the free energy function of autodock?
Thirty protein]ligand complexes with published binding constants were used in the calibraŽ .tion of AUTODOCK’s free energy function Table The author, and were chosen from the set of 45 used by Bohm,54¨
Q9. What are the first two terms for the bound and unbound states of the ligand?
The first two terms are intramolecular energies for thebound and unbound states of the ligand, and the following twoterms are intramolecular energies for the bound and unboundstates of the protein.
Q10. What is the rmsd of the lowest energy found by any search method?
The crystallographic rmsd of the lowest energy Ž .found by any search method for each of the ˚protein]ligand test systems were all within 1.14 A, or less, of the crystal structure.
Q11. What is the reason for the large discrepancy?
This large discrepancy may be due to neglect of the conformational rearrangements of streptavidin upon binding biotin, which are neglected in the docking simulation and binding free energy calculation.
Q12. How many ligands were not predicted correctly by AutoDock 4?
The remaining 28 com-plexes were not predicted correctly by AutoDock 4, most casesdue to the fact that they were very large ligands with greaterthan 15 degrees of torsional freedom (see Fig. 5).