scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Development and testing of a general amber force field.

15 Jul 2004-Journal of Computational Chemistry (John Wiley and Sons Inc.)-Vol. 25, Iss: 9, pp 1157-1174
TL;DR: A general Amber force field for organic molecules is described, designed to be compatible with existing Amber force fields for proteins and nucleic acids, and has parameters for most organic and pharmaceutical molecules that are composed of H, C, N, O, S, P, and halogens.
Abstract: We describe here a general Amber force field (GAFF) for organic molecules. GAFF is designed to be compatible with existing Amber force fields for proteins and nucleic acids, and has parameters for most organic and pharmaceutical molecules that are composed of H, C, N, O, S, P, and halogens. It uses a simple functional form and a limited number of atom types, but incorporates both empirical and heuristic models to estimate force constants and partial atomic charges. The performance of GAFF in test cases is encouraging. In test I, 74 crystallographic structures were compared to GAFF minimized structures, with a root-mean-square displacement of 0.26 A, which is comparable to that of the Tripos 5.2 force field (0.25 A) and better than those of MMFF 94 and CHARMm (0.47 and 0.44 A, respectively). In test II, gas phase minimizations were performed on 22 nucleic acid base pairs, and the minimized structures and intermolecular energies were compared to MP2/6-31G* results. The RMS of displacements and relative energies were 0.25 A and 1.2 kcal/mol, respectively. These data are comparable to results from Parm99/RESP (0.16 A and 1.18 kcal/mol, respectively), which were parameterized to these base pairs. Test III looked at the relative energies of 71 conformational pairs that were used in development of the Parm99 force field. The RMS error in relative energies (compared to experiment) is about 0.5 kcal/mol. GAFF can be applied to wide range of molecules in an automatic fashion, making it suitable for rational drug design and database searching.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
TL;DR: The development, current features, and some directions for future development of the Amber package of computer programs, which contains a group of programs embodying a number of powerful tools of modern computational chemistry, focused on molecular dynamics and free energy calculations of proteins, nucleic acids, and carbohydrates.
Abstract: We describe the development, current features, and some directions for future development of the Amber package of computer programs. This package evolved from a program that was constructed in the late 1970s to do Assisted Model Building with Energy Refinement, and now contains a group of programs embodying a number of powerful tools of modern computational chemistry, focused on molecular dynamics and free energy calculations of proteins, nucleic acids, and carbohydrates.

7,672 citations

Journal ArticleDOI
TL;DR: The implementation of Open Babel is detailed, key advances in the 2.3 release are described, and a variety of uses are outlined both in terms of software products and scientific research, including applications far beyond simple format interconversion.
Abstract: A frequent problem in computational modeling is the interconversion of chemical structures between different formats. While standard interchange formats exist (for example, Chemical Markup Language) and de facto standards have arisen (for example, SMILES format), the need to interconvert formats is a continuing problem due to the multitude of different application areas for chemistry data, differences in the data stored by different formats (0D versus 3D, for example), and competition between software along with a lack of vendor-neutral formats. We discuss, for the first time, Open Babel, an open-source chemical toolbox that speaks the many languages of chemical data. Open Babel version 2.3 interconverts over 110 formats. The need to represent such a wide variety of chemical and molecular data requires a library that implements a wide range of cheminformatics algorithms, from partial charge assignment and aromaticity detection, to bond order perception and canonicalization. We detail the implementation of Open Babel, describe key advances in the 2.3 release, and outline a variety of uses both in terms of software products and scientific research, including applications far beyond simple format interconversion. Open Babel presents a solution to the proliferation of multiple chemical file formats. In addition, it provides a variety of useful utilities from conformer searching and 2D depiction, to filtering, batch conversion, and substructure and similarity searching. For developers, it can be used as a programming library to handle chemical data in areas such as organic chemistry, drug design, materials science, and computational chemistry. It is freely available under an open-source license from http://openbabel.org .

6,040 citations


Cites methods from "Development and testing of a genera..."

  • ...Recently, code implementing the GAFF force field [34,35] was also contributed and released as part of version 2....

    [...]

Journal ArticleDOI
TL;DR: A range of new simulation algorithms and features developed during the past 4 years are presented, leading up to the GROMACS 4.5 software package, which provides extremely high performance and cost efficiency for high-throughput as well as massively parallel simulations.
Abstract: Motivation: Molecular simulation has historically been a low-throughput technique, but faster computers and increasing amounts of genomic and structural data are changing this by enabling large-scale automated simulation of, for instance, many conformers or mutants of biomolecules with or without a range of ligands. At the same time, advances in performance and scaling now make it possible to model complex biomolecular interaction and function in a manner directly testable by experiment. These applications share a need for fast and efficient software that can be deployed on massive scale in clusters, web servers, distributed computing or cloud resources. Results: Here, we present a range of new simulation algorithms and features developed during the past 4 years, leading up to the GROMACS 4.5 software package. The software now automatically handles wide classes of biomolecules, such as proteins, nucleic acids and lipids, and comes with all commonly used force fields for these molecules built-in. GROMACS supports several implicit solvent models, as well as new free-energy algorithms, and the software now uses multithreading for efficient parallelization even on low-end systems, including windows-based workstations. Together with hand-tuned assembly kernels and state-of-the-art parallelization, this provides extremely high performance and cost efficiency for high-throughput as well as massively parallel simulations. Availability: GROMACS is an open source and free software available from http://www.gromacs.org. Contact: erik.lindahl@scilifelab.se Supplementary information:Supplementary data are available at Bioinformatics online.

6,029 citations

Journal ArticleDOI
TL;DR: An extension of the CHARMM force field to drug‐like molecules is presented, making it possible to perform “all‐CHARMM” simulations on drug‐target interactions thereby extending the utility ofCHARMM force fields to medicinally relevant systems.
Abstract: The widely used CHARMM additive all-atom force field includes parameters for proteins, nucleic acids, lipids, and carbohydrates. In the present article, an extension of the CHARMM force field to drug-like molecules is presented. The resulting CHARMM General Force Field (CGenFF) covers a wide range of chemical groups present in biomolecules and drug-like molecules, including a large number of heterocyclic scaffolds. The parametrization philosophy behind the force field focuses on quality at the expense of transferability, with the implementation concentrating on an extensible force field. Statistics related to the quality of the parametrization with a focus on experimental validation are presented. Additionally, the parametrization procedure, described fully in the present article in the context of the model systems, pyrrolidine, and 3-phenoxymethylpyrrolidine will allow users to readily extend the force field to chemical groups that are not explicitly covered in the force field as well as add functional groups to and link together molecules already available in the force field. CGenFF thus makes it possible to perform "all-CHARMM" simulations on drug-target interactions thereby extending the utility of CHARMM force fields to medicinally relevant systems.

4,553 citations

Journal ArticleDOI
TL;DR: Anautomatic algorithm of perceiving atom types that are defined in a description table, and an automatic algorithm of assigning bond types just based on atomic connectivity are presented.
Abstract: In molecular mechanics (MM) studies, atom types and/or bond types of molecules are needed to determine prior to energy calculations. We present here an automatic algorithm of perceiving atom types that are defined in a description table, and an automatic algorithm of assigning bond types just based on atomic connectivity. The algorithms have been implemented in a new module of the AMBER packages. This auxiliary module, antechamber (roughly meaning "before AMBER"), can be applied to generate necessary inputs of leap-the AMBER program to generate topologies for minimization, molecular dynamics, etc., for most organic molecules. The algorithms behind the manipulations may be useful for other molecular mechanical packages as well as applications that need to designate atom types and bond types.

4,124 citations

References
More filters
Journal ArticleDOI
TL;DR: Weiner et al. as mentioned in this paper derived a new molecular mechanical force field for simulating the structures, conformational energies, and interaction energies of proteins, nucleic acids, and many related organic molecules in condensed phases.
Abstract: We present the derivation of a new molecular mechanical force field for simulating the structures, conformational energies, and interaction energies of proteins, nucleic acids, and many related organic molecules in condensed phases. This effective two-body force field is the successor to the Weiner et al. force field and was developed with some of the same philosophies, such as the use of a simple diagonal potential function and electrostatic potential fit atom centered charges. The need for a 10-12 function for representing hydrogen bonds is no longer necessary due to the improved performance of the new charge model and new van der Waals parameters. These new charges are determined using a 6-31G* basis set and restrained electrostatic potential (RESP) fitting and have been shown to reproduce interaction energies, free energies of solvation, and conformational energies of simple small molecules to a good degree of accuracy. Furthermore, the new RESP charges exhibit less variability as a function of the molecular conformation used in the charge determination. The new van der Waals parameters have been derived from liquid simulations and include hydrogen parameters which take into account the effects of any geminal electronegative atoms. The bonded parameters developed by Weiner et al. were modified as necessary to reproduce experimental vibrational frequencies and structures. Most of the simple dihedral parameters have been retained from Weiner et al., but a complex set of 4 and yj parameters which do a good job of reproducing the energies of the low-energy conformations of glycyl and alanyl dipeptides has been developed for the peptide backbone.

12,660 citations

Journal ArticleDOI
TL;DR: In this paper, the authors present an approach to generate electrostatic potential (ESP) derived charges for molecules, which optimally reproduce the intermolecular interaction properties of molecules with a simple two-body additive potential, provided that a suitably accurate level of quantum mechanical calculation is used to derive the ESP around the molecule.
Abstract: We present a new approach to generating electrostatic potential (ESP) derived charges for molecules. The major strength of electrostatic potential derived charges is that they optimally reproduce the intermolecular interaction properties of molecules with a simple two-body additive potential, provided, of course, that a suitably accurate level of quantum mechanical calculation is used to derive the ESP around the molecule. Previously, the major weaknesses of these charges have been that they were not easily transferable between common functional groups in related molecules, they have often been conformationally dependent, and the large charges that frequently occur can be problematic for simulating intramolecular interactions

6,266 citations

Journal ArticleDOI
Thomas A. Halgren1
TL;DR: The first published version of the Merck molecular force field (MMFF) is MMFF94 as mentioned in this paper, which is based on the OPLS force field and has been applied to condensed-phase processes.
Abstract: This article introduces MMFF94, the initial published version of the Merck molecular force field (MMFF). It describes the objectives set for MMFF, the form it takes, and the range of systems to which it applies. This study also outlines the methodology employed in parameterizing MMFF94 and summarizes its performance in reproducing computational and experimental data. Though similar to MM3 in some respects, MMFF94 differs in ways intended to facilitate application to condensed-phase processes in molecular-dynamics simulations. Indeed, MMFF94 seeks to achieve MM3-like accuracy for small molecules in a combined “organic/protein” force field that is equally applicable to proteins and other systems of biological significance. A second distinguishing feature is that the core portion of MMFF94 has primarily been derived from high-quality computational data—ca. 500 molecular structures optimized at the HF/6-31G* level, 475 structures optimized at the MP2/6-31G* level, 380 MP2/6-31G* structures evaluated at a defined approximation to the MP4SDQ/TZP level, and 1450 structures partly derived from MP2/6-31G* geometries and evaluated at the MP2/TZP level. A third distinguishing feature is that MMFF94 has been parameterized for a wide variety of chemical systems of interest to organic and medicial chemists, including many that feature frequently occurring combinations of functional groups for which little, if any, useful experimental data are available. The methodology used in parameterizing MMFF94 represents a fourth distinguishing feature. Rather than using the common “functional group” approach, nearly all MMFF parameters have been determined in a mutually consistent fashion from the full set of available computational data. MMFF94 reproduces the computational data used in its parameterization very well. In addition, MMFF94 reproduces experimental bond lengths (0.014 A root mean square [rms]), bond angles (1.2° rms), vibrational frequencies (61 cm−1 rms), conformational energies (0.38 kcal/mol/rms), and rotational barriers (0.39 kcal/mol rms) very nearly as well as does MM3 for comparable systems. MMFF94 also describes intermolecular interactions in hydrogen-bonded systems in a way that closely parallels that given by the highly regarded OPLS force field. © 1996 John Wiley & Sons, Inc.

4,353 citations

Journal ArticleDOI
TL;DR: In this paper, a force field for simulation of nucleic acids and proteins is presented, which is based on the ECEPP, UNECEPP, and EPEN energy refinement software.
Abstract: We present the development of a force field for simulation of nucleic acids and proteins. Our approach began by obtaining equilibrium bond lengths and angles from microwave, neutron diffraction, and prior molecular mechanical calculations, torsional constants from microwave, NMR, and molecular mechanical studies, nonbonded parameters from crystal packing calculations, and atomic charges from the fit of a partial charge model to electrostatic potentials calculated by ab initio quantum mechanical theory. The parameters were then refined with molecular mechanical studies on the structures and energies of model compounds. For nucleic acids, we focused on methyl ethyl ether, tetrahydrofuran, deoxyadenosine, dimethyl phosphate, 9-methylguanine-l-methylcytosine hydrogen-bonded complex, 9-methyladenine-l-methylthymine hydrogen-bonded complex, and 1,3-dimethyluracil base-stacked dimer. Bond, angle, torsional, nonbonded, and hydrogen-bond parameters were varied to optimize the agreement between calculated and experimental values for sugar pucker energies and structures, vibrational frequencies of dimethyl phosphate and tetrahydrofuran, and energies for base pairing and base stacking. For proteins, we focused on 4>,'lt maps of glycyl and alanyl dipeptides, hydrogen-bonding interactions involving the various protein polar groups, and energy refinement calculations on insulin. Unlike the models for hydrogen bonding involving nitrogen and oxygen electron donors, an adequate description of sulfur hydrogen bonding required explicit inclusion of lone pairs. There are two fundamental problems in simulating the struc­ tural and energetic properties of molecules: the first is how to choose an analytical been placed E(R) which correctly describes the energy of the system in terms of its 3N degrees of freedom. The second is how the simulation can search or span conforma­ tional space (R) in order to answer questions posed by the scientist interested in the properties of the system. For complex systems, solution to the first problem are an es­ sential first step in attacking the second problem, and thus, considerable effort has been placed in developing analytical functions that are simple enough to allow one to simulate the properties of complex molecules yet accurate enough to obtain meaningful estimates for structures and energies. In the case of the structures and thermodynamic stabilities of saturated hydrocarbons in inert solvents or the gas phase, the first problem has been essentially solved by molecular mechanics ap­ proaches of Allinger, I Ermer and Lifson,2 and their co-workers. However, for polar and ionic molecules in condensed phases, unsolved questions remain as to the best form of the analytical function E(R). In the area of proteins and peptides, seminal work has come from the Scheraga 3 and Lifson 4 schools. The Scheraga group has used both crystal packing (intermolecular) and con­ formational properties of peptides to arrive at force fields ECEPP, UNECEPP, and EPEN for modeling structural and thermodynamic properties of peptides and proteins. Levitt, using the energy refinement software developed in the Lifson group, has proposed a force field for proteins based on calculations on lysozyme,S and Gelin and Karplus have adapted this software along with many parameters from the Scheraga studies to do molecular dynamics

4,340 citations

Journal ArticleDOI
TL;DR: In this paper, the authors present conformational energies for a molecular mechanical model (Parm99) developed for organic and biological molecules using the restrained electrostatic potential (RESP) approach to derive the partial charges.
Abstract: In this study, we present conformational energies for a molecular mechanical model (Parm99) developed for organic and biological molecules using the restrained electrostatic potential (RESP) approach to derive the partial charges. This approach uses the simple "generic" force field model (Parm94), and attempts to add a minimal number of extra Fourier components to the torsional energies, but doing so only when there is a physical justification. The results are quite encouraging, not only for the 34-molecule set that has been studied by both the highest level ab initio model (GVB/LMP2) and experiment, but also for the 55-molecule set for which high-quality experimental data are available. Considering the 55 molecules studied by all the force field models for which there are experimental data, the average absolute errors (AAEs) are 0.28 (this model), 0.52 (MM3), 0.57 (CHARMm (MSI)), and 0.43 kcal/mol (MMFF). For the 34-molecule set, the AAEs of this model versus experiment and ab initio are 0.28 and 0.27 kcal/mol, respectively. This is a lower error than found with MM3 and CHARMm, and is comparable to that found with MMFF (0.31 and 0.22 kcal/mol). We also present two examples of how well the torsional parameters are transferred from the training set to the test set. The absolute errors of molecules in the test set are only slightly larger than in the training set (differences of <0.1 kcal/mol). Therefore, it can be concluded that a simple "generic" force field with a limited number of specific torsional parameters can describe intra- and intermolecular interactions, although all comparison molecules were selected from our 82-compound training set. We also show how this effective two-body

3,748 citations