scispace - formally typeset
Search or ask a question

Showing papers in "arXiv: Chemical Physics in 2021"


Journal ArticleDOI
TL;DR: In this paper, the authors provide a review of the applications of computational chemistry and machine learning in molecular and materials modeling, retrosyntheses, catalysis, and drug design.
Abstract: Machine learning models are poised to make a transformative impact on chemical sciences by dramatically accelerating computational algorithms and amplifying insights available from computational chemistry methods. However, achieving this requires a confluence and coaction of expertise in computer science and physical sciences. This review is written for new and experienced researchers working at the intersection of both fields. We first provide concise tutorials of computational chemistry and machine learning methods, showing how insights involving both can be achieved. We then follow with a critical review of noteworthy applications that demonstrate how computational chemistry and machine learning can be used together to provide insightful (and useful) predictions in molecular and materials modeling, retrosyntheses, catalysis, and drug design.

155 citations


Journal ArticleDOI
TL;DR: This review summarizes the current understanding of the nature and characteristics of the most commonly used structural and chemical descriptions of atomistic structures, highlighting the deep underlying connections between different frameworks and the ideas that lead to computationally efficient and universally applicable models.
Abstract: The first step in the construction of a regression model or a data-driven analysis, aiming to predict or elucidate the relationship between the atomic scale structure of matter and its properties, involves transforming the Cartesian coordinates of the atoms into a suitable representation. The development of atomic-scale representations has played, and continues to play, a central role in the success of machine-learning methods for chemistry and materials science. This review summarizes the current understanding of the nature and characteristics of the most commonly used structural and chemical descriptions of atomistic structures, highlighting the deep underlying connections between different frameworks, and the ideas that lead to computationally efficient and universally applicable models. It emphasizes the link between properties, structures, their physical chemistry and their mathematical description, provides examples of recent applications to a diverse set of chemical and materials science problems, and outlines the open questions and the most promising research directions in the field.

150 citations


Posted Content
TL;DR: SpookyNet as discussed by the authors introduces a deep neural network for constructing ML-FFs with explicit treatment of electronic degrees of freedom and quantum nonlocality, which can generalize across chemical and conformational space and leverage the learned chemical insights.
Abstract: Machine-learned force fields (ML-FFs) combine the accuracy of ab initio methods with the efficiency of conventional force fields. However, current ML-FFs typically ignore electronic degrees of freedom, such as the total charge or spin state, and assume chemical locality, which is problematic when molecules have inconsistent electronic states, or when nonlocal effects play a significant role. This work introduces SpookyNet, a deep neural network for constructing ML-FFs with explicit treatment of electronic degrees of freedom and quantum nonlocality. Chemically meaningful inductive biases and analytical corrections built into the network architecture allow it to properly model physical limits. SpookyNet improves upon the current state-of-the-art (or achieves similar performance) on popular quantum chemistry data sets. Notably, it is able to generalize across chemical and conformational space and can leverage the learned chemical insights, e.g. by predicting unknown spin states, thus helping to close a further important remaining gap for today's machine learning models in quantum chemistry.

63 citations


Journal ArticleDOI
TL;DR: Machine learning (ML) methods are being used in almost every conceivable area of electronic structure theory and molecular simulation as discussed by the authors and have become firmly established in the construction of high-dimensional interatomic potentials.
Abstract: Machine learning (ML) methods are being used in almost every conceivable area of electronic structure theory and molecular simulation. In particular, ML has become firmly established in the construction of high-dimensional interatomic potentials. Not a day goes by without another proof of principle being published on how ML methods can represent and predict quantum mechanical properties - be they observable, such as molecular polarizabilities, or not, such as atomic charges. As ML is becoming pervasive in electronic structure theory and molecular simulation, we provide an overview of how atomistic computational modeling is being transformed by the incorporation of ML approaches. From the perspective of the practitioner in the field, we assess how common workflows to predict structure, dynamics, and spectroscopy are affected by ML. Finally, we discuss how a tighter and lasting integration of ML methods with computational chemistry and materials science can be achieved and what it will mean for research practice, software development, and postgraduate training.

59 citations


Posted Content
TL;DR: Implicit solvation is an effective, highly coarse-grained approach in atomic-scale simulations to account for a surrounding liquid electrolyte on the level of a continuous polarizable medium.
Abstract: Implicit solvation is an effective, highly coarse-grained approach in atomic-scale simulations to account for a surrounding liquid electrolyte on the level of a continuous polarizable medium. Originating in molecular chemistry with finite solutes, implicit solvation techniques are now increasingly used in the context of first-principles modeling of electrochemistry and electrocatalysis at extended (often metallic) electrodes. The prevalent ansatz to model the latter electrodes and the reactive surface chemistry at them through slabs in periodic boundary condition supercells brings its specific challenges. Foremost this concerns the diffculty to describe the entire double layer forming at the electrified solid-liquid interface (SLI) within supercell sizes tractable by commonly employed density-functional theory (DFT). We review liquid solvation methodology from this specific application angle, highlighting in particular its use in the widespread {\em ab initio} thermodynamics approach to surface catalysis. Notably, implicit solvation can be employed to mimic a polarization of the electrode's electronic density under the applied potential and the concomitant capacitive charging of the entire double layer beyond the limitations of the employed DFT supercell. Most critical for continuing advances of this effective methodology for the SLI context is the lack of pertinent (experimental or high-level theoretical) reference data needed for parametrization.

44 citations


Journal ArticleDOI
TL;DR: In this article, the authors introduce a reformulation of the conventional distributed memory ab initio DMRG algorithm that connects it to the conceptually simpler and advantageous sum of sub-Hamiltonians approach.
Abstract: There has been recent interest in the deployment of ab initio density matrix renormalization group computations on high performance computing platforms. Here, we introduce a reformulation of the conventional distributed memory ab initio DMRG algorithm that connects it to the conceptually simpler and advantageous sum of sub-Hamiltonians approach. Starting from this framework, we further explore a hierarchy of parallelism strategies, that includes (i) parallelism over the sum of sub-Hamiltonians, (ii) parallelism over sites, (iii) parallelism over normal and complementary operators, (iv) parallelism over symmetry sectors, and (v) parallelism over dense matrix multiplications. We describe how to reduce processor load imbalance and the communication cost of the algorithm to achieve higher efficiencies. We illustrate the performance of our new open-source implementation on a recent benchmark ground-state calculation of benzene in an orbital space of 108 orbitals and 30 electrons, with a bond dimension of up to 6000, and a model of the FeMo cofactor with 76 orbitals and 113 electrons. The observed parallel scaling from 448 to 2800 CPU cores is nearly ideal.

35 citations


Posted Content
TL;DR: NewmanNet as mentioned in this paper takes inspiration from Newton's equations of motion to learn interatomic potentials and forces and achieves state-of-the-art performance on energy and force prediction.
Abstract: We report a new deep learning message passing network that takes inspiration from Newton's equations of motion to learn interatomic potentials and forces. With the advantage of directional information from trainable latent force vectors, and physics-infused operators that are inspired by the Newtonian physics, the entire model remains rotationally equivariant, and many-body interactions are inferred by more interpretable physical features. We test NewtonNet on the prediction of several reactive and non-reactive high quality ab initio data sets including single small molecule dynamics, a large set of chemically diverse molecules, and methane and hydrogen combustion reactions, achieving state-of-the-art test performance on energies and forces with far greater data and computational efficiency than other deep learning models.

26 citations


Journal ArticleDOI
TL;DR: In this paper, the photoinduced excited-state dynamics of the keto and enol forms of cytosine is investigated using ab initio surface hopping in order to understand the outcome of molecular beam femtosecond pump-probe photoionization spectroscopy experiments.
Abstract: The photoinduced excited-state dynamics of the keto and enol forms of cytosine is investigated using ab initio surface hopping in order to understand the outcome of molecular beam femtosecond pump-probe photoionization spectroscopy experiments. Both singlet and triplet states are included in the dynamics. The results show that triplet states play a significant role in the relaxation of the keto tautomer, while they are less important in the enol tautomer. In both forms, the T$_1$ state minimum is found too low in energy to be detected in standard photoionization spectroscopy experiments and therefore experimental decay times should arise from a simultaneous relaxation to the ground state and additional intersystem crossing followed by internal conversion to the T$_1$ state. In agreement with available experimental lifetimes, we observe three decay constants of 7 fs, 270 fs and 1900 fs - the first two coming from the keto tautomer and the longer one from the enol tautomer. Deactivation of the enol form is due to internal conversion to the ground state via two S$_1$/S$_0$ conical intersections of ethylenic type.

26 citations


Posted Content
TL;DR: In this paper, the authors theoretically demonstrate that chemical reaction rate constant can be significantly suppressed by coupling molecular vibrations with an optical cavity, exhibiting both the collective coupling effect and the cavity-frequency modification of the rate constant.
Abstract: We theoretically demonstrate that chemical reaction rate constant can be significantly suppressed by coupling molecular vibrations with an optical cavity, exhibiting both the collective coupling effect and the cavity-frequency modification of the rate constant. When a reaction coordinate is strongly coupled to the solvent molecules, the reaction rate constant is reduced due to the dynamical caging effect. We demonstrate that collectively coupling the solvent to the cavity can further enhance this dynamical caging effect, leading to additional suppression of the chemical kinetics. This effect is further amplified when cavity loss is considered.

23 citations


Journal ArticleDOI
TL;DR: In this paper, a machine learning directed, multiobjective optimization workflow for force field parameterization is presented, which evaluates millions of prospective force field parameters while requiring only a small fraction of them to be tested with molecular simulations.
Abstract: Accurate force fields are necessary for predictive molecular simulations. However, developing force fields that accurately reproduce experimental properties is challenging. Here, we present a machine learning directed, multiobjective optimization workflow for force field parameterization that evaluates millions of prospective force field parameter sets while requiring only a small fraction of them to be tested with molecular simulations. We demonstrate the generality of the approach and identify multiple low-error parameter sets for two distinct test cases: simulations of hydrofluorocarbon (HFC) vapor-liquid equilibrium (VLE) and an ammonium perchlorate (AP) crystal phase. We discuss the challenges and implications of our force field optimization workflow.

20 citations


Posted ContentDOI
TL;DR: The open-access QMugs (Quantum-Mechanical Properties of Drug-like Molecules) dataset is intended to facilitate the development of models that learn from molecular data on different levels of theory while also providing insight into the corresponding relationships between molecular structure and biological activity.
Abstract: Machine learning approaches in drug discovery, as well as in other areas of the chemical sciences, benefit from curated datasets of physical molecular properties. However, there is a lack of sufficiently large data collections that include first-principle quantum chemical information on bioactive molecules, such as single-point electronic properties, quantum mechanical wave functions and density-functional theory (DFT) matrices. The open-access QMugs (Quantum-Mechanical Properties of Drug-like Molecules) dataset fills this void. The QMugs collection comprises quantum mechanical properties of more than 665k biologically and pharmacologically relevant molecules extracted from the ChEMBL database, totaling $\sim$2M conformers. QMugs contains optimized molecular geometries and thermodynamic data obtained via the semi-empirical method GFN2-xTB. Atomic and molecular properties (e.g., partial charges, energies, and rotational constants) are provided on both the GFN2-xTB and on the DFT ($\omega$B97X-D/def2-SVP) levels of theory. QMugs also comprises the respective quantum mechanical wave functions, including DFT density and orbital matrices, totaling over 7 terabytes of uncompressed data. This dataset is intended to facilitate the development of models that learn from molecular data on different levels of theory while also providing insight into the corresponding relationships between molecular structure and biological activity.

Journal ArticleDOI
TL;DR: In this article, a trajectory-based quantum-classical approach has been proposed to deal with nonadiabatic electronic processes, including spin-orbit coupling and the non-perturbative effect of an external time-dependent field.
Abstract: The exact factorization of the time-dependent electron-nuclear wavefunction has been employed successfully in the field of quantum molecular dynamics simulations for interpreting and simulating light-induced ultrafast processes. In this work, we summarize the major developments leading to the formulation of a trajectory-based approach, derived from the exact factorization equations, capable of dealing with nonadiabatic electronic processes, and including spin-orbit coupling and the non-perturbative effect of an external time-dependent field. This trajectory-based quantum-classical approach has been dubbed coupled-trajectory mixed quantum-classical (CT-MQC) algorithm, whose performance is tested here to study the photo-dissociation dynamics of IBr.

Posted Content
TL;DR: In this paper, the authors demonstrate that dissolution of spin-polarized pentacene-doped naphthalene crystals enables transfer of polarization to target molecules via intermolecular cross relaxation at room temperature and moderate magnetic fields.
Abstract: Nuclear spin hyperpolarization provides a promising route to overcome the challenges imposed by the limited sensitivity of nuclear magnetic resonance. Here we demonstrate that dissolution of spin-polarized pentacene-doped naphthalene crystals enables transfer of polarization to target molecules via intermolecular cross relaxation at room temperature and moderate magnetic fields (1.45$\,$T). This makes it possible to exploit the high spin polarization of optically polarized crystals while mitigating the challenges of its transfer to external nuclei, particularly of the large distances and prohibitively weak coupling between source and target nuclei across solid-solid or solid-liquid interfaces. With this method, here we inject the highly polarized mixture into a benchtop NMR spectrometer and observe the polarization dynamics for target $^1$H nuclei. Although the spectra are radiation damped due to the high naphthalene magnetization, we describe a procedure to process the data in order to obtain more conventional NMR spectra, and extract the target nuclei polarization. With the entire process occurring on a timescale of one minute, we observe NMR signals enhanced by factors between -200 and -1730 at 1.45$\,$T for a range of small molecules.

Journal ArticleDOI
TL;DR: Librascal as mentioned in this paper is based on the kernel ridge regression model, commonly used with SOAP features, and shows how to further reduce the total computational cost by up to a factor of 4 or 5 without affecting the model's symmetry properties and without significantly impacting its accuracy.
Abstract: Physically-motivated and mathematically robust atom-centred representations of molecular structures are key to the success of modern atomistic machine learning (ML) methods They lie at the foundation of a wide range of methods to predict the properties of both materials and molecules as well as to explore and visualize the chemical compound and configuration space Recently, it has become clear that many of the most effective representations share a fundamental formal connection: that they can all be expressed as a discretization of N-body correlation functions of the local atom density, suggesting the opportunity of standardizing and, more importantly, optimizing the calculation of such representations We present an implementation, named librascal, whose modular design lends itself both to developing refinements to the density-based formalism and to rapid prototyping for new developments of rotationally equivariant atomistic representations As an example, we discuss SOAP features, perhaps the most widely used member of this family of representations, to show how the expansion of the local density can be optimized for any choice of radial basis set We discuss the representation in the context of a kernel ridge regression model, commonly used with SOAP features, and analyze how the computational effort scales for each of the individual steps of the calculation By applying data reduction techniques in feature space, we show how to further reduce the total computational cost by at up to a factor of 4 or 5 without affecting the model's symmetry properties and without significantly impacting its accuracy

Posted Content
TL;DR: In this paper, a family of structural descriptors that generalize the very successful atom-centered density correlation features to the N-centers case is presented, which can be applied to efficiently learn the matrix elements of the (effective) single-particle Hamiltonian written in an atom centered orbital basis, and lay the foundations for symmetry-adapted machine learning models of new classes of properties of molecules and materials.
Abstract: Symmetry considerations are at the core of the major frameworks used to provide an effective mathematical representation of atomic configurations, that are then used in machine-learning models to predict the properties associated with each structure. In most cases, the models rely on a description of atom-centered environments, and are suitable to learn atomic properties, or global observables that can be decomposed into atomic contributions. Many quantities that are relevant for quantum mechanical calculations, however -- most notably the Hamiltonian matrix when written in an atomic-orbital basis -- are not associated with a single center, but with two (or more) atoms in the structure. We discuss a family of structural descriptors that generalize the very successful atom-centered density correlation features to the N-centers case, and show in particular how this construction can be applied to efficiently learn the matrix elements of the (effective) single-particle Hamiltonian written in an atom-centered orbital basis. These N-centers features are fully equivariant -- not only in terms of translations and rotations, but also in terms of permutations of the indices associated with the atoms -- and lay the foundations for symmetry-adapted machine-learning models of new classes of properties of molecules and materials.

Posted Content
TL;DR: In this paper, a model for the charging of electrical double layers inside a cylindrical pore for arbitrary pore size was developed, and the effect of electrode-pore-size distribution over their energy storage properties remains unclear.
Abstract: Porous electrodes are found in energy storage devices such as supercapacitors and pseudocapacitors. However, the effect of electrode-pore-size distribution over their energy storage properties remains unclear. Here, we develop a model for the charging of electrical double layers inside a cylindrical pore for arbitrary pore size. We assume small applied potentials and perform a regular perturbation analysis to predict the evolution of electrical potential and ion concentrations in both the radial and axial directions. We validate our perturbation model with direct numerical simulations of the Poisson-Nernst-Planck equations, and obtain quantitative agreement between the two approaches for small and moderate potentials. Our analysis yields two main characteristic features of arbitrary pore size: i) a monotonic decrease of the charging timescale with an increase in relative pore size (pore size relative to Debye length); ii) large potential changes for overlapping double layers in a thin transition region, which we approximate mathematically by a jump discontinuity. We quantify the contributions of electromigration and charge diffusion fluxes which provide mechanistic insights into the dependence of charging timescale and capacitance on pore size. We develop a modified transmission circuit model that captures the effect of arbitrary pore size and demonstrate that a time-dependent transition-region resistor needs to be included in the circuit. We also derive phenomenological expressions for average effective capacitance and charging timescale as a function of pore-size distribution. We show that the capacitance and charging timescale increase with smaller average pore sizes and with smaller polydispersity, resulting in a gain of energy density at a constant power density. Overall, our results advance the mechanistic understanding of electrical-double-layer charging.

Posted Content
TL;DR: In this paper, a high-yield surfactant-free synthesis of spiky hollow Au-Ag nanostars (SHAANs) is reported, where each SHAAN is composed of more than 50 spikes attached to a hollow ca. 150 nm diameter cubic core.
Abstract: Spiky/hollow metal nanoparticles have applications across a broad range of fields. However, current bottom-up methods to produce spiky/hollow metal nanoparticles rely heavily on the use of strongly adsorbing surfactant molecules, which is undesirable since these passivate the product particle surfaces. Here we report a high-yield surfactant-free synthesis of spiky hollow Au-Ag nanostars (SHAANs). Each SHAAN is composed of more than 50 spikes attached to a hollow ca. 150 nm diameter cubic core, which makes SHAANs highly plasmonically and catalytically active. Moreover, the surfaces of SHAANs are chemically exposed which gives them significantly enhanced functionality compared to their surfactant-capped counterparts, as demonstrated in surface-enhanced Raman spectroscopy (SERS) and catalysis. The chemical accessibility of the pristine SHAANs also allows the use of hydroxyethyl cellulose as a weakly-bound stabilizing agent. This produces colloidal SHAANs which remain stable for more than 1 month while retaining the functionalities of the pristine particles and allow even single-particle SERS to be realized.

Posted ContentDOI
TL;DR: In this paper, the effect of vibrational polariton condensation on the kinetics of electron transfer processes was investigated, and it was shown that the condensate changes the reaction yield significantly due to additional channels with reduced activation barriers resulting from the large accumulation of energy in the lower polariton and the many modes available for energy redistribution during the reaction.
Abstract: When molecular transitions strongly couple to photon modes, they form hybrid light-matter modes called polaritons. Collective vibrational strong coupling is a promising avenue for control of chemistry, but this can be deterred by the large number of quasi-degenerate dark modes. The macroscopic occupation of a single polariton mode by excitations, as observed in Bose-Einstein condensation, offers promise for overcoming this issue. Here we theoretically investigate the effect of vibrational polariton condensation on the kinetics of electron transfer processes. Compared with excitation with infrared laser sources, the condensate changes the reaction yield significantly due to additional channels with reduced activation barriers resulting from the large accumulation of energy in the lower polariton, and the many modes available for energy redistribution during the reaction. Our results offer tantalizing opportunities to use condensates for driving chemical reactions, kinetically bypassing usual constraints of fast intramolecular vibrational redistribution in condensed phase.

Journal ArticleDOI
TL;DR: In this paper, a variety of homologous carbon chains (HCnH, HCnN, CnS, CO, OCnO) are found to exhibit an appealing even-odd effect, where chains containing a number of carbon atoms of a certain parity possess singlet ground states while members of opposite parity have triplet ground states.
Abstract: A variety of homologous carbon chains (HCnH, HCnN, CnS, CnO, and OCnO) are found to exhibit an appealing even-odd effect. Chains containing a number of carbon atoms of a certain parity possess singlet ground states, while members of opposite parity have triplet ground states. From a general perspective, it is important that this even-odd effect confounds straightforward chemical intuition. Whether the most stable form is a triplet or a singlet is neither simply related to the fact that the species in question is a normal (closed-shell, nonradical) molecule nor a (di)radical or to the (e.g., cumulene-type) C-C bond succession across the chain. From a computational perspective, the present results are important also because they demonstrate that electron correlations in carbon-based chains are extremely strong. Whether the gold-standard CCSD(T) (coupled-cluster expansions with single and double excitations and triple excitations corrections) framework suffices to describe such strongly correlated systems remains an open question that calls for further clarification. Most importantly for astrochemistry, the present results may explain why certain members are not astronomically observed although larger members of the same homologous series are detected; the missing species are exactly those for which the present calculations predict triplet ground states.

Journal ArticleDOI
TL;DR: In this article, the structure of the energy landscape of variational coupled-cluster (VCC) was explored and compared with the traditional version (TCC) in the case of paired double excitations (pCCD).
Abstract: In single-reference coupled-cluster (CC) methods, one has to solve a set of non-linear polynomial equations in order to determine the so-called amplitudes which are then used to compute the energy and other properties. Although it is of common practice to converge to the (lowest-energy) ground-state solution, it is also possible, thanks to tailored algorithms, to access higher-energy roots of these equations which may or may not correspond to genuine excited states. Here, we explore the structure of the energy landscape of variational CC (VCC) and we compare it with its (projected) traditional version (TCC) in the case where the excitation operator is restricted to paired double excitations (pCCD). By investigating two model systems (the symmetric stretching of the linear \ce{H4} molecule and the continuous deformation of the square \ce{H4} molecule into a rectangular arrangement) in the presence of weak and strong correlations, the performance of VpCCD and TpCCD are gauged against their configuration interaction (CI) equivalent, known as doubly-occupied CI (DOCI), for reference Slater determinants made of ground- or excited-state Hartree-Fock orbitals or state-specific orbitals optimized directly at the VpCCD level. The influence of spatial symmetry breaking is also investigated.

Posted Content
TL;DR: In this article, the concept of static chemical shift from conventional XPS was extended by the excited-state chemical shift (ESCS), which is connected to the charge in the framework of a potential model.
Abstract: The conversion of photon energy into other energetic forms in molecules is accompanied by charge moving on ultrafast timescales. We directly observe the charge motion at a specific site in an electronically excited molecule using time-resolved x-ray photoelectron spectroscopy (TR-XPS). We extend the concept of static chemical shift from conventional XPS by the excited-state chemical shift (ESCS), which is connected to the charge in the framework of a potential model. This allows us to invert TR-XPS spectra to the dynamic charge at a specific atom. We demonstrate the power of TR-XPS by using sulphur 2p-core-electron-emission probing to study the UV-excited dynamics of 2-thiouracil. The new method allows us to discover that a major part of the population relaxes to the molecular ground state within 220-250 fs. In addition, a 250-fs oscillation, visible in the kinetic energy of the TR-XPS, reveals a coherent exchange of population among electronic states.

Posted ContentDOI
TL;DR: In this paper, a shallow neural network encoding the desired state of the system with the amplitude computed by sampling the Gibbs- Boltzmann distribution using a quantum circuit and the phase information obtained classically from the non-linear activation of a separate set of neurons is presented.
Abstract: Quantum machine learning algorithms have emerged to be a promising alternative to their classical counterparts as they leverage the power of quantum computers. Such algorithms have been developed to solve problems like electronic structure calculations of molecular systems and spin models in magnetic systems. However the discussion in all these recipes focus specifically on targeting the ground state. Herein we demonstrate a quantum algorithm that can filter any energy eigenstate of the system based on either symmetry properties or on a predefined choice of the user. The work horse of our technique is a shallow neural network encoding the desired state of the system with the amplitude computed by sampling the Gibbs- Boltzmann distribution using a quantum circuit and the phase information obtained classically from the non-linear activation of a separate set of neurons. We show that the resource requirements of our algorithm is strictly quadratic. To demonstrate its efficacy, we use state-filtration in monolayer transition metal-dichalcogenides which are hitherto unexplored in any flavor of quantum simulations. We implement our algorithm not only on quantum simulators but also on actual IBM-Q quantum devices and show good agreement with the results procured from conventional electronic structure calculations. We thus expect our protocol to provide a new alternative in exploring band-structures of exquisite materials to usual electronic structure methods or machine learning techniques which are implementable solely on a classical computer

Posted Content
TL;DR: In this article, a non-resonant second harmonic generation phase and amplitude measurements obtained from the silica:water interface at varying pH and 05 M ionic strength point to the existence of a nonlinear susceptibility term, which is associated with a 90 deg phase shift.
Abstract: Non-resonant second harmonic generation phase and amplitude measurements obtained from the silica:water interface at varying pH and 05 M ionic strength point to the existence of a nonlinear susceptibility term, which we call chi(3)X, that is associated with a 90 deg phase shift Including this contribution in a model for the total effective second-order nonlinear susceptibility produces reasonable point estimates for interfacial potentials and second-order nonlinear susceptibilities when chi(3)Xis about 15 times chi(3)water A model without this term and containing only traditional chi(2) and chi(3) terms cannot recapitulate the experimental data The new model also provides a demonstrated utility for distinguishing apparent differences in the second-order nonlinear susceptibility when the electrolyte is NaCl vs MgSO4, pointing to the possibility of using HD-SHG to investigate ion-specificity in interfacial processes

Journal Article
TL;DR: A framework that unifies sequence- and graph-based methods as energy-based models (EBMs) with different energy functions as well as a novel dual variant within the framework that performs consistent training over Bayesian forward- and backward-prediction by constraining the agreement between the two directions is proposed.
Abstract: Retrosynthesis—the process of identifying a set of reactants to synthesize a target molecule—is of vital importance to material design and drug discovery. Existing machine learning approaches based on language models and graph neural networks have achieved encouraging results. However, the inner connections of these models are rarely discussed, and rigorous evaluations of these models are largely in need. In this paper, we propose a framework that unifies sequence- and graph-based methods as energy-based models (EBMs) with different energy functions. This unified point of view establishes connections between different models and identifies the differences between them, thereby promoting the understanding of model design. We also provide a comprehensive assessment of performance to the community. Moreover, we present a novel “dual” variant within the framework that performs consistent training over Bayesian forward- and backward-prediction by constraining the agreement between the two directions. This model improves the state of the art for template-free approaches where the reaction type is unknown and known.

Journal ArticleDOI
TL;DR: In this article, a range-separated Gaussian density fitting (RSGDF) method was proposed to scale sublinearly to linearly with the number of points for small to medium-sized $k$-point meshes that are commonly used in periodic calculations with electron correlation.
Abstract: We present an efficient implementation of periodic Gaussian density fitting (GDF) using the Coulomb metric The three-center integrals are divided into two parts by range-separating the Coulomb kernel, with the short-range part evaluated in real space and the long-range part in reciprocal space With a few algorithmic optimizations, we show that this new method -- which we call range-separated GDF (RSGDF) -- scales sublinearly to linearly with the number of $k$-points for small to medium-sized $k$-point meshes that are commonly used in periodic calculations with electron correlation Numerical results on a few three-dimensional solids show about $10$-fold speedups over the previously developed GDF with little precision loss The error introduced by RSGDF is about $10^{-5}~E_{\textrm{h}}$ in the converged Hartree-Fock energy with default auxiliary basis sets and can be systematically reduced by increasing the size of the auxiliary basis with little extra work [The article has been accepted by The Journal of Chemical Physics]

Journal ArticleDOI
TL;DR: In this article, a model consisting of a rovibrational Hamiltonian with the dipole and quadrupole moments of water interacting with the crystal field was used to fit the infrared absorption spectra.
Abstract: Infrared absorption spectroscopy study of endohedral water molecule in a solid mixture of H$_2$O@C$_{60}$ and C$_{60}$ was carried out at liquid helium temperature. From the evolution of the spectra during the ortho-para conversion process, the spectral lines were identified as para- and ortho-water transitions. Eight vibrational transitions with rotational side peaks were observed in the mid-infrared: $\omega_1$, $\omega_2$, $\omega_3$, $2\omega_1$, $2\omega_2$, $\omega_1 +\omega_3$, $\omega_2 +\omega_3$, and $2\omega_2+\omega_3$. The vibrational frequencies $\omega_2$ and 2$\omega_2$ are lower by 1.6\% and the rest by 2.4\%, as compared to free \water/. A model consisting of a rovibrational Hamiltonian with the dipole and quadrupole moments of water interacting with the crystal field was used to fit the infrared absorption spectra. The electric quadrupole interaction with the crystal field lifts the degeneracy of the rotational levels. The finite amplitudes of the pure $v_1$ and $v_2$ vibrational transitions are consistent with the interaction of the water molecule dipole moment with a lattice-induced electric field. The permanent dipole moment of encapsulated \water/ is found to be $0.5\pm 0.1$ D as determined from the far-infrared rotational line intensities. The translational mode of the quantized center of mass motion of \water/ in the molecular cage of C$_{60}$ was observed at 110cm$^{-1}$ (13.6meV).

Posted Content
TL;DR: In this paper, the authors investigated the molecular diffusivity of reactants, catalyst and product of a model reaction, the copper-catalyzed azide-alkyne cycloaddition click reaction, and developed new NMR diffusion approaches that allow the probing of reaction-induced diffusion enhancement in nano-sized molecular systems with higher precision than the state of the art.
Abstract: Micrometer-sized objects are widely known to exhibit chemically-driven motility in systems away from equilibrium. Experimental observation of reaction-induced motility or enhancement in diffusivity at the much shorter length scale of small molecules is however still a matter of debate. Here, we investigate the molecular diffusivity of reactants, catalyst and product of a model reaction, the copper-catalyzed azide-alkyne cycloaddition click reaction, and develop new NMR diffusion approaches that allow the probing of reaction-induced diffusion enhancement in nano-sized molecular systems with higher precision than the state of the art. Following two different approaches that enable the accounting of time-dependent concentration changes during NMR experiments, we closely monitored the diffusion coefficient of reaction components during the reaction. The reaction components showed distinct changes in the diffusivity: while the two reactants underwent a time-dependent decrease in their diffusivity, the diffusion coefficient of the product gradually increased and the catalyst showed only slight diffusion enhancement within the range expected for reaction-induced sample heating. The decrease in diffusion coefficient of the alkyne, one of the two reactants of click reaction, was not reproduced during its copper coordination when the second reactant, azide, was absent. Our results do not support the catalysis-induced diffusion enhancement of the components of click reaction and, instead, point to the role of a relatively large intermediate species within the reaction cycle with diffusivity lower than both the reactants and product molecule.

Posted Content
TL;DR: In this paper, a two-dimensional electronic-vibrational spectroscopic study of the photosystem II reaction center (PSII-RC) was performed and it was shown that the mixed exciton-charge transfer state, previously proposed to be responsible for the far-red light operation of photosynthesis, is characterized by the Chl$(D1}+$Phe$$ +$D1+$D2$ +D1)-radical pair and can be directly prepared upon photoexcitation.
Abstract: Photosystem II is crucial for life on Earth as it provides oxygen as a result of photoinduced electron transfer and water splitting reactions. The excited state dynamics of the photosystem II-reaction center (PSII-RC) has been a matter of vivid debate because the absorption spectra of the embedded chromophores significantly overlap and hence it is extremely difficult to distinguish transients. Here, we report the two-dimensional electronic-vibrational spectroscopic study of the PSII-RC. The simultaneous resolution along both the visible excitation and infrared detection axis is crucial in allowing for the character of the excitonic states and interplay between them to be clearly distinguished. In particular, this work demonstrates that the mixed exciton-charge transfer state, previously proposed to be responsible for the far-red light operation of photosynthesis, is characterized by the Chl$_{\rm D1}^+$Phe$^-$ radical pair and can be directly prepared upon photoexcitation. Further, we find that the initial electron acceptor in the PSII-RC is Phe, rather than P$_{\rm D1}$, regardless of excitation wavelength.

Posted Content
TL;DR: In this paper, the authors review the fundamental challenges the approximate functionals have in describing double-excitations and charge-transfer excitations, which are two of the most common impediments for the theory to be applied in a black box way.
Abstract: Time-dependent density functional theory has emerged as a method of choice for calculations of spectra and response properties in physics, chemistry, and biology, with its system-size scaling enabling computations on systems much larger than possible otherwise. While increasingly complex and interesting systems have been successfully tackled with relatively simple functional approximations, there has also been increasing awareness that these functionals tend to fail for certain classes of approximations. I review the fundamental challenges the approximate functionals have in describing double-excitations and charge-transfer excitations, which are two of the most common impediments for the theory to be applied in a black box way. At the same time, I describe the progress made in recent decades in developing functional approximations that give useful predictions for these excitations.

Journal ArticleDOI
TL;DR: The theory behind computation of absolute binding free energies using explicit-solvent molecular simulations is well-established, yet somewhat complex, with counter-intuitive aspects This leads to frequent frustration, common misconceptions, and sometimes, erroneous numerical treatment as mentioned in this paper.
Abstract: The theory behind computation of absolute binding free energies using explicit-solvent molecular simulations is well-established, yet somewhat complex, with counter-intuitive aspects This leads to frequent frustration, common misconceptions, and sometimes, erroneous numerical treatment To improve this, we present the main practically relevant segments of the theory with constant reference to physical intuition We pinpoint the role of the implicit or explicit definition of the bound state (or the binding site), to make a robust link between an experimental measurement and a computational result We clarify the role of symmetry, and discuss cases where symmetry number corrections have been misinterpreted In particular, we argue that symmetry corrections as classically presented are a source of confusion, and could be advantageously replaced by restraint free energy contributions We establish that contrary to a common intuition, partial or missing sampling of some modes of symmetric bound states does not affect the calculated decoupling free energies Finally, we review these questions and pitfalls in the context of a few common practical situations: binding to a symmetric receptor (equivalent binding sites), binding of a symmetric ligand (equivalent poses), and formation of a symmetric complex, in the case of homodimerization