scispace - formally typeset
Search or ask a question

Showing papers by "Francesca Grisoni published in 2021"


Journal ArticleDOI
TL;DR: In this paper, the current status of AI in chemoinformatics is reviewed, including quantitative structure-activity/property relationship and structure-based modeling, de novo molecular design, and chemical synthesis prediction.
Abstract: Introduction: Artificial intelligence (AI) has inspired computer-aided drug discovery. The widespread adoption of machine learning, in particular deep learning, in multiple scientific disciplines, and the advances in computing hardware and software, among other factors, continue to fuel this development. Much of the initial skepticism regarding applications of AI in pharmaceutical discovery has started to vanish, consequently benefitting medicinal chemistry.Areas covered: The current status of AI in chemoinformatics is reviewed. The topics discussed herein include quantitative structure-activity/property relationship and structure-based modeling, de novo molecular design, and chemical synthesis prediction. Advantages and limitations of current deep learning applications are highlighted, together with a perspective on next-generation AI for drug discovery.Expert opinion: Deep learning-based approaches have only begun to address some fundamental problems in drug discovery. Certain methodological advances, such as message-passing models, spatial-symmetry-preserving networks, hybrid de novo design, and other innovative machine learning paradigms, will likely become commonplace and help address some of the most challenging questions. Open data sharing and model development will play a central role in the advancement of drug discovery with AI.

90 citations


Journal ArticleDOI
TL;DR: The results support the suitability of the proposed design-make-test-analyze framework as a blueprint for automated drug design with artificial intelligence and miniaturized bench-top synthesis.
Abstract: Automating the molecular design-make-test-analyze cycle accelerates hit and lead finding for drug discovery. Using deep learning for molecular design and a microfluidics platform for on-chip chemical synthesis, liver X receptor (LXR) agonists were generated from scratch. The computational pipeline was tuned to explore the chemical space of known LXRα agonists and generate novel molecular candidates. To ensure compatibility with automated on-chip synthesis, the chemical space was confined to the virtual products obtainable from 17 one-step reactions. Twenty-five de novo designs were successfully synthesized in flow. In vitro screening of the crude reaction products revealed 17 (68%) hits, with up to 60-fold LXR activation. The batch resynthesis, purification, and retesting of 14 of these compounds confirmed that 12 of them were potent LXR agonists. These results support the suitability of the proposed design-make-test-analyze framework as a blueprint for automated drug design with artificial intelligence and miniaturized bench-top synthesis.

47 citations


Journal ArticleDOI
Kamel Mansouri1, Agnes L. Karmaus, Jeremy M. Fitzpatrick1, Grace Patlewicz1, Prachi Pradeep1, Prachi Pradeep2, Domenico Alberga3, Nathalie Alépée4, Timothy E. H. Allen5, D Allen, Vinicius M. Alves6, Vinicius M. Alves7, Carolina Horta Andrade6, Tyler R. Auernhammer8, Davide Ballabio9, Shannon M. Bell, Emilio Benfenati10, Sudin Bhattacharya11, Joyce V. Bastos6, Stephen Boyd11, James B. Brown12, Stephen J. Capuzzi7, Yaroslav Chushak13, Heather L. Ciallella14, Alex M. Clark, Viviana Consonni9, Pankaj R. Daga15, Sean Ekins, Sherif Farag7, Maxim V. Fedorov16, Denis Fourches17, Domenico Gadaleta10, Feng Gao11, Jeffery M. Gearhart13, Garett Goh18, Jonathan M. Goodman5, Francesca Grisoni9, Christopher M. Grulke1, Thomas Hartung19, Matthew J. Hirn11, Pavel Karpov, Alexandru Korotcov, Giovanna J. Lavado10, Michael S. Lawless15, Xinhao Li17, Thomas Luechtefeld19, F. Lunghini20, Giuseppe Felice Mangiatordi3, Gilles Marcou20, Dan Marsh19, Todd M. Martin21, Andrea Mauri, Eugene N. Muratov7, Eugene N. Muratov6, Glenn J. Myatt, Dac-Trung Nguyen22, Orazio Nicolotti3, Paritosh Pande18, Amanda K. Parks8, Tyler Peryea22, Ahsan Habib Polash12, Robert Rallo18, Alessandra Roncaglioni10, Craig Rowlands19, Patricia Ruiz23, Daniel P. Russo14, Ahmed Sayed, Risa Sayre2, Risa Sayre1, Timothy Sheils22, Charles Siegel18, Arthur C. Silva6, Anton Simeonov22, Sergey Sosnin16, Noel Southall22, Judy Strickland, Yun Tang24, Brian J. Teppen11, Igor V. Tetko, Dennis G. Thomas18, Valery Tkachenko, R Todeschini9, Cosimo Toma10, Ignacio J. Tripodi25, Daniela Trisciuzzi3, Alexander Tropsha7, Alexandre Varnek20, Kristijan Vukovic10, Zhongyu Wang26, Liguo Wang26, Katrina M. Waters18, Andrew J. Wedlake5, Sanjeeva J. Wijeyesakere8, Daniel M. Wilson8, Zijun Xiao26, Hongbin Yang24, Gergely Zahoranszky-Kohalmi22, Alexey V. Zakharov22, Fagen F. Zhang8, Zhen Zhang27, Tongan Zhao22, Hao Zhu14, Kimberley M. Zorn, Warren Casey1, Nicole Kleinstreuer1 
TL;DR: In this paper, the authors proposed a method to assess tens of thousands of chemical substances that need to be assessed for their potential toxicity, which serves as the basis for regulatory testing.
Abstract: Background: Humans are exposed to tens of thousands of chemical substances that need to be assessed for their potential toxicity. Acute systemic toxicity testing serves as the basis for regulatory ...

38 citations


Journal ArticleDOI
TL;DR: This paper leveraged the probabilities learned by chemical language models with the beam search algorithm as a model-intrinsic technique for automated molecule design and scoring and yielded novel inverse agonists of retinoic acid receptor-related orphan receptors (RORs).
Abstract: Chemical language models enable de novo drug design without the requirement for explicit molecular construction rules. While such models have been applied to generate novel compounds with desired bioactivity, the actual prioritization and selection of the most promising computational designs remains challenging. Herein, we leveraged the probabilities learnt by chemical language models with the beam search algorithm as a model-intrinsic technique for automated molecule design and scoring. Prospective application of this method yielded novel inverse agonists of retinoic acid receptor-related orphan receptors (RORs). Each design was synthesizable in three reaction steps and presented low-micromolar to nanomolar potency towards RORγ. This model-intrinsic sampling technique eliminates the strict need for external compound scoring functions, thereby further extending the applicability of generative artificial intelligence to data-driven drug discovery.

28 citations



Posted Content
TL;DR: Geometric deep learning (GDL) has emerged as a recent paradigm in artificial intelligence as discussed by the authors and has shown particular promise in molecular modeling applications, in which various molecular representations with different symmetry properties and levels of abstraction exist.
Abstract: Geometric deep learning (GDL), which is based on neural network architectures that incorporate and process symmetry information, has emerged as a recent paradigm in artificial intelligence. GDL bears particular promise in molecular modeling applications, in which various molecular representations with different symmetry properties and levels of abstraction exist. This review provides a structured and harmonized overview of molecular GDL, highlighting its applications in drug discovery, chemical synthesis prediction, and quantum chemistry. Emphasis is placed on the relevance of the learned molecular features and their complementarity to well-established molecular descriptors. This review provides an overview of current challenges and opportunities, and presents a forecast of the future of GDL for molecular sciences.

4 citations


Book ChapterDOI
TL;DR: The Weighted Holistic Atom Localization and Entity Shape (WHALES) descriptors were originally designed for scaffold hopping from natural products to synthetic molecules as mentioned in this paper, which capture molecular shape and partial charges simultaneously.
Abstract: Molecular descriptors encode a variety of molecular representations for computer-assisted drug discovery. Here, we focus on the Weighted Holistic Atom Localization and Entity Shape (WHALES) descriptors, which were originally designed for scaffold hopping from natural products to synthetic molecules. WHALES descriptors capture molecular shape and partial charges simultaneously. We introduce the key aspects of the WHALES concept and provide a step-by-step guide on how to use these descriptors for virtual compound screening and scaffold hopping. The results presented can be reproduced by using the code freely available from URL: github.com/ETHmodlab/scaffold_hopping_whales .

4 citations


Posted ContentDOI
04 Mar 2021-ChemRxiv
TL;DR: This work leveraged the probabilities learnt by chemical language models with the beam search algorithm as a model-intrinsic technique for automated molecule design and scoring and yielded three novel inverse agonists of retinoic acid receptor-related orphan receptors.
Abstract: Chemical language models enable de novo drug design without the requirement for explicit molecular construction rules. While such models have been applied to generate novel compounds with desired bioactivity, the actual prioritization and selection of the most promising computational designs remains challenging. In this work, we leveraged the probabilities learnt by chemical language models with the beam search algorithm as a model-intrinsic technique for automated molecule design and scoring. Prospective application of this method yielded three novel inverse agonists of retinoic acid receptor-related orphan receptors (RORs). Each design was synthesizable in three reaction steps and presented low-micromolar to nanomolar potency towards RORg. This model-intrinsic sampling technique eliminates the strict need for external compound scoring functions, thereby further extending the applicability of generative artificial intelligence to data-driven drug discovery.

2 citations


Posted ContentDOI
11 Aug 2021-ChemRxiv
TL;DR: Previously unknown off-target interactions for the two JAK inhibitors were identified and proposed pharmacological off- target effects include attenuation of pulmonary vascular remodeling, modulation of Hepatitis C viral response and hypomagnesemia.
Abstract: Objectives: There are no clear on-target mechanisms that explain the increased risk for thrombosis and viral infection or reactivation associated with Janus kinase (JAK) inhibitors. We aimed to identify and validate off-target binding effects of the JAK inhibitors baricitinib and tofacitinib using computational and experimental methods. Methods: Potential biological targets of baricitinib and tofacitinib were predicted using two established computational methods. Targets related to thrombosis or viral infection/reactivation were experimentally validated using biochemical and cell-based in vitro assays. Results: Overall, 98 targets were predicted by the computational methods (baricitinib n=41; tofacitinib n=58), of which 17 drug-target pairs were related to thrombosis (n=10) or viral infection/reactivation (n=7), and 11 were commercially available for in vitro analysis. Inhibitory activity was identified in vitro for four drug-target pairs – two related to thrombosis in the micromolar range (phosphodiesterase 10A [baricitinib], transient receptor potential cation channel subfamily M subtype 6 [tofacitinib]) and two related to viral infection/reactivation in the nanomolar range (Serine/threonine protein kinase N2 [baricitinib, tofacitinib]). Conclusions: Previously unknown off-target interactions for the two JAK inhibitors were identified. The proposed pharmacological off-target effects include attenuation of pulmonary vascular remodeling, modulation of Hepatitis C viral response and hypomagnesemia. Off-target effects related to an increased risk of thrombosis or viral infection/reactivation for baricitinib and tofacitinib were not identified. Further clinical and experimental research is required to explain the observed thrombosis and viral infection/reactivation risk.

1 citations



Posted ContentDOI
04 Oct 2021-ChemRxiv
TL;DR: It is shown that “hybrid” CLMs can additionally leverage the bioactivity information available for the training compounds to positively advocate hybrid CLMs for virtual compound screening and activity-focused molecular design in low-data situations.
Abstract: Generative chemical language models (CLMs) can be used for de novo molecular structure generation. These CLMs learn from the structural information of known molecules to generate new ones. In this paper, we show that “hybrid” CLMs can additionally leverage the bioactivity information available for the training compounds. To computationally design ligands of phosphoinositide 3-kinase gamma (PI3Kγ), we created a large collection of virtual molecules with a generative CLM. This primary virtual compound library was further refined using a CLM-based classifier for bioactivity prediction. This second hybrid CLM was pretrained with patented molecular structures and fine-tuned with known PI3Kγ binders and non-binders by transfer learning. Several of the computer-generated molecular designs were commercially available, which allowed for fast prescreening and preliminary experimental validation. A new PI3Kγ ligand with sub-micromolar activity was identified. The results positively advocate hybrid CLMs for virtual compound screening and activity-focused molecular design in low-data situations.

Posted ContentDOI
27 Oct 2021-ChemRxiv
TL;DR: This paper applied the perplexity metric to determine the degree to which the molecules generated by a chemical language model match the desired design objectives, which allowed identifying and ranking the most promising molecular designs based on the probabilities learned by the CLM.
Abstract: Chemical language models (CLMs) can be employed to design molecules with desired properties. CLMs generate new chemical structures in the form of textual representations, such as the simplified molecular input line entry systems (SMILES) strings, in a rule-free manner. However, the quality of these de novo generated molecules is difficult to assess a priori. In this study, we apply the perplexity metric to determine the degree to which the molecules generated by a CLM match the desired design objectives. This model-intrinsic score allows identifying and ranking the most promising molecular designs based on the probabilities learned by the CLM. Using perplexity to compare “greedy” (beam search) with “explorative” (multinomial sampling) methods for SMILES generation, certain advantages of multinomial sampling become apparent. Additionally, perplexity scoring is performed to identify undesired model biases introduced during model training and allows the development of a new ranking system to remove those undesired biases.