scispace - formally typeset
Search or ask a question
Author

Suzanna C. Ward

Bio: Suzanna C. Ward is an academic researcher from University of Cambridge. The author has contributed to research in topics: Computer science & Medicine. The author has an hindex of 4, co-authored 6 publications receiving 5313 citations.

Papers
More filters
Journal ArticleDOI
TL;DR: The creation, maintenance, information content and availability of the Cambridge Structural Database (CSD), the world’s repository of small molecule crystal structures, are described.
Abstract: The Cambridge Structural Database (CSD) contains a complete record of all published organic and metal–organic small-molecule crystal structures. The database has been in operation for over 50 years and continues to be the primary means of sharing structural chemistry data and knowledge across disciplines. As well as structures that are made public to support scientific articles, it includes many structures published directly as CSD Communications. All structures are processed both computationally and by expert structural chemistry editors prior to entering the database. A key component of this processing is the reliable association of the chemical identity of the structure studied with the experimental data. This important step helps ensure that data is widely discoverable and readily reusable. Content is further enriched through selective inclusion of additional experimental data. Entries are available to anyone through free CSD community web services. Linking services developed and maintained by the CCDC, combined with the use of standard identifiers, facilitate discovery from other resources. Data can also be accessed through CCDC and third party software applications and through an application programming interface.

6,313 citations

Journal ArticleDOI
TL;DR: The generation and characterization of the most complete collection of metal–organic frameworks (MOFs) maintained and updated, for the first time, by the Cambridge Crystallographic Data Centre (CCDC).
Abstract: We report the generation and characterization of the most complete collection of metal–organic frameworks (MOFs) maintained and updated, for the first time, by the Cambridge Crystallographic Data Centre (CCDC). To set up this subset, we asked the question “what is a MOF?” and implemented a number of “look-for-MOF” criteria embedded within a bespoke Cambridge Structural Database (CSD) Python API workflow to identify and extract information on 69 666 MOF materials. The CSD MOF subset is updated regularly with subsequent MOF additions to the CSD, bringing a unique record for all researchers working in the area of porous materials around the world, whether to perform high-throughput computational screening for materials discovery or to have a global view over the existing structures in a single resource. Using this resource, we then developed and used an array of computational tools to remove residual solvent molecules from the framework pores of all the MOFs identified and went on to analyze geometrical and ...

634 citations

Journal ArticleDOI
TL;DR: The proceedings and conclusions from the first Worldwide PDB/Cambridge Crystallographic Data Center/Drug Design Data Resource/D3R Ligand Validation Workshop are described, with consensus recommendations on best practices developed in response to each of these questions provided.

58 citations

Journal ArticleDOI
07 Apr 2021
TL;DR: The CSD MOF Collection as mentioned in this paper is a dataset of >10,000 3D porous computation-ready MOFs, derived from our automatically updated CSD-MOF subset.
Abstract: The computational study of metal-organic frameworks (MOFs) relies heavily on the availability, quality, and simulation-readiness of MOF structural data We announce here the release of the freely accessible and regularly updated “CSD MOF Collection,” a dataset of >10,000 3D porous computation-ready MOFs, derived from our automatically updated CSD MOF subset

12 citations

Journal ArticleDOI
TL;DR: This study shows the adsorption of this highly unreactive element within an organic or metal-organic environment and reveals the first crystallographic observation of an interaction between neon and a transition metal.

11 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: An overview of Mercury 4.0, an analysis, design and prediction platform that acts as a hub for the entire Cambridge Structural Database software suite, is presented.
Abstract: The program Mercury, developed at the Cambridge Crystallographic Data Centre, was originally designed primarily as a crystal structure visualization tool. Over the years the fields and scientific communities of chemical crystallography and crystal engineering have developed to require more advanced structural analysis software. Mercury has evolved alongside these scientific communities and is now a powerful analysis, design and prediction platform which goes a lot further than simple structure visualization.

2,075 citations

Journal ArticleDOI
08 Aug 2019
TL;DR: A comprehensive overview and analysis of the most recent research in machine learning principles, algorithms, descriptors, and databases in materials science, and proposes solutions and future research paths for various challenges in computational materials science.
Abstract: One of the most exciting tools that have entered the material science toolbox in recent years is machine learning. This collection of statistical methods has already proved to be capable of considerably speeding up both fundamental and applied research. At present, we are witnessing an explosion of works that develop and apply machine learning to solid-state systems. We provide a comprehensive overview and analysis of the most recent research in this topic. As a starting point, we introduce machine learning principles, algorithms, descriptors, and databases in materials science. We continue with the description of different machine learning approaches for the discovery of stable materials and the prediction of their crystal structure. Then we discuss research in numerous quantitative structure–property relationships and various approaches for the replacement of first-principle methods by machine learning. We review how active learning and surrogate-based optimization can be applied to improve the rational design process and related examples of applications. Two major questions are always the interpretability of and the physical understanding gained from machine learning models. We consider therefore the different facets of interpretability and their importance in materials science. Finally, we propose solutions and future research paths for various challenges in computational materials science.

1,301 citations

Journal ArticleDOI
TL;DR: A large scale benchmark for molecular machine learning consisting of multiple public datasets, metrics, featurizations and learning algorithms.
Abstract: Molecular machine learning has been maturing rapidly over the last few years. Improved methods and the presence of larger datasets have enabled machine learning algorithms to make increasingly accurate predictions about molecular properties. However, algorithmic progress has been limited due to the lack of a standard benchmark to compare the efficacy of proposed methods; most new algorithms are benchmarked on different datasets making it challenging to gauge the quality of proposed methods. This work introduces MoleculeNet, a large scale benchmark for molecular machine learning. MoleculeNet curates multiple public datasets, establishes metrics for evaluation, and offers high quality open-source implementations of multiple previously proposed molecular featurization and learning algorithms (released as part of the DeepChem open source library). MoleculeNet benchmarks demonstrate that learnable representations are powerful tools for molecular machine learning and broadly offer the best performance. However, this result comes with caveats. Learnable representations still struggle to deal with complex tasks under data scarcity and highly imbalanced classification. For quantum mechanical and biophysical datasets, the use of physics-aware featurizations can be more important than choice of particular learning algorithm.

1,277 citations

Journal ArticleDOI
TL;DR: This review highlights the research aimed at the implementation of MOFs as an integral part of solid-state microelectronics and discusses the fundamental and applied aspects of this two-pronged approach.
Abstract: Metal-organic frameworks (MOFs) are typically highlighted for their potential application in gas storage, separations and catalysis. In contrast, the unique prospects these porous and crystalline materials offer for application in electronic devices, although actively developed, are often underexposed. This review highlights the research aimed at the implementation of MOFs as an integral part of solid-state microelectronics. Manufacturing these devices will critically depend on the compatibility of MOFs with existing fabrication protocols and predominant standards. Therefore, it is important to focus in parallel on a fundamental understanding of the distinguishing properties of MOFs and eliminating fabrication-related obstacles for integration. The latter implies a shift from the microcrystalline powder synthesis in chemistry labs, towards film deposition and processing in a cleanroom environment. Both the fundamental and applied aspects of this two-pronged approach are discussed. Critical directions for future research are proposed in an updated high-level roadmap to stimulate the next steps towards MOF-based microelectronics within the community.

908 citations

Journal ArticleDOI
01 Sep 2017-IUCrJ
TL;DR: The accurate and efficient CE-B3LYP and CE-HF model energies for intermolecular interactions in molecular crystals are extended to a broad range of crystals by calibration against density functional results for molecule/ion pairs extracted from 171 crystal structures.

704 citations