scispace - formally typeset
Search or ask a question

Showing papers by "Mark S. Hybertsen published in 2023"


Journal ArticleDOI
TL;DR: In this paper , a semi-supervised machine learning method for the discovery of structure-spectrum relationships is developed and demonstrated using the specific example of interpreting X-ray absorption near-edge structure (XANES) spectra.
Abstract: A new semi-supervised machine learning method for the discovery of structure-spectrum relationships is developed and demonstrated using the specific example of interpreting X-ray absorption near-edge structure (XANES) spectra. This method constructs a one-to-one mapping between individual structure descriptors and spectral trends. Specifically, an adversarial autoencoder is augmented with a novel rank constraint (RankAAE). The RankAAE methodology produces a continuous and interpretable latent space, where each dimension can track an individual structure descriptor. As a part of this process, the model provides a robust and quantitative measure of the structure-spectrum relationship by decoupling intertwined spectral contributions from multiple structural characteristics. This makes it ideal for spectral interpretation and the discovery of new descriptors. The capability of this procedure is showcased by considering five local structure descriptors and a database of over fifty thousand simulated XANES spectra across eight first-row transition metal oxide families. The resulting structure-spectrum relationships not only reproduce known trends in the literature, but also reveal unintuitive ones that are visually indiscernible in large data sets. The results suggest that the RankAAE methodology has great potential to assist researchers to interpret complex scientific data, test physical hypotheses, and reveal new patterns that extend scientific insight.

2 citations


30 Mar 2023
TL;DR: In this paper , the convergence performance of first-principle simulated X-ray absorption spectroscopy (XAS) simulations is evaluated using the Ti-O-10 dataset and three state-of-the-art codes: xSpectra, ocean and exciting.
Abstract: X-ray absorption spectroscopy (XAS) is an element-specific materials characterization technique that is sensitive to structural and electronic properties. First-principles simulated XAS has been widely used as a powerful tool to interpret experimental spectra and draw physical insights. Recently, there has also been growing interest in building computational XAS databases to enable data analytics and machine learning applications. However, there are non-trivial differences among commonly used XAS simulation codes, both in underlying theoretical formalism and in technical implementation. Reliable and reproducible computational XAS databases require systematic benchmark studies. In this work, we benchmarked Ti K-edge XAS simulations of ten representative Ti-O binary compounds, which we refer to as the Ti-O-10 dataset, using three state-of-the-art codes: xspectra, ocean and exciting. We systematically studied the convergence behavior with respect to the input parameters and developed a workflow to automate and standardize the calculations to ensure converged spectra. Our benchmark comparison shows: (1) the two Bethe-Salpeter equation (BSE) codes (ocean and exciting) have excellent agreement in the energy range studied (up to 35 eV above the onset) with an average Spearman's rank correlation score of 0.998; (2) good agreement is obtained between the core-hole potential code (xspectra) and BSE codes (ocean and exciting) with an average Spearman's rank correlation score of 0.990. Our benchmark study provides important standards for first-principles XAS simulations with broad impact in data-driven XAS analysis.