scispace - formally typeset
Journal ArticleDOI

Machine learning for data-driven discovery in solid Earth geoscience

Reads0
Chats0
TLDR
Solid Earth geoscience is a field that has very large set of observations, which are ideal for analysis with machine-learning methods, and how these methods can be applied to solid Earth datasets is reviewed.
Abstract
BACKGROUND The solid Earth, oceans, and atmosphere together form a complex interacting geosystem. Processes relevant to understanding Earth’s geosystem behavior range in spatial scale from the atomic to the planetary, and in temporal scale from milliseconds to billions of years. Physical, chemical, and biological processes interact and have substantial influence on this complex geosystem, and humans interact with it in ways that are increasingly consequential to the future of both the natural world and civilization as the finiteness of Earth becomes increasingly apparent and limits on available energy, mineral resources, and fresh water increasingly affect the human condition. Earth is subject to a variety of geohazards that are poorly understood, yet increasingly impactful as our exposure grows through increasing urbanization, particularly in hazard-prone areas. We have a fundamental need to develop the best possible predictive understanding of how the geosystem works, and that understanding must be informed by both the present and the deep past. This understanding will come through the analysis of increasingly large geo-datasets and from computationally intensive simulations, often connected through inverse problems. Geoscientists are faced with the challenge of extracting as much useful information as possible and gaining new insights from these data, simulations, and the interplay between the two. Techniques from the rapidly evolving field of machine learning (ML) will play a key role in this effort. ADVANCES The confluence of ultrafast computers with large memory, rapid progress in ML algorithms, and the ready availability of large datasets place geoscience at the threshold of dramatic progress. We anticipate that this progress will come from the application of ML across three categories of research effort: (i) automation to perform a complex prediction task that cannot easily be described by a set of explicit commands; (ii) modeling and inverse problems to create a representation that approximates numerical simulations or captures relationships; and (iii) discovery to reveal new and often unanticipated patterns, structures, or relationships. Examples of automation include geologic mapping using remote-sensing data, characterizing the topology of fracture systems to model subsurface transport, and classifying volcanic ash particles to infer eruptive mechanism. Examples of modeling include approximating the viscoelastic response for complex rheology, determining wave speed models directly from tomographic data, and classifying diverse seismic events. Examples of discovery include predicting laboratory slip events using observations of acoustic emissions, detecting weak earthquake signals using similarity search, and determining the connectivity of subsurface reservoirs using groundwater tracer observations. OUTLOOK The use of ML in solid Earth geosciences is growing rapidly, but is still in its early stages and making uneven progress. Much remains to be done with existing datasets from long-standing data sources, which in many cases are largely unexplored. Newer, unconventional data sources such as light detection and ranging (LiDAR), fiber-optic sensing, and crowd-sourced measurements may demand new approaches through both the volume and the character of information that they present. Practical steps could accelerate and broaden the use of ML in the geosciences. Wider adoption of open-science principles such as open source code, open data, and open access will better position the solid Earth community to take advantage of rapid developments in ML and artificial intelligence. Benchmark datasets and challenge problems have played an important role in driving progress in artificial intelligence research by enabling rigorous performance comparison and could play a similar role in the geosciences. Testing on high-quality datasets produces better models, and benchmark datasets make these data widely available to the research community. They also help recruit expertise from allied disciplines. Close collaboration between geoscientists and ML researchers will aid in making quick progress in ML geoscience applications. Extracting maximum value from geoscientific data will require new approaches for combining data-driven methods, physical modeling, and algorithms capable of learning with limited, weak, or biased labels. Funding opportunities that target the intersection of these disciplines, as well as a greater component of data science and ML education in the geosciences, could help bring this effort to fruition. The list of author affiliations is available in the full article online.

read more

Citations
More filters

Machine learning with Python

TL;DR: This presentation is a case study taken from the travel and holiday industry and describes the effectiveness of various techniques as well as the performance of Python-based libraries such as Python Data Analysis Library (Pandas), and Scikit-learn (built on NumPy, SciPy and matplotlib).
Journal ArticleDOI

A physics-informed deep learning framework for inversion and surrogate modeling in solid mechanics

TL;DR: It is found that honoring the physics leads to improved robustness: when trained only on a few parameters, the PINN model can accurately predict the solution for a wide range of parameters new to the network—thus pointing to an important application of this framework to sensitivity analysis and surrogate modeling.
Posted Content

Integrating Physics-Based Modeling with Machine Learning: A Survey

TL;DR: An overview of techniques to integrate machine learning with physics-based modeling and classes of methodologies used to construct physics-guided machine learning models and hybrid physics-machine learning frameworks from a machine learning standpoint is provided.
Journal ArticleDOI

Enforcing Analytic Constraints in Neural Networks Emulating Physical Systems.

TL;DR: This work introduces a systematic way of enforcing nonlinear analytic constraints in neural networks via constraints in the architecture or the loss function, which reduces errors in the subsets of the outputs most impacted by the constraints.
References
More filters

ObsPy: A Python Toolbox for Seismology

TL;DR: ObsPy as mentioned in this paper is a Python toolbox that simplifies the usage of Python programming for seismologists by providing direct access to the actual time series, allowing the use of powerful numerical array-programming modules like NumPy (http://numpy.mathworks.org) or SciPy(http://scipy.org).
Journal ArticleDOI

DeepStack: Expert-level artificial intelligence in heads-up no-limit poker

TL;DR: DeepStack is introduced, an algorithm for imperfect-information settings that combines recursive reasoning to handle information asymmetry, decomposition to focus computation on the relevant decision, and a form of intuition that is automatically learned from self-play using deep learning.
Posted Content

Invariant Scattering Convolution Networks

TL;DR: A wavelet scattering network as discussed by the authors computes a translation invariant image representation, which is stable to deformations and preserves high frequency information for classification, cascading wavelet transform convolutions with nonlinear modulus and averaging operators.
Journal ArticleDOI

Theory-Guided Data Science: A New Paradigm for Scientific Discovery from Data

TL;DR: The paradigm of theory-guided data science is formally conceptualized and a taxonomy of research themes in TGDS is presented and several approaches for integrating domain knowledge in different research themes are described using illustrative examples from different disciplines.
Journal ArticleDOI

Convolutional Neural Networks for Inverse Problems in Imaging: A Review

TL;DR: Recent experimental work in convolutional neural networks to solve inverse problems in imaging, with a focus on the critical design decisions is reviewed, including sparsity-based techniques such as compressed sensing.
Related Papers (5)