Integrating Physics-Based Modeling with Machine Learning: A Survey

Home
/
Papers
/
Integrating Physics-Based Modeling with Machine Learning: A Survey

Posted Content•

Integrating Physics-Based Modeling with Machine Learning: A Survey

Jared Willard, Xiaowei Jia, Shaoming Xu, Michael Steinbach, Vipin Kumar - Show less +1 more

01 Jan 2020-

TL;DR: An overview of techniques to integrate machine learning with physics-based modeling and classes of methodologies used to construct physics-guided machine learning models and hybrid physics-machine learning frameworks from a machine learning standpoint is provided.

read less

Abstract: In this manuscript, we provide a structured and comprehensive overview of techniques to integrate machine learning with physics-based modeling. First, we provide a summary of application areas for which these approaches have been applied. Then, we describe classes of methodologies used to construct physics-guided machine learning models and hybrid physics-machine learning frameworks from a machine learning standpoint. With this foundation, we then provide a systematic organization of these existing techniques and discuss ideas for future research.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Proceedings of the National Academy of Sciences

[...]

Omar Bagasra

18 Aug 1998-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: It is shown that the full set of hydromagnetic equations admit five more integrals, besides the energy integral, if dissipative processes are absent, which made it possible to formulate a variational principle for the force-free magnetic fields.

...read moreread less

Abstract: where A represents the magnetic vector potential, is an integral of the hydromagnetic equations. This -integral made it possible to formulate a variational principle for the force-free magnetic fields. The integral expresses the fact that motions cannot transform a given field in an entirely arbitrary different field, if the conductivity of the medium isconsidered infinite. In this paper we shall show that the full set of hydromagnetic equations admit five more integrals, besides the energy integral, if dissipative processes are absent. These integrals, as we shall presently verify, are I2 =fbHvdV, (2)

...read moreread less

1,858 citations

Posted Content•

Tackling Climate Change with Machine Learning

[...]

David Rolnick¹, Priya L. Donti, Lynn H. Kaack, K. Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew S. Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, Alexandra Luccioni, Tegan Maharaj, Evan D. Sherwin, S. Karthik Mukkavilli, Konrad P. Kording, Carla P. Gomes, Andrew Y. Ng, Demis Hassabis, John Platt, Felix Creutzig, Jennifer Chayes, Yoshua Bengio - Show less +18 more•Institutions (1)

University of Pennsylvania¹

10 Jun 2019-arXiv: Computers and Society

TL;DR: From smart grids to disaster management, high impact problems where existing gaps can be filled by ML are identified, in collaboration with other fields, to join the global effort against climate change.

...read moreread less

Abstract: Climate change is one of the greatest challenges facing humanity, and we, as machine learning experts, may wonder how we can help. Here we describe how machine learning can be a powerful tool in reducing greenhouse gas emissions and helping society adapt to a changing climate. From smart grids to disaster management, we identify high impact problems where existing gaps can be filled by machine learning, in collaboration with other fields. Our recommendations encompass exciting research questions as well as promising business opportunities. We call on the machine learning community to join the global effort against climate change.

...read moreread less

441 citations

Journal Article•

Deep Potential Molecular Dynamics: a Scalable Model with the Accuracy of Quantum Mechanics

[...]

Linfeng Zhang, Jiequn Han, Han Wang, Roberto Car, Weinan E - Show less +1 more

08 Mar 2018-Bulletin of the American Physical Society

TL;DR: This work introduces a scheme for molecular simulations, the deep potential molecular dynamics (DPMD) method, based on a many-body potential and interatomic forces generated by a carefully crafted deep neural network trained with ab initio data.

...read moreread less

Abstract: We introduce a scheme for molecular simulations, the deep potential molecular dynamics (DPMD) method, based on a many-body potential and interatomic forces generated by a carefully crafted deep neural network trained with ab initio data. The neural network model preserves all the natural symmetries in the problem. It is first-principles based in the sense that there are no ad hoc components aside from the network model. We show that the proposed scheme provides an efficient and accurate protocol in a variety of systems, including bulk materials and molecules. In all these cases, DPMD gives results that are essentially indistinguishable from the original data, at a cost that scales linearly with system size.

...read moreread less

254 citations

Journal Article•DOI•

Enforcing Analytic Constraints in Neural Networks Emulating Physical Systems.

[...]

Tom Beucler¹, Tom Beucler², Michael S. Pritchard², Stephan Rasp³, Jordan Ott², Pierre Baldi², Pierre Gentine¹ - Show less +3 more•Institutions (3)

Columbia University¹, University of California, Irvine², Technische Universität München³

04 Mar 2021-Physical Review Letters

TL;DR: This work introduces a systematic way of enforcing nonlinear analytic constraints in neural networks via constraints in the architecture or the loss function, which reduces errors in the subsets of the outputs most impacted by the constraints.

...read moreread less

Abstract: Neural networks can emulate nonlinear physical systems with high accuracy, yet they may produce physically inconsistent results when violating fundamental constraints. Here, we introduce a systematic way of enforcing nonlinear analytic constraints in neural networks via constraints in the architecture or the loss function. Applied to convective processes for climate modeling, architectural constraints enforce conservation laws to within machine precision without degrading performance. Enforcing constraints also reduces errors in the subsets of the outputs most impacted by the constraints.

...read moreread less

187 citations

Journal Article•DOI•

Physics-informed machine learning: case studies for weather and climate modelling.

[...]

Karthik Kashinath¹, Mustafa Mustafa¹, Adrian Albert¹, Jin-Long Wu¹, Jin-Long Wu², Chiyu "Max" Jiang³, Chiyu "Max" Jiang¹, Soheil Esmaeilzadeh⁴, Kamyar Azizzadenesheli⁵, Rui Wang⁶, Rui Wang¹, Ashesh Chattopadhyay⁷, Ashesh Chattopadhyay¹, A. Singh¹, Ashray Manepalli¹, Dragos B. Chirila, Rose Yu⁶, Robin Walters⁸, Brian White, Heng Xiao⁹, Hamdi A. Tchelepi⁴, Philip Marcus³, Animashree Anandkumar¹⁰, Animashree Anandkumar², Pedram Hassanzadeh⁷, Prabhat¹ - Show less +22 more•Institutions (10)

Lawrence Berkeley National Laboratory¹, California Institute of Technology², University of California, Berkeley³, Stanford University⁴, Purdue University⁵, University of California, San Diego⁶, Rice University⁷, Northeastern University⁸, Virginia Tech⁹, Nvidia¹⁰

05 Apr 2021-Philosophical Transactions of the Royal Society A

TL;DR: In this paper, the authors survey systematic approaches to incorporating physics and domain knowledge into ML models and distill these approaches into broad categories, and show how these approaches have been used successfully for emulating, downscaling, and forecasting weather and climate processes.

...read moreread less

Abstract: Machine learning (ML) provides novel and powerful ways of accurately and efficiently recognizing complex patterns, emulating nonlinear dynamics, and predicting the spatio-temporal evolution of weather and climate processes. Off-the-shelf ML models, however, do not necessarily obey the fundamental governing laws of physical systems, nor do they generalize well to scenarios on which they have not been trained. We survey systematic approaches to incorporating physics and domain knowledge into ML models and distill these approaches into broad categories. Through 10 case studies, we show how these approaches have been used successfully for emulating, downscaling, and forecasting weather and climate processes. The accomplishments of these studies include greater physical consistency, reduced training time, improved data efficiency, and better generalization. Finally, we synthesize the lessons learned and identify scientific, diagnostic, computational, and resource challenges for developing truly robust and reliable physics-informed ML models for weather and climate processes. This article is part of the theme issue 'Machine learning for weather and climate modelling'.

...read moreread less

119 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46

Collapse

References

PDF

Open Access

More filters

Proceedings Article•DOI•

Deep Residual Learning for Image Recognition

[...]

Kaiming He¹, Xiangyu Zhang¹, Shaoqing Ren¹, Jian Sun¹•Institutions (1)

Microsoft¹

27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Abstract: Deeper neural networks are more difficult to train. We present a residual learning framework to ease the training of networks that are substantially deeper than those used previously. We explicitly reformulate the layers as learning residual functions with reference to the layer inputs, instead of learning unreferenced functions. We provide comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth. On the ImageNet dataset we evaluate residual nets with a depth of up to 152 layers—8× deeper than VGG nets [40] but still having lower complexity. An ensemble of these residual nets achieves 3.57% error on the ImageNet test set. This result won the 1st place on the ILSVRC 2015 classification task. We also present analysis on CIFAR-10 with 100 and 1000 layers. The depth of representations is of central importance for many visual recognition tasks. Solely due to our extremely deep representations, we obtain a 28% relative improvement on the COCO object detection dataset. Deep residual nets are foundations of our submissions to ILSVRC & COCO 2015 competitions1, where we also won the 1st places on the tasks of ImageNet detection, ImageNet localization, COCO detection, and COCO segmentation.

...read moreread less

123,388 citations

Book•

Matrix computations

[...]

Gene H. Golub

01 Jan 1983

34,729 citations

Additional excerpts

...n addition to improved generalizability to new data. Physics-based loss function terms have also been used in the discovery of governing equations. Loiseau et al. [155] uses constrained least squares [96] to incorporate energy-preserving nonlinearities or to enforce symmetries in the identified equations to the equation learning process described , Vol. 1, No. 1, Article . Publication date: July 2020....
[...]

Book•

Gaussian Processes for Machine Learning

[...]

Carl Edward Rasmussen¹, Christopher Williams•Institutions (1)

Max Planck Society¹

23 Nov 2005

TL;DR: The treatment is comprehensive and self-contained, targeted at researchers and students in machine learning and applied statistics, and deals with the supervised learning problem for both regression and classification.

...read moreread less

Abstract: A comprehensive and self-contained introduction to Gaussian processes, which provide a principled, practical, probabilistic approach to learning in kernel machines. Gaussian processes (GPs) provide a principled, practical, probabilistic approach to learning in kernel machines. GPs have received increased attention in the machine-learning community over the past decade, and this book provides a long-needed systematic and unified treatment of theoretical and practical aspects of GPs in machine learning. The treatment is comprehensive and self-contained, targeted at researchers and students in machine learning and applied statistics. The book deals with the supervised-learning problem for both regression and classification, and includes detailed algorithms. A wide variety of covariance (kernel) functions are presented and their properties discussed. Model selection is discussed both from a Bayesian and a classical perspective. Many connections to other well-known techniques from machine learning and statistics are discussed, including support-vector machines, neural networks, splines, regularization networks, relevance vector machines and others. Theoretical issues including learning curves and the PAC-Bayesian framework are treated, and several approximation methods for learning with large datasets are discussed. The book contains illustrative examples and exercises, and code and datasets are available on the Web. Appendixes provide mathematical background and a discussion of Gaussian Markov processes.

...read moreread less

11,357 citations

"Integrating Physics-Based Modeling ..." refers methods in this paper

...In GPR, first a Gaussian process prior must be assumed in the form of a mean function and a matrix-valued kernel or covariance function....
[...]
...GPR has several benefits, including working well on small amounts of data and enabling uncertainty measurements on predictions....
[...]
...Gaussian process regression (GPR) [265] is a nonparametric, Bayesian approach to regression that is increasingly being used in ML applications....
[...]
...One way to incorporate physical knowledge in GPR is to encode differential equations into the kernel [242]....
[...]
...More recently, Glielmo et al. [94] propose a vectorial GPR that encodes physical knowledge in the matrix-valued kernel function....
[...]

Journal Article•DOI•

Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations

[...]

Maziar Raissi¹, Paris Perdikaris², George Em Karniadakis¹•Institutions (2)

Brown University¹, University of Pennsylvania²

01 Feb 2019-Journal of Computational Physics

TL;DR: In this article, the authors introduce physics-informed neural networks, which are trained to solve supervised learning tasks while respecting any given laws of physics described by general nonlinear partial differential equations.

...read moreread less

5,448 citations

"Integrating Physics-Based Modeling ..." refers methods in this paper

...Solve PDEs [147] [65] [140] [9] [215] [42] [264] [106] [235] [107] [131] [209] [234] [279] [281] [60] [177] [208] [231] [301] [71] [89] [129] [268] [194] [176] [16] [48] [218] [44] [60] [172] [235] [19] [51] [131] [76] [278] [194] [176] [206] [166]...
[...]
...[208], this concept is developed and shown to create data-efficient spatiotemporal function approximators to both solve and find parameters of basic PDEs like Burgers Equation or Schrodinger Equation....
[...]

Posted Content•

WaveNet: A Generative Model for Raw Audio

[...]

Aaron van den Oord, Sander Dieleman, Heiga Zen, Karen Simonyan, Oriol Vinyals, Alex Graves, Nal Kalchbrenner, Andrew W. Senior, Koray Kavukcuoglu - Show less +5 more

12 Sep 2016-arXiv: Sound

TL;DR: This paper proposed WaveNet, a deep neural network for generating audio waveforms, which is fully probabilistic and autoregressive, with the predictive distribution for each audio sample conditioned on all previous ones.

...read moreread less

Abstract: This paper introduces WaveNet, a deep neural network for generating raw audio waveforms. The model is fully probabilistic and autoregressive, with the predictive distribution for each audio sample conditioned on all previous ones; nonetheless we show that it can be efficiently trained on data with tens of thousands of samples per second of audio. When applied to text-to-speech, it yields state-of-the-art performance, with human listeners rating it as significantly more natural sounding than the best parametric and concatenative systems for both English and Mandarin. A single WaveNet can capture the characteristics of many different speakers with equal fidelity, and can switch between them by conditioning on the speaker identity. When trained to model music, we find that it generates novel and often highly realistic musical fragments. We also show that it can be employed as a discriminative model, returning promising results for phoneme recognition.

...read moreread less

4,002 citations