scispace - formally typeset
Search or ask a question

Showing papers by "Stanley Osher published in 2020"


Posted Content
TL;DR: It is expected that this work will pave the way for the determination of the 3D structure of a wide range of amorphous solids, which could transform the fundamental understanding of non-crystalline materials and related phenomena.
Abstract: Amorphous solids such as glass are ubiquitous in our daily life and have found broad applications ranging from window glass and solar cells to telecommunications and transformer cores. However, due to the lack of long-range order, the three-dimensional (3D) atomic structure of amorphous solids have thus far defied any direct experimental determination. Here, using a multi-component metallic glass as a model, we advance atomic electron tomography to determine its 3D atomic positions and chemical species with a precision of 21 picometer. We quantify the short-range order (SRO) and medium-range order (MRO) of the 3D atomic arrangement. We find that although the 3D atomic packing of the SRO is geometrically disordered, some SROs connect with each other to form crystal-like networks and give rise to MROs, which exhibit translational but no orientational order. We identify four crystal-like MROs - face-centred cubic, hexagonal close-packed, body-centered cubic and simple cubic - coexisting in the sample, which significantly deviate from the ideal crystal structures. We quantify the size, shape, volume, and structural distortion of these MROs with unprecedented detail. Looking forward, we anticipate this experiment will open the door to determining the 3D atomic coordinates of various amorphous solids, whose impact on non-crystalline solids may be comparable to the first 3D crystal structure solved by x-ray crystallography over a century ago.

100 citations


Posted Content
TL;DR: APAC-Net, an alternating population and agent control neural network for solving stochastic mean field games (MFGs), is presented, which takes advantage of the underlying variational primal-dual structure that MFGs exhibit and phrase it as a convex-concave saddle point problem.
Abstract: We present APAC-Net, an alternating population and agent control neural network for solving stochastic mean field games (MFGs). Our algorithm is geared toward high-dimensional instances of MFGs that are beyond reach with existing solution methods. We achieve this in two steps. First, we take advantage of the underlying variational primal-dual structure that MFGs exhibit and phrase it as a convex-concave saddle point problem. Second, we parameterize the value and density functions by two neural networks, respectively. By phrasing the problem in this manner, solving the MFG can be interpreted as a special case of training a generative adversarial network (GAN). We show the potential of our method on up to 100-dimensional MFG problems.

31 citations


Journal ArticleDOI
TL;DR: Using simulated attosecond pulses, this work simultaneously reconstructs the spectrum, 17 probes, and 17 spectral images of extended objects from a set of ptychographic diffraction patterns and believes this work clears the way to an unexplored domain ofAttosecond imaging science, which could have a far-reaching impact across different disciplines.
Abstract: Attosecond science has been transforming our understanding of electron dynamics in atoms, molecules, and solids. However, to date almost all of the attoscience experiments have been based on spectroscopic measurements because attosecond pulses have intrinsically very broad spectra due to the uncertainty principle and are incompatible with conventional imaging systems. Here we report an important advance towards achieving attosecond coherent diffractive imaging. Using simulated attosecond pulses, we simultaneously reconstruct the spectrum, 17 probes, and 17 spectral images of extended objects from a set of ptychographic diffraction patterns. We further confirm the principle and feasibility of this method by successfully performing a ptychographic coherent diffractive imaging experiment using a light-emitting diode with a broad spectrum. We believe this work clears the way to an unexplored domain of attosecond imaging science, which could have a far-reaching impact across different disciplines.

24 citations


Posted Content
TL;DR: A mean-field game model is introduced in controlling the propagation of epidemics on a spatial domain and the control variable, the spatial velocity, is first introduced for the classical disease models, such as the SIR model.
Abstract: The coronavirus disease 2019 (COVID-19) pandemic is changing and impacting lives on a global scale. In this paper, we introduce a mean-field game model in controlling the propagation of epidemics on a spatial domain. The control variable, the spatial velocity, is first introduced for the classical disease models, such as the SIR model. For this proposed model, we provide fast numerical algorithms based on proximal primal-dual methods. Numerical experiments demonstrate that the proposed model illustrates how to separate infected patients in a spatial domain effectively.

20 citations


Posted Content
TL;DR: A neural network approach for solving high-dimensional optimal control problems with obstacle and collision avoidance that fuses the Pontryagin Maximum Principle and Hamilton-Jacobi-Bellman approaches and parameterizes the value function with a neural network.
Abstract: We propose a neural network approach for solving high-dimensional optimal control problems. In particular, we focus on multi-agent control problems with obstacle and collision avoidance. These problems immediately become high-dimensional, even for moderate phase-space dimensions per agent. Our approach fuses the Pontryagin Maximum Principle and Hamilton-Jacobi-Bellman (HJB) approaches and parameterizes the value function with a neural network. Our approach yields controls in a feedback form for quick calculation and robustness to moderate disturbances to the system. We train our model using the objective function and optimality conditions of the control problem. Therefore, our training algorithm neither involves a data generation phase nor solutions from another algorithm. Our model uses empirically effective HJB penalizers for efficient training. By training on a distribution of initial states, we ensure the controls' optimality is achieved on a large portion of the state-space. Our approach is grid-free and scales efficiently to dimensions where grids become impractical or infeasible. We demonstrate our approach's effectiveness on a 150-dimensional multi-agent problem with obstacles.

16 citations


Book ChapterDOI
19 Jul 2020
TL;DR: In this article, the authors focus on a co-design of efficient DNN compression algorithms and sparse neural architectures for robust and accurate deep learning, and leverage the relaxed augmented Lagrangian based algorithms to prune the weights of adversarially trained DNNs.
Abstract: Deep neural nets (DNNs) compression is crucial for adaptation to mobile devices. Though many successful algorithms exist to compress naturally trained DNNs, developing efficient and stable compression algorithms for robustly trained DNNs remains widely open. In this paper, we focus on a co-design of efficient DNN compression algorithms and sparse neural architectures for robust and accurate deep learning. Such a co-design enables us to advance the goal of accommodating both sparsity and robustness. With this objective in mind, we leverage the relaxed augmented Lagrangian based algorithms to prune the weights of adversarially trained DNNs, at both structured and unstructured levels. Using a Feynman-Kac formalism principled robust and sparse DNNs, we can at least double the channel sparsity of the adversarially trained ResNet20 for CIFAR10 classification, meanwhile, improve the natural accuracy by 8.69% and the robust accuracy under the benchmark 20 iterations of IFGSM attack by 5.42%.

13 citations


Posted Content
TL;DR: A utility enhancement scheme based on Laplacian smoothing for differentially-private federated learning (DP-Fed-LS), where the parameter aggregation with injected Gaussian noise is improved in statistical precision.
Abstract: Federated learning aims to protect data privacy by collaboratively learning a model without sharing private data among users. However, an adversary may still be able to infer the private training data by attacking the released model. Differential privacy(DP) provides a statistical guarantee against such attacks, at a privacy of possibly degenerating the accuracy or utility of the trained models. In this paper, we apply a utility enhancement scheme based on Laplacian smoothing for differentially-private federated learning (DP-Fed-LS), where the parameter aggregation with injected Gaussian noise is improved in statistical precision. We provide tight closed-form privacy bounds for both uniform and Poisson subsampling and derive corresponding DP guarantees for differential private federated learning, with or without Laplacian smoothing. Experiments over MNIST, SVHN and Shakespeare datasets show that the proposed method can improve model accuracy with DP-guarantee under both subsampling mechanisms.

13 citations


Posted Content
TL;DR: A mean-field game model is introduced in controlling the propagation of epidemics on a spatial domain and the control variable, the spatial velocity, is first introduced for the classical disease models, such as the SIR model.
Abstract: The coronavirus disease 2019 (COVID-19) pandemic is changing and impacting lives on a global scale. In this paper, we introduce a mean-field game model in controlling the propagation of epidemics on a spatial domain. The control variable, the spatial velocity, is first introduced for the classical disease models, such as the SIR model. For this proposed model, we provide fast numerical algorithms based on proximal primal-dual methods. Numerical experiments demonstrate that the proposed model illustrates how to separate infected patients in a spatial domain effectively.

11 citations


Proceedings ArticleDOI
01 Dec 2020
TL;DR: In this paper, the velocity control problem for massive numbers of rotary-wing UAVs is formulated as a Schrodinger bridge problem and solved with the Gprox primal dual hybrid gradient method.
Abstract: When a disaster happens in a metropolitan area, wireless communication systems in the area are highly affected, degrading the efficiency of the search and rescue (SAR) mission. An emergency wireless network must be deployed quickly and efficiently to preserve human lives. Teams of low-altitude rotarywing unmanned aerial vehicles (UAVs) are useful as on-demand temporal wireless networks because they are generally faster to deploy, flexible to reconfigure, and able to provide good communication services with short line-of-sight links. However, rotary-wing UAVs’ limited on-board batteries require that they need to recharge and reconFigure frequently during a mission. Therefore, we formulate the velocity control problem for massive numbers of rotary-wing UAVs as a Schrodinger bridge problem which can describe the frequent reconfiguration of UAVs. Then we transform it into a mean field game and solve it with the Gprox primal dual hybrid gradient (PDHG) method. Finally, we show the efficiency of our algorithm and analyze the influence of wind dynamics with numerical results.

7 citations


Posted Content
TL;DR: This paper proposes mean-field game inverse-problem models to reconstruct the ground metrics and interaction kernels in the running costs, and numerically demonstrates that the model is both efficient and robust to noise.
Abstract: Mean-field games arise in various fields including economics, engineering, and machine learning. They study strategic decision making in large populations where the individuals interact via certain mean-field quantities. The ground metrics and running costs of the games are of essential importance but are often unknown or only partially known. In this paper, we propose mean-field game inverse-problem models to reconstruct the ground metrics and interaction kernels in the running costs. The observations are the macro motions, to be specific, the density distribution, and the velocity field of the agents. They can be corrupted by noise to some extent. Our models are PDE constrained optimization problems, which are solvable by first-order primal-dual methods. Besides, we apply Bregman iterations to find the optimal model parameters. We numerically demonstrate that our model is both efficient and robust to noise.

7 citations


Posted Content
TL;DR: Using a Feynman-Kac formalism principled robust and sparse DNNs, this paper can at least double the channel sparsity of the adversarially trained ResNet20 for CIFAR10 classification and improve the natural accuracy and robust accuracy under the benchmark iterations of IFGSM attack.
Abstract: Deep neural nets (DNNs) compression is crucial for adaptation to mobile devices. Though many successful algorithms exist to compress naturally trained DNNs, developing efficient and stable compression algorithms for robustly trained DNNs remains widely open. In this paper, we focus on a co-design of efficient DNN compression algorithms and sparse neural architectures for robust and accurate deep learning. Such a co-design enables us to advance the goal of accommodating both sparsity and robustness. With this objective in mind, we leverage the relaxed augmented Lagrangian based algorithms to prune the weights of adversarially trained DNNs, at both structured and unstructured levels. Using a Feynman-Kac formalism principled robust and sparse DNNs, we can at least double the channel sparsity of the adversarially trained ResNet20 for CIFAR10 classification, meanwhile, improve the natural accuracy by $8.69$\% and the robust accuracy under the benchmark $20$ iterations of IFGSM attack by $5.42$\%. The code is available at \url{this https URL}.

Posted Content
TL;DR: A novel framework to model and solve mean-field game systems with nonlocal interactions that yields a computationally efficient saddle-point reformulation of the original problem that is amenable to state-of-the-art convex optimization methods such as the primal-dual hybrid gradient method (PDHG).
Abstract: We introduce a novel framework to model and solve mean-field game systems with nonlocal interactions. Our approach relies on kernel-based representations of mean-field interactions and feature-space expansions in the spirit of kernel methods in machine learning. We demonstrate the flexibility of our approach by modeling various interaction scenarios between agents. Additionally, our method yields a computationally efficient saddle-point reformulation of the original problem that is amenable to state-of-the-art convex optimization methods such as the primal-dual hybrid gradient method (PDHG). We also discuss potential applications of our methods to multi-agent trajectory planning problems.

Journal ArticleDOI
13 Jul 2020
TL;DR: Empirical adversarial risk minimization is a widely used mathematical framework to robustly train deep neural nets that are resistant to adversarial attacks.
Abstract: Empirical adversarial risk minimization is a widely used mathematical framework to robustly train deep neural nets that are resistant to adversarial attacks. However, both natural and robust accura...

Posted Content
TL;DR: Li et al. as discussed by the authors investigated a utility enhancement scheme based on Laplacian smoothing for differentially private federated learning (DP-Fed-LS), where the parameter aggregation with injected Gaussian noise is improved in statistical precision without losing privacy budget.
Abstract: Federated learning aims to protect data privacy by collaboratively learning a model without sharing private data among users. However, an adversary may still be able to infer the private training data by attacking the released model. Differential privacy provides a statistical protection against such attacks at the price of significantly degrading the accuracy or utility of the trained models. In this paper, we investigate a utility enhancement scheme based on Laplacian smoothing for differentially private federated learning (DP-Fed-LS), where the parameter aggregation with injected Gaussian noise is improved in statistical precision without losing privacy budget. Our key observation is that the aggregated gradients in federated learning often enjoy a type of smoothness, i.e. sparsity in the graph Fourier basis with polynomial decays of Fourier coefficients as frequency grows, which can be exploited by the Laplacian smoothing efficiently. Under a prescribed differential privacy budget, convergence error bounds with tight rates are provided for DP-Fed-LS with uniform subsampling of heterogeneous Non-IID data, revealing possible utility improvement of Laplacian smoothing in effective dimensionality and variance reduction, among others. Experiments over MNIST, SVHN, and Shakespeare datasets show that the proposed method can improve model accuracy with DP-guarantee and membership privacy under both uniform and Poisson subsampling mechanisms.

Posted Content
05 Aug 2020
TL;DR: A new mechanism, called adversarial projection, is presented that projects a given signal onto the intrinsically low dimensional manifold of true data, which can be used for solving inverse problems, which consists of recovering a signal from a collection of noisy measurements.
Abstract: We present a new mechanism, called adversarial projection, that projects a given signal onto the intrinsically low dimensional manifold of true data. This operator can be used for solving inverse problems, which consists of recovering a signal from a collection of noisy measurements. Rather than attempt to encode prior knowledge via an analytic regularizer, we leverage available data to project signals directly onto the (possibly nonlinear) manifold of true data (i.e., regularize via an indicator function of the manifold). Our approach avoids the difficult task of forming a direct representation of the manifold. Instead, we directly learn the projection operator by solving a sequence of unsupervised learning problems, and we prove our method converges in probability to the desired projection. This operator can then be directly incorporated into optimization algorithms in the same manner as Plug-and-Play methods, but now with robust theoretical guarantees. Numerical examples are provided.

Proceedings ArticleDOI
14 Dec 2020
TL;DR: In this article, the authors address the problem of optimal path planning for a Dubins-type nonholonomic vehicle in the presence of obstacles and present a Hamilton-Jacobi formulation of the problem that resolves time-optimal paths and considers the geometry of the vehicle.
Abstract: We address the problem of optimal path planning for a Dubins-type nonholonomic vehicle in the presence of obstacles. Most current approaches are either split hierarchically into global path planning and local collision avoidance, or neglect some of the ambient geometry by assuming the car is a point mass. We present a Hamilton-Jacobi formulation of the problem that resolves time-optimal paths and considers the geometry of the vehicle.

Posted ContentDOI
TL;DR: In this article, the 3D atomic positions of amorphous solids have been determined using atomic electron tomography (AT) for the first time, using a multi-component metallic glass as a proof-of-principle.
Abstract: Amorphous solids such as glass are ubiquitous in our daily life and have found broad applications ranging from window glass and solar cells to telecommunications and transformer cores. However, due to the lack of long-range order, the three-dimensional (3D) atomic structure of amorphous solids have thus far defied any direct experimental determination without model fitting. Here, using a multi-component metallic glass as a proof-of-principle, we advance atomic electron tomography to determine the 3D atomic positions in an amorphous solid for the first time. We quantitatively characterize the short-range order (SRO) and medium-range order (MRO) of the 3D atomic arrangement. We find that although the 3D atomic packing of the SRO is geometrically disordered, some SRO connect with each other to form crystal-like networks and give rise to MRO. We identify four crystal-like MRO networks - face-centred cubic, hexagonal close-packed, body-centered cubic and simple cubic - coexisting in the sample, which show translational but no orientational order. These observations confirm that the 3D atomic structure in some parts of the sample is consistent with the efficient cluster packing model. Looking forward, we anticipate this experiment will open the door to determining the 3D atomic coordinates of various amorphous solids, whose impact on non-crystalline solids may be comparable to the first 3D crystal structure solved by x-ray crystallography over a century ago.

Posted Content
TL;DR: Yuan et al. as discussed by the authors showed that pentagonal bipyramids are the most abundant atomic motifs in monatomic amorphous materials and showed that the majority of these motifs arrange into networks that extend to medium-range scale.
Abstract: Author(s): Yuan, Yakun; Kim, Dennis S; Zhou, Jihan; Chang, Dillan J; Zhu, Fan; Nagaoka, Yasutaka; Yang, Yao; Pham, Minh; Osher, Stanley J; Chen, Ou; Ercius, Peter; Schmid, Andreas K; Miao, Jianwei | Abstract: Liquids and solids are two fundamental states of matter. However, due to the lack of direct experimental determination, our understanding of the 3D atomic structure of liquids and amorphous solids remained speculative. Here we advance atomic electron tomography to determine for the first time the 3D atomic positions in monatomic amorphous materials, including a Ta thin film and two Pd nanoparticles. We observe that pentagonal bipyramids are the most abundant atomic motifs in these amorphous materials. Instead of forming icosahedra, the majority of pentagonal bipyramids arrange into networks that extend to medium-range scale. Molecular dynamic simulations further reveal that pentagonal bipyramid networks are prevalent in monatomic amorphous liquids, which rapidly grow in size and form icosahedra during the quench from the liquid state to glass state. The experimental method and results are expected to advance the study of the amorphous-crystalline phase transition and glass transition at the single-atom level.

Posted Content
TL;DR: This work presents a Hamilton-Jacobi formulation of the problem of optimal path planning for a Dubins-type nonholonomic vehicle in the presence of obstacles that resolves time-optimal paths and considers the geometry of the vehicle.
Abstract: We address the problem of optimal path planning for a simple nonholonomic vehicle in the presence of obstacles. Most current approaches are either split hierarchically into global path planning and local collision avoidance, or neglect some of the ambient geometry by assuming the car is a point mass. We present a Hamilton-Jacobi formulation of the problem that resolves time-optimal paths and considers the geometry of the vehicle.

Posted Content
TL;DR: This paper proposes a hybrid gradient descent to solve the tomography problem by combining Fourier slice theorem and calculus of variations and shows that the state-of-art RESIRE can produce more superior results than previous methods; the reconstructed objects have higher quality and smaller relative errors.
Abstract: Tomography has made a revolutionary impact on diverse fields, ranging from macro-/mesoscopic scale studies in biology, radiology, plasma physics to the characterization of 3D atomic structure in material science. The fundamental of tomography is to reconstruct a 3D object from a set of 2D projections. To solve the tomography problem, many algorithms have been developed. Among them are methods using transformation technique such as computed tomography (CT) based on Radon transform and Generalized Fourier iterative reconstruction (GENFIRE) based on Fourier slice theorem (FST), and direct methods such as Simultaneous Iterative Reconstruction Technique (SIRT) and Simultaneous Algebraic Reconstruction Technique (SART) using gradient descent and algebra technique. In this paper, we propose a hybrid gradient descent to solve the tomography problem by combining Fourier slice theorem and calculus of variations. By using simulated and experimental data, we show that the state-of-art RESIRE can produce more superior results than previous methods; the reconstructed objects have higher quality and smaller relative errors. More importantly, RESIRE can deal with partially blocked projections rigorously where only part of projection information are provided while other methods fail. We anticipate RESIRE will not only improve the reconstruction quality in all existing tomographic applications, but also expand tomography method to a broad class of functional thin films. We expect RESIRE to find a broad applications across diverse disciplines.

Posted Content
TL;DR: In this paper, the Wasserstein-based projection (WP) operator is proposed to approximate the true projection with high probability. But, the projection operator can be replaced by a data-driven denoiser with limited theoretical guarantees.
Abstract: Inverse problems consist of recovering a signal from a collection of noisy measurements. These are typically cast as optimization problems, with classic approaches using a data fidelity term and an analytic regularizer that stabilizes recovery. Recent Plug-and-Play (PnP) works propose replacing the operator for analytic regularization in optimization methods by a data-driven denoiser. These schemes obtain state of the art results, but at the cost of limited theoretical guarantees. To bridge this gap, we present a new algorithm that takes samples from the manifold of true data as input and outputs an approximation of the projection operator onto this manifold. Under standard assumptions, we prove this algorithm generates a learned operator, called Wasserstein-based projection (WP), that approximates the true projection with high probability. Thus, WPs can be inserted into optimization methods in the same manner as PnP, but now with theoretical guarantees. Provided numerical examples show WPs obtain state of the art results for unsupervised PnP signal recovery.

Posted Content
TL;DR: APAC-Net as mentioned in this paper is an alternating population and agent control neural network for solving stochastic mean field games, which is geared toward high-dimensional instances of MFGs that are beyond reach with existing solution methods.
Abstract: We present APAC-Net, an alternating population and agent control neural network for solving stochastic mean field games (MFGs). Our algorithm is geared toward high-dimensional instances of MFGs that are beyond reach with existing solution methods. We achieve this in two steps. First, we take advantage of the underlying variational primal-dual structure that MFGs exhibit and phrase it as a convex-concave saddle point problem. Second, we parameterize the value and density functions by two neural networks, respectively. By phrasing the problem in this manner, solving the MFG can be interpreted as a special case of training a generative adversarial network (GAN). We show the potential of our method on up to 100-dimensional MFG problems.