scispace - formally typeset
Search or ask a question

Showing papers in "arXiv: Computational Physics in 2017"


Journal ArticleDOI
TL;DR: In this article, a variety of modern many-body methods are employed, with exhaustive cross-checks and validation, to reach the continuous space limit and the thermodynamic limit of an infinite chain of hydrogen atoms.
Abstract: We present numerical results for the equation of state of an infinite chain of hydrogen atoms. A variety of modern many-body methods are employed, with exhaustive cross-checks and validation. Approaches for reaching the continuous space limit and the thermodynamic limit are investigated, proposed, and tested. The detailed comparisons provide a benchmark for assessing the current state of the art in many-body computation, and for the development of new methods. The ground-state energy per atom in the linear chain is accurately determined versus bondlength, with a confidence bound given on all uncertainties.

134 citations


Journal ArticleDOI
TL;DR: Deep Potential is able to reproduce the original model, whether empirical or quantum mechanics based, within chemical accuracy, and the computational cost of this new model is not substantially larger than that of empirical force fields.
Abstract: We present a simple, yet general, end-to-end deep neural network representation of the potential energy surface for atomic and molecular systems. This methodology, which we call Deep Potential, is "first-principle" based, in the sense that no ad hoc approximations or empirical fitting functions are required. The neural network structure naturally respects the underlying symmetries of the systems. When tested on a wide variety of examples, Deep Potential is able to reproduce the original model, whether empirical or quantum mechanics based, within chemical accuracy. The computational cost of this new model is not substantially larger than that of empirical force fields. In addition, the method has promising scalability properties. This brings us one step closer to being able to carry out molecular simulations with accuracy comparable to that of quantum mechanics models and computational cost comparable to that of empirical potentials.

128 citations


Journal ArticleDOI
TL;DR: In this article, the authors describe the R&D activities required to prepare for this software upgrade, and present a white paper describing the software upgrade activities required for the HL-LHC.
Abstract: Particle physics has an ambitious and broad experimental programme for the coming decades. This programme requires large investments in detector hardware, either to build new facilities and experiments, or to upgrade existing ones. Similarly, it requires commensurate investment in the R&D of software to acquire, manage, process, and analyse the shear amounts of data to be recorded. In planning for the HL-LHC in particular, it is critical that all of the collaborating stakeholders agree on the software goals and priorities, and that the efforts complement each other. In this spirit, this white paper describes the R&D activities required to prepare for this software upgrade.

123 citations


Journal ArticleDOI
TL;DR: In this article, the application of flat borophene nanomembranes as anode materials for Al, Mg, Na or Li-ion batteries was investigated.
Abstract: Most recent exciting experimental advances introduced buckled and flat borophene nanomembranes as new members to the advancing family of two-dimensional (2D) materials. Borophene, is the boron atom analogue of graphene with interesting properties suitable for a wide variety of applications. In this investigation, we conducted extensive first-principles density functional theory simulations to explore the application of four different flat borophene films as anode materials for Al, Mg, Na or Li-ion batteries. In our modelling, first the strongest binding sites were predicted and next we gradually increased the adatoms coverage until the maximum capacity was reached. Bader charge analysis was employed to evaluate the charge transfer between the adatoms and the borophene films. Nudged elastic band method was also utilized to probe the ions diffusions. We calculated the average atom adsorption energies and open-circuit voltage profiles as a function of adatoms coverage. Our findings propose the flat borophene films as electrically conductive and thermally stable anode materials with ultra high capacities of 2480 mAh/g, 1640 mAh/g and 2040 mAh/g for Mg, Na or Li-ion batteries, respectively, which distinctly outperform not only the buckled borophene but also all other 2D materials. Our study may provide useful viewpoint with respect to the possible application of flat borophene films for the design of high capacity and light weight advanced rechargeable ion batteries.

112 citations


Journal ArticleDOI
TL;DR: In this article, the authors examined and critically assessed an alternative method for predicting low-lying neutral excitations with similar computational cost, the ab initio Bethe-Salpeter equation (BSE) approach, and compared results against high-accuracy wavefunction-based methods.
Abstract: The accurate prediction of singlet and triplet excitation energies is of significant fundamental interest and is critical for many applications. An area of intense research, most calculations of singlet and triplet energies use time-dependent density functional theory (TDDFT) in conjunction with an approximate exchange-correlation functional. In this work, we examine and critically assess an alternative method for predicting low-lying neutral excitations with similar computational cost, the ab initio Bethe-Salpeter equation (BSE) approach, and compare results against high-accuracy wavefunction-based methods. We consider singlet and triplet excitations of 27 prototypical organic molecules, including members of Thiel's set, the acene series, and several aromatic hydrocarbons exhibiting charge-transfer-like excitations. Analogous to its impact in TDDFT, we find that the Tamm-Dancoff approximation (TDA) overcomes triplet instabilities in the BSE approach, improving both triplet and singlet energetics relatively to higher level theories. Finally, we find that BSE-TDA calculations built on good DFT starting points, such as those utilizing optimally-tuned range-separated hybrid functionals, can yield accurate singlet and triplet excitation energies for gas-phase organic molecules.

74 citations


Posted Content
TL;DR: In this article, a deep convolutional neural network was used to predict the distribution of electric potential in 2D or 3D cases, with a significant reduction in CPU time compared with the traditional finite difference methods.
Abstract: In this work, we investigated the feasibility of applying deep learning techniques to solve Poisson's equation. A deep convolutional neural network is set up to predict the distribution of electric potential in 2D or 3D cases. With proper training data generated from a finite difference solver, the strong approximation capability of the deep convolutional neural network allows it to make correct prediction given information of the source and distribution of permittivity. With applications of L2 regularization, numerical experiments show that the predication error of 2D cases can reach below 1.5\% and the predication of 3D cases can reach below 3\%, with a significant reduction in CPU time compared with the traditional solver based on finite difference methods.

58 citations


BookDOI
TL;DR: In this paper, the authors focus on the contraction algorithms of tensor networks and some of the applications to the simulations of quantum many-body systems, including the relation between tensor decompositions and physical problems, and present several paradigm algorithms based on the ideas of the numerical renormalization group and/or boundary states.
Abstract: Tensor network (TN), a young mathematical tool of high vitality and great potential, has been undergoing extremely rapid developments in the last two decades, gaining tremendous success in condensed matter physics, atomic physics, quantum information science, statistical physics, and so on. In this lecture notes, we focus on the contraction algorithms of TN as well as some of the applications to the simulations of quantum many-body systems. Starting from basic concepts and definitions, we first explain the relations between TN and physical problems, including the TN representations of classical partition functions, quantum many-body states (by matrix product state, tree TN, and projected entangled pair state), time evolution simulations, etc. These problems, which are challenging to solve, can be transformed to TN contraction problems. We present then several paradigm algorithms based on the ideas of the numerical renormalization group and/or boundary states, including density matrix renormalization group, time-evolving block decimation, coarse-graining/corner tensor renormalization group, and several distinguished variational algorithms. Finally, we revisit the TN approaches from the perspective of multi-linear algebra (also known as tensor algebra or tensor decompositions) and quantum simulation. Despite the apparent differences in the ideas and strategies of different TN algorithms, we aim at revealing the underlying relations and resemblances in order to present a systematic picture to understand the TN contraction approaches.

56 citations


Journal ArticleDOI
TL;DR: A tutorial on current techniques in machine learning -- a jumping-off point for interested researchers to advance their work on deep neural networks with an emphasis on demystifying deep learning.
Abstract: Machine learning is finding increasingly broad application in the physical sciences. This most often involves building a model relationship between a dependent, measurable output and an associated set of controllable, but complicated, independent inputs. We present a tutorial on current techniques in machine learning -- a jumping-off point for interested researchers to advance their work. We focus on deep neural networks with an emphasis on demystifying deep learning. We begin with background ideas in machine learning and some example applications from current research in plasma physics. We discuss supervised learning techniques for modeling complicated functions, beginning with familiar regression schemes, then advancing to more sophisticated deep learning methods. We also address unsupervised learning and techniques for reducing the dimensionality of input spaces. Along the way, we describe methods for practitioners to help ensure that their models generalize from their training data to as-yet-unseen test data. We describe classes of tasks -- predicting scalars, handling images, fitting time-series -- and prepare the reader to choose an appropriate technique. We finally point out some limitations to modern machine learning and speculate on some ways that practitioners from the physical sciences may be particularly suited to help.

43 citations


Journal ArticleDOI
TL;DR: In this paper, a geometric multigrid technique is introduced into the implicit UGKS, where the prediction step for the equilibrium state and the evolution step for distribution function are both treated with multi-rigid acceleration.
Abstract: The unified gas kinetic scheme (UGKS) is a direct modeling method based on the gas dynamical model on the mesh size and time step scales. With the implementation of particle transport and collision in a time-dependent flux function, the UGKS can recover multiple flow physics from the kinetic particle transport to the hydrodynamic wave propagation. In comparison with direct simulation Monte Carlo (DSMC), the equations-based UGKS can use the implicit techniques in the updates of macroscopic conservative variables and microscopic distribution function. The implicit UGKS significantly increases the convergence speed for steady flow computations, especially in the highly rarefied and near continuum regime. In order to further improve the computational efficiency, for the first time a geometric multigrid technique is introduced into the implicit UGKS, where the prediction step for the equilibrium state and the evolution step for the distribution function are both treated with multigrid acceleration. The multigrid implicit UGKS (MIUGKS) is used in the non-equilibrium flow study, which includes microflow, such as lid-driven cavity flow and the flow passing through a finite-length flat plate, and high speed one, such as supersonic flow over a square cylinder. The MIUGKS shows 5 to 9 times efficiency increase over the previous implicit scheme. For the low speed microflow, the efficiency of MIUGKS is several orders of magnitude higher than the DSMC. Even for the hypersonic flow at Mach number 5 and Knudsen number 0.1, the MIUGKS is still more than 100 times faster than the DSMC method for a convergent steady state solution.

41 citations


Posted Content
TL;DR: Prismatic as discussed by the authors is a CUDA/C++ software package for parallelized image formation in scanning transmission electron microscopy (STEM) using both the plane-wave reciprocal-space interpolated scattering matrix (PRISM) and multislice methods.
Abstract: Simulation of atomic resolution image formation in scanning transmission electron microscopy can require significant computation times using traditional methods. A recently developed method, termed plane-wave reciprocal-space interpolated scattering matrix (PRISM), demonstrates potential for significant acceleration of such simulations with negligible loss of accuracy. Here we present a software package called Prismatic for parallelized simulation of image formation in scanning transmission electron microscopy (STEM) using both the PRISM and multislice methods. By distributing the workload between multiple CUDA-enabled GPUs and multicore processors, accelerations as high as 1000x for PRISM and 30x for multislice are achieved relative to traditional multislice implementations using a single 4-GPU machine. We demonstrate a potentially important application of Prismatic, using it to compute images for atomic electron tomography at sufficient speeds to include in the reconstruction pipeline. Prismatic is freely available both as an open-source CUDA/C++ package with a graphical user interface and as a Python package, PyPrismatic.

41 citations


Journal ArticleDOI
TL;DR: In this article, the intrinsic carrier mobility in semimetals with distorted Dirac cones under both longitudinal and transverse acoustic phonon scattering was investigated and an analytic formula for the carrier mobility was obtained.
Abstract: We have theoretically investigated the intrinsic carrier mobility in semimetals with distorted Dirac cones under both longitudinal and transverse acoustic phonon scattering. An analytic formula for the carrier mobility was obtained. It shows that tilting significantly reduces the mobility. The theory was then applied to 8B-Pmmn borophene and borophane (fully hydrogenated borophene), both of which have tilted Dirac cones. The predicted carrier mobilities in 8B-Pmmn borophene at room temperature are both higher than that in graphene. For borophane, despite its superhigh Fermi velocity, the carrier mobility is lower than that in 8B-Pmmn owing to its smaller elastic constant under shear strain.

Journal ArticleDOI
TL;DR: The feasibility of data based machine learning applied to ultrasound tomography is studied to estimate water-saturated porous material parameters, and a high-order discontinuous Galerkin method is considered, while deep convolutional neural networks are used to solve the parameter estimation problem.
Abstract: We study the feasibility of data based machine learning applied to ultrasound tomography to estimate water-saturated porous material parameters. In this work, the data to train the neural networks is simulated by solving wave propagation in coupled poroviscoelastic-viscoelastic-acoustic media. As the forward model, we consider a high-order discontinuous Galerkin method while deep convolutional neural networks are used to solve the parameter estimation problem. In the numerical experiment, we estimate the material porosity and tortuosity while the remaining parameters which are of less interest are successfully marginalized in the neural networks-based inversion. Computational examples confirms the feasibility and accuracy of this approach.

Posted Content
TL;DR: The ACE-ISDF method is used to geometrically optimize a 1000-atom silicon system with a vacancy defect using the HSE06 functional and computes its electronic structure, finding that that the computed energy gap is much closer to the experimental value compared to that produced by semilocal functionals in the DFT calculations.
Abstract: We present a new efficient way to perform hybrid density functional theory (DFT) based electronic structure calculation. The new method uses an interpolative separable density fitting (ISDF) procedure to construct a set of numerical auxiliary basis vectors and a compact approximation of the matrix consisting of products of occupied orbitals represented in a large basis set such as the planewave basis. Such an approximation allows us to reduce the number of Poisson solves from $\Or(N_{e}^2)$ to $\Or(N_{e})$ when we apply the exchange operator to occupied orbitals in an iterative method for solving the Kohn-Sham equations, where $N_{e}$ is the number of electrons in the system to be studied. We show that the ISDF procedure can be carried out in $\Or(N_{e}^3)$ operations, with a much smaller pre-constant compared to methods used in existing approaches. When combined with the recently developed adaptively compressed exchange (ACE) operator formalism, which reduces the number of times the exchange operator needs to be updated, the resulting ACE-ISDF method significantly reduces the computational cost \REV{associated with the exchange operator} by nearly two orders of magnitude compared to existing approaches for a large silicon system with $1000$ atoms. We demonstrate that the ACE-ISDF method can produce accurate energies and forces for insulating and metallic systems, and that it is possible to obtain converged hybrid functional calculation results for a 1000-atom bulk silicon within 10 minutes on 2000 computational cores. We also show that ACE-ISDF can scale to 8192 computational cores for a 4096-atom bulk silicon system. We use the ACE-ISDF method to geometrically optimize a 1000-atom silicon system with a vacancy defect using the HSE06 functional and computes its electronic structure.

Posted Content
TL;DR: This work explores multi-fidelity strategies to accelerate the estimation of the effect of uncertainties in model inputs, and their performance assessed on an irradiated particle-laden turbulent flow case related to Stanford's PSAAP II particle-based solar energy receiver.
Abstract: The study of complex systems is often based on computationally intensive, high-fidelity, simulations. To build confidence in the prediction accuracy of such simulations, the impact of uncertainties in model inputs on the quantities of interest must be measured. This, however, requires a computational budget that is a possibly large multiple of the cost of a single simulation, and thus may become infeasible for expensive simulation models featuring a large number of uncertain inputs and highly nonlinear behavior. Therefore, this work explores multi-fidelity strategies to accelerate the estimation of the effect of uncertainties. The main idea behind multi-fidelity models is to utilize cheaper, lower-fidelity models - than the intended high-fidelity, expensive model of the problem - to generate a baseline solution that together, with relatively small number of high-fidelity simulations can lead to accurate predictions. The methods are briefly presented, and their performance assessed on an irradiated particle-laden turbulent flow case related to Stanford's PSAAP II particle-based solar energy receiver.

Posted Content
TL;DR: In this paper, an open source large eddy simulation (LES) tool for low-Mach number turbulent combustion using the OpenFOAM framework is developed and disseminated for unstructured grid formulation.
Abstract: Large eddy simulation (LES) has become the de-facto computational tool for modeling complex reacting flows, especially in gas turbine applications. However, readily usable general-purpose LES codes for complex geometries are typically academic or proprietary/commercial in nature. The objective of this work is to develop and disseminate an open source LES tool for low-Mach number turbulent combustion using the OpenFOAM framework. In particular, a collocated-mesh approach suited for unstructured grid formulation is provided. Unlike other fluid dynamics models, LES accuracy is intricately linked to so-called primary and secondary conservation properties of the numerical discretization schemes. This implies that although the solver only evolves equations for mass, momentum, and energy, the implied discrete equation for kinetic energy (square of velocity) should be minimally-dissipative. Here, a specific spatial and temporal discretization is imposed such that this kinetic energy dissipation is minimized. The method is demonstrated using manufactured solutions approach on regular and skewed meshes, a canonical flow problem, and a turbulent sooting flame in a complex domain relevant to gas turbines applications.

Posted Content
Bing He1, Sucui Yang1, Zhangrong Qin1, Binghai Wen1, Chaoying Zhang1 
TL;DR: In this article, a lattice Boltzmann-based binary fluid model for inkjet printing is described, where a time-dependent driving force is applied to actuate the droplet ejection.
Abstract: This paper describes a lattice Boltzmann-based binary fluid model for inkjet printing. In this model, a time-dependent driving force is applied to actuate the droplet ejection. As a result, the actuation can be accurately controlled by adjusting the intensity and duration of the positive and negative forces, as well as the idle time. The present model was verified by reproducing the actual single droplet ejection process captured by fast imaging. This model was subsequently used to investigate droplet formation in piezoelectric inkjet printing. It was determined that wettability of the nozzle inner wall and the surface tension of the ink are vital factors controlling the print quality and speed. Increasing the contact angle of the nozzle inner delays the droplet breakup time and reduces the droplet velocity. In contrast, higher surface tension values promote earlier droplet breakup and faster drop velocity. These results indicate that the hydrophilic modification of the nozzle inner wall and the choice of inks with high surface tensions will improve printing quality.

Posted Content
TL;DR: With its independence from the microscope transfer function, direct recovery of phase contrast, and better scaling of signal-to-noise ratio, low-dose cryo electron ptychography may become a promising alternative to Zernike phase-contrast microscopy.
Abstract: Electron ptychography has seen a recent surge of interest for phase sensitive imaging at atomic or near-atomic resolution. However, applications are so far mainly limited to radiation-hard samples because the required doses are too high for imaging biological samples at high resolution. We propose the use of non-convex, Bayesian optimization to overcome this problem and reduce the dose required for successful reconstruction by two orders of magnitude compared to previous experiments. We suggest to use this method for imaging single biological macromolecules at cryogenic temperatures and demonstrate 2D single-particle reconstructions from simulated data with a resolution of 7.9 A$\,$ at a dose of 20 $e^- / A^2$. When averaging over only 15 low-dose datasets, a resolution of 4 A$\,$ is possible for large macromolecular complexes. With its independence from microscope transfer function, direct recovery of phase contrast and better scaling of signal-to-noise ratio, cryo-electron ptychography may become a promising alternative to Zernike phase-contrast microscopy.

Posted Content
TL;DR: In this article, the average turbulent airflow through an array of fences as a function of the porosity, spacing and height of the fences is calculated using Computational Fluid Dynamics.
Abstract: Sand fences are widely applied to prevent soil erosion by wind in areas affected by desertification. Sand fences also provide a way to reduce the emission rate of dust particles, which is triggered mainly by the impacts of wind-blown sand grains onto the soil and affects the Earth's climate. Many different types of fence have been designed and their effects on the sediment transport dynamics studied since many years. However, the search for the optimal array of fences has remained largely an empirical task. In order to achieve maximal soil protection using the minimal amount of fence material, a quantitative understanding of the flow profile over the relief encompassing the area to be protected including all employed fences is required. Here we use Computational Fluid Dynamics to calculate the average turbulent airflow through an array of fences as a function of the porosity, spacing and height of the fences. Specifically, we investigate the factors controlling the fraction of soil area over which the basal average wind shear velocity drops below the threshold for sand transport when the fences are applied. We introduce a cost function, given by the amount of material necessary to construct the fences. We find that, for typical sand-moving wind velocities, the optimal fence height (which minimizes this cost function) is around $50\,$cm, while using fences of height around $1.25\,$m leads to maximal cost.

Posted Content
TL;DR: A high-performance gas kinetic solver using multi-level parallelization is developed to enable pore-scale simulations of rarefied flows in porous media and can be readily extended to solve other Boltzmann model equations.
Abstract: A high-performance gas kinetic solver using multi-level parallelization is developed to enable pore-scale simulations of rarefied flows in porous media. The Boltzmann model equation is solved by the discrete velocity method with an iterative scheme. The multi-level MPI/OpenMP parallelization is implemented with the aim to efficiently utilise the computational resources to allow direct simulation of rarefied gas flows in porous media based on digital rock images for the first time. The multi-level parallel approach is analyzed in details confirming its better performance than the commonly-used MPI processing alone for an iterative scheme. With high communication efficiency and appropriate load balancing among CPU processes, parallel efficiency of 94% is achieved for 1536 cores in the 2D simulations, and 81% for 12288 cores in the 3D simulations. While decomposition in the spatial space does not affect the simulation results, one additional benefit of this approach is that the number of subdomains can be kept minimal to avoid deterioration of the convergence rate of the iteration process. This multi-level parallel approach can be readily extended to solve other Boltzmann model equations.

Posted Content
TL;DR: In this paper, a two-level Chebyshev polynomial filter based complementary subspace strategy is proposed to compute a set of vectors that span the occupied subspace of the Kohn-Sham Hamiltonian.
Abstract: We describe a novel iterative strategy for Kohn-Sham density functional theory calculations aimed at large systems (> 1000 electrons), applicable to metals and insulators alike. In lieu of explicit diagonalization of the Kohn-Sham Hamiltonian on every self-consistent field (SCF) iteration, we employ a two-level Chebyshev polynomial filter based complementary subspace strategy to: 1) compute a set of vectors that span the occupied subspace of the Hamiltonian; 2) reduce subspace diagonalization to just partially occupied states; and 3) obtain those states in an efficient, scalable manner via an inner Chebyshev-filter iteration. By reducing the necessary computation to just partially occupied states, and obtaining these through an inner Chebyshev iteration, our approach reduces the cost of large metallic calculations significantly, while eliminating subspace diagonalization for insulating systems altogether. We describe the implementation of the method within the framework of the Discontinuous Galerkin (DG) electronic structure method and show that this results in a computational scheme that can effectively tackle bulk and nano systems containing tens of thousands of electrons, with chemical accuracy, within a few minutes or less of wall clock time per SCF iteration on large-scale computing platforms. We anticipate that our method will be instrumental in pushing the envelope of large-scale ab initio molecular dynamics. As a demonstration of this, we simulate a bulk silicon system containing 8,000 atoms at finite temperature, and obtain an average SCF step wall time of 51 seconds on 34,560 processors; thus allowing us to carry out 1.0 ps of ab initio molecular dynamics in approximately 28 hours (of wall time).

Journal ArticleDOI
TL;DR: In this article, the electronic structure and transport properties of 18-valence electron count cobalt based half-Heusler alloys with prime focus on CoVSn, CoNbSn, COAT, CoTaSn and CoMoIn were investigated.
Abstract: In search of new prospects for thermoelectric materials, using ab-initio calculations and semi-classical Boltzmann theory, we have systematically investigated the electronic structure and transport properties of 18-valence electron count cobalt based half-Heusler alloys with prime focus on CoVSn, CoNbSn, CoTaSn, CoMoIn, and CoWIn. The effect of doping on transport properties has been studied under the rigid band approximation. The maximum power factor, S$^2\sigma$, for all systems is obtained on hole doping and is comparable to the existing thermoelectric material CoTiSb. The stability of all the systems is verified by phonon calculations. Based on our calculations, we suggest that CoVSn, CoNbSn, CoTaSn, CoMoIn and CoWIn could be potential candidates for high temperature thermoelectric materials.

Proceedings ArticleDOI
TL;DR: Bayesian optimization with Gaussian processes is employed in order to automatize and speed up the optimization process of the shape of a free-form reflective meta surface such that it diffracts light into a specific diffraction order.
Abstract: Numerical simulation of complex optical structures enables their optimization with respect to specific objectives. Often, optimization is done by multiple successive parameter scans, which are time consuming and computationally expensive. We employ here Bayesian optimization with Gaussian processes in order to automatize and speed up the optimization process. As a toy example, we demonstrate optimization of the shape of a free-form reflective meta surface such that it diffracts light into a specific diffraction order. For this example, we compare the performance of six different Bayesian optimization approaches with various acquisition functions and various kernels of the Gaussian process.

Journal ArticleDOI
TL;DR: In this paper, a spatially monotonicity-preserving (MP) scheme is proposed for Vlasov-Poisson equations with high-order accuracy, which is based on the spatially fifth and seventh-order MP scheme.
Abstract: We develop new numerical schemes for Vlasov--Poisson equations with high-order accuracy. Our methods are based on a spatially monotonicity-preserving (MP) scheme and are modified suitably so that positivity of the distribution function is also preserved. We adopt an efficient semi-Lagrangian time integration scheme that is more accurate and computationally less expensive than the three-stage TVD Runge-Kutta integration. We apply our spatially fifth- and seventh-order schemes to a suite of simulations of collisionless self-gravitating systems and electrostatic plasma simulations, including linear and nonlinear Landau damping in one dimension and Vlasov--Poisson simulations in a six-dimensional phase space. The high-order schemes achieve a significantly improved accuracy in comparison with the third-order positive-flux-conserved scheme adopted in our previous study. With the semi-Lagrangian time integration, the computational cost of our high-order schemes does not significantly increase, but remains roughly the same as that of the third-order scheme. Vlasov--Poisson simulations on $128^3 \times 128^3$ mesh grids have been successfully performed on a massively parallel computer.

Posted Content
TL;DR: In this paper, the authors proposed a unified framework for solving the Wannier localization problem with isolated and entangled eigenvalues, which is robust, efficient, and relies on few tunable parameters.
Abstract: The Wannier localization problem in quantum physics is mathematically analogous to finding a localized representation of a subspace corresponding to a nonlinear eigenvalue problem. While Wannier localization is well understood for insulating materials with isolated eigenvalues, less is known for metallic systems with entangled eigenvalues. Currently, the most widely used method for treating systems with entangled eigenvalues is to first obtain a reduced subspace (often referred to as disentanglement) and then to solve the Wannier localization problem by treating the reduced subspace as an isolated system. This is a multi-objective nonconvex optimization procedure and its solution can depend sensitively on the initial guess. We propose a new method to solve the Wannier localization problem, avoiding the explicit use of an an optimization procedure. Our method is robust, efficient, relies on few tunable parameters, and provides a unified framework for addressing problems with isolated and entangled eigenvalues.

Journal ArticleDOI
TL;DR: Arising from a metadynamics study of the wetting transition of water on a solid substrate, it is found that the influence of the cutoff is unexpectedly strong and can change the character of the soaking transition from continuous to first order by creating artificial metastable wetting states.
Abstract: Non-bonded potentials are included in most force fields and therefore widely used in classical molecular dynamics simulations of materials and interfacial phenomena. It is commonplace to truncate these potentials for computational efficiency based on the assumption that errors are negligible for reasonable cutoffs or compensated for by adjusting other interaction parameters. Arising from a metadynamics study of the wetting transition of water on a solid substrate, we find that the influence of the cutoff is unexpectedly strong and can change the character of the wetting transition from continuous to first order by creating artificial metastable wetting states. Common cutoff corrections such as the use of a force switching function, a shifted potential, or a shifted force do not avoid this. Such a qualitative difference urges caution and suggests that using truncated non-bonded potentials can induce unphysical behavior that cannot be fully accounted for by adjusting other interaction parameters.

Journal ArticleDOI
TL;DR: In this article, the authors studied the effect of cavity collapse in non-ideal explosives as a means of controlling their sensitivity, and the main aim is to understand the origin of localised temperature peaks (hot spots) that play a leading order role at early ignition stages.
Abstract: We study effect of cavity collapse in non-ideal explosives as a means of controlling their sensitivity. The main aim is to understand the origin of localised temperature peaks (hot spots) that play a leading order role at early ignition stages. Thus, we perform 2D and 3D numerical simulations of shock induced single gas-cavity collapse in nitromethane. Ignition is the result of a complex interplay between fluid dynamics and exothermic chemical reaction. In part I of this work we focused on the hydrodynamic effects in the collapse process by switching off the reaction terms in the mathematical model. Here, we reinstate the reactive terms and study the collapse of the cavity in the presence of chemical reactions. We use a multi-phase formulation which overcomes current challenges of cavity collapse modelling in reactive media to obtain oscillation-free temperature fields across material interfaces to allow the use of a temperature-based reaction rate law. The mathematical and physical models are validated against experimental and analytic data. We identify which of the previously-determined (in part I of this work) high-temperature regions lead to ignition and comment on their reactive strength and reaction growth rate. We quantify the sensitisation of nitromethane by the collapse of the cavity by comparing ignition times of neat and single-cavity material; the ignition occurs in less than half the ignition time of the neat material. We compare 2D and 3D simulations to examine the change in topology, temperature and reactive strength of the hot spots by the third dimension. It is apparent that belated ignition times can be avoided by the use of 3D simulations. The effect of the chemical reactions on the topology and strength of the hot spots in the timescales considered is studied by comparing inert and reactive simulations and examine maximum temperature fields and their growth rates.

Journal ArticleDOI
TL;DR: In this paper, a charge-conserving finite element time-domain (FETD) particle-in-cell (PIC) algorithm for the time-dependent Maxwell-Vlasov equations on irregular (unstructured) meshes to the relativistic regime was proposed.
Abstract: In many problems involving particle accelerators and relativistic plasmas, the accurate modeling of relativistic particle motion is essential for accurate physical predictions. Here, we extend a charge-conserving finite element time-domain (FETD) particle-in-cell (PIC) algorithm for the time-dependent Maxwell-Vlasov equations on irregular (unstructured) meshes to the relativistic regime by implementing and comparing three particle pushers: (relativistic) Boris, Vay, and Higuera-Cary. We illustrate the application of the proposed relativistic FETD-PIC algorithm for the analysis of particle cyclotron motion at relativistic speeds, harmonic particle oscillation in the Lorentz-boosted frame, and relativistic Bernstein modes in magnetized charge-neutral (pair) plasmas.

Book ChapterDOI
TL;DR: The opportunities and challenges of massively parallel computing for Monte Carlo simulations in statistical physics, with a focus on the simulation of systems exhibiting phase transitions and critical phenomena, are outlined.
Abstract: Applications that require substantial computational resources today cannot avoid the use of heavily parallel machines. Embracing the opportunities of parallel computing and especially the possibilities provided by a new generation of massively parallel accelerator devices such as GPUs, Intel's Xeon Phi or even FPGAs enables applications and studies that are inaccessible to serial programs. Here we outline the opportunities and challenges of massively parallel computing for Monte Carlo simulations in statistical physics, with a focus on the simulation of systems exhibiting phase transitions and critical phenomena. This covers a range of canonical ensemble Markov chain techniques as well as generalized ensembles such as multicanonical simulations and population annealing. While the examples discussed are for simulations of spin systems, many of the methods are more general and moderate modifications allow them to be applied to other lattice and off-lattice problems including polymers and particle systems. We discuss important algorithmic requirements for such highly parallel simulations, such as the challenges of random-number generation for such cases, and outline a number of general design principles for parallel Monte Carlo codes to perform well.

Posted Content
TL;DR: A light-weight system of bash scripts for efficiently bundling supercomputing tasks into large jobs, so that one can take advantage of incentives or discounts for requesting large allocations.
Abstract: We describe a light-weight system of bash scripts for efficiently bundling supercomputing tasks into large jobs, so that one can take advantage of incentives or discounts for requesting large allocations. The software can backfill computational tasks, avoiding wasted cycles, and can streamline collaboration between different users. It is simple to use, functioning similarly to batch systems like PBS, MOAB, and SLURM.

Journal ArticleDOI
TL;DR: The presented method employs the density approach for topology optimization and uses an adjoint method for the gradient computation in a finite-difference method based on the FFT accelerated computation of the stray-field.
Abstract: We present a finite-difference method for the topology optimization of permanent magnets that is based on the FFT accelerated computation of the stray-field. The presented method employs the density approach for topology optimization and uses an adjoint method for the gradient computation. Comparsion to various state-of-the-art finite-element implementations shows a superior performance and accuracy. Moreover, the presented method is very flexible and easy to implement due to various preexisting FFT stray-field implementations that can be used.