scispace - formally typeset
Search or ask a question

Showing papers in "arXiv: Atmospheric and Oceanic Physics in 2019"


Journal ArticleDOI
TL;DR: This paper found that MLPNN has potential capability for SPEI drought forecasting based on performance measures, and applied and tested the algorithm on monthly time series data of Standardized Precipitation Evapotranspiration Index for seventeen climatological stations located in Northern Area and KPK (Pakistan).
Abstract: These days human beings are facing many environmental challenges due to frequently occurring drought hazards. It may have an effect on the countrys environment, the community, and industries. Several adverse impacts of drought hazard are continued in Pakistan, including other hazards. However, early measurement and detection of drought can provide guidance to water resources management for employing drought mitigation policies. In this paper, we used a multilayer perceptron neural network (MLPNN) algorithm for drought forecasting. We applied and tested MLPNN algorithm on monthly time series data of Standardized Precipitation Evapotranspiration Index (SPEI) for seventeen climatological stations located in Northern Area and KPK (Pakistan). We found that MLPNN has potential capability for SPEI drought forecasting based on performance measures (i.e., Mean Average Error (MAE), the coefficient of correlation R, and Root Mean Square Error (RMSE). Water resources and management planner can take necessary action in advance (e.g., in water scarcity areas) by using MLPNN model as part of their decision making.

70 citations


Journal ArticleDOI
TL;DR: It is proposed that the ultimate objective of using a neural network can also be the interpretation of what the network has learned rather than the output itself, and suggested that combining interpretable neural networks with novel scientific hypotheses will open the door to many new avenues in neural network‐related geoscience research.
Abstract: Neural networks have become increasingly prevalent within the geosciences, although a common limitation of their usage has been a lack of methods to interpret what the networks learn and how they make decisions. As such, neural networks have often been used within the geosciences to most accurately identify a desired output given a set of inputs, with the interpretation of what the network learns used as a secondary metric to ensure the network is making the right decision for the right reason. Neural network interpretation techniques have become more advanced in recent years, however, and we therefore propose that the ultimate objective of using a neural network can also be the interpretation of what the network has learned rather than the output itself. We show that the interpretation of neural networks can enable the discovery of scientifically meaningful connections within geoscientific data. In particular, we use two methods for neural network interpretation called backwards optimization and layerwise relevance propagation, both of which project the decision pathways of a network back onto the original input dimensions. To the best of our knowledge, LRP has not yet been applied to geoscientific research, and we believe it has great potential in this area. We show how these interpretation techniques can be used to reliably infer scientifically meaningful information from neural networks by applying them to common climate patterns. These results suggest that combining interpretable neural networks with novel scientific hypotheses will open the door to many new avenues in neural network-related geoscience research.

69 citations


Posted Content
TL;DR: It is shown that architecture constraints can enforce conservation laws to satisfactory numerical precision, while all constraints help the neural-network better generalize to conditions outside of its training set, such as global warming.
Abstract: Artificial neural-networks have the potential to emulate cloud processes with higher accuracy than the semi-empirical emulators currently used in climate models. However, neural-network models do not intrinsically conserve energy and mass, which is an obstacle to using them for long-term climate predictions. Here, we propose two methods to enforce linear conservation laws in neural-network emulators of physical models: Constraining (1) the loss function or (2) the architecture of the network itself. Applied to the emulation of explicitly-resolved cloud processes in a prototype multi-scale climate model, we show that architecture constraints can enforce conservation laws to satisfactory numerical precision, while all constraints help the neural-network better generalize to conditions outside of its training set, such as global warming.

66 citations


Journal ArticleDOI
TL;DR: In this paper, an NN is coupled to the dynamical core of a GCM with the same 160 km resolution to predict residual heating and moistening averaged over (160 km)^2 grid boxes as a function of the coarse-resolution fields within the same atmospheric column.
Abstract: General circulation models (GCMs) typically have a grid size of 25--200 km. Parametrizations are used to represent diabatic processes such as radiative transfer and cloud microphysics and account for sub-grid-scale motions and variability. Unlike traditional approaches, neural networks (NNs) can readily exploit recent observational datasets and global cloud-system resolving model (CRM) simulations to learn subgrid variability. This article describes an NN parametrization trained by coarse-graining a near-global CRM simulation with a 4~km horizontal grid spacing. The NN predicts the residual heating and moistening averaged over (160 km)^2 grid boxes as a function of the coarse-resolution fields within the same atmospheric column. This NN is coupled to the dynamical core of a GCM with the same 160 km resolution. A recent study described how to train such an NN to be numerically stable when coupled to specified time-evolving advective forcings in a single column model, but feedbacks between NN and GCM components cause spatially-extended simulations to crash within a few days. Analyzing the linearized response of such an NN reveals that it learns to exploit a strong synchrony between precipitation and the atmospheric state above 10 km. Removing these variables from the NN's inputs stabilizes the coupled simulations, which predict the future state more accurately than a coarse-resolution simulation without any parametrizations of sub-grid-scale variability, although the mean state slowly drifts.

66 citations


Journal ArticleDOI
TL;DR: In this paper, a data-driven framework that is based on analog forecasting (prediction using past similar patterns) and employs a novel deep learning pattern-recognition technique (capsule neural networks, CapsNets) and impact-based auto-labeling strategy is introduced.
Abstract: Numerical weather prediction (NWP) models require ever-growing computing time/resources, but still, have difficulties with predicting weather extremes. Here we introduce a data-driven framework that is based on analog forecasting (prediction using past similar patterns) and employs a novel deep learning pattern-recognition technique (capsule neural networks, CapsNets) and impact-based auto-labeling strategy. CapsNets are trained on mid-tropospheric large-scale circulation patterns (Z500) labeled $0-4$ depending on the existence and geographical region of surface temperature extremes over North America several days ahead. The trained networks predict the occurrence/region of cold or heat waves, only using Z500, with accuracies (recalls) of $69\%-45\%$ $(77\%-48\%)$ or $62\%-41\%$ $(73\%-47\%)$ $1-5$ days ahead. CapsNets outperform simpler techniques such as convolutional neural networks and logistic regression. Using both temperature and Z500, accuracies (recalls) with CapsNets increase to $\sim 80\%$ $(88\%)$, showing the promises of multi-modal data-driven frameworks for accurate/fast extreme weather predictions, which can augment NWP efforts in providing early warnings.

52 citations


Posted Content
TL;DR: In this paper, a deep convolutional neural network (CNN) was employed to forecast ozone concentrations over Seoul, South Korea for 2017, using several predictors from the previous day, including the wind fields, temperature, relative humidity, pressure, and precipitation.
Abstract: This study uses a deep learning approach to forecast ozone concentrations over Seoul, South Korea for 2017. We employ a deep convolutional neural network (CNN). We apply this method to predict the hourly ozone concentration on each day for the entire year using several predictors from the previous day, including the wind fields, temperature, relative humidity, pressure, and precipitation, along with in-situ ozone and NO2 concentrations. We refer to a history of all observed parameters between 2014 and 2016 for training the predictive models. Model-measurement comparisons for the 25 monitoring sites for the year 2017 report average indexes of agreement (IOA) of 0.84-0.89 and a Pearson correlation coefficient of 0.74-0.81, indicating reasonable performance for the CNN forecasting model. Although the CNN model successfully captures daily trends as well as yearly high and low variations of the ozone concentrations, it notably underpredicts high ozone peaks during the summer. The forecasting results are generally more accurate for the stations located in the southern regions of the Han River, the result of more stable topographical and meteorological conditions. Furthermore, through two separate daytime and nighttime forecasts, we find that the monthly IOA of the CNN model is 0.05-0.30 higher during the daytime, resulting from the unavailability of some of the input parameters during the nighttime. While the CNN model can predict the next 24 hours of ozone concentrations within less than a minute, we identify several limitations of deep learning models for real-time air quality forecasting for further improvement.

45 citations


Journal ArticleDOI
TL;DR: The authors developed a stochastic parameterization using the generative adversarial network (GAN) machine learning framework, which was trained and evaluated on output from the Lorenz '96 model, which is a common baseline model for evaluating both parameterization and data assimilation techniques.
Abstract: Stochastic parameterizations account for uncertainty in the representation of unresolved sub-grid processes by sampling from the distribution of possible sub-grid forcings. Some existing stochastic parameterizations utilize data-driven approaches to characterize uncertainty, but these approaches require significant structural assumptions that can limit their scalability. Machine learning models, including neural networks, are able to represent a wide range of distributions and build optimized mappings between a large number of inputs and sub-grid forcings. Recent research on machine learning parameterizations has focused only on deterministic parameterizations. In this study, we develop a stochastic parameterization using the generative adversarial network (GAN) machine learning framework. The GAN stochastic parameterization is trained and evaluated on output from the Lorenz '96 model, which is a common baseline model for evaluating both parameterization and data assimilation techniques. We evaluate different ways of characterizing the input noise for the model and perform model runs with the GAN parameterization at weather and climate timescales. Some of the GAN configurations perform better than a baseline bespoke parameterization at both timescales, and the networks closely reproduce the spatio-temporal correlations and regimes of the Lorenz '96 system. We also find that in general those models which produce skillful forecasts are also associated with the best climate simulations.

40 citations


Journal ArticleDOI
TL;DR: In this paper, seven highly recommended soil thermal conductivity schemes are evaluated for their applicability in land surface model (LSMs) and the Balland and Arp [2005] scheme is found to consistently perform best among all the schemes, and thus can be recommended as a superior scheme for land modeling use.
Abstract: Soil thermal conductivity is an important physical parameter in modeling land surface processes. Previous studies on evaluations of parameterization schemes of soil thermal conductivity are mostly based on specific experimental conditions or local soil samples, and their recommendations may not be the optimal schemes for land surface model (LSMs). In this work, seven highly recommended soil thermal conductivity schemes are evaluated for their applicability in LSMs. With the consideration of both scheme estimations and land process simulations by incorporation into the Common Land Model, the Balland and Arp [2005] scheme is found to consistently perform best among all the schemes, and thus can be recommended as a superior scheme for land modeling use. Uncertainty analyses by in-situ simulations demonstrate that, over relatively dry regions, the inter-scheme variations of soil thermal conductivity can lead to significant differences of simulated soil temperature, especially at deep layers, due to changes of downward soil heat conduction and the associated freeze-thaw cycles. However, few effects appear over wet regions, likely due to the high soil heat capacity induced by high soil moisture levels, which increases the heat inertia in soil thermodynamics. Global comparisons show the similar relationships that soil thermal conductivity significantly affects the simulated soil temperature and other related thermal and hydraulic variables over arid and semi-arid regions in mid- and high-latitudes. These results display the role of soil thermal conductivity in LSM, and suggest the importance of the evaluation and further development of thermal conductivity schemes with respect to land modelling applications.

34 citations


Journal ArticleDOI
TL;DR: In this article, high-resolution in-sit measurements of pancake ice drift are presented, from a pair of buoys deployed on floes in the Antarctic marginal ice zone during the winter sea ice expansion, over nine days in which the region was impacted by four polar cyclones.
Abstract: High temporal resolution in--situ measurements of pancake ice drift are presented, from a pair of buoys deployed on floes in the Antarctic marginal ice zone during the winter sea ice expansion, over nine days in which the region was impacted by four polar cyclones. Concomitant measurements of wave-in-ice activity from the buoys is used to infer that pancake ice conditions were maintained over at least the first seven days. Analysis of the data shows: (i)~unprecedentedly fast drift speeds in the Southern Ocean; (ii)~high correlation of drift velocities with the surface wind velocities, indicating absence of internal ice stresses $>$100\,km in from the edge in 100\% remotely sensed ice concentration; and (iii)~presence of a strong inertial signature with a 13\,h period. A Langrangian free drift model is developed, including a term for geostrophic currents that reproduces the 13\,h period signature in the ice motion. The calibrated model is shown to provide accurate predictions of the ice drift for up to 2\,days, and the calibrated parameters provide estimates of wind and ocean drag for pancake floes under storm conditions.

28 citations


Journal ArticleDOI
TL;DR: In this article, a framework for the study of surface ocean inertial particle motion is built from the Maxey-Riley set, which is obtained by vertically averaging each term of the original set, adapted to account for Earth's rotation effects.
Abstract: A framework for the study of surface ocean inertial particle motion is built from the Maxey--Riley set. A new set is obtained by vertically averaging each term of the original set, adapted to account for Earth's rotation effects, across the extent of a sufficiently small spherical particle that floats at an assumed unperturbed air--sea interface with unsteady nonuniform winds and ocean currents above and below, respectively. The inertial particle velocity is shown to exponentially decay in time to a velocity that lies close to an average of seawater and air velocities, weighted by a function of the seawater-to-particle density ratio. Such a weighted average velocity turns out to fortuitously be of the type commonly discussed in the search-and-rescue literature, which alone cannot explain the observed role of anticyclonic mesoscale eddies as traps for marine debris or the formation of great garbage patches in the subtropical gyres, phenomena dominated by finite-size effects. A heuristic extension of the theory is proposed to describe the motion of nonspherical particles by means of a simple shape factor correction, and recommendations are made for incorporating wave-induced Stokes drift, and allowing for inhomogeneities of the carrying fluid density. The new Maxey--Riley set outperforms an ocean adaptation that ignored wind drag effects and the first reported adaption that attempted to incorporate them.

28 citations


Journal ArticleDOI
TL;DR: A framework using machine learning together with physically‐derived models is tested, and it is shown that the approach yields models that are stable and that give both improved skill in initialized predictions and better long‐term climate statistics.
Abstract: Dynamical weather and climate prediction models underpin many studies of the Earth system and hold the promise of being able to make robust projections of future climate change based on physical laws. However, simulations from these models still show many differences compared with observations. Machine learning has been applied to solve certain prediction problems with great success, and recently it's been proposed that this could replace the role of physically-derived dynamical weather and climate models to give better quality simulations. Here, instead, a framework using machine learning together with physically-derived models is tested, in which it is learnt how to correct the errors of the latter from timestep to timestep. This maintains the physical understanding built into the models, whilst allowing performance improvements, and also requires much simpler algorithms and less training data. This is tested in the context of simulating the chaotic Lorenz '96 system, and it is shown that the approach yields models that are stable and that give both improved skill in initialised predictions and better long-term climate statistics. Improvements in long-term statistics are smaller than for single time-step tendencies, however, indicating that it would be valuable to develop methods that target improvements on longer time scales. Future strategies for the development of this approach and possible applications to making progress on important scientific problems are discussed.

Posted Content
TL;DR: An example of how two tools that have recently become accessible to a wide range of researchers, crowd-sourcing and deep learning, can be combined to explore satellite imagery at scale, with a focus on the organization of shallow cumulus convection in the trade wind regions.
Abstract: Humans excel at detecting interesting patterns in images, for example those taken from satellites. This kind of anecdotal evidence can lead to the discovery of new phenomena. However, it is often difficult to gather enough data of subjective features for significant analysis. This paper presents an example of how two tools that have recently become accessible to a wide range of researchers, crowd-sourcing and deep learning, can be combined to explore satellite imagery at scale. In particular, the focus is on the organization of shallow cumulus convection in the trade wind regions. Shallow clouds play a large role in the Earth's radiation balance yet are poorly represented in climate models. For this project four subjective patterns of organization were defined: Sugar, Flower, Fish and Gravel. On cloud labeling days at two institutes, 67 scientists screened 10,000 satellite images on a crowd-sourcing platform and classified almost 50,000 mesoscale cloud clusters. This dataset is then used as a training dataset for deep learning algorithms that make it possible to automate the pattern detection and create global climatologies of the four patterns. Analysis of the geographical distribution and large-scale environmental conditions indicates that the four patterns have some overlap with established modes of organization, such as open and closed cellular convection, but also differ in important ways. The results and dataset from this project suggests promising research questions. Further, this study illustrates that crowd-sourcing and deep learning complement each other well for the exploration of image datasets.

Journal ArticleDOI
TL;DR: A workshop was organized in the Les Houches School of Physics in France in January 2019 with the objective of gathering leading figures in the field to produce a road map for the scientific community as mentioned in this paper.
Abstract: Environmental fluid mechanics underlies a wealth of natural, industrial and, by extension, societal challenges. In the coming decades, as we strive towards a more sustainable planet, there are a wide range of grand challenge problems that need to be tackled, ranging from fundamental advances in understanding and modeling of stratified turbulence and consequent mixing, to applied studies of pollution transport in the ocean, atmosphere and urban environments. A workshop was organized in the Les Houches School of Physics in France in January 2019 with the objective of gathering leading figures in the field to produce a road map for the scientific community. Five subject areas were addressed: multiphase flow, stratified flow, ocean transport, atmospheric and urban transport, and weather and climate prediction. This article summarizes the discussions and outcomes of the meeting, with the intent of providing a resource for the community going forward.

Journal ArticleDOI
TL;DR: In this paper, a non-stationary model has been used to highlight the relationship between these extreme events and nonstationary climate, and applied to a Representative Concentration Pathway 8.5 Climate-Change scenario, for a fetch-limited environment (Catalan Coast).
Abstract: Extreme events, such as wave-storms, need to be characterized for coastal infrastructure design purposes. Such description should contain information on both the univariate behaviour and the joint-dependence of storm-variables. These two aspects have been here addressed through generalized Pareto distributions and hierarchical Archimedean copulas. A non-stationary model has been used to highlight the relationship between these extreme events and non-stationary climate. It has been applied to a Representative Concentration Pathway 8.5 Climate-Change scenario, for a fetch-limited environment (Catalan Coast). In the non-stationary model, all considered variables decrease in time, except for storm-duration at the northern part of the Catalan Coast. The joint distribution of storm variables presents cyclical fluctuations, with a stronger influence of climate dynamics than of climate itself.

Journal ArticleDOI
TL;DR: In this paper, a multi-layered network analysis was developed to detect the interlinks between the geopotential height of the upper air (~5 km) and surface air pollution in both China and the USA.
Abstract: Air pollution is associated with human diseases and has been found to be related to premature mortality. In response, environmental policies have been adopted in many countries, to decrease anthropogenic air pollution for the improvement of long-term air quality, since most air pollutant sources are anthropogenic. However, air pollution fluctuations have been found to strongly depend on the weather dynamics. This raises a fundamental question: What are the significant atmospheric processes that affect the local daily variability of air pollution? For this purpose, we develop here a multi-layered network analysis to detect the interlinks between the geopotential height of the upper air (~5 km) and surface air pollution in both China and the USA. We find that Rossby waves significantly affect air pollution fluctuations through the development of cyclone and anticyclone systems, and further affect the local stability of the air and the winds. The significant impacts of Rossby waves on air pollution are found to underlie most of the daily fluctuations in air pollution. Thus, the impact of Rossby waves on human life is greater than previously assumed. The rapid warming of the Arctic could slow down Rossby waves, thus increasing human health risks. Our method can help to determine the risk assessment of such extreme events and can improve potential predictability.

Journal ArticleDOI
TL;DR: In this paper, a deep learning approach based on big data is proposed to locate broadband acoustic sources using a single hydrophone in ocean waveguides with uncertain bottom parameters, where several 50-layer residual neural networks are used to handle the bottom uncertainty in source localization.
Abstract: A deep learning approach based on big data is proposed to locate broadband acoustic sources using a single hydrophone in ocean waveguides with uncertain bottom parameters. Several 50-layer residual neural networks, trained on a huge number of sound field replicas generated by an acoustic propagation model, are used to handle the bottom uncertainty in source localization. A two-step training strategy is presented to improve the training of the deep models. First, the range is discretized in a coarse (5 km) grid. Subsequently, the source range within the selected interval and source depth are discretized on a finer (0.1 km and 2 m) grid. The deep learning methods were demonstrated for simulated magnitude-only multi-frequency data in uncertain environments. Experimental data from the China Yellow Sea also validated the approach.

Posted Content
TL;DR: Cumulo, a benchmark dataset for training and evaluating global cloud classification models, is introduced, which consists of one year of 1km resolution MODIS hyperspectral imagery merged with pixel-width 'tracks' of CloudSat cloud labels.
Abstract: One of the greatest sources of uncertainty in future climate projections comes from limitations in modelling clouds and in understanding how different cloud types interact with the climate system. A key first step in reducing this uncertainty is to accurately classify cloud types at high spatial and temporal resolution. In this paper, we introduce Cumulo, a benchmark dataset for training and evaluating global cloud classification models. It consists of one year of 1km resolution MODIS hyperspectral imagery merged with pixel-width 'tracks' of CloudSat cloud labels. Bringing these complementary datasets together is a crucial first step, enabling the Machine-Learning community to develop innovative new techniques which could greatly benefit the Climate community. To showcase Cumulo, we provide baseline performance analysis using an invertible flow generative model (IResNet), which further allows us to discover new sub-classes for a given cloud class by exploring the latent space. To compare methods, we introduce a set of evaluation criteria, to identify models that are not only accurate, but also physically-realistic. CUMULO can be download from this https URL .

Journal ArticleDOI
TL;DR: Hidden flow features that attract floating objects provide critical information for hazard responses, such as SAR and oil spill containment, and hence have the potential to save lives and limit environmental disasters.
Abstract: Every year hundreds of people die at sea because of vessel and airplane accidents. A key challenge in reducing the number of these fatalities is to make Search and Rescue (SAR) algorithms more efficient. Here we address this challenge by uncovering hidden TRansient Attracting Profiles (TRAPs) in ocean-surface velocity data. Computable from a single velocity-field snapshot, TRAPs act as short-term attractors for all floating objects. In three different ocean field experiments, we show that TRAPs computed from measured as well as modelled velocities attract deployed drifters and manikins emulating people fallen in the water. TRAPs, which remain hidden to prior flow diagnostics, thus provide critical information for hazard responses, such as SAR and oil spill containment, and hence have the potential to save lives and limit environmental disasters.

Journal ArticleDOI
TL;DR: TrackEddy as discussed by the authors is based on the premise that the sea level signature of a coherent eddy can be approximated as a Gaussian feature, which then allows for the calculation of kinetic energy of the eddy field through the geostrophic approximation.
Abstract: The mesoscale eddy field plays a key role in the mixing and transport of physical and biological properties and redistribute energy budgets in the ocean. Eddy kinetic energy is commonly defined as the kinetic energy of the time-varying component of the velocity field. However, this definition contains all processes that vary in time, including coherent mesoscale eddies, jets, waves, and large-scale motions. The focus of this paper is on the eddy kinetic energy contained in coherent mesoscale eddies. We present a new method to decompose eddy kinetic energy into oceanic processes. The proposed method uses a new eddy-identification algorithm (TrackEddy). This algorithm is based on the premise that the sea level signature of a coherent eddy can be approximated as a Gaussian feature. The eddy Gaussian signature then allows for the calculation of kinetic energy of the eddy field through the geostrophic approximation. TrackEddy has been validated using synthetic sea surface height data, and then used to investigate trends of eddy kinetic energy in the Southern Ocean using Satellite Sea Surface Height anomaly (AVISO+). We detect an increasing trend of eddy kinetic energy associated with mesoscale eddies in the Southern Ocean. This trend is correlated with an increase of the coherent eddy amplitude and the strengthening of wind stress over the last two decades.

Journal ArticleDOI
TL;DR: This work presents a new rare event sampling algorithm called quantile diffusion Monte Carlo (quantile DMC), a simple-to-use algorithm that can sample extreme tail behavior for a wide class of processes that has potential to provide low-variance extreme weather statistics.
Abstract: Extreme mesoscale weather, including tropical cyclones, squall lines, and floods, can be enormously damaging and yet challenging to simulate; hence, there is a pressing need for more efficient simulation strategies. Here we present a new rare event sampling algorithm called Quantile Diffusion Monte Carlo (Quantile DMC). Quantile DMC is a simple-to-use algorithm that can sample extreme tail behavior for a wide class of processes. We demonstrate the advantages of Quantile DMC compared to other sampling methods and discuss practical aspects of implementing Quantile DMC. To test the feasibility of Quantile DMC for extreme mesoscale weather, we sample extremely intense realizations of two historical tropical cyclones, 2010 Hurricane Earl and 2015 Hurricane Joaquin. Our results demonstrate Quantile DMC's potential to provide low-variance extreme weather statistics while highlighting the work that is necessary for Quantile DMC to attain greater efficiency in future applications.

Journal ArticleDOI
TL;DR: In this paper, a comparison of the performance of CDA with the standard grid and spectral nudging techniques for representing long and short-scale features in the downscaled fields using the Weather Research and Forecast (WRF) model is further presented and analyzed.
Abstract: Continuous data assimilation (CDA) is successfully implemented for the first time for efficient dynamical downscaling of a global atmospheric reanalysis. A comparison of the performance of CDA with the standard grid and spectral nudging techniques for representing long- and short-scale features in the downscaled fields using the Weather Research and Forecast (WRF) model is further presented and analyzed. The WRF model is configured at 25km horizontal resolution and is driven by 250km initial and boundary conditions from NCEP/NCAR reanalysis fields. Downscaling experiments are performed over a one-month period in January, 2016. The similarity metric is used to evaluate the performance of the downscaling methods for large and small scales. Similarity results are compared for the outputs of the WRF model with different downscaling techniques, NCEP reanalysis, and Final Analysis. Both spectral nudging and CDA describe better the small-scale features compared to grid nudging. The choice of the wave number is critical in spectral nudging; increasing the number of retained frequencies generally produced better small-scale features, but only up to a certain threshold after which its solution gradually became closer to grid nudging. CDA maintains the balance of the large- and small-scale features similar to that of the best simulation achieved by the best spectral nudging configuration, without the need of a spectral decomposition. The different downscaled atmospheric variables, including rainfall distribution, with CDA is most consistent with the observations. The Brier skill score values further indicate that the added value of CDA is distributed over the entire model domain. The overall results clearly suggest that CDA provides an efficient new approach for dynamical downscaling by maintaining better balance between the global model and the downscaled fields.

Journal ArticleDOI
TL;DR: In this paper, a set of semi-empirical models based on these collected data was created and then used to estimate water broadening and its temperature dependence for all transitions of selected molecules in the HITRAN2016 database.
Abstract: The amount of water vapor in the terrestrial atmosphere is highly variable both spatially and temporally. In the tropics it sometimes constitutes 4-5% of the atmosphere. At the same time collisional broadening of spectral lines by water vapor is much larger than that by nitrogen and oxygen. Therefore, in order to accurately characterize and model spectra of the atmospheres with significant amounts of water vapor, the line-shape parameters for spectral lines broadened by water vapor are required. In this work, the line-broadening coefficients (and their temperature dependence exponents) due to the pressure of water vapor for lines of CO2, N2O, CO, CH4, O2, NH3, and H2S from both experimental and theoretical studies were collected and carefully reviewed. A set of semi-empirical models based on these collected data was created and then used to estimate water broadening and its temperature dependence for all transitions of selected molecules in the HITRAN2016 database.

Posted Content
TL;DR: This work demonstrates how encoder-decoder Convolutional Neural Networks (CNN) can be used to derive total precipitation using geopotential height as the only input, and provides a method to identify the levels of the geopotentials that have a higher influence on precipitation through a variable selection process.
Abstract: Numerical Weather Prediction (NWP) models represent sub-grid processes using parameterizations, which are often complex and a major source of uncertainty in weather forecasting. In this work, we devise a simple machine learning (ML) methodology to learn parameterizations from basic NWP fields. Specifically, we demonstrate how encoder-decoder Convolutional Neural Networks (CNN) can be used to derive total precipitation using geopotential height as the only input. Several popular neural network architectures, from the field of image processing, are considered and a comparison with baseline ML methodologies is provided. We use NWP reanalysis data to train different ML models showing how encoder-decoder CNNs are able to interpret the spatial information contained in the geopotential field to infer total precipitation with a high degree of accuracy. We also provide a method to identify the levels of the geopotential height that have a higher influence on precipitation through a variable selection process. As far as we know, this paper covers the first attempt to model NWP parameterizations using CNN methodologies.

Journal ArticleDOI
TL;DR: In this paper, the authors applied k-means clustering to the first several empirical orthogonal functions (EOFs) of geopotential height data and found that applying a weak persistence constraint within the clustering procedure led to a longer duration of circulation regimes, extending beyond the synoptic timescale without changing their occurrence rates.
Abstract: Atmospheric circulation is often clustered in so-called circulation regimes, which are persistent and recurrent patterns. For the Euro-Atlantic sector in winter, most studies identify four regimes: the Atlantic Ridge, the Scandinavian Blocking and the two phases of the North Atlantic Oscillation. These results are obtained by applying k-means clustering to the first several empirical orthogonal functions (EOFs) of geopotential height data. Studying the observed circulation in reanalysis data, it is found that when the full field data is used for the k-means cluster analysis instead of the EOFs, the optimal number of clusters is no longer four but six. The two extra regimes that are found are the opposites of the Atlantic Ridge and Scandinavian Blocking, meaning they have a low-pressure area roughly where the original regimes have a high-pressure area. This introduces an appealing symmetry in the clustering result. Incorporating a weak persistence constraint in the clustering procedure is found to lead to a longer duration of regimes, extending beyond the synoptic timescale, without changing their occurrence rates. This is in contrast to the commonly-used application of a time-filter to the data before the clustering is executed, which, while increasing the persistence, changes the occurrence rates of the regimes. We conclude that applying a persistence constraint within the clustering procedure is a superior way of stabilizing the clustering results than low-pass filtering the data.

Posted Content
TL;DR: In this article, the authors present an open source instrument that was developed for performing in situ measurements of sea ice dynamics, which includes an ultra-low power unit, a microcontroller-based logger, a small microcomputer for on-board data processing, and an Iridium modem for satellite communications.
Abstract: Sea ice is a major feature of the polar environments. Recent changes in the climate and extent of the sea ice, together with increased economic activity and research interest in these regions, are driving factors for new measurements of sea ice dynamics. Waves in ice are important as they participate in the coupling between the open ocean and the ice-covered regions. Measurements are challenging to perform due to remoteness and harsh environmental conditions. While progress has been made in observing wave propagtion in sea ice using remote methods, these are still relatively new measurements and would benefit from more in situ data for validation. In this article, we present an open source instrument that was developed for performing such measurements. The versatile design includes an ultra-low power unit, a microcontroller-based logger, a small microcomputer for on-board data processing, and an Iridium modem for satellite communications. Virtually any sensor can be used with this design. In the present case, we use an Inertial Motion Unit to record wave motion. High quality results were obtained, which opens new possibilities for in situ measurements in the polar regions. Our instrument can be easily customized to fit many in situ measurement tasks, and we hope that our work will provide a framework for future developments of a variety of such open source instruments.

Journal ArticleDOI
TL;DR: In this paper, the authors presented monthly time series data on precipitation, minimum and maximum temperature for four downscaled global circulation models, and used model output statistics in combination with mechanistic downscaling to calculate mean monthly maximum and minimum temperatures, as well as monthly precipitation at 5 km spatial resolution globally for the years 2006-2100.
Abstract: Predicting future climatic conditions at high spatial resolution is essential for many applications and impact studies in science. Here, we present monthly time series data on precipitation, minimum- and maximum temperature for four downscaled global circulation models. We used model output statistics in combination with mechanistic downscaling (the CHELSa algorithm) to calculate mean monthly maximum and minimum temperatures, as well as monthly precipitation at 5 km spatial resolution globally for the years 2006-2100. We validated the performance of the downscaling algorithm by comparing model output with the observed climate of the historical period 1950-1969.

Journal ArticleDOI
TL;DR: A physically derived global oceanic model of Wood et al. with five boxes is studied, that is calibrated to runs of the FAMOUS coupled atmosphere-ocean general circulation model, and finds that rate-induced thresholds for tipping can appear, even for perturbations that do not cross the bifurcation.
Abstract: The Atlantic Meridional Overturning Circulation (AMOC) transports substantial amounts of heat into the North Atlantic sector, and hence is of very high importance in regional climate projections. The AMOC has been observed to show multi-stability across a range of models of different complexity. The simplest models find a bifurcation associated with the AMOC `on' state losing stability that is a saddle node. Here we study a physically derived global oceanic model of Wood {\em et al} with five boxes, that is calibrated to runs of the FAMOUS coupled atmosphere-ocean general circulation model. We find the loss of stability of the `on' state is due to a subcritical Hopf for parameters from both pre-industrial and doubled CO${}_2$ atmospheres. This loss of stability via subcritical Hopf bifurcation has important consequences for the behaviour of the basin of attraction close to bifurcation. We consider various time-dependent profiles of freshwater forcing to the system, and find that rate-induced thresholds for tipping can appear, even for perturbations that do not cross the bifurcation. Understanding how such state transitions occur is important in determining allowable safe climate change mitigation pathways to avoid collapse of the AMOC.

Posted ContentDOI
TL;DR: Gaussian Process Regression, Nearest-Neighbor, Random Forest and Support Vector Regression were used to estimate the pan evaporation in the meteorological stations of Golestan Province, Iran and indicated that the PE values might be estimated with few easily measured meteorological parameters accurately.
Abstract: Evaporation is one of the main processes in the hydrological cycle, and it is one of the most critical factors in agricultural, hydrological, and meteorological studies. Due to the interactions of multiple climatic factors, the evaporation is a complex and nonlinear phenomenon; therefore, the data-based methods can be used to have precise estimations of it. In this regard, in the present study, Gaussian Process Regression, Nearest-Neighbor, Random Forest and Support Vector Regression were used to estimate the pan evaporation in the meteorological stations of Golestan Province, Iran. For this purpose, meteorological data including PE, temperature, relative humidity, wind speed and sunny hours collected from the Gonbad-e Kavus, Gorgan and Bandar Torkman stations from 2011 through 2017. The accuracy of the studied methods was determined using the statistical indices of Root Mean Squared Error, correlation coefficient and Mean Absolute Error. Furthermore, the Taylor charts utilized for evaluating the accuracy of the mentioned models. We report that GPR for Gonbad-e Kavus Station with input parameters of T, W and S and GPR for Gorgan and Bandar Torkmen stations with input parameters of T, RH, W, and S had the most accurate performances and proposed for precise estimation of PE. Due to the high rate of evaporation in Iran and the lack of measurement instruments, the findings of the current study indicated that the PE values might be estimated with few easily measured meteorological parameters accurately.

Posted Content
TL;DR: The reliability of two neural network interpretation techniques, backward optimization and layerwise relevance propagation, within geoscientific applications are tested by applying them to a commonly studied geophysical phenomenon, the Madden-Julian Oscillation, and it is found that the conventionally defined extended seasons should be shifted later by one month.
Abstract: We test the reliability of two neural network interpretation techniques, backward optimization and layerwise relevance propagation, within geoscientific applications by applying them to a commonly studied geophysical phenomenon, the Madden-Julian Oscillation. The Madden-Julian Oscillation is a multi-scale pattern within the tropical atmosphere that has been extensively studied over the past decades, which makes it an ideal test case to ensure the interpretability methods can recover the current state of knowledge regarding its spatial structure. The neural networks can, indeed, reproduce the current state of knowledge and can also provide new insights into the seasonality of the Madden-Julian Oscillation and its relationships with atmospheric state variables. The neural network identifies the phase of the Madden-Julian Oscillation twice as accurately as linear regression, which means that nonlinearities used by the neural network are important to the structure of the Madden-Julian Oscillation. Interpretations of the neural network show that it accurately captures the spatial structures of the Madden-Julian Oscillation, suggest that the nonlinearities of the Madden-Julian Oscillation are manifested through the uniqueness of each event, and offer physically meaningful insights into its relationship with atmospheric state variables. We also use the interpretations to identify the seasonality of the Madden-Julian Oscillation, and find that the conventionally defined extended seasons should be shifted later by one month. More generally, this study suggests that neural networks can be reliably interpreted for geoscientific applications and may there

Posted ContentDOI
TL;DR: Coupled online learning is proposed as a way to combat issues in Earth System Models based on the same three-step approach and is illustrated using the Lorenz 96 model, where coupled learning is able to recover the "true" parameterizations.
Abstract: Over the last couple of years, machine learning parameterizations have emerged as a potential way to improve the representation of sub-grid processes in Earth System Models (ESMs). So far, all studies were based on the same three-step approach: first a training dataset was created from a high-resolution simulation, then a machine learning algorithm was fitted to this dataset, before the trained algorithm was implemented in the ESM. The resulting online simulations were frequently plagued by instabilities and biases. Here, coupled online learning is proposed as a way to combat these issues. Coupled learning can be seen as a second training stage in which the pretrained machine learning parameterization, specifically a neural network, is run in parallel with a high-resolution simulation. The high-resolution simulation is kept in sync with the neural network-driven ESM through constant nudging. This enables the neural network to learn from the tendencies that the high-resolution simulation would produce if it experienced the states the neural network creates. The concept is illustrated using the Lorenz 96 model, where coupled learning is able to recover the "true" parameterizations. Further, detailed algorithms for the implementation of coupled learning in 3D cloud-resolving models and the super parameterization framework are presented. Finally, outstanding challenges and issues not resolved by this approach are discussed.