Showing papers by "Vipin Kumar published in 2020"

PDF

Open Access

Posted Content•

Integrating Physics-Based Modeling with Machine Learning: A Survey

[...]

Jared Willard, Xiaowei Jia, Shaoming Xu, Michael Steinbach, Vipin Kumar - Show less +1 more

01 Jan 2020

TL;DR: An overview of techniques to integrate machine learning with physics-based modeling and classes of methodologies used to construct physics-guided machine learning models and hybrid physics-machine learning frameworks from a machine learning standpoint is provided.

...read moreread less

Abstract: In this manuscript, we provide a structured and comprehensive overview of techniques to integrate machine learning with physics-based modeling. First, we provide a summary of application areas for which these approaches have been applied. Then, we describe classes of methodologies used to construct physics-guided machine learning models and hybrid physics-machine learning frameworks from a machine learning standpoint. With this foundation, we then provide a systematic organization of these existing techniques and discuss ideas for future research.

...read moreread less

230 citations

Posted Content•

Physics-Guided Machine Learning for Scientific Discovery: An Application in Simulating Lake Temperature Profiles

[...]

Xiaowei Jia¹, Jared Willard², Anuj Karpatne³, Jordan S. Read⁴, Jacob A. Zwart⁵, Michael Steinbach², Vipin Kumar² - Show less +3 more•Institutions (5)

University of Pittsburgh¹, University of Minnesota², Virginia Tech³, Association for Computing Machinery⁴, United States Geological Survey⁵

28 Jan 2020-arXiv: Learning

TL;DR: This article proposed a physics-guided recurrent neural network model (PGRNN) that combines RNNs and physics-based models to leverage their complementary strengths and improves the modeling of physical processes.

...read moreread less

Abstract: Physics-based models of dynamical systems are often used to study engineering and environmental systems. Despite their extensive use, these models have several well-known limitations due to simplified representations of the physical processes being modeled or challenges in selecting appropriate parameters. While-state-of-the-art machine learning models can sometimes outperform physics-based models given ample amount of training data, they can produce results that are physically inconsistent. This paper proposes a physics-guided recurrent neural network model (PGRNN) that combines RNNs and physics-based models to leverage their complementary strengths and improves the modeling of physical processes. Specifically, we show that a PGRNN can improve prediction accuracy over that of physics-based models, while generating outputs consistent with physical laws. An important aspect of our PGRNN approach lies in its ability to incorporate the knowledge encoded in physics-based models. This allows training the PGRNN model using very few true observed data while also ensuring high prediction accuracy. Although we present and evaluate this methodology in the context of modeling the dynamics of temperature in lakes, it is applicable more widely to a range of scientific and engineering disciplines where physics-based (also known as mechanistic) models are used, e.g., climate science, materials science, computational chemistry, and biomedicine.

...read moreread less

88 citations

Posted Content•DOI•

High throughput detection and genetic epidemiology of SARS-CoV-2 using COVIDSeq next generation sequencing

[...]

Rahul C. Bhoyar¹, Abhinav Jain¹, Paras Sehgal¹, Mohit Kumar Divakar¹, Disha Sharma¹, Mohamed Imran¹, Bani Jolly¹, Gyan Ranjan¹, Mercy Rophina¹, Sumit Sharma¹, Sanjay Siwach¹, Kavita Pandhare¹, Swayamprabha Sahoo², Maheswata Sahoo², Ananya Nayak², Jatindra Nath Mohanty², Jayashankar Das², Sudhir Bhandari³, Sandeep Kumar Mathur³, Anshul Kumar³, Rahul Sahlot³, Pallavali Rojarani⁴, Juturu Vijaya Lakshmi⁴, Araveti Surekha⁴, Pulala Chandra Sekhar⁴, Shelly Mahajan, Shet Masih, Pawan K. Singh, Vipin Kumar, Blessy Jose, Vidur Mahajan, Vivek Gupta, Rakesh Gupta, Prabhakar Arumugam¹, Anjali Singh¹, Ananya Nandy¹, P.V. Raghavendran¹, Rakesh Mohan Jha¹, Anupama Kumari¹, Sheetal Gandotra¹, Vivek Rao¹, Mohammed Faruq¹, Sanjeev Kumar¹, G Betsy Reshma¹, G Narendra Varma¹, Shuvra Shekhar Roy¹, Antara Sengupta¹, Sabyasachi Chattopadhyay¹, Khushboo Singhal¹, Shalini Pradhan¹, Diksha Jha¹, Salwa Naushin¹, Saruchi Wadhwa¹, Nishu Tyagi¹, Mukta Poojary¹, Vinod Scaria¹, Sridhar Sivasubbu¹ - Show less +53 more•Institutions (4)

Institute of Genomics and Integrative Biology¹, Siksha O Anusandhan University², Sawai ManSingh Medical College³, Kurnool Medical College⁴

10 Aug 2020-bioRxiv

TL;DR: The present study uses the COVIDSeq protocol, which involves multiplex-PCR, barcoding and sequencing of samples for high-throughput detection and deciphering the genetic epidemiology of SARS-CoV-2, and suggests that CO VIDSeq could be a potential high sensitivity assay for detection of Sars-Cov-2.

...read moreread less

Abstract: The rapid emergence of coronavirus disease 2019 (COVID-19) as a global pandemic affecting millions of individuals globally has necessitated sensitive and high-throughput approaches for the diagnosis, surveillance and for determining the genetic epidemiology of SARS-CoV-2. In the present study, we used the COVIDSeq protocol, which involves multiplex-PCR, barcoding and sequencing of samples for high-throughput detection and deciphering the genetic epidemiology of SARS-CoV-2. We used the approach on 752 clinical samples in duplicates, amounting to a total of 1536 samples which could be sequenced on a single S4 sequencing flow cell on NovaSeq 6000. Our analysis suggests a high concordance between technical duplicates and a high concordance of detection of SARS-CoV-2 between the COVIDSeq as well as RT-PCR approaches. An in-depth analysis revealed a total of six samples in which COVIDSeq detected SARS-CoV-2 in high confidence which were negative in RT-PCR. Additionally, the assay could detect SARS-CoV-2 in 21 samples and 16 samples which were classified inconclusive and pan-sarbeco positive respectively suggesting that COVIDSeq could be used as a confirmatory test. The sequencing approach also enabled insights into the evolution and genetic epidemiology of the SARS-CoV-2 samples. The samples were classified into a total of 3 clades. This study reports two lineages B.1.112 and B.1.99 for the first time in India. This study also revealed 1,143 unique single nucleotide variants and added a total of 73 novel variants identified for the first time. To the best of our knowledge, this is the first report of the COVIDSeq approach for detection and genetic epidemiology of SARS-CoV-2. Our analysis suggests that COVIDSeq could be a potential high sensitivity assay for detection of SARS-CoV-2, with an additional advantage of enabling genetic epidemiology of SARS-CoV-2.

...read moreread less

63 citations

Journal Article•DOI•

Predicting lake surface water phosphorus dynamics using process-guided machine learning

[...]

Paul C. Hanson¹, Aviah B. Stillman¹, Xiaowei Jia², Anuj Karpatne, Hilary A. Dugan¹, Cayelan C. Carey³, Joseph Stachelek⁴, Nicole K. Ward³, Yu Zhang⁵, Yu Zhang⁶, Jordan S. Read⁷, Vipin Kumar - Show less +8 more•Institutions (7)

University of Wisconsin-Madison¹, University of Minnesota², Virginia Tech³, Michigan State University⁴, Pennsylvania State University⁵, Los Alamos National Laboratory⁶, United States Geological Survey⁷

15 Aug 2020-Ecological Modelling

TL;DR: In this paper, a process-guided recurrent neural network (PGRNN) was used to predict epilimnetic phosphorus over a time range of days to decades in Lake Mendota.

...read moreread less

48 citations

Posted Content•

Integrating Scientific Knowledge with Machine Learning for Engineering and Environmental Systems

[...]

Jared Willard, Xiaowei Jia, Shaoming Xu, Michael Steinbach, Vipin Kumar - Show less +1 more

10 Mar 2020-arXiv: Computational Physics

TL;DR: There is a growing consensus that solutions to complex science and engineering problems require novel methodologies that are able to integrate traditional physics-based modeling approaches with state-of-the-art machine learning (ML) techniques as discussed by the authors.

...read moreread less

Abstract: There is a growing consensus that solutions to complex science and engineering problems require novel methodologies that are able to integrate traditional physics-based modeling approaches with state-of-the-art machine learning (ML) techniques. This paper provides a structured overview of such techniques. Application-centric objective areas for which these approaches have been applied are summarized, and then classes of methodologies used to construct physics-guided ML models and hybrid physics-ML frameworks are described. We then provide a taxonomy of these existing techniques, which uncovers knowledge gaps and potential crossovers of methods between disciplines that can serve as ideas for future research.

...read moreread less

38 citations

Journal Article•DOI•

Satellite-based remote sensing data set of global surface water storage change from 1992 to 2018

[...]

Riccardo Tortini¹, Nina V. Noujdina¹, Samantha Yeo¹, Martina Ricko², Charon Birkett², Ankush Khandelwal³, Vipin Kumar³, Miriam E. Marlier¹, Dennis P. Lettenmaier¹ - Show less +5 more•Institutions (3)

University of California, Los Angeles¹, Goddard Space Flight Center², University of Minnesota³

19 May 2020-Earth System Science Data

TL;DR: Khandelwal et al. as discussed by the authors presented a global set of satellite-derived time series of surface water storage changes for lakes and reservoirs for a period that covers the satellite-altimetry era.

...read moreread less

Abstract: . The recent availability of freely and openly available satellite remote sensing products has enabled the implementation of global surface water monitoring at a level not previously possible. Here we present a global set of satellite-derived time series of surface water storage variations for lakes and reservoirs for a period that covers the satellite altimetry era. Our goals are to promote the use of satellite-derived products for the study of large inland water bodies and to set the stage for the expected availability of products from the Surface Water and Ocean Topography (SWOT) mission, which will vastly expand the spatial coverage of such products, expected from 2021 on. Our general strategy is to estimate global surface water storage changes ( ΔV ) in large lakes and reservoirs using a combination of paired water surface elevation (WSE) and water surface area (WSA) extent products. Specifically, we use data produced by multiple satellite altimetry missions (TOPEX/Poseidon, Jason-1, Jason-2, Jason-3, and Envisat) from 1992 on, with surface extent estimated from Terra/Aqua Moderate Resolution Imaging Spectroradiometer (MODIS) from 2000 on. We leverage relationships between elevation and surface area (i.e., hypsometry) to produce estimates of ΔV even during periods when either of the variables was not available. This approach is successful provided that there are strong relationships between the two variables during an overlapping period. Our target is to produce time series of ΔV as well as of WSE and WSA for a set of 347 lakes and reservoirs globally for the 1992–2018 period. The data sets presented and their respective algorithm theoretical basis documents are publicly available and distributed via the Physical Oceanography Distributed Active Archive Center (PO DAAC; https://podaac.jpl.nasa.gov/ , last access: 13 May 2020) of NASA's Jet Propulsion Laboratory. Specifically, the WSE data set is available at https://doi.org/10.5067/UCLRS-GREV2 (Birkett et al., 2019), the WSA data set is available at https://doi.org/10.5067/UCLRS-AREV2 (Khandelwal and Kumar, 2019), and the ΔV data set is available at https://doi.org/10.5067/UCLRS-STOV2 (Tortini et al., 2019). The records we describe represent the most complete global surface water time series available from the launch of TOPEX/Poseidon in 1992 (beginning of the satellite altimetry era) to the near present. The production of long-term, consistent, and calibrated records of surface water cycle variables such as in the data set presented here is of fundamental importance to baseline future SWOT products.

...read moreread less

34 citations

Journal Article•DOI•

Predicting Water Temperature Dynamics of Unmonitored Lakes with Meta Transfer Learning.

[...]

Jared Willard¹, Jordan S. Read², Alison P. Appling², Samantha K. Oliver², Xiaowei Jia³, Vipin Kumar¹ - Show less +2 more•Institutions (3)

University of Minnesota¹, United States Geological Survey², University of Pittsburgh³

10 Nov 2020-arXiv: Learning

TL;DR: This method builds a meta-learning model to predict transfer performance from candidate source models to targets using lake attributes and candidates' past performance, and demonstrates that MTL with meaningful predictor variables and high-quality source models is a promising approach for many kinds of unmonitored systems and environmental variables.

...read moreread less

Abstract: Most environmental data come from a minority of well-observed sites. An ongoing challenge in the environmental sciences is transferring knowledge from monitored sites to unobserved sites. Here, we demonstrate a novel transfer learning framework that accurately predicts temperature in unobserved lakes (targets) by borrowing models from highly observed lakes (sources). This method, Meta Transfer Learning (MTL), builds a meta-learning model to predict transfer performance from candidate source models to targets using lake attributes and candidates' past performance. We constructed source models at 145 well-observed lakes using calibrated process-based modeling (PB) and a recently developed approach called process-guided deep learning (PGDL). We applied MTL to either PB or PGDL source models (PB-MTL or PGDL-MTL, respectively) to predict temperatures in 305 target lakes treated as unobserved in the Upper Midwestern United States. We show significantly improved performance relative to the uncalibrated process-based General Lake Model, where the median RMSE for the target lakes is $2.52^{\circ}C$. PB-MTL yielded a median RMSE of $2.43^{\circ}C$; PGDL-MTL yielded $2.16^{\circ}C$; and a PGDL-MTL ensemble of nine sources per target yielded $1.88^{\circ}C$. For sparsely observed target lakes, PGDL-MTL often outperformed PGDL models trained on the target lakes themselves. Differences in maximum depth between the source and target were consistently the most important predictors. Our approach readily scales to thousands of lakes in the Midwestern United States, demonstrating that MTL with meaningful predictor variables and high-quality source models is a promising approach for many kinds of unmonitored systems and environmental variables.

...read moreread less

31 citations

Posted Content•

Physics Guided Machine Learning Methods for Hydrology.

[...]

Ankush Khandelwal¹, Shaoming Xu, Xiang Li, Xiaowei Jia, Michael Steinbach, Christopher J. Duffy, John Nieber, Vipin Kumar - Show less +4 more•Institutions (1)

University of Minnesota¹

02 Dec 2020-arXiv: Atmospheric and Oceanic Physics

TL;DR: An LSTM based deep learning architecture that is coupled with SWAT, an hydrology model that is in wide use today, is proposed to incorporate the understanding of physical processes and constraints in hydrology into machine learning algorithms, and thus bridge the performance gap while reducing the need for large amounts of data compared to traditional data-driven approaches.

...read moreread less

Abstract: Streamflow prediction is one of the key challenges in the field of hydrology due to the complex interplay between multiple non-linear physical mechanisms behind streamflow generation. While physically-based models are rooted in rich understanding of the physical processes, a significant performance gap still remains which can be potentially addressed by leveraging the recent advances in machine learning. The goal of this work is to incorporate our understanding of physical processes and constraints in hydrology into machine learning algorithms, and thus bridge the performance gap while reducing the need for large amounts of data compared to traditional data-driven approaches. In particular, we propose an LSTM based deep learning architecture that is coupled with SWAT (Soil and Water Assessment Tool), an hydrology model that is in wide use today. The key idea of the approach is to model auxiliary intermediate processes that connect weather drivers to streamflow, rather than directly mapping runoff from weather variables which is what a deep learning architecture without physical insight will do. The efficacy of the approach is being analyzed on several small catchments located in the South Branch of the Root River Watershed in southeast Minnesota. Apart from observation data on runoff, the approach also leverages a 200-year synthetic dataset generated by SWAT to improve the performance while reducing convergence time. In the early phases of this study, simpler versions of the physics guided deep learning architectures are being used to achieve a system understanding of the coupling of physics and machine learning. As more complexity is introduced into the present implementation, the framework will be able to generalize to more sophisticated cases where spatial heterogeneity is present.

...read moreread less

15 citations

Proceedings Article•DOI•

Personalized Image Retrieval with Sparse Graph Representation Learning

[...]

Xiaowei Jia¹, Handong Zhao², Zhe Lin², Ajinkya Gorakhnath Kale², Vipin Kumar¹ - Show less +1 more•Institutions (2)

University of Minnesota¹, Adobe Systems²

23 Aug 2020

TL;DR: This paper develops a novel method CA-GCN for personalized image retrieval in the Adobe Stock image system that leverages user behavior data in a Graph Convolutional Neural Network (GCN) model to learn user and image embeddings simultaneously.

...read moreread less

Abstract: Personalization is essential for enhancing the customer experience in retrieval tasks. In this paper, we develop a novel method CA-GCN for personalized image retrieval in the Adobe Stock image system. The proposed method CA-GCN leverages user behavior data in a Graph Convolutional Neural Network (GCN) model to learn user and image embeddings simultaneously. Standard GCN performs poorly on sparse user-image interaction graphs due to the limited knowledge gain from less representative neighbors. To address this challenge, we propose to augment the sparse user-image interaction data by considering the similarities among images. Specifically, we detect clusters of similar images and introduce a set of hidden super-nodes in the graph to represent clusters. We show that such an augmented graph structure can significantly improve the retrieval performance on real-world data collected from Adobe Stock service. In particular, when testing the proposed method on real users' stock image retrieval sessions, we get promoted average click position from 70 to 51.

...read moreread less

15 citations

Book Chapter•DOI•

Semi-supervised Classification using Attention-based Regularization on Coarse-resolution Data.

[...]

Guruprasad Nayak¹, Rahul Ghosh, Xiaowei Jia¹, Varun Mithal², Vipin Kumar - Show less +1 more•Institutions (2)

University of Minnesota¹, LinkedIn²

01 Jan 2020

TL;DR: In this article, a multi-view learning approach is proposed to exploit the complementarity of features across different views to improve models on both views, where features of instances at different resolutions are used to learn the correspondence between instances across resolutions using an attention mechanism.

...read moreread less

Abstract: Many real-world phenomena are observed at multiple resolutions. Predictive models designed to predict these phenomena typically consider different resolutions separately. This approach might be limiting in applications where predictions are desired at fine resolutions but available training data is scarce. In this paper, we propose classification algorithms that leverage supervision from coarser resolutions to help train models on finer resolutions. The different resolutions are modeled as different views of the data in a multi-view framework that exploits the complementarity of features across different views to improve models on both views. Unlike traditional multi-view learning problems, the key challenge in our case is that there is no one-to-one correspondence between instances across different views in our case, which requires explicit modeling of the correspondence of instances across resolutions. We propose to use the features of instances at different resolutions to learn the correspondence between instances across resolutions using an attention mechanism.Experiments on the real-world application of mapping urban areas using satellite observations and sentiment classification on text data show the effectiveness of the proposed methods.

...read moreread less

13 citations

Journal Article•DOI•

Predicting diabetes clinical outcomes using longitudinal risk factor trajectories

[...]

György J. Simon¹, Kevin A. Peterson¹, M. Regina Castro², Michael Steinbach¹, Vipin Kumar¹, Pedro J. Caraballo² - Show less +2 more•Institutions (2)

University of Minnesota¹, Mayo Clinic²

08 Jan 2020-BMC Medical Informatics and Decision Making

TL;DR: Cumulative exposure measured over different periods showed that even short episodes of hyperglycemia increase the risk of developing diabetes, and the longer an individual maintains glycemic control after a hyperglycemic episode, the lower the subsequent risk of diabetes.

...read moreread less

Abstract: The ubiquity of electronic health records (EHR) offers an opportunity to observe trajectories of laboratory results and vital signs over long periods of time. This study assessed the value of risk factor trajectories available in the electronic health record to predict incident type 2 diabetes. Analysis was based on a large 13-year retrospective cohort of 71,545 adult, non-diabetic patients with baseline in 2005 and median follow-up time of 8 years. The trajectories of fasting plasma glucose, lipids, BMI and blood pressure were computed over three time frames (2000–2001, 2002–2003, 2004) before baseline. A novel method, Cumulative Exposure (CE), was developed and evaluated using Cox proportional hazards regression to assess risk of incident type 2 diabetes. We used the Framingham Diabetes Risk Scoring (FDRS) Model as control. The new model outperformed the FDRS Model (.802 vs .660; p-values <2e-16). Cumulative exposure measured over different periods showed that even short episodes of hyperglycemia increase the risk of developing diabetes. Returning to normoglycemia moderates the risk, but does not fully eliminate it. The longer an individual maintains glycemic control after a hyperglycemic episode, the lower the subsequent risk of diabetes. Incorporating risk factor trajectories substantially increases the ability of clinical decision support risk models to predict onset of type 2 diabetes and provides information about how risk changes over time.

...read moreread less

Posted Content•

Physics-Guided Recurrent Graph Networks for Predicting Flow and Temperature in River Networks

[...]

Xiaowei Jia¹, Jacob A. Zwart², Jeffrey M. Sadler², Alison P. Appling², Samantha K. Oliver², Steven L. Markstrom², Jared Willard², Shaoming Xu³, Michael Steinbach³, Jordan S. Read³, Vipin Kumar³ - Show less +7 more•Institutions (3)

University of Pittsburgh¹, United States Geological Survey², University of Minnesota³

26 Sep 2020-arXiv: Geophysics

TL;DR: A physics-guided machine learning approach that combines advanced machine learning models and physics-based models to improve the prediction of water flow and temperature in river networks is proposed and shown to produce better performance when generalized to different seasons or river segments with different streamflow ranges.

...read moreread less

Abstract: This paper proposes a physics-guided machine learning approach that combines advanced machine learning models and physics-based models to improve the prediction of water flow and temperature in river networks. We first build a recurrent graph network model to capture the interactions among multiple segments in the river network. Then we present a pre-training technique which transfers knowledge from physics-based models to initialize the machine learning model and learn the physics of streamflow and thermodynamics. We also propose a new loss function that balances the performance over different river segments. We demonstrate the effectiveness of the proposed method in predicting temperature and streamflow in a subset of the Delaware River Basin. In particular, we show that the proposed method brings a 33\%/14\% improvement over the state-of-the-art physics-based model and 24\%/14\% over traditional machine learning models (e.g., Long-Short Term Memory Neural Network) in temperature/streamflow prediction using very sparse (0.1\%) observation data for training. The proposed method has also been shown to produce better performance when generalized to different seasons or river segments with different streamflow ranges.

...read moreread less

Proceedings Article•DOI•

Regularized graph convolutional networks for short text classification

[...]

Kshitij Tayal¹, Nikhil Rao², Agarwal Saurabh³, Xiaowei Jia⁴, Karthik Subbian², Vipin Kumar¹ - Show less +2 more•Institutions (4)

University of Minnesota¹, Amazon.com², Microsoft³, University of Pittsburgh⁴

01 Dec 2020

TL;DR: This work proposes a Graph-regularized Graph Convolution Network (GR-GCN), which augments graph convolution networks by incorporating label dependencies in the output space and shows that compared to baseline methods, GR- GCN is more robust to noise in textual features.

...read moreread less

Abstract: Short text classification is a fundamental problem in natural language processing, social network analysis, and e-commerce. The lack of structure in short text sequences limits the success of popular NLP methods based on deep learning. Simpler methods that rely on bag-of-words representations tend to perform on par with complex deep learning methods. To tackle the limitations of textual features in short text, we propose a Graph-regularized Graph Convolution Network (GR-GCN), which augments graph convolution networks by incorporating label dependencies in the output space. Our model achieves state-of-the-art results on both proprietary and external datasets, outperforming several baseline methods by up to 6% . Furthermore, we show that compared to baseline methods, GR-GCN is more robust to noise in textual features.

...read moreread less

Journal Article•DOI•

Mining Novel Multivariate Relationships in Time Series Data Using Correlation Networks

[...]

Saurabh Agrawal¹, Michael Steinbach¹, Daniel Boley¹, Snigdhansu Chatterjee¹, Gowtham Atluri², Stefan Liess¹, Vipin Kumar¹ - Show less +3 more•Institutions (2)

University of Minnesota¹, University of Cincinnati²

01 Sep 2020-IEEE Transactions on Knowledge and Data Engineering

TL;DR: The utility of multipoles is demonstrated in discovering new physical phenomena in two scientific domains: climate science and neuroscience by discovering several multipole relationships that are reproducible in multiple other independent datasets and lead to novel domain insights.

...read moreread less

Abstract: In many domains, there is significant interest in capturing novel relationships between time series that represent activities recorded at different nodes of a highly complex system. In this paper, we introduce multipoles, a novel class of linear relationships between more than two time series. A multipole is a set of time series that have strong linear dependence among themselves, with the requirement that each time series makes a significant contribution to the linear dependence. We demonstrate that most interesting multipoles can be identified as cliques of negative correlations in a correlation network. Such cliques are typically rare in a real-world correlation network, which allows us to find almost all multipoles efficiently using a clique-enumeration approach. Using our proposed framework, we demonstrate the utility of multipoles in discovering new physical phenomena in two scientific domains: climate science and neuroscience. In particular, we discovered several multipole relationships that are reproducible in multiple other independent datasets and lead to novel domain insights.

...read moreread less

Posted Content•

Teaching deep learning causal effects improves predictive performance.

[...]

Jia Li, Xiaowei Jia, Haoyu Yang, Vipin Kumar, Michael Steinbach, György J. Simon - Show less +2 more

11 Nov 2020-arXiv: Learning

TL;DR: This work describes a Causal-Temporal Structure for temporal EHR data and proposes a knowledge-guided neural network methodology to incorporate estimated ITE, and demonstrates on real-world and synthetic data that the proposed methodology can significantly improve the prediction performance of RNN.

...read moreread less

Abstract: Causal inference is a powerful statistical methodology for explanatory analysis and individualized treatment effect (ITE) estimation, a prominent causal inference task that has become a fundamental research problem. ITE estimation, when performed naively, tends to produce biased estimates. To obtain unbiased estimates, counterfactual information is needed, which is not directly observable from data. Based on mature domain knowledge, reliable traditional methods to estimate ITE exist. In recent years, neural networks have been widely used in clinical studies. Specifically, recurrent neural networks (RNN) have been applied to temporal Electronic Health Records (EHR) data analysis. However, RNNs are not guaranteed to automatically discover causal knowledge, correctly estimate counterfactual information, and thus correctly estimate the ITE. This lack of correct ITE estimates can hinder the performance of the model. In this work we study whether RNNs can be guided to correctly incorporate ITE-related knowledge and whether this improves predictive performance. Specifically, we first describe a Causal-Temporal Structure for temporal EHR data; then based on this structure, we estimate sequential ITE along the timeline, using sequential Propensity Score Matching (PSM); and finally, we propose a knowledge-guided neural network methodology to incorporate estimated ITE. We demonstrate on real-world and synthetic data (where the actual ITEs are known) that the proposed methodology can significantly improve the prediction performance of RNN.

...read moreread less

Unlocking Inverse Problems Using Deep Learning: Breaking Symmetries in Phase Retrieval

[...]

Kshitij Tayal, Chieh-Hsin Lai, Raunak Manekar, Zhong Zhuang, Vipin Kumar, Ju Sun - Show less +2 more

23 Oct 2020

TL;DR: It is shown that careful symmetry breaking on the training data can help get rid of the difficulties of tackling inverse problems by the emerging end-to-end deep learning approach and significantly improve learning performance in real data experiments.

...read moreread less

Abstract: In many physical systems, inputs related by intrinsic system symmetries generate the same output. So when inverting such systems, an input is mapped to multiple symmetry-related outputs. This causes fundamental difficulties for tackling these inverse problems by the emerging end-to-end deep learning approach. Taking phase retrieval as an illustrative example, we show that careful symmetry breaking on the training data can help get rid of the difficulties and significantly improve learning performance in real data experiments. We also extract and highlight the underlying mathematical principle of the proposed solution, which is directly applicable to other inverse problems.

...read moreread less

Deep Learning Initialized Phase Retrieval

[...]

Raunak Manekar, Zhong Zhuang, Kshitij Tayal, Vipin Kumar, Ju Sun - Show less +1 more

23 Oct 2020

TL;DR: It is shown that a carefully designed deep learning pipeline can consistently generate reliable initialization, so that the subsequent iterative methods can solve the PR problem and produce high-quality solutions.

...read moreread less

Abstract: Phase retrieval (PR) consists of estimating 2D or 3D objects from their Fourier magnitudes and takes a central place in scientific imaging. At present, most iterative methods for PR work well only when the initialization is close enough to the solution and fail otherwise. But there has been no general way of obtaining desired initialization. In this paper, we show that a carefully designed deep learning pipeline can consistently generate reliable initialization, so that the subsequent iterative methods can solve the PR problem and produce high-quality solutions. Technically, PR is an inverse problem containing three forward symmetries, and naive deployment of end-to-end deep learning for PR yields poor initialization. We explain why the symmetries cause the learning difficulty and propose a novel strategy that substantially improves the estimation. Overall, the proposed method solves PR in regimes not accessible by the previous methods, and our work synergizes deep learning and iterative methods to solve a difficult scientific inverse problem.

...read moreread less

Journal Article•DOI•

Global River Monitoring Using Semantic Fusion Networks

[...]

Zhihao Wei¹, Kebin Jia¹, Xiaowei Jia², Ankush Khandelwal, Vipin Kumar - Show less +1 more•Institutions (2)

Beijing University of Technology¹, University of Minnesota²

12 Aug 2020-Water

TL;DR: A segmentation method for global river monitoring based on semantic clustering and semantic fusion that outperforms several state-of-the-art algorithms, and demonstrates that grouping semantic information helps better segment the RSIR in global scale.

...read moreread less

Abstract: Global river monitoring is an important mission within the remote sensing society. One of the main challenges faced by this mission is generating an accurate water mask from remote sensing images (RSI) of rivers (RSIR), especially on a global scale with various river features. Aiming at better water area classification using semantic information, this paper presents a segmentation method for global river monitoring based on semantic clustering and semantic fusion. Firstly, an encoder–decoder network (AEN)-based architecture is proposed to obtain the semantic features from RSIR. Secondly, a clustering-based semantic fusion method is proposed to divide semantic features of RSIR into groups and train convolutional neural networks (CNN) models corresponding to each group using data augmentation and semi-supervised learning. Thirdly, a semantic distance-based segmentation fusion method is proposed for fusing the CNN models result into final segmentation mask. We built a global river dataset that contains multiple river segments from each continent of the world based on Sentinel-2 satellite imagery. The result shows that the F1-score of the proposed segmentation method is 93.32%, which outperforms several state-of-the-art algorithms, and demonstrates that grouping semantic information helps better segment the RSIR in global scale.

...read moreread less

Posted Content•

Semi-supervised Classification using Attention-based Regularization on Coarse-resolution Data.

[...]

Guruprasad Nayak¹, Rahul Ghosh, Xiaowei Jia¹, Varun Mithal², Vipin Kumar - Show less +1 more•Institutions (2)

University of Minnesota¹, LinkedIn²

03 Jan 2020-arXiv: Learning

TL;DR: In this paper, a multi-view learning approach is proposed to exploit the complementarity of features across different views to improve models on both views, where features of instances at different resolutions are used to learn the correspondence between instances across resolutions using an attention mechanism.

...read moreread less

Proceedings Article•DOI•

Model-agnostic Methods for Text Classification with Inherent Noise

[...]

Kshitij Tayal¹, Rahul Ghosh¹, Vipin Kumar¹•Institutions (1)

University of Minnesota¹

01 Dec 2020

TL;DR: This work evaluates model-agnostic methods to handle inherent noise in large scale text classification that can be easily incorporated into existing machine learning workflows with minimal interruption and describes the learning and application of this approach.

...read moreread less

Abstract: Text classification is a fundamental problem, and recently, deep neural networks (DNN) have shown promising results in many natural language tasks. However, their human-level performance relies on high-quality annotations, which are time-consuming and expensive to collect. As we move towards large inexpensive datasets, the inherent label noise degrades the generalization of DNN. While most machine learning literature focuses on building complex networks to handle noise, in this work, we evaluate model-agnostic methods to handle inherent noise in large scale text classification that can be easily incorporated into existing machine learning workflows with minimal interruption. Specifically, we conduct a point-by-point comparative study between several noise-robust methods on three datasets encompassing three popular classification models. To our knowledge, this is the first time such a comprehensive study in text classification encircling popular models and model-agnostic loss methods has been conducted. In this study, we describe our learning and demonstrate the application of our approach, which outperformed baselines by up to 10 % in classification accuracy while requiring no network modifications.

...read moreread less

Journal Article•DOI•

BHi-Cect: a top-down algorithm for identifying the multi-scale hierarchical structure of chromosomes.

[...]

Vipin Kumar, Simon Leclerc, Yuichi Taniguchi¹•Institutions (1)

National Presto Industries¹

18 Mar 2020-Nucleic Acids Research

TL;DR: The results reveal that the hierarchical structure of chromosome is organized as ‘enclaves’, which are complex interwoven clusters at both local and global scales, and it is shown that the nesting of local clusters within global clusters characterizing enclaves, is associated with the epigenomic activity found on the underlying DNA.

...read moreread less

Abstract: High-throughput chromosome conformation capture (Hi-C) technology enables the investigation of genome-wide interactions among chromosome loci. Current algorithms focus on topologically associating domains (TADs), that are contiguous clusters along the genome coordinate, to describe the hierarchical structure of chromosomes. However, high resolution Hi-C displays a variety of interaction patterns beyond what current TAD detection methods can capture. Here, we present BHi-Cect, a novel top-down algorithm that finds clusters by considering every locus with no assumption of genomic contiguity using spectral clustering. Our results reveal that the hierarchical structure of chromosome is organized as 'enclaves', which are complex interwoven clusters at both local and global scales. We show that the nesting of local clusters within global clusters characterizing enclaves, is associated with the epigenomic activity found on the underlying DNA. Furthermore, we show that the hierarchical nesting that links different enclaves integrates their respective function. BHi-Cect provides means to uncover the general principles guiding chromatin architecture.

...read moreread less

Proceedings Article•DOI•

Learning with Small Data

[...]

Huaxiu Yao¹, Xiaowei Jia², Vipin Kumar², Zhenhui Li¹•Institutions (2)

Pennsylvania State University¹, University of Minnesota²

23 Aug 2020

TL;DR: This tutorial will review the trending state-of-the-art machine learning techniques for learning with small (labeled) data, and focuses more on meta-learning, which improves the model generalization ability and has been proven to be an effective approach recently.

...read moreread less

Abstract: In the era of big data, data-driven methods have become increasingly popular in various applications, such as image recognition, traffic signal control, fake news detection. The superior performance of these data-driven approaches relies on large-scale labeled training data, which are probably inaccessible in real-world applications, i.e., "small (labeled) data" challenge. Examples include predicting emergent events in a city, detecting emerging fake news, and forecasting the progression of conditions for rare diseases. In most scenarios, people care about these small data cases most and thus improving the learning effectiveness of machine learning algorithms with small labeled data has been a popular research topic. In this tutorial, we will review the trending state-of-the-art machine learning techniques for learning with small (labeled) data. These techniques are organized from two aspects: (1) providing a comprehensive review of recent studies about knowledge generalization, transfer, and sharing, where transfer learning, multi-task learning, and meta-learning are discussed. Particularly, we will focus more on meta-learning, which improves the model generalization ability and has been proven to be an effective approach recently; (2) introducing the cutting-edge techniques which focus on incorporating domain knowledge into machine learning models. Different from model-based knowledge transfer techniques, in real-world applications, domain knowledge (e.g., physical laws) provides us with a new angle to deal with the small data challenge. Specifically, domain knowledge can be used to optimize learning strategies and/or guide the model design. In data mining field, we believe that learning with small data is a trending topic with important social impact, which will attract both researchers and practitioners from academia and industry.

...read moreread less

Proceedings Article•

End to End learning for Phase Retrieval

[...]

Raunak Manekar¹, Kshitij Tayal², Vipin Kumar², Ju Sun²•Institutions (2)

Central Electronics Engineering Research Institute¹, University of Minnesota²

01 Jul 2020

TL;DR: In this paper, an end-to-end deep learning approach for phase retrieval was proposed, and the authors highlight a fundamental difficulty for learning that previous work has neglected, likely due to the biased datasets they use for training and evaluation.

...read moreread less

Abstract: We consider the end-to-end deep learning approach for phase retrieval, a central problem in scientific imaging. We highlight a fundamental difficulty for learning that previous work has neglected, likely due to the biased datasets they use for training and evaluation. We propose a simple yet different formulation for PR that seems to overcome the difficulty and return consistently better qualitative results.

...read moreread less

Journal Article•DOI•

Automated Plantation Mapping in Southeast Asia Using MODIS Data and Imperfect Visual Annotations

[...]

Xiaowei Jia¹, Ankush Khandelwal, Kimberly M. Carlson², Kimberly M. Carlson³, James S. Gerber¹, Paul C. West¹, Leah H. Samberg¹, Leah H. Samberg⁴, Vipin Kumar - Show less +5 more•Institutions (4)

University of Minnesota¹, Icahn School of Medicine at Mount Sinai², University of Hawaii at Manoa³, Rainforest Alliance⁴

01 Feb 2020-Remote Sensing

TL;DR: This work proposes an automated approach, which is called Plantation Analysis by Learning from Multiple Classes (PALM), for mapping plantations on an annual basis using satellite remote sensing data and implements the proposed automated approach using MODIS data in Sumatra and Indonesian Borneo.

...read moreread less

Abstract: Expansion of large-scale tree plantations for commodity crop and timber production is a leading cause of tropical deforestation. While automated detection of plantations across large spatial scales and with high temporal resolution is critical to inform policies to reduce deforestation, such mapping is technically challenging. Thus, most available plantation maps rely on visual inspection of imagery, and many of them are limited to small areas for specific years. Here, we present an automated approach, which we call Plantation Analysis by Learning from Multiple Classes (PALM), for mapping plantations on an annual basis using satellite remote sensing data. Due to the heterogeneity of land cover classes, PALM utilizes ensemble learning to simultaneously incorporate training samples from multiple land cover classes over different years. After the ensemble learning, we further improve the performance by post-processing using a Hidden Markov Model. We implement the proposed automated approach using MODIS data in Sumatra and Indonesian Borneo (Kalimantan). To validate the classification, we compare plantations detected using our approach with existing datasets developed through visual interpretation. Based on random sampling and comparison with high-resolution images, the user’s accuracy and producer’s accuracy of our generated map are around 85% and 80% in our study region.

...read moreread less

DOI•

Analysis of tri-cum biserial bulk queue model connected with a common server

[...]

Sachin Kumar Agrawal, Vipin Kumar, Brahma Nand Agrawal

10 Dec 2020

TL;DR: The present paper demonstrates the tri-cum biserial bulk queue model linked with a common server with fixed batch size, postulated to follow the Poisson law, which shows the adequacy of the current arrangement procedure.

...read moreread less

Abstract: The present paper demonstrates the tri-cum biserial bulk queue model linked with a common server with fixed batch size. The development of the model has been done in the steady-state condition. The arrival and servicing patterns of the customers are postulated to follow the Poisson law. Various queuing model performances have been assessed by using the probability generating function technique and other statistical tools. The broad parametric examination has been documented to show the adequacy of the current arrangement procedure.

...read moreread less

Patent•

Opportunistic offlining for faulty devices in datacenters

[...]

Agarwal Saurabh¹, Goh Koon Hui Geoffrey¹, Yaqoob Asad, Zhou Shandan, Subramanian Karthikeyan, Natarajan Gowtham, Vipin Kumar - Show less +3 more•Institutions (1)

Microsoft¹

23 Jan 2020

TL;DR: In this article, the authors consider the potential impact on availability of tenant VMs, unused capacity of the datacenter, a number or ratio of unavailable hosts on the RDU, and other factors may be considered to make a balanced decision.

...read moreread less

Abstract: Embodiments relate to determining whether to take a resource distribution unit (RDU) of a datacenter offline when the RDU becomes faulty. RDUs in a cloud or datacenter supply a resource such as power, network connectivity, and the like to respective sets of hosts that provide computing resources to tenant units such as virtual machines (VMs). When an RDU becomes faulty some of the hosts that it supplies may continue to function and others may become unavailable for various reasons. This can make a decision of whether to take the RDU offline for repair difficult, since in some situations countervailing requirements of the datacenter may be at odds. To decide whether to take an RDU offline, the potential impact on availability of tenant VMs, unused capacity of the datacenter, a number or ratio of unavailable hosts on the RDU, and other factors may be considered to make a balanced decision.

...read moreread less

Proceedings Article•DOI•

Process Guided Deep Learning for Modeling Physical Systems: An Application in Lake Temperature Modeling

[...]

Xiaowei Jia¹, Jared Willard¹, Anuj Karpatne², Jordan S. Read³, Jacob A. Zwart³, Michael Steinbach¹, Vipin Kumar¹ - Show less +3 more•Institutions (3)

University of Minnesota¹, Virginia Tech², United States Geological Survey³

26 Sep 2020

TL;DR: In this article, the authors introduce a new paradigm that combines scientific knowledge within process-based models and machine learning models to advance scientific discovery in many physical systems, such as lake water temperature.

...read moreread less

Abstract: In this paper, we introduce a new paradigm that combines scientific knowledge within process-based models and machine learning models to advance scientific discovery in many physical systems. We will describe how to incorporate physical knowledge in real-world dynamical systems as additional constraints for training machine learning models and how to leverage the hidden knowledge encoded by existing process-based models. We evaluate this approach on modeling lake water temperature and demonstrate its superior performance using limited training data and the improved generalizability to different scenarios.

...read moreread less