Home
/
Authors
/
Indira Gutierrez-Polo

Author

Indira Gutierrez-Polo

University of Illinois at Urbana–Champaign

Bio: Indira Gutierrez-Polo is an academic researcher from University of Illinois at Urbana–Champaign. The author has contributed to research in topics: Cyberinfrastructure & Web application. The author has an hindex of 2, co-authored 2 publications receiving 22 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Clowder: Open Source Data Management for Long Tail Data

[...]

Luigi Marini¹, Indira Gutierrez-Polo¹, Rob Kooper¹, Sandeep Puthanveetil Satheesan¹, M. Burnette¹, Jong Lee¹, Todd Nicholson¹, Yan Zhao¹, Kenton McHenry¹ - Show less +5 more•Institutions (1)

University of Illinois at Urbana–Champaign¹

22 Jul 2018

TL;DR: Some of the challenges encountered in designing and developing a system that can be easily adapted to different scientific areas are discussed, including support for large amounts of data, horizontal scaling of domain specific preprocessing algorithms, and ability to provide new data visualizations in the web browser.

...read moreread less

Abstract: Clowder is an open source data management system to support data curation of long tail data and metadata across multiple research domains and diverse data types. Institutions and labs can install and customize their own instance of the framework on local hardware or on remote cloud computing resources to provide a shared service to distributed communities of researchers. Data can be ingested directly from instruments or manually uploaded by users and then shared with remote collaborators using a web front end. We discuss some of the challenges encountered in designing and developing a system that can be easily adapted to different scientific areas including digital preservation, geoscience, material science, medicine, social science, cultural heritage and the arts. Some of these challenges include support for large amounts of data, horizontal scaling of domain specific preprocessing algorithms, ability to provide new data visualizations in the web browser, a comprehensive Web service API for automatic data ingestion and curation, a suite of social annotation and metadata management features to support data annotation by communities of users and algorithms, and a web based front-end to interact with code running on heterogeneous clusters, including HPC resources.

...read moreread less

20 citations

Proceedings Article•DOI•

Monitoring Water Quality in the Great Lakes Leveraging Geo-Temporal Cyberinfrastructure

[...]

Indira Gutierrez-Polo¹, Yan Zhao¹, Shannon Bradley¹, Eugene Roeder¹, Michelle Pitcel¹, Kristin TePas², Paris D. Collingsworth², Luigi Marini³ - Show less +4 more•Institutions (3)

University of Illinois at Urbana–Champaign¹, Illinois Indiana Sea Grant², National Center for Supercomputing Applications³

01 Oct 2017

TL;DR: The cyberinfrastructure that has been implemented in order to address the bottleneck when trying to collect relevant information in Great Lakes Monitoring is described.

...read moreread less

Abstract: When hydrologic scientists are looking for data to use in their data models relevant to their research area, they encounter a bottleneck when trying to collect relevant information, sometimes they might not know of all the available sources. Great Lakes Monitoring (GLM) collects data from 5 different state and federal agencies that take measurements on nutrients, geochemicals, contaminants, among other water quality data that directly affect the water health in the Great Lakes. In this paper, we describe the cyberinfrastructure that has been implemented in order to address this issue. To accomplish this, it cleans and organizes the data into a semi-standardized schema, ingests the data to a database, and takes advantage of a user interface for data visualizations. On top of this, trends for selected parameters can be observed within a map, data can be searched, downloaded in various formats and compared across different locations within the Great Lakes.

...read moreread less

5 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

I and i

[...]

Kevin Barraclough

08 Dec 2001-BMJ

TL;DR: There is, I think, something ethereal about i —the square root of minus one, which seems an odd beast at that time—an intruder hovering on the edge of reality.

...read moreread less

Abstract: There is, I think, something ethereal about i —the square root of minus one. I remember first hearing about it at school. It seemed an odd beast at that time—an intruder hovering on the edge of reality. Usually familiarity dulls this sense of the bizarre, but in the case of i it was the reverse: over the years the sense of its surreal nature intensified. It seemed that it was impossible to write mathematics that described the real world in …

...read moreread less

33,785 citations

Proceedings Article•DOI•

TERRA-REF Data Processing Infrastructure

[...]

M. Burnette¹, Rob Kooper¹, J. D. Maloney¹, Gareth S. Rohde¹, Jeffrey Terstriep¹, Craig Willis¹, Noah Fahlgren², Todd C. Mockler², Maria Newcomb³, Vasit Sagan⁴, Pedro Andrade-Sanchez³, Nadia Shakoor², Paheding Sidike⁴, Richard W. Ward³, David LeBauer¹ - Show less +11 more•Institutions (4)

University of Illinois at Urbana–Champaign¹, Donald Danforth Plant Science Center², University of Arizona³, Saint Louis University⁴

22 Jul 2018

TL;DR: The technical architecture for the TERRA-REF data and computing pipeline provides a suite of components to convert raw imagery to standard formats, geospatially subset data, and identify biophysical and physiological plant features related to crop productivity, resource use, and stress tolerance.

...read moreread less

Abstract: The Transportation Energy Resources from Renewable Agriculture Phenotyping Reference Platform (TERRA-REF) provides a data and computation pipeline responsible for collecting, transferring, processing and distributing large volumes of crop sensing and genomic data from genetically informative germplasm sets. The primary source of these data is a field scanner system built over an experimental field at the University of Arizona Maricopa Agricultural Center. The scanner uses several different sensors to observe the field at a dense collection frequency with high resolution. These sensors include RGB stereo, thermal, pulse-amplitude modulated chlorophyll fluorescence, imaging spectrometer cameras, a 3D laser scanner, and environmental monitors. In addition, data from sensors mounted on tractors, UAVs, an indoor controlled-environment facility, and manually collected measurements are integrated into the pipeline. Up to two TB of data per day are collected and transferred to the National Center for Supercomputing Applications at the University of Illinois (NCSA) where they are processed.In this paper we describe the technical architecture for the TERRA-REF data and computing pipeline. This modular and scalable pipeline provides a suite of components to convert raw imagery to standard formats, geospatially subset data, and identify biophysical and physiological plant features related to crop productivity, resource use, and stress tolerance. Derived data products are uploaded to the Clowder content management system and the BETYdb traits and yields database for querying, supporting research at an experimental plot level. All software is open source2 under a BSD 3-clause or similar license and the data products are open access (currently for evaluation with a full release in fall 2019). In addition, we provide computing environments in which users can explore data and develop new tools. The goal of this system is to enable scientists to evaluate and use data, create new algorithms, and advance the science of digital agriculture and crop improvement.

...read moreread less

21 citations

Proceedings Article•DOI•

Clowder: Open Source Data Management for Long Tail Data

[...]

University of Illinois at Urbana–Champaign¹

22 Jul 2018

...read moreread less

20 citations

Journal Article•DOI•

Convergence of Artificial Intelligence and High Performance Computing on NSF-supported Cyberinfrastructure

[...]

E. A. Huerta¹, Asad Khan¹, Edward Davis², Colleen Bushell¹, William Gropp¹, Daniel S. Katz, Volodymyr Kindratenko, Seid Koric¹, William Kramer¹, Brendan McGinty¹, Kenton McHenry¹, Aaron Saxton¹ - Show less +8 more•Institutions (2)

University of Illinois at Urbana–Champaign¹, University of Queensland²

18 Mar 2020-arXiv: Computational Physics

TL;DR: A summary of recent developments in HPC and AI is presented, and specific advances that authors in this article are spearheading to accelerate and streamline the use of HPC platforms to design and apply accelerated AI algorithms in academia and industry are described.

...read moreread less

Abstract: Significant investments to upgrade and construct large-scale scientific facilities demand commensurate investments in R&D to design algorithms and computing approaches to enable scientific and engineering breakthroughs in the big data era. Innovative Artificial Intelligence (AI) applications have powered transformational solutions for big data challenges in industry and technology that now drive a multi-billion dollar industry, and which play an ever increasing role shaping human social patterns. As AI continues to evolve into a computing paradigm endowed with statistical and mathematical rigor, it has become apparent that single-GPU solutions for training, validation, and testing are no longer sufficient for computational grand challenges brought about by scientific facilities that produce data at a rate and volume that outstrip the computing capabilities of available cyberinfrastructure platforms. This realization has been driving the confluence of AI and high performance computing (HPC) to reduce time-to-insight, and to enable a systematic study of domain-inspired AI architectures and optimization schemes to enable data-driven discovery. In this article we present a summary of recent developments in this field, and describe specific advances that authors in this article are spearheading to accelerate and streamline the use of HPC platforms to design and apply accelerated AI algorithms in academia and industry.

...read moreread less

18 citations

Journal Article•DOI•

Convergence of artificial intelligence and high performance computing on NSF-supported cyberinfrastructure

[...]

E. A. Huerta, Asad Khan¹, Edward Davis², Colleen Bushell¹, William Gropp¹, Daniel S. Katz, Volodymyr Kindratenko, Seid Koric¹, William Kramer¹, Brendan McGinty¹, Kenton McHenry¹, Aaron Saxton¹ - Show less +8 more•Institutions (2)

University of Illinois at Urbana–Champaign¹, University of Queensland²

01 Dec 2020-Journal of Big Data

TL;DR: A summary of recent developments in this field, and specific advances that authors in this article are spearheading to accelerate and streamline the use of HPC platforms to design and apply accelerated AI algorithms in academia and industry can be found in this paper.

...read moreread less

14 citations

1
2
3
4
…
5