scispace - formally typeset
Search or ask a question
Author

Jillian Rowe

Bio: Jillian Rowe is an academic researcher from New York University Abu Dhabi. The author has contributed to research in topics: Web application & Type 2 diabetes. The author has an hindex of 8, co-authored 11 publications receiving 746 citations. Previous affiliations of Jillian Rowe include Qatar Airways & Cornell University.

Papers
More filters
Journal ArticleDOI
TL;DR: The present Bioconda, a distribution of bioinformatics software for the lightweight, multi-platform and language-agnostic package manager Conda, improves analysis reproducibility by allowing users to define isolated environments with defined software versions.
Abstract: We present Bioconda (https://bioconda.github.io), a distribution of bioinformatics software for the lightweight, multi-platform and language-agnostic package manager Conda. Currently, Bioconda offers a collection of over 3000 software packages, which is continuously maintained, updated, and extended by a growing global community of more than 200 contributors. Bioconda improves analysis reproducibility by allowing users to define isolated environments with defined software versions, all of which are easily installed and managed without administrative privileges.

699 citations

Journal ArticleDOI
TL;DR: This study suggests that 1,5-AG in saliva can be used in national screening programs for undiagnosed diabetes, which are of particular interest for Middle Eastern countries with young populations and exceptionally high diabetes rates.
Abstract: Context: In most ethnicities at least a quarter of all cases with diabetes is assumed to be undiagnosed. Screening for diabetes using saliva has been suggested as an effective approach to identify affected individuals. Objective: The objective of the study was to identify a noninvasive metabolic marker of type 2 diabetes in saliva. Design and Setting: In a case-control study of type 2 diabetes, we used a clinical metabolomics discovery study to screen for diabetes-relevant metabolic readouts in saliva, using blood and urine as a reference. With a combination of three metabolomics platforms based on nontargeted mass spectrometry, we examined 2178 metabolites in saliva, blood plasma, and urine samples from 188 subjects with type 2 diabetes and 181 controls of Arab and Asian ethnicities. Results: We found a strong association of type 2 diabetes with 1,5-anhydroglucitol (1,5-AG) in saliva (P = 3.6 × 10−13). Levels of 1,5-AG in saliva highly correlated with 1,5-AG levels in blood and inversely correlated with ...

86 citations

Journal ArticleDOI
TL;DR: A broad metabolomics study in a clinical setting that reveals a complex network of biochemical dysregulation involving metabolites from different pathways of diabetes pathology, and provides a reference framework for future diabetes studies with metabolic endpoints is reported.
Abstract: Aims/hypothesis Metabolomics has opened new avenues for studying metabolic alterations in type 2 diabetes. While many urine and blood metabolites have been associated individually with diabetes, a complete systems view analysis of metabolic dysregulations across multiple biofluids and over varying timescales of glycaemic control is still lacking.

77 citations

Journal ArticleDOI
TL;DR: NASQAR (Nucleic acid SeQuence Analysis Resource) as discussed by the authors is a collection of custom and publicly available open-source web applications that make extensive use of a variety of R packages to provide interactive data analysis and visualization.
Abstract: As high-throughput sequencing applications continue to evolve, the rapid growth in quantity and variety of sequence-based data calls for the development of new software libraries and tools for data analysis and visualization Often, effective use of these tools requires computational skills beyond those of many researchers To ease this computational barrier, we have created a dynamic web-based platform, NASQAR (Nucleic Acid SeQuence Analysis Resource) NASQAR offers a collection of custom and publicly available open-source web applications that make extensive use of a variety of R packages to provide interactive data analysis and visualization The platform is publicly accessible at http://nasqarabudhabinyuedu/ Open-source code is on GitHub at https://githubcom/nasqar/NASQAR , and the system is also available as a Docker image at https://hubdockercom/r/aymanm/nasqarall NASQAR is a collaboration between the core bioinformatics teams of the NYU Abu Dhabi and NYU New York Centers for Genomics and Systems Biology NASQAR empowers non-programming experts with a versatile and intuitive toolbox to easily and efficiently explore, analyze, and visualize their Transcriptomics data interactively Popular tools for a variety of applications are currently available, including Transcriptome Data Preprocessing, RNA-seq Analysis (including Single-cell RNA-seq), Metagenomics, and Gene Enrichment

39 citations

Posted ContentDOI
21 Oct 2017-bioRxiv
TL;DR: The present Bioconda, a distribution of bioinformatics software for the lightweight, multiplatform and language-agnostic package manager Conda, improves analysis reproducibility by allowing users to define isolated environments with defined software versions.
Abstract: We present Bioconda (https://bioconda.github.io), a distribution of bioinformatics software for the lightweight, multi-platform and language-agnostic package manager, Conda. Currently, Bioconda offers a collection of over 2900 software tools, which are continuously maintained, updated, and extended by a growing global community of more than 200 contributors. Bioconda improves analysis reproducibility by allowing users to define isolated environments with defined software versions, all of which are easily installed and managed without administrative privileges.

37 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Key statistics on the current data contents and volume of downloads are outlined, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas are outlined.
Abstract: The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world’s largest data repository of mass spectrometry-based proteomics data, and is one of the founding members of the global ProteomeXchange (PX) consortium. In this manuscript, we summarize the developments in PRIDE resources and related tools since the previous update manuscript was published in Nucleic Acids Research in 2016. In the last 3 years, public data sharing through PRIDE (as part of PX) has definitely become the norm in the field. In parallel, data re-use of public proteomics data has increased enormously, with multiple applications. We first describe the new architecture of PRIDE Archive, the archival component of PRIDE. PRIDE Archive and the related data submission framework have been further developed to support the increase in submitted data volumes and additional data types. A new scalable and fault tolerant storage backend, Application Programming Interface and web interface have been implemented, as a part of an ongoing process. Additionally, we emphasize the improved support for quantitative proteomics data through the mzTab format. At last, we outline key statistics on the current data contents and volume of downloads, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas.

5,735 citations

Journal ArticleDOI
TL;DR: An updated version of the popular Bioconductor package, clusterProfiler 4.0, which provides a universal interface for functional enrichment analysis in thousands of organisms based on internally supported ontologies and pathways as well as annotation data provided by users or derived from online databases.
Abstract: Functional enrichment analysis is pivotal for interpreting high-throughput omics data in life science. It is crucial for this type of tool to use the latest annotation databases for as many organisms as possible. To meet these requirements, we present here an updated version of our popular Bioconductor package, clusterProfiler 4.0. This package has been enhanced considerably compared with its original version published 9 years ago. The new version provides a universal interface for functional enrichment analysis in thousands of organisms based on internally supported ontologies and pathways as well as annotation data provided by users or derived from online databases. It also extends the dplyr and ggplot2 packages to offer tidy interfaces for data operation and visualization. Other new features include gene set enrichment analysis and comparison of enrichment results from multiple gene lists. We anticipate that clusterProfiler 4.0 will be applied to a wide range of scenarios across diverse organisms.

2,448 citations

Journal ArticleDOI
TL;DR: The SAMtools and BCFtools packages represent a unique collection of tools that have been used in numerous other software projects and countless genomic pipelines and are freely available on GitHub under the permissive MIT licence, free for both noncommercial and commercial use.
Abstract: Background: SAMtools and BCFtools are widely used programs for processing and analysing high-throughput sequencing data. They include tools for file format conversion and manipulation, sorting, querying, statistics, variant calling, and effect analysis amongst other methods. Findings: The first version appeared online 12 years ago and has been maintained and further developed ever since, with many new features and improvements added over the years. The SAMtools and BCFtools packages represent a unique collection of tools that have been used in numerous other software projects and countless genomic pipelines. Conclusion: Both SAMtools and BCFtools are freely available on GitHub under the permissive MIT licence, free for both non-commercial and commercial use. Both packages have been installed >1 million times via Bioconda. The source code and documentation are available from https://www.htslib.org.

2,448 citations

Journal ArticleDOI
TL;DR: NanoPack, a set of tools developed for visualization and processing of long‐read sequencing data from Oxford Nanopore Technologies and Pacific Biosciences, is described.
Abstract: Summary Here we describe NanoPack, a set of tools developed for visualization and processing of long-read sequencing data from Oxford Nanopore Technologies and Pacific Biosciences. Availability and implementation The NanoPack tools are written in Python3 and released under the GNU GPL3.0 License. The source code can be found at https://github.com/wdecoster/nanopack, together with links to separate scripts and their documentation. The scripts are compatible with Linux, Mac OS and the MS Windows 10 subsystem for Linux and are available as a graphical user interface, a web service at http://nanoplot.bioinf.be and command line tools. Supplementary information Supplementary data are available at Bioinformatics online.

1,296 citations

Journal ArticleDOI
TL;DR: New functionalities and major improvements of the BUSCO software are presented, as well as the renewal and expansion of the underlying datasets in sync with the OrthoDB v10 release.
Abstract: Methods for evaluating the quality of genomic and metagenomic data are essential to aid genome assembly procedures and to correctly interpret the results of subsequent analyses. BUSCO estimates the completeness and redundancy of processed genomic data based on universal single-copy orthologs. Here, we present new functionalities and major improvements of the BUSCO software, as well as the renewal and expansion of the underlying data sets in sync with the OrthoDB v10 release. Among the major novelties, BUSCO now enables phylogenetic placement of the input sequence to automatically select the most appropriate BUSCO data set for the assessment, allowing the analysis of metagenome-assembled genomes of unknown origin. A newly introduced genome workflow increases the efficiency and runtimes especially on large eukaryotic genomes. BUSCO is the only tool capable of assessing both eukaryotic and prokaryotic species, and can be applied to various data types, from genome assemblies and metagenomic bins, to transcriptomes and gene sets.

1,181 citations