scispace - formally typeset
Open AccessJournal ArticleDOI

'Big data', Hadoop and cloud computing in genomics

TLDR
An overview of cloud computing and big data technologies, and how such expertise can be used to deal with biology's big data sets is discussed, together with an overview of the current usage of Hadoop within the bioinformatics community.
About
This article is published in Journal of Biomedical Informatics.The article was published on 2013-10-01 and is currently open access. It has received 403 citations till now. The article focuses on the topics: Data-intensive computing & Big data.

read more

Citations
More filters
Book ChapterDOI

Cloud-Based Computing

TL;DR: This chapter aims to describe the basic concepts of cloud computing and identify its potential applications, benefits, and challenges in healthcare.
Proceedings ArticleDOI

Big data in genomics

TL;DR: Focus on Genomics in cancer testing whether the healthcare applications can scale well on commercial big data platforms that implement Map Reduce framework and short read gene data sequence alignment and assembly workloads in genome analysis and Apache Hadoop distributed parallelized data processing.
Book ChapterDOI

Scalable Reference Genome Assembly from Compressed Pan-Genome Index with Spark

TL;DR: A scalable distributed pipeline, PanGenSpark, for compressing and indexing pan-genomes and assembling a reference genome from thepan-genomic index is proposed and experimentally show the scalability of the Pan GenSpark with human pan- genomes in a distributed Spark cluster comprising 448 cores distributed to 26 computing nodes.
Journal ArticleDOI

Government Cloud Computing Policies: Potential Opportunities for Advancing Military Biomedical Research.

TL;DR: It is recommended that cloud computing be considered by DoD biomedical researchers for increasing connectivity, presumably by facilitating communications and data sharing, among the various intra- and extramural laboratories.
Journal ArticleDOI

The impact of COVID-19 on the protection of rural traditional village

TL;DR: This paper analyzes people's preference for Huizhou cultural resources to better realize the more effective and far-reaching development and exploitation of Huiz Zhou cultural resources.
References
More filters
Journal ArticleDOI

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.
Journal ArticleDOI

Cramming More Components Onto Integrated Circuits

TL;DR: Integrated circuits will lead to such wonders as home computers or at least terminals connected to a central computer, automatic controls for automobiles, and personal portable communications equipment as mentioned in this paper. But the biggest potential lies in the production of large systems.
Journal Article

Cramming More Components onto Integrated Circuits

Gordon E. Moore
- 01 Jan 1965 - 
TL;DR: Integrated circuits will lead to such wonders as home computers or at least terminals connected to a central computer, automatic controls for automobiles, and personal portable communications equipment as discussed by the authors. But the biggest potential lies in the production of large systems.
Book

Big data: The next frontier for innovation, competition, and productivity

James Manyika
TL;DR: The amount of data in the authors' world has been exploding, and analyzing large data sets will become a key basis of competition, underpinning new waves of productivity growth, innovation, and consumer surplus, according to research by MGI and McKinsey.
Journal ArticleDOI

Galaxy: A platform for interactive large-scale genome analysis

TL;DR: An interactive system, Galaxy, that combines the power of existing genome annotation databases with a simple Web portal to enable users to search remote resources, combine data from independent queries, and visualize the results.
Related Papers (5)