Searching for SNPs with cloud computing
Citations
10,056 citations
Additional excerpts
...Comparison of this calling pipeline to Crossbow To calibrate the additional value of the tools described here, we contrasted our results with SNPs called on our raw NA12878 exome data using Crossbo...
[...]
1,128 citations
Additional excerpts
...Variant calling on 2 billion genomes per year, with 100,000 CPUs in parallel, would require methods that process 2 genomes per CPU-hour, three-to-four orders of magnitude faster than current capabilities [42]....
[...]
970 citations
Cites methods from "Searching for SNPs with cloud compu..."
...Sequence reads of 25 bp for ChIP-Seq and 36 bp for RNA-Seq were generated from an Illumina Genome Analyzer, mapped to mouse genome (mm8) by using Bowtie (Langmead et al., 2009)....
[...]
958 citations
612 citations
References
20,335 citations
"Searching for SNPs with cloud compu..." refers methods in this paper
...For alignment, Crossbow uses Bowtie [17], which employs a Burrows-Wheeler index [25] based on the full-text minute-space (FM) index [26] to enable fast and memory-efficient alignment of short reads to mammalian genomes....
[...]
...We present Crossbow, a Hadoop-based software tool that combines the speed of the short read aligner Bowtie [17] with the accuracy of the SNP caller SOAPsnp [18] to perform alignment and SNP detection for multiple whole-human datasets per day....
[...]
20,309 citations
12,293 citations
"Searching for SNPs with cloud compu..." refers background in this paper
...Technologies from Illumina (San Diego, CA, USA), Applied Biosystems (Foster City, CA, USA) and 454 Life Sciences (Branford, CT, USA) have been used to detect genomic variations among humans [1-5], to profile methylation patterns [6], to map DNA-protein interactions [7], and to identify differentially expressed genes and novel splice junctions [8,9]....
[...]
11,473 citations
"Searching for SNPs with cloud compu..." refers background in this paper
...Technologies from Illumina (San Diego, CA, USA), Applied Biosystems (Foster City, CA, USA) and 454 Life Sciences (Branford, CT, USA) have been used to detect genomic variations among humans [1-5], to profile methylation patterns [6], to map DNA-protein interactions [7], and to identify differentially expressed genes and novel splice junctions [8,9]....
[...]
6,449 citations
"Searching for SNPs with cloud compu..." refers methods in this paper
...Positions for known SNPs were calculated according to data in dbSNP [28] versions 128 and 130, and allele frequencies were calculated according to data from the HapMap project [22]....
[...]
...Files containing known SNP locations and allele frequencies derived from dbSNP [28] are distributed to worker nodes via the same mechanism used to Crossbow workflow Figure 2 Crossbow workflow....
[...]