A fast and accurate heuristic for the single individual snp haplotyping problem with many gaps, high reading error rate and low coverage
Citations
125 citations
Cites methods from "A fast and accurate heuristic for t..."
...Two more algorithms are mentioned in (23), a randomized one called SHRThree (31), and SpeedHap (32) which tries to build first a core solution with variants and fragments with full agreement and evidence of presence of the two alleles for each variant, and then includes the remaining fragments and variants by relaxing constraints....
[...]
...These blocks were used as the input for eight SIH algorithms (namely ReFHap, HapCUT, FastHare, DGS, MLF, 2d-MEC, SHRThree and SpeedHap)....
[...]
73 citations
67 citations
Cites methods from "A fast and accurate heuristic for t..."
...Computational properties of these problems have been analyzed by [16, 15] and several algorithms have been proposed for MEC [1, 6, 23, 26]....
[...]
...A practical exact algorithm for the individual haplotyping problem MEC/GI....
[...]
...The input for this test case is a matrix of 32347 SNPs covered by Table 2: MEC percentage and running time of ReFHap and HapCUT for a real instance with 32347 SNPs and 13905 fragments in chromosome 22 ReFHap HapCUT (1 It) HapCUT (50 It) %MEC 6.32% 6.26% 6.24% Time 73.04 Sec 0.99 Hours 50.4 Hours 13905 fragments....
[...]
...The .rst one is the Minimum Error Correction (MEC), which is the minimum number of changes within the matrix to make it consistent with the answer haplotypes....
[...]
...ReFHap consistently produces lower MEC and switch errors....
[...]
45 citations
Cites methods from "A fast and accurate heuristic for t..."
...Other methods have proposed heuristics (Genovese et al., 2007; Alessandro Panconesi, 2004), but do not...
[...]
18 citations
References
5,479 citations
2,908 citations
213 citations
"A fast and accurate heuristic for t..." refers background or methods in this paper
...MFR is NP-hard for fragments with at most 1 gap, and MSR is NP-hard for fragments with at most 2 gaps [7]....
[...]
...This problem has been tackled both from a theoretical point of view [1, 3, 4, 7, 13] and from a more practical one [8, 11, 14]....
[...]
...In previous papers [7, 11] experiments were based on SNP matrices obtained from the fragmentation of arti cially generated haplotype data....
[...]
...At this point the strings are split in fragments by selecting iteratively the next cut point at an integer distance from the previous one chosen uniformly at random in the range [3, 7], starting from the rst base....
[...]
...Each fragment covers a number of SNP's in the range roughly [3, 7], thus we chose the length of each fragment in this range....
[...]
145 citations
"A fast and accurate heuristic for t..." refers background or methods in this paper
...As future work we plan a comparison of our method with the one in [14]....
[...]
...We are not aware of any publicly available implementation of the methods described in [8, 11, 14, 16], therefore we chose as baseline the method in [11] that is comparable to ours in terms of speed, and does not rely on any statistical model....
[...]
...This problem has been tackled both from a theoretical point of view [1, 3, 4, 7, 13] and from a more practical one [8, 11, 14]....
[...]
...[14] describe a Genetic Algorithm for this problem that in some reported experiments gives good performance for short haplotypes (about 100 SNPs)....
[...]
136 citations