scispace - formally typeset
Book ChapterDOI

A fast and accurate heuristic for the single individual snp haplotyping problem with many gaps, high reading error rate and low coverage

Reads0
Chats0
TLDR
A new heuristic method is described that is able to tackle the case of many gapped fragments and retains its effectiveness even when the input fragments have high rate of reading errors and low coverage.
Abstract
Single nucleotide polymorphism (SNP) is the most frequent form of DNA variation. The set of SNPs present in a chromosome (called the haplotype) is of interest in a wide area of applications in molecular biology and biomedicine, including diagnostic and medical therapy. In this paper we propose a new heuristic method for the problem of haplotype reconstruction for (portions of ) a pair of homologous human chromosomes from a single individual (SIH). The problem is well known in literature and exact algorithms have been proposed for the case when no (or few) gaps are allowed in the input fragments. These algorithms, though exact and of polynomial complexity, are slow in practice. Therefore fast heuristics have been proposed. In this paper we describe a new heuristic method that is able to tackle the case of many gapped fragments and retains its effectiveness even when the input fragments have high rate of reading errors (up to 20%) and low coverage (as low as 3). We test our method on real data from the HapMap Project.

read more

Citations
More filters
Journal ArticleDOI

RadixHap: a radix tree-based heuristic for solving the single individual haplotyping problem

TL;DR: A greedy approach to reconstruct reliable Single Individual Haplotypes, named RadixHap, is introduced, to handle data sets with high error rates and the experimental results show that RadxHap can generate highly reliable results in most cases.
Proceedings ArticleDOI

A Hopfield-Type Neural Network for Haplotype Assembly Problem

TL;DR: A Hopfield-type neural network based on the minimum fragment removal (MFR) model, which constructs a pair of haplotypes by deleting minimum fragments from the data set so that left fragments can be classified into two sets in which there is no confliction.
Journal ArticleDOI

Examination of The Students' Mistakes of Oral Reading

TL;DR: In this article, the authors examined the level of oral reading of the students studying on their 3rd degree, who are at the age range of 60-66 months, and the students who completed their 72nd month and started primary school.
Book ChapterDOI

Reconstruction of Infectious Bronchitis Virus Quasispecies from NGS Data

TL;DR: A computational pipeline for quasispecies (closely related variants to ancestral genome) reconstruction consisting of 3 phases, which shows that varying the parameter settings gets better results in terms of Average Distance to Clones, and Average Prediction Error.
Journal ArticleDOI

Matrix Completion and Performance Guarantees for Single Individual Haplotyping

TL;DR: A binary matrix factorization formulation of the single individual haplotyping problem is considered and shown to outperform existing methods when applied to synthetic as well as real-world Fosmid-based HapMap NA12878 datasets.
References
More filters
Journal ArticleDOI

A haplotype map of the human genome

John W. Belmont, +232 more
TL;DR: A public database of common variation in the human genome: more than one million single nucleotide polymorphisms for which accurate and complete genotypes have been obtained in 269 DNA samples from four populations, including ten 500-kilobase regions in which essentially all information about common DNA variation has been extracted.
Book ChapterDOI

SNPs Problems, Complexity, and Algorithms

TL;DR: It is shown that the general SNPs Haplotyping Problem is NP-hard for mate-pairs assembly data, and polynomial time algorithms for fragment assembly data are designed, and the Minimum SNPs Removal problem amounts to finding the largest independent set in a weakly triangulated graph.
Journal ArticleDOI

Haplotype reconstruction from SNP fragments by minimum error correction

TL;DR: To improve the MEC model for haplotype reconstruction, a new computational model is proposed, which simultaneously employs genotype information of an individual in the process of SNP correction, and is called MEC with genotypes information (shortly, MEC/GI).
Book ChapterDOI

Fast hare: A fast heuristic for single individual SNP haplotype reconstruction

TL;DR: A simple heuristic is introduced and it is proved experimentally that is very fast and accurate and when compared with a dynamic programming of [8] it is much faster and also more accurate.
Related Papers (5)