scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Median network analysis of defectively sequenced entire mitochondrial genomes from early and contemporary disease studies.

TL;DR: The quality of complete mtDNA sequences generated in the medical field in the past few years is somewhat unsatisfactory and may even fall behind that of pioneer manual sequencing in the early nineties.
Abstract: Sequence analysis of the mitochondrial genome has become a routine method in the study of mitochondrial diseases. Quite often, the sequencing efforts in the search of pathogenic or disease-associated mutations are affected by technical and interpretive problems, caused by sample mix-up, contamination, biochemical problems, incomplete sequencing, misdocumentation and insufficient reference to previously published data. To assess data quality in case studies of mitochondrial diseases, it is recommended to compare any mtDNA sequence under consideration to their phylogenetically closest lineages available in the Web. The median network method has proven useful for visualizing potential problems with the data. We contrast some early reports of complete mtDNA sequences to more recent total mtDNA sequencing efforts in studies of various mitochondrial diseases. We conclude that the quality of complete mtDNA sequences generated in the medical field in the past few years is somewhat unsatisfactory and may even fall behind that of pioneer manual sequencing in the early nineties. Our study provides a paradigm for an a posteriori evaluation of sequence quality and for detection of potential problems with inferring a pathogenic status of a particular mutation.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
TL;DR: HaploGrep is presented, a fast, reliable and straight‐forward algorithm implemented in a Web application to determine the haplogroup affiliation of thousands of mtDNA profiles genotyped for the entire mtDNA or any part of it.
Abstract: An ongoing source of controversy in mitochondrial DNA (mtDNA) research is based on the detection of numerous errors in mtDNA profiles that led to erroneous conclusions and false disease associations. Most of these controversies could be avoided if the samples' haplogroup status would be taken into consideration. Knowing the mtDNA haplogroup affiliation is a critical prerequisite for studying mechanisms of human evolution and discovering genes involved in complex diseases, and validating phylogenetic consistency using haplogroup classification is an important step in quality control. However, despite the availability of Phylotree, a regularly updated classification tree of global mtDNA variation, the process of haplogroup classification is still time-consuming and error-prone, as researchers have to manually compare the polymorphisms found in a population sample to those summarized in Phylotree, polymorphism by polymorphism, sample by sample. We present HaploGrep, a fast, reliable and straight-forward algorithm implemented in a Web application to determine the haplogroup affiliation of thousands of mtDNA profiles genotyped for the entire mtDNA or any part of it. HaploGrep uses the latest version of Phylotree and offers an all-in-one solution for quality assessment of mtDNA profiles in clinical genetics, population genetics and forensics. HaploGrep can be accessed freely at http://haplogrep.uibk.ac.at.

472 citations


Cites background or methods from "Median network analysis of defectiv..."

  • ...A ‘‘phylogenetic bookkeeping of mutations’’ [Bandelt et al., 2009], that is, a systematic mutation by mutation comparison along known evolutionary pathways, as seen in Figure 4, would have helped to identify missed mutations and to check for phantom mutations on the list of ‘‘remaining’’ mutations,…...

    [...]

  • ...These especially recommend double evaluation of mtDNA sequences, checking for phylogenetic consistency using haplogroup classification and the construction of quasi-median networks for the detection of phantom mutations [Bandelt et al., 2009; Brandstätter et al., 2006a]....

    [...]

  • ...These especially recommend double evaluation of mtDNA sequences, checking for phylogenetic consistency using haplogroup classification and the construction of quasi-median networks for the detection of phantom mutations [Bandelt et al., 2009; Brandstätter et al., 2006a]....

    [...]

  • ...A ‘‘phylogenetic bookkeeping of mutations’’ [Bandelt et al., 2009], that is, a systematic mutation by mutation comparison along known evolutionary pathways, as seen in Figure 4, would have helped to identify missed mutations and to check for phantom mutations on the list of ‘‘remaining’’ mutations, which were marked as ‘‘global private mutation....

    [...]

  • ...…2005b; Yao et al., 2003] has led to formulation of guidelines, which especially recommend checking for phylogenetic consistency using haplogroup classification and the construction of quasi-median networks for the detection of phantom mutations [Bandelt et al., 2001, 2009; Salas et al., 2005a]....

    [...]

Journal ArticleDOI
TL;DR: In this paper, the authors evaluated the performance of mtGenome sequencing using the Personal Genome Machine (PGM) and compared the resulting haplotypes directly with conventional Sanger-type sequencing.
Abstract: Insights into the human mitochondrial phylogeny have been primarily achieved by sequencing full mitochondrial genomes (mtGenomes). In forensic genetics (partial) mtGenome information can be used to assign haplotypes to their phylogenetic backgrounds, which may, in turn, have characteristic geographic distributions that would offer useful information in a forensic case. In addition and perhaps even more relevant in the forensic context, haplogroup-specific patterns of mutations form the basis for quality control of mtDNA sequences. The current method for establishing (partial) mtDNA haplotypes is Sanger-type sequencing (STS), which is laborious, time-consuming, and expensive. With the emergence of Next Generation Sequencing (NGS) technologies, the body of available mtDNA data can potentially be extended much more quickly and cost-efficiently. Customized chemistries, laboratory workflows and data analysis packages could support the community and increase the utility of mtDNA analysis in forensics. We have evaluated the performance of mtGenome sequencing using the Personal Genome Machine (PGM) and compared the resulting haplotypes directly with conventional Sanger-type sequencing. A total of 64 mtGenomes (>1 million bases) were established that yielded high concordance with the corresponding STS haplotypes (<0.02% differences). About two-thirds of the differences were observed in or around homopolymeric sequence stretches. In addition, the sequence alignment algorithm employed to align NGS reads played a significant role in the analysis of the data and the resulting mtDNA haplotypes. Further development of alignment software would be desirable to facilitate the application of NGS in mtDNA forensic genetics.

159 citations

01 Jan 2013
TL;DR: Evaluated the performance of mtGenome sequencing using the Personal Genome Machine and compared the resulting haplotypes directly with conventional Sanger-type sequencing and the sequence alignment algorithm employed to align NGS reads played a significant role in the analysis of the data and the resulting mtDNA haplotypes.
Abstract: A B S T R A C T Insights into the human mitochondrial phylogeny have been primarily achieved by sequencing full mitochondrial genomes (mtGenomes). In forensic genetics (partial) mtGenome information can be used to assign haplotypes to their phylogenetic backgrounds, which may, in turn, have characteristic geographic distributions that would offer useful information in a forensic case. In addition and perhaps even more relevant in the forensic context, haplogroup-specific patterns of mutations form the basis for quality control of mtDNA sequences. The current method for establishing (partial) mtDNA haplotypes is Sanger-type sequencing (STS), which is laborious, time-consuming, and expensive. With the emergence of Next Generation Sequencing (NGS) technologies, the body of available mtDNA data can potentially be extended much more quickly and cost-efficiently. Customized chemistries, laboratory workflows and data analysis packages could support the community and increase the utility of mtDNA analysis in forensics. We have evaluated the performance of mtGenome sequencing using the Personal Genome Machine (PGM) and compared the resulting haplotypes directly with conventional Sanger-type sequencing. A total of 64 mtGenomes (>1 million bases) were established that yielded high concordance with the corresponding STS haplotypes (<0.02% differences). About two-thirds of the differences were observed in or around homopolymeric sequence stretches. In addition, the sequence alignment algorithm employed to align NGS reads played a significant role in the analysis of the data and the resulting mtDNA haplotypes. Further development of alignment software would be desirable to facilitate the application of NGS in mtDNA forensic genetics.

150 citations

Journal ArticleDOI
TL;DR: Since the first sequencing of the complete human mtDNA genome, both the sequencing techniques and the quality of commercial kits have improved greatly, but the overall mtDNA data quality is still far from satisfactory.
Abstract: To the Editor: Since the first sequencing of the complete human mtDNA genome,1 both the sequencing techniques and the quality of commercial kits have improved greatly. This has led to a growing number of reports for complete mtDNA sequences from the fields of molecular anthropology, medical genetics, and forensic science; and there are now over 6700 complete or near-complete mtDNA sequences available for study.2 However, in comparison to the pioneer manual-sequencing efforts in the early nineties, the overall mtDNA data quality, especially in the medical field, is still far from satisfactory.

74 citations

Journal ArticleDOI
TL;DR: It is preferred to maintain the rCRS and to report mtDNA profiles by employing the hitherto predominant circumfix style, to guard against misunderstandings about reporting mtDNA variation.
Abstract: Since the determination in 1981 of the sequence of the human mitochondrial DNA (mtDNA) genome, the Cambridge Reference Sequence (CRS), has been used as the reference sequence to annotate mtDNA in molecular anthropology, forensic science and medical genetics. The CRS was eventually upgraded to the revised version (rCRS) in 1999. This reference sequence is a convenient device for recording mtDNA variation, although it has often been misunderstood as a wild-type (WT) or consensus sequence by medical geneticists. Recently, there has been a proposal to replace the rCRS with the so-called Reconstructed Sapiens Reference Sequence (RSRS). Even if it had been estimated accurately, the RSRS would be a cumbersome substitute for the rCRS, as the new proposal fuses—and thus confuses—the two distinct concepts of ancestral lineage and reference point for human mtDNA. Instead, we prefer to maintain the rCRS and to report mtDNA profiles by employing the hitherto predominant circumfix style. Tree diagrams could display mutations by using either the profile notation (in conventional short forms where appropriate) or in a root-upwards way with two suffixes indicating ancestral and derived nucleotides. This would guard against misunderstandings about reporting mtDNA variation. It is therefore neither necessary nor sensible to change the present reference sequence, the rCRS, in any way. The proposed switch to RSRS would inevitably lead to notational chaos, mistakes and misinterpretations.

69 citations

References
More filters
Journal ArticleDOI
09 Apr 1981
TL;DR: The complete sequence of the 16,569-base pair human mitochondrial genome is presented and shows extreme economy in that the genes have none or only a few noncoding bases between them, and in many cases the termination codons are not coded in the DNA but are created post-transcriptionally by polyadenylation of the mRNAs.
Abstract: The complete sequence of the 16,569-base pair human mitochondrial genome is presented. The genes for the 12S and 16S rRNAs, 22 tRNAs, cytochrome c oxidase subunits I, II and III, ATPase subunit 6, cytochrome b and eight other predicted protein coding genes have been located. The sequence shows extreme economy in that the genes have none or only a few noncoding bases between them, and in many cases the termination codons are not coded in the DNA but are created post-transcriptionally by polyadenylation of the mRNAs.

8,783 citations

Journal ArticleDOI
01 Oct 1995-Genetics
TL;DR: A novel network approach for portraying mtDNA relationships is proposed that reduces the complexity of the network by identifying parallelisms and is guided by a compatibility argument and an additional source of phylogenetic information: the frequencies of the mitochondrial haplotypes.
Abstract: Analysis of variation in the hypervariable region of mitochondrial DNA (mtDNA) has emerged as an important tool for studying human evolution and migration. However, attempts to reconstruct optimal intraspecific mtDNA phylogenies frequently fail because parallel mutation events partly obscure the true evolutionary pathways. This makes it inadvisable to present a single phylogenetic tree at the expense of neglecting equally acceptable ones. As an alternative, we propose a novel network approach for portraying mtDNA relationships. For small sample sizes (< approximately 50), an unmodified median network contains all most parsimonious trees, displays graphically the full information content of the sequence data, and can easily be generated by hand. For larger sample sizes, we reduce the complexity of the network by identifying parallelisms. This reduction procedure is guided by a compatibility argument and an additional source of phylogenetic information: the frequencies of the mitochondrial haplotypes. As a spin-off, our approach can also assist in identifying sequencing errors, which manifest themselves in implausible network substructures. We illustrate the advantages of our approach with several examples from existing data sets.

1,092 citations

Journal ArticleDOI
13 May 2005-Science
TL;DR: It is shown that mitochondrial DNA variation in isolated “relict” populations in southeast Asia supports the view that there was only a single dispersal from Africa, most likely via a southern coastal route, through India and onward into southeast Asia and Australasia.
Abstract: A recent dispersal of modern humans out of Africa is now widely accepted, but the routes taken across Eurasia are still disputed. We show that mitochondrial DNA variation in isolated “relict” populations in southeast Asia supports the view that there was only a single dispersal from Africa, most likely via a southern coastal route, through India and onward into southeast Asia and Australasia. There was an early offshoot, leading ultimately to the settlement of the Near East and Europe, but the main dispersal from India to Australia ∼65,000 years ago was rapid, most likely taking only a few thousand years.

696 citations

Journal ArticleDOI
TL;DR: It is shown that the main indigenous North African cluster is a sister group to the most ancient cluster of European mtDNAs, from which it diverged approximately 50,000 years ago.
Abstract: Variation in the human mitochondrial genome (mtDNA) is now routinely described and used to infer the histories of peoples, by means of one of two procedures, namely, the assaying of RFLPs throughout the genome and the sequencing of parts of the control region (CR). Using 95 samples from the Near East and northwest Caucasus, we present an analysis based on both systems, demonstrate their concordance, and, using additional available information, present the most refined phylogeny to date of west Eurasian mtDNA. We describe and apply a nomenclature for mtDNA clusters. Hypervariable nucleotides are identified, and the relative mutation rates of the two systems are evaluated. We point out where ambiguities remain. The identification of signature mutations for each cluster leads us to apply a hierarchical scheme for determining the cluster composition of a sample of Berber speakers, previously analyzed only for CR variation. We show that the main indigenous North African cluster is a sister group to the most ancient cluster of European mtDNAs, from which it diverged ∼50,000 years ago.

630 citations