scispace - formally typeset
Search or ask a question

Showing papers on "Sanger sequencing published in 2004"


Patent
28 Jan 2004
TL;DR: An apparatus and method for performing rapid DNA sequencing, such as genomic sequencing, is described in this paper, which includes the steps of preparing a sample DNA for sequencing, amplifying the prepared DNA in a representative manner, and performing multiple sequencing reaction on the amplified DNA with only one primer hybridization step.
Abstract: An apparatus and method for performing rapid DNA sequencing, such as genomic sequencing, is provided herein The method includes the steps of preparing a sample DNA for genomic sequencing, amplifying the prepared DNA in a representative manner, and performing multiple sequencing reaction on the amplified DNA with only one primer hybridization step

778 citations


Journal ArticleDOI
TL;DR: Various novel sequencing technologies are being developed, each aspiring to reduce costs to the point at which the genomes of individual humans could be sequenced as part of routine health care.
Abstract: Nearly three decades have passed since the invention of electrophoretic methods for DNA sequencing. The exponential growth in the cost-effectiveness of sequencing has been driven by automation and by numerous creative refinements of Sanger sequencing, rather than through the invention of entirely new methods. Various novel sequencing technologies are being developed, each aspiring to reduce costs to the point at which the genomes of individual humans could be sequenced as part of routine health care. Here, we review these technologies, and discuss the potential impact of such a 'personal genome project' on both the research community and on society.

620 citations


Journal ArticleDOI
TL;DR: Analysis of 16S rDNA amplicons by MALDI-TOF MS after base-specific cleavage of RNA transcripts allowed fast and reliable identification of the Mycobacterium tuberculosis complex and ubiquitous mycobacteria (mycob bacteria other than tuberculosis).
Abstract: Matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) after base-specific cleavage of PCR amplified and in vitro-transcribed 16S rRNA gene (rDNA) was used for the identification of mycobacteria. Full-length 16S rDNA reference sequences of 12 type strains of Mycobacterium spp. frequently isolated from clinical specimens were determined by PCR, cloning, and sequencing. For MALDI-TOF MS-based comparative sequence analysis, mycobacterial 16S rDNA signature sequences (∼500 bp) of the 12 type strains and 24 clinical isolates were PCR amplified using RNA promoter-tagged forward primers. T7 RNA polymerase-mediated transcription of forward strands in the presence of 5-methyl ribo-CTP maximized mass differences of fragments generated by base-specific cleavage. In vitro transcripts were subsequently treated with RNase T1, resulting in G-specific cleavage. Sample analysis by MALDI-TOF MS showed a specific mass signal pattern for each of the 12 type strains, allowing unambiguous identification. All 24 clinical isolates were identified unequivocally by comparing their detected mass signal pattern to the reference sequence-derived in silico pattern of the type strains and to the in silico mass patterns of published 16S rDNA sequences. A 16S rDNA microheterogeneity of the Mycobacterium xenopi type strain (DSM 43995) was detected by MALDI-TOF MS and later confirmed by Sanger dideoxy sequencing. In conclusion, analysis of 16S rDNA amplicons by MS after base-specific cleavage of RNA transcripts allowed fast and reliable identification of the Mycobacterium tuberculosis complex and ubiquitous mycobacteria (mycobacteria other than tuberculosis). The technology delivers an open platform for high-throughput microbial identification on the basis of any specific genotypic marker region.

91 citations


Journal ArticleDOI
TL;DR: The results showed that the method could be a useful tool for the detection of lamivudine resistance in the clinical setting and experiments with mixtures of wild-type and resistant viruses indicated that pyrosequencing can detect minor sequence variants in heterogeneous virus populations.
Abstract: Chronic hepatitis B virus (HBV) infection can cause severe liver disease, including cirrhosis and hepatocellular carcinoma. Lamivudine is a relatively recent alternative to alpha interferon for the treatment of HBV infection, but unfortunately, resistance to lamivudine commonly develops during monotherapy. Lamivudine-resistant HBV mutants display specific mutations in the YMDD (tyrosine, methionine, aspartate, aspartate) motif of the viral polymerase (reverse transcriptase [rt]), which is the catalytic site of the enzyme, i.e., methionine 204 to isoleucine (rtM204I) or valine (rtM204V). The latter mutation is often accompanied by a compensatory leucine-to-methionine change at codon 180 (rtL180M). In the present study, a novel sequencing method, pyrosequencing, was applied to the detection of lamivudine resistance mutations and was compared with direct Sanger sequencing. The new pyrosequencing method had advantages in terms of throughput. Experiments with mixtures of wild-type and resistant viruses indicated that pyrosequencing can detect minor sequence variants in heterogeneous virus populations. The new pyrosequencing method was evaluated with a small number of patient samples, and the results showed that the method could be a useful tool for the detection of lamivudine resistance in the clinical setting.

77 citations



Journal ArticleDOI
TL;DR: A new method for DNA sequencing using base-specific cleavage and mass spectrometry that appears to be a promising alternative to classical DNA sequencing approaches is proposed and application of this method to simulated data indicates that it might be capable of sequencing DNA molecules with 200+ nt.
Abstract: One of the main endeavors in today's life science remains the efficient sequencing of long DNA molecules. Today, most de novo sequencing of DNA is still performed using the electrophoresis-based Sanger concept of 1977, in spite of certain restrictions of this method. Methods using mass spectrometry to acquire the Sanger sequencing data are limited by short sequencing lengths of 15-25 nt. We propose a new method for DNA sequencing using base-specific cleavage and mass spectrometry that appears to be a promising alternative to classical DNA sequencing approaches. A single stranded DNA or RNA molecule is cleaved by a base-specific (bio-)chemical reaction using, for example, RNAses. The cleavage reaction is modified such that not all, but only a certain percentage of bases are cleaved. The resulting mixture of fragments is then analyzed using MALDI-TOF mass spectrometry, whereby we acquire the molecular masses of fragments. For every peak in the mass spectrum, we calculate those base compositions that will potentially create a peak of the observed mass and, repeating the cleavage reaction for all four bases, finally try to uniquely reconstruct the underlying sequence from these observed spectra. This leads us to the combinatorial problem of sequencing from compomers and, finally, to the graph-theoretical problem of finding a walk in a subgraph of the de Bruijn graph. Application of this method to simulated data indicates that it might be capable of sequencing DNA molecules with 200+ nt.

39 citations


Journal ArticleDOI
TL;DR: The fast and high-throughput pyrosequencing technology can be used for molecular epidemiological characterization of N. gonorrhoeae and provides evidence of a high discriminatory ability, practically the same as that for sequencing of the entire porB gene.
Abstract: For prevention and control of gonorrhea, an objective, highly discriminating, and reproducible molecular epidemiological characterization of Neisseria gonorrhoeae is essential. In the present study, in pursuance of providing such qualities, pyrosequencing technology, a fast real-time DNA sequence analysis, was applied to six short, highly polymorphic porB gene segments, with subsequent genetic variant (genovar) determination of the bacterial isolates. The sequencing templates were obtained by real-time PCR amplification, which also included fluorescence melting curve analysis of the entire porB gene in order to determine the genogroup (porB1a or porB1b allele) prior to pyrosequencing analysis. The PSQ 96 MA system used allowed rapid (in approximately 1.5 h) determination of 96 sequences of 20 to 65 correct nucleotides each. The results were reproducible and mostly in concordance with the results of conventional Sanger dideoxy sequencing, with the exception of shorter read lengths and some uncertainty in determining the correct number of identical nucleotides in homopolymeric segments. The number of sequence variants identified in each of the six highly polymorphic segments of the porB1a and porB1b alleles (encoding surface-exposed amino acid loops of the mature PorB protein) ranged from 5 to 11 and from 8 to 39, respectively. Among porB1a isolates (n = 22) and porB1b isolates (n = 65), 22 and 64 unique genovars, respectively, were identified. All isolates were typeable. The present results provide evidence of a high discriminatory ability, practically the same as that for sequencing of the entire porB gene. In conclusion, the fast and high-throughput pyrosequencing technology can be used for molecular epidemiological characterization of N. gonorrhoeae.

31 citations


Journal ArticleDOI
TL;DR: In this article, the authors proposed a novel approach to extract target DNA sequence from variants generated by introduction of random mutations, which can be used to read previously intractable poly(A), AT-rich and GC-rich motifs.
Abstract: Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not destroy original sequence information, but distributes it amongst multiple variants. Some of these variants lack problematic features of the target and are more amenable to conventional sequencing. The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs.

30 citations


Journal ArticleDOI
TL;DR: The objective was to identify TP53 mutations in 138 stage I–IV colorectal adenocarcinomas and liver metastases without first enriching for tumor cells by microdissection, and a harmonized protocol involving multiplex polymerase chain reaction/ligase detection reaction (PCR/LDR) with Universal DNA microarray analysis and endonuclease V/ligases mutation scanning was developed.
Abstract: Both the mutational status and the specific mutation of TP53 (p53) have been shown to impact both tumor prognosis and response to therapies. Molecular profiling of solid tumors is confounded by infiltrating wild-type cells, since normal DNA can interfere with detection of mutant sequences. Our objective was to identify TP53 mutations in 138 stage I-IV colorectal adenocarcinomas and liver metastases without first enriching for tumor cells by microdissection. To achieve this, we developed a harmonized protocol involving multiplex polymerase chain reaction/ligase detection reaction (PCR/LDR) with Universal DNA microarray analysis and endonuclease V/ligase mutation scanning. Sequences were verified using dideoxy sequencing. The harmonized protocol detected all 66 mutations. Dideoxy sequencing detected 41 out of 66 mutations (62%) using automated reading, and 59 out of 66 mutations (89%) with manual reading. Data analysis comparing colon cancer entries in the TP53 database (http://p53.curie.fr) with the results reported in this study showed that distribution of mutations and the mutational events were comparable.

25 citations


Journal Article
TL;DR: The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs.
Abstract: Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not destroy original sequence information, but distributes it amongst multiple variants. Some of these variants lack problematic features of the target and are more amenable to conventional sequencing. The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs.

20 citations



Journal ArticleDOI
TL;DR: SPC-sequencing is compared with electrophoresis-based sequencing in characterizing the above BRCA1 mutations, demonstrating the significant advantage offered by SPC- sequencing for the accurate identification of frameshift mutations.
Abstract: Deletion or insertion mutations lead to a frameshift that causes misalignment between wild-type and mutated allele sequences, making it difficult to identify such mutations unambiguously by using electrophoresis-based DNA sequencing. We have previously established the feasibility of an accurate DNA sequencing method using solid-phase capturable (SPC) dideoxynucleotides and MALDI-TOF mass spectrometry on synthetic templates, an approach we refer to as SPC-sequencing. Here, we report the application of SPC-sequencing in characterizing frameshift mutations by using the detection of the BRCA1 gene mutations 185delAG and 5382insC as examples. In this method, Sanger DNA sequencing fragments are generated in one tube by using biotinylated dideoxynucleotides. The sequencing fragments carrying a biotin moiety at the 3′ end are captured on a streptavidin-coated solid phase to eliminate excess primer, primer dimers, and false stops. Only correctly terminated DNA fragments are captured, subsequently released, and analyzed by mass spectrometry to obtain digital DNA sequencing data. This method produces distinct doublet mass peaks at each point in the mass spectrum beyond the mutation site, facilitating the accurate characterization of the mutation. We have compared SPC-sequencing with electrophoresis-based sequencing in characterizing the above BRCA1 mutations, demonstrating the significant advantage offered by SPC-sequencing for the accurate identification of frameshift mutations.

Proceedings Article
01 Jan 2004
TL;DR: Here, a natural generalization of SFC is presented, the Weighted Sequencing from Compomers (WSC) Problem, that allows for de-novo sequencing of DNA molecules with 200+ nt and shows that the family of graphs introduced to solve SFC, can be generalized to capture the new aspects of WSC.
Abstract: One of the main endeavors in today’s Life Science remains the efficient sequencing of long DNA molecules. Today, most de-novo sequencing of DNA is still performed using electrophoresis-based Sanger Sequencing introduced in 1977, in spite of certain restrictions of this method. Recently, we proposed a new method for DNA sequencing using base-specific cleavage and mass spectrometry, that appears to be a promising alternative to classical DNA sequencing approaches: Among its benefits is the extremely fast data acquisition of mass spectrometry. This leads to the combinatorial problem of Sequencing From Compomers (SFC), and to the definition of sequencing graphs. Simulations indicate that this method may allow for de-novo sequencing of DNA molecules with 200+ nt. An open problem in the context of SFC is that it does not take into account false negative peaks (missing peaks) that are common for real-world mass spectra. Here, we present a natural generalization of SFC, the Weighted Sequencing from Compomers (WSC) Problem, that allows us to cope with false negative peaks. We also show that the family of graphs introduced to solve SFC, can be generalized to capture the new aspects of WSC. Finally, we present a branch-and-bound algorithm to find all sequences that agree with the sample mass spectra with the exception of some missing peaks.