scispace - formally typeset
Search or ask a question

Showing papers on "genomic DNA published in 2000"


Journal ArticleDOI
TL;DR: Phylogenetic analysis of 16S rRNA gene sequences recovered from one of the libraries indicates that the BAC libraries contain DNA from a wide diversity of microbial phyla, including sequences from diverse taxa such as the low-G+C, gram-positive Acidobacterium,Cytophagales, and Proteobacteria.
Abstract: Recent progress in molecular microbial ecology has revealed that traditional culturing methods fail to represent the scope of microbial diversity in nature, since only a small proportion of viable microorganisms in a sample are recovered by culturing techniques. To develop methods to investigate the full extent of microbial diversity, we used a bacterial artificial chromosome (BAC) vector to construct libraries of genomic DNA isolated directly from soil (termed metagenomic libraries). To date, we have constructed two such libraries, which contain more than 1 Gbp of DNA. Phylogenetic analysis of 16S rRNA gene sequences recovered from one of the libraries indicates that the BAC libraries contain DNA from a wide diversity of microbial phyla, including sequences from diverse taxa such as the low-G1C, gram-positive Acidobacterium, Cytophagales, and Proteobacteria. Initial screening of the libraries in Escherichia coli identified several clones that express heterologous genes from the inserts, confirming that the BAC vector can be used to maintain, express, and analyze environmental DNA. The phenotypes expressed by these clones include antibacterial, lipase, amylase, nuclease, and hemolytic activities. Metagenomic libraries are a powerful tool for exploring soil microbial diversity, providing access to the genetic information of uncultured soil microorganisms. Such libraries will be the basis of new initiatives to conduct genomic studies that link phylogenetic and functional information about the microbiota of environments dominated by microorganisms that are refractory to cultivation. The biosphere is dominated by microorganisms (32), yet most microbes in nature have not been studied. Traditional methods for culturing microorganisms limit analysis to those that grow under laboratory conditions (14, 25). The recent surge of research in molecular microbial ecology provides compelling evidence for the existence of many novel types of microorganisms in the environment in numbers and varieties that dwarf those of the comparatively few microorganisms amenable to laboratory cultivation (7, 13, 31). Corroboration comes from estimates of DNA complexity and the discovery of many unique 16S rRNA gene sequences from numerous environmental sources (8, 10, 28). Collectively, the genomes of the total microbiota found in nature, which we termed the metagenome (11), contain vastly more genetic information than is contained in the culturable subset. Given the profound utility and importance of microorganisms to all biological systems, methods are needed to access the wealth of information within the metagenome. Cloning large fragments of DNA isolated directly from microbes in natural environments provides a method to access soil metagenomic DNA. Previously, we investigated the use of the bacterial artificial chromosome (BAC) vector to express Bacillus cereus genomic DNA (20). The advantage of BAC vectors is that they maintain very large DNA inserts (greater than 100 kb) stably in Escherichia coli (23), facilitating the cloning of large fragments of DNA. Our results demonstrated that expression of heterologous DNA from B. cereus in an E. coli BAC system was detectable at a reasonable frequency (20), validating the idea that the low-copy BAC vector (one to two per cell) (23) could be used to express foreign DNA from foreign promoters in E. coli. Here we describe the construction and initial screening of two BAC libraries made with DNA isolated directly from soil. We found detectable levels of several biochemical activities from BAC library clones. Sequence analysis of selected BAC plasmids encoding such activities and of 16S rRNA genes in one of the libraries confirms the novelty of the genomic information cloned in our libraries. The results show that DNA extracted directly from soil is a valuable source of new genetic information and is accessible by using BAC libraries. Our results demonstrate that both traditional and functional genomics of uncultured microorganisms can be carried out by this approach and that screening of metagenome libraries for activities or gene sequences can provide a basis for conducting genomic analyses of uncultured microorganisms.

1,230 citations


Journal ArticleDOI
TL;DR: Results showed that computational gene prediction can be a reliable tool for annotating new genomic sequences, giving accurate information on 90% of coding sequences with 14% false positives, and exact gene prediction needs additional improvement using gene prediction algorithms.
Abstract: Ab initio gene identification in the genomic sequence of Drosophila melanogaster was obtained using (human gene predictor) and Fgenesh programs that have organism-specific parameters for human, Drosophila, plants, yeast, and nematode. We did not use information about cDNA/EST in most predictions to model a real situation for finding new genes because information about complete cDNA is often absent or based on very small partial fragments. We investigated the accuracy of gene prediction on different levels and designed several schemes to predict an unambiguous set of genes (annotation CGG1), a set of reliable exons (annotation CGG2), and the most complete set of exons (annotation CGG3). For 49 genes, protein products of which have clear homologs in protein databases, predictions were recomputed by Fgenesh+ program. The first annotation serves as the optimal computational description of new sequence to be presented in a database. Reliable exons from the second annotation serve as good candidates for selecting the PCR primers for experimental work for gene structure verification. Our results shows that we can identify approximately 90% of coding nucleotides with 20% false positives. At the exon level we accurately predicted 65% of exons and 89% including overlapping exons with 49% false positives. Optimizing accuracy of prediction, we designed a gene identification scheme using Fgenesh, which provided sensitivity (Sn) = 98% and specificity (Sp) = 86% at the base level, Sn = 81% (97% including overlapping exons) and Sp = 58% at the exon level and Sn = 72% and Sp = 39% at the gene level (estimating sensitivity on std1 set and specificity on std3 set). In general, these results showed that computational gene prediction can be a reliable tool for annotating new genomic sequences, giving accurate information on 90% of coding sequences with 14% false positives. However, exact gene prediction (especially at the gene level) needs additional improvement using gene prediction algorithms. The program was also tested for predicting genes of human Chromosome 22 (the last variant of Fgenesh can analyze the whole chromosome sequence). This analysis has demonstrated that the 88% of manually annotated exons in Chromosome 22 were among the ab initio predicted exons. The suite of gene identification programs is available through the WWW server of Computational Genomics Group at http://genomic.sanger.ac.uk/gf. html.

1,163 citations


Book ChapterDOI
01 Jan 2000
TL;DR: This chapter describes two procedures for the isolation of chromosomal DNA from E. coli that can be used for most gram-negative and gram-positive bacteria or modified to isolate DNA from organisms other than bacteria.
Abstract: This chapter describes two procedures for the isolation of chromosomal DNA from E. coli. These procedures can be used for most gram-negative and gram-positive bacteria or modified to isolate DNA from organisms other than bacteria.

546 citations


Journal ArticleDOI
TL;DR: The general usefulness of expression profiles for nearly 300 Saccharomyces cerevisiae deletion mutants, obtained recently, is demonstrated, with implications for interpreting whole-genome expression data, particularly from cells known to suffer genomic instability, such as malignant or immortalized cells.
Abstract: Expression profiling using DNA microarrays holds great promise for a variety of research applications, including the systematic characterization of genes discovered by sequencing projects. To demonstrate the general usefulness of this approach, we recently obtained expression profiles for nearly 300 Saccharomyces cerevisiae deletion mutants. Approximately 8% of the mutants profiled exhibited chromosome-wide expression biases, leading to spurious correlations among profiles. Competitive hybridization of genomic DNA from the mutant strains and their isogenic parental wild-type strains showed they were aneuploid for whole chromosomes or chromosomal segments. Expression profile data published by several other laboratories also suggest the use of aneuploid strains. In five separate cases, the extra chromosome harboured a close homologue of the deleted gene; in two cases, a clear growth advantage for cells acquiring the extra chromosome was demonstrated. Our results have implications for interpreting whole-genome expression data, particularly from cells known to suffer genomic instability, such as malignant or immortalized cells.

497 citations


Journal ArticleDOI
TL;DR: The Tol2 transposon system should thus be used to develop novel transgenesis and insertional mutagenesis methods in zebrafish and possibly in other fishes.
Abstract: The Tol2 element of the medaka fish Oryzias latipes belongs to the hAT family of transposons (hobo/Ac/Tam3). We report here identification of a functional transposase of Tol2 that is capable of catalyzing its transposition in the germ line of zebrafish Danio rerio. A transcript produced from Tol2 encodes a putative transposase. Zebrafish fertilized eggs were coinjected with mRNA transcribed in vitro, using cDNA of the Tol2 transcript as a template and a plasmid DNA harboring a mutant Tol2, which had a deletion in the putative transposase gene but retained necessary cis sequences. The injected fish were raised to adulthood and mated to noninjected fish, and genomic DNA of the progeny fish were analyzed by PCR and Southern hybridization. Half of F(1) fish obtained from one of eight injected fish contained the Tol2 DNA in their genomes but not the vector portion. Among these F(1) fish, Tol2 insertions at four different loci were identified, and some F(1) fish carried two or three different Tol2 insertions, indicating that the germ line of the founder fish is highly mosaic. Sequencing analyses revealed that, in all cases, Tol2 was surrounded by zebrafish genomic sequences, and an 8-bp duplication was created at the target site, indicating that Tol2 was integrated in the zebrafish genome through transposition. This study identifies an autonomous member of a DNA-based transposable element from a vertebrate genome. The Tol2 transposon system should thus be used to develop novel transgenesis and insertional mutagenesis methods in zebrafish and possibly in other fishes.

475 citations


Journal ArticleDOI
TL;DR: High-density oligonucleotide microarrays were used to analyse the genomes and meiotic expression patterns of two yeast strains that display distinct kinetics and efficiencies of sporulation and yielded approximately 1,600 meiotically regulated genes in each strain.
Abstract: We used high-density oligonucleotide microarrays to analyse the genomes and meiotic expression patterns of two yeast strains, SK1 and W303, that display distinct kinetics and efficiencies of sporulation. Hybridization of genomic DNA to arrays revealed numerous gene deletions and polymorphisms in both backgrounds. The expression analysis yielded approximately 1,600 meiotically regulated genes in each strain, with a core set of approximately 60% displaying similar patterns in both strains. Most of these (95%) are MATa/MATalpha-dependent and are not similarly expressed in near-isogenic meiosis-deficient controls. The transcript profiles correlate with the distribution of defined meiotic promoter elements and with the time of known gene function.

445 citations


Journal ArticleDOI
01 Oct 2000-Genetics
TL;DR: In this article, the type and frequency of simple sequence repeats (SSRs) in plant genomes were investigated using the expanding quantity of DNA sequence data deposited in public databases, and an approach was proposed and demonstrated for the targeted isolation of single or multiple, physically clustered SSRs linked to any gene that has been mapped using low-copy DNA-based markers.
Abstract: The type and frequency of simple sequence repeats (SSRs) in plant genomes was investigated using the expanding quantity of DNA sequence data deposited in public databases. In Arabidopsis, 306 genomic DNA sequences longer than 10 kb and 36,199 EST sequences were searched for all possible mono- to pentanucleotide repeats. The average frequency of SSRs was one every 6.04 kb in genomic DNA, decreasing to one every 14 kb in ESTs. SSR frequency and type differed between coding, intronic, and intergenic DNA. Similar frequencies were found in other plant species. On the basis of these findings, an approach is proposed and demonstrated for the targeted isolation of single or multiple, physically clustered SSRs linked to any gene that has been mapped using low-copy DNA-based markers. The approach involves sample sequencing a small number of subclones of selected randomly sheared large insert DNA clones (e.g., BACs). It is shown to be both feasible and practicable, given the probability of fortuitously sequencing through an SSR. The approach is demonstrated in barley where sample sequencing 34 subclones of a single BAC selected by hybridization to the Big1 gene revealed three SSRs. These allowed Big1 to be located at the top of barley linkage group 6HS.

418 citations


Journal Article
TL;DR: DNA methylation was directly and significantly related to RBC folate concentrations in persons with the T/T genotype, but not in those with wild-type MTHFR, consistent with prior observations, which suggest that the T or T genotype is associated with impaired MTH FR activity in vivo and that the cellular impact of this impairment is determined, in part, by folate status.
Abstract: DNA methylation is an epigenetic feature of DNA that influences cellular development and function, and aberrations of DNA methylation are a candidate mechanism for the development of cancer. Methylenetetrahydrofolate reductase (MTHFR) catalyzes the synthesis of 5-methyltetrahydrofolate, the methyl donor for methionine synthesis and the precursor of S-adenosylmethionine. S-adenosylmethionine is the universal methyl donor for methylation reactions, including that of DNA methylation. In the present study, we investigated whether a common C677T mutation in the MTHFR gene, which results in reduced enzyme activity in vitro, affects genomic DNA methylation. We selected 9 subjects homozygous for the wild-type MTHFR and 10 subjects homozygous for the mutation (T/T). Genomic DNA methylation was determined by an established enzymatic assay that measures the capacity of DNA to accept methyl groups in vitro, which is inversely related to endogenous methylation. DNA from subjects with the T/T MTHFR genotype had a significantly higher methyl group acceptance capacity (12,615 +/- 1836 dpm/2 microg of DNA) compared with wild-type MTHFR (7843 +/- 1043 dpm/2 microg of DNA; P < 0.05), indicating DNA hypomethylation in the T/T genotype. Furthermore, DNA methylation was directly and significantly related to RBC folate concentrations in persons with the T/T genotype, but not in those with wild-type MTHFR. These data are consistent with prior observations, which suggest that the T/T genotype is associated with impaired MTHFR activity in vivo and that the cellular impact of this impairment is determined, in part, by folate status. The relationship of genomic DNA hypomethylation in persons with the T/T MTHFR genotype to the development of cancer remains to be defined.

416 citations


Journal ArticleDOI
TL;DR: The results suggest that the two main groups of resistance genes underwent divergent evolution in cereal and dicot genomes and imply that their cognate signaling pathways have diverged as well.
Abstract: The majority of plant disease resistance genes are members of very large multigene families. They encode structurally related proteins containing nucleotide binding site domains (NBS) and C-terminal leucine rich repeats (LRR). The N-terminal region of some resistance genes contain a short sequence called TIR with homology to the animal innate immunity factors, Toll and interleukin receptor-like genes. Only a few plant resistance genes have been functionally analyzed and the origin and evolution of plant resistance genes remain obscure. We have reconstructed gene phylogeny by exhaustive analysis of available genome and amplified NBS domain sequences. Our study shows that NBS domains faithfully predict whole gene structure and can be divided into two major groups. Group I NBS domains contain group-specific motifs that are always linked with the TIR sequence in the N terminus. Significantly, Group I NBS domains and their associated TIR domains are widely distributed in dicot species but were not detected in cereal databases. Furthermore, Group I specific NBS sequences were readily amplified from dicot genomic DNA but could not be amplified from cereal genomic DNA. In contrast, Group II NBS domains are always associated with putative coiled-coil domains in their N terminus and appear to be present throughout the angiosperms. These results suggest that the two main groups of resistance genes underwent divergent evolution in cereal and dicot genomes and imply that their cognate signaling pathways have diverged as well.

414 citations


Journal ArticleDOI
TL;DR: An approach for recognizing genes within orthologous genomic loci from human and mouse by first aligning the regions using an iterative global alignment system and then identifying genes based on conservation of exonic features at aligned positions in both species.
Abstract: A fundamental task in analyzing genomes is to identify the genes. This is relatively straightforward for organisms with compact genomes (such as bacteria, yeast, flies and worms) because exons tend to be large and the introns are either non-existent or tend to be short. The challenge is much greater for large genomes (such as those of mammals and higher plants), because the exonic 'signal' is scattered in a vast sea of non-genic 'noise'. While coding sequences comprise 75% of the yeast genome, they represent only about 3% of the human genome. Computational approaches have been developed for gene recognition in large genomes, with most employing various statistical tools to identify likely splice sites and to detect tell-tale differences in sequence composition between coding and non-coding DNA (Burset & Guigo 1996). Some programs perform e novo recognition, in that they directly use only information about the input sequence itself. One of the best programs of this sort is GENSCAN (Burge 1997), which uses a Hidden Markov Model to scan large genomic sequences. Other programs employ “homology” approaches, in which exons are identified by comparing a conceptual translation of DNA sequences to databases of known protein sequences (Pachter et al. 1999; Gelfand et al. 1996). In this paper, we explore a powerful new approach to gene recognition by using cross-species sequence comparison, i.e., by simultaneously analyzing homologous loci from two related species. Specifically, we focus on the ability to accurately identify coding exons by comparison of syntenic human and mouse genomic sequences. It is well known that cross-species sequence comparison can help highlight important functional elements such as exons, because such elements tend to be more strongly conserved by evolution than random genomic sequences. If a protein encoded by a gene is already known in one organism, it is relatively simple to search genomic DNA from another organism to identify genes encoding a similar protein (using such computer packages such as Wise2 (http://www.sanger.ac.uk/Software/Wise2). A more challenging problem is to idenitfy exons directly from cross-species comparisons of genomic DNA. Computer programs are available that identify regions of sequence conservation, using simple “dot plots” or more sophisticated “pip plots” (Jang et al. 1999), which can then be individually analyzed in an ad hoc fashion to see whether they may contain such features as exons or regulatory elements. However, these programs simply identify conserved regions and do not systematically use the cross-species information to perform exon recogntion. We sought to develop an automatic approach to exon recognition by using cross-species sequence comparison to identify and align relevant regions and then searching for the presence of exonic features at corresponding positions in both species. We began by undertaking a systematic comparison of the genomic structure of 117 orthologous gene pairs from human and mouse to understand the extent of conservation of the number, length, and sequence of exons and introns. We then used these results to develop algorithms for cross-species gene recognition, consisting of GLASS, a new alignment program designed to provide good global alignments of large genomic regions by using a hierarchical alignment approach, and ROSETTA, a program that identifies coding exons in both species based on coincidence of genomic structure (splice sites, exon number, exon length, coding frame, and sequence similarity). ROSETTA performed extremely well in identifying coding exons, showing 95% sensitivity and 97% specificity at the nucleotide level. The performance was superior to programs that use much more sophisticated signals and statistical analysis but analyze only a single genome (Burset and Guigo 1996, Burge 1997). To our knowledge, ROSETTA is the first program for gene recognition based on cross-species comparison of genomic DNA from two organisms. The approach can be readily generalized to other pairs of organisms, as well as to the study of three or more organisms simultaneously. With the current explosion of knowledge regarding the human and mouse genomic sequences, cross-species comparison is likely to provide one of the most powerful approaches for extracting the information in mammalian genomes.

383 citations


Journal ArticleDOI
TL;DR: The data suggest that the kidney barrier in rodents and humans is permeable to DNA molecules large enough to be analyzed by standard genetic methodologies.
Abstract: Background: Cell-free DNA from dying cells recently has been discovered in human blood plasma. In experiments performed on animals and humans, we examined whether this cell-free DNA can cross the kidney barrier and be used as a diagnostic tool. Methods: Mice received subcutaneous injections of either human Raji cells or purified 32P-labeled DNA. DNA was isolated from urine and analyzed by measurement of radioactivity, agarose gel electrophoresis, and PCR. In humans, the permeability of the kidney barrier to polymeric DNA was assessed by detection in urine of sequences that were different from an organism bulk nuclear DNA. Results: In the experiments on laboratory animals, we found that ∼0.06% of injected DNA was excreted into urine within 3 days in a polymeric form and that human-specific Alu sequences that passed through the kidneys could be amplified by PCR. In humans, male-specific sequences could be detected in the urine of females who had been transfused with male blood as well as in DNA isolated from urine of women pregnant with male fetuses. K- ras mutations were detected in the urine of patients with colon adenocarcinomas and pancreatic carcinomas. Conclusions: The data suggest that the kidney barrier in rodents and humans is permeable to DNA molecules large enough to be analyzed by standard genetic methodologies.

Journal ArticleDOI
TL;DR: Estimates of microsatellite frequencies were determined, and indicate that CT repeats occur every 100 kb, CA repeats every 420 kb, and AGG repeats every 700 kb in the peach genome.
Abstract: Microsatellites have emerged as an important system of molecular markers. We evaluated the potential of microsatellites for use in genetic studies of peach [Prunus persica (L.) Batsch]. Microsatellite loci in peach were identified by screening a pUC8 genomic library, a λZAPII leaf cDNA library, as well as through database searches. Primer sequences for the microsatellite loci were tested from the related Rosaceae species apple (Malus×domestica) and sour cherry (Prunus cerasus L.). The genomic library was screened for CT, CA and AGG repeats, while the cDNA library was screened for (CT)n- and (CA)n-containing clones. Estimates of microsatellite frequencies were determined from the genomic library screening, and indicate that CT repeats occur every 100 kb, CA repeats every 420 kb, and AGG repeats every 700 kb in the peach genome. Microsatellite- containing clones were sequenced, and specific PCR primers were designed to amplify the microsatellite- containing regions from genomic DNA. The level of microsatellite polymorphism was evaluated among 28 scion peach cultivars which displayed one to four alleles per primer pair. Five microsatellites were found to segregate in intraspecific peach-mapping crosses. In addition, these microsatellite markers were tested for their utility in cross-species amplification for use in comparative mapping both within the Rosaceae, and with the un- related species Arabidopsis thaliana L.

Journal ArticleDOI
TL;DR: The alignment confirms that patterns of conservation can be useful in identifying regulatory regions and rarely expressed coding regions and a bimodal size distribution of syntenic regions is revealed.
Abstract: A new algorithm, WABA, was developed for doing large-scale alignments between genomic DNA of different species. WABA was used to align 8 million bases of Caenorhabditis briggsae genomic DNA against the entire 97-million-base Caenorhabditis elegans genome. The alignment, including C. briggsae homologs of 154 genetically characterized C. elegans genes and many times this number of largely uncharacterized ORFs, can be browsed and searched on the Web (http://www.cse.ucsc.edu/ approximately kent/intronerator). The alignment confirms that patterns of conservation can be useful in identifying regulatory regions and rarely expressed coding regions. Conserved regulatory elements can be identified inside coding exons by examining the level of divergence at the wobble position of codons. The alignment reveals a bimodal size distribution of syntenic regions. Over 250 introns are present in one species but not the other. The 3' and 5' intron splice sites have more similarity to each other in introns unique to one species than in C. elegans introns as a whole, suggesting a possible mechanism for intron removal.

Journal ArticleDOI
TL;DR: Using bioinformatic approaches to analyze the structures of L1 element target site duplications and flanking sequence features, evidence is provided suggesting that approximately 15% of full-length L1 elements bear evidence of flanking DNA segment transduction.
Abstract: Human L1 retrotransposons can produce DNA transduction events in which unique DNA segments downstream of L1 elements are mobilized as part of aberrant retrotransposition events. That L1s are capable of carrying out such a reaction in tissue culture cells was elegantly demonstrated. Using bioinformatic approaches to analyze the structures of L1 element target site duplications and flanking sequence features, we provide evidence suggesting that approximately 15% of full-length L1 elements bear evidence of flanking DNA segment transduction. Extrapolating these findings to the 600,000 copies of L1 in the genome, we predict that the amount of DNA transduced by L1 represents approximately 1% of the genome, a fraction comparable with that occupied by exons.

Journal ArticleDOI
TL;DR: It is shown that short probes can be recovered and amplified quantitatively following hybridisation to genomic DNA, and this simple observation forms the basis of a new approach to determining locus copy number in complex genomes.
Abstract: Despite its fundamental importance in genome analysis, it is only recently that systematic approaches have been developed to assess copy number at specific genetic loci, or to examine genomic DNA for submicroscopic deletions of unknown location. In this report we show that short probes can be recovered and amplified quantitatively following hybridisation to genomic DNA. This simple observation forms the basis of a new approach to determining locus copy number in complex genomes. The power and specificity of multiplex amplifiable probe hybridisation is demonstrated by the simultaneous assessment of copy number at a set of 40 human loci, including detection of deletions causing Duchenne muscular dystrophy and Prader–Willi/Angelman syndromes. Assembly of other probe sets will allow novel, technically simple approaches to a wide variety of genetic analyses, including the potential for extension to high resolution genome-wide screens for deletions and amplifications.

Journal ArticleDOI
TL;DR: The findings indicate that the use of rice as the model system for other cereals may sometimes be complicated by the presence of rapidly evolving gene families and microtranslocations.
Abstract: A 2.3-centimorgan (cM) segment of rice chromosome 11 consisting of 340 kb of DNA sequence around the alcohol dehydrogenase Adh1 and Adh2 loci was completely sequenced, revealing the presence of 33 putative genes, including several apparently involved in disease resistance. Fourteen of the genes were confirmed by identifying the corresponding transcripts. Five genes, spanning 1.9 cM of the region, cross-hybridized with maize genomic DNA and were genetically mapped in maize, revealing a stretch of colinearity with maize chromosome 4. The Adh1 gene marked one significant interruption. This gene mapped to maize chromosome 1, indicating a possible translocation of Adh1 after the evolutionary divergence leading to maize and sorghum. Several other genes, most notably genes similar to known disease resistance genes, showed no cross-hybridization with maize genomic DNA, suggesting sequence divergence or absence of these sequences in maize, which is in contrast to several other well-conserved genes, including Adh1 and Adh2. These findings indicate that the use of rice as the model system for other cereals may sometimes be complicated by the presence of rapidly evolving gene families and microtranslocations. Seven retrotransposons and eight transposons were identified in this rice segment, including a Tc1/Mariner–like element, which is new to rice. In contrast to maize, retroelements are less frequent in rice. Only 14.4% of this genome segment consist of retroelements. Miniature inverted repeat transposable elements were found to be the most frequently occurring class of repetitive elements, accounting for 18.8% of the total repetitive DNA.

Journal ArticleDOI
TL;DR: In this paper, seven amino acid substitution variants were identified in the repair domain of human Ape1, and functional characterization revealed that three of the variants, L104R, E126D and R237A, exhibited ∼40-60% reductions in specific incision activity.
Abstract: Apurinic/apyrimidinic (AP) sites are common mutagenic and cytotoxic DNA lesions. Ape1 is the major human repair enzyme for abasic sites and incises the phosphodiester backbone 5′ to the lesion to initiate a cascade of events aimed at removing the AP moiety and maintaining genetic integrity. Through resequencing of genomic DNA from 128 unrelated individuals, and searching published reports and sequence databases, seven amino acid substitution variants were identified in the repair domain of human Ape1. Functional characterization revealed that three of the variants, L104R, E126D and R237A, exhibited ∼40–60% reductions in specific incision activity. A fourth variant, D283G, is similar to the previously characterized mutant D283A found to exhibit ∼10% repair capacity. The most common substitution (D148E; observed at an allele frequency of 0.38) had no impact on endonuclease and DNA binding activities, nor did a G306A substitution. A G241R variant showed slightly enhanced endonuclease activity relative to wild-type. In total, four of seven substitutions in the repair domain of Ape1 imparted reduced function. These reduced function variants may represent low penetrance human polymorphisms that associate with increased disease susceptibility.

Patent
12 May 2000
TL;DR: In this article, an improved high-throughput and quantitative process for determining methylation patterns in genomic DNA samples based on amplifying modified nucleic acid, and detecting methylated nucleic acids based on amplification-dependent displacement of specifically annealed hybridization probes is disclosed.
Abstract: There is disclosed an improved high-throughput and quantitative process for determining methylation patterns in genomic DNA samples based on amplifying modified nucleic acid, and detecting methylated nucleic acid based on amplification-dependent displacement of specifically annealed hybridization probes. Specifically, the inventive process provides for treating genomic DNA samples with sodium bisulfite to create methylation-dependent sequence differences, followed by detection with fluorescence-based quantitative PCR techniques. The process is particularly well suited for the rapid analysis of a large number of nucleic acid samples, such as those from collections of tumor tissues.

Journal ArticleDOI
01 Dec 2000-Genome
TL;DR: A large number of polymerase chain reaction (PCR) primer sets that amplify DNA simple sequence repeat (SSR) loci of Sorghum bicolor (L.) Moench are developed and tested, with results indicating that mapping of SSR loci isolated from BAC clones located to these segments is likely to be the most efficient method for placing SSR Loci in the segments.
Abstract: We report the development, testing, and use (for genetic mapping) of a large number of polymerase chain reaction (PCR) primer sets that amplify DNA simple sequence repeat (SSR) loci of Sorghum bicolor (L.) Moench. Most of the primer sets were developed from clones isolated from two sorghum bacterial artificial chromosome (BAC) libraries and three enriched sorghum genomic-DNA (gDNA) libraries. A few were developed from sorghum DNA sequences present in public databases. The libraries were probed with radiolabeled di- and trinucleotide oligomers, the BAC libraries with four and six oligomers, respectively, and the enriched gDNA libraries with four and three oligomers, respectively. Both types of libraries were markedly enriched for SSRs relative to a size-fractionated gDNA library studied earlier. However, only 2% of the sequenced clones obtained from the size-fractionated gDNA library lacked a SSR, whereas 13% and 17% of the sequenced clones obtained from the BAC and enriched gDNA libraries, respectively, l...

Journal Article
TL;DR: A fast, simple, and reliable mini-prep method for the extraction of DNA from Gossypium species and cultivars has been developed, and it extracts DNA from one to three folded or nearly unfolded young leaves processed in a 1.5mL tube with 0.5 mL of extraction buffer.
Abstract: A fast, simple, and reliable mini-prep method for the extraction of DNA from Gossypium species and cultivars has been developed. This small-scale method is cetyltrimethylammonium bromide (cTAB)-based, and it extracts DNA from one to three folded or nearly unfolded young leaves processed in a 1.5 mL tube with 0.5 mL of extraction buffer and homogenized by an electric drill. Compared with the macro-prep cTAB method, the improved mini-prep method is highly efficient and much cheaper in terms of time, chemical use, and labor input. Easily 200 samples per day can be processed by a single person. The DNA yield is greater (60 )g per 50-100 mg of fresh leaf tissue) than that obtained from the macroprep method (50 )g from 5 g of fresh leaf tissue) and it provides DNA for 3000 to 6000 polymerase chain reactions (PCRs). The DNA quality is sufficient for PCR-based and endonuclease restriction marker analysis. With the development of polymerase chain reaction (PCR) technology, molecular markers J. Zhang and J.McD. Stewart, 115 Plant Sci. Bldg., Dep. of Crop, Soil, and Environ. Sci., Univ. of Arkansas, Fayetteville, AR 72701. Received 10 Mar. 2000. *Corresponding author (jstewart@comp.uark.edu). Abbreviations: PCR, polymerase chain reaction; RAPD, random amplified polymorphic DNA; SSR, simple sequence repeat; AFLP, amplified fragment length polymorphism; RFLP, restriction fragment length polymorphism; cTAB, cetyltrimethylammonium bromide. 194 ZHANG & STEWART: ECONOMICAL, RAPID GENOMIC DNA EXTRACTION based on PCR soon found vast application in plant genetics and breeding (Lee, 1995). To accommodate the need for PCR-based markers, a rapid, simple, and reliable DNA preparation method is required to provide high quality and quantity DNA for the analyses. Although numerous DNA extraction methods for plants have been reported in the literature, the cTAB extraction method is used most often. The traditional macro-preparation of DNA usually requires from 0.5 to several grams of plant tissue, making it impractical to analyze individual plants during early seedling stage. Also, the methods are time consuming and laborious due to their multistep procedures. Furthermore, large amounts of hazardous chemical solvents are required. Modifications have been made for plant species such as cotton that are high in polysaccharides and polyphenols. The compounds form a sticky, brown gelatinous matrix during DNA preparation that interferes with DNA digestion and PCRs. These modified methods usually employ high salt concentrations to remove polysaccharides, and polyvinylpyrrolidone to bind polyphenols (Lodhi et al., 1994; Porebski et al., 1997). Ascorbic acid, emercaptoethanol, and activated charcoal were found to improve extracted DNA quality (Paterson et al., 1993; Bi et al., 1996). For PCR-based DNA markers used in markerassisted selection, a fast DNA extraction method is needed. A reliable method should meet the following criteria: (i) require only a small amount of tissue; (ii) involve simple procedures; (iii) use minimal number and amount of chemicals; (iv) yield high-quality DNA; and (v) yield large quantities of DNA. Many mini-prep methods for obtaining DNA have been developed that have included such modifications as no grinding, no centrifugation, and/or no liquid transfer.

Journal ArticleDOI
21 Jul 2000-Science
TL;DR: It is shown that group II introns can be retargeted to insert efficiently into virtually any target DNA and that the retargeting introns retain activity in human cells.
Abstract: Mobile group II intron RNAs insert directly into DNA target sites and are then reverse-transcribed into genomic DNA by the associated intron-encoded protein. Target site recognition involves modifiable base-pairing interactions between the intron RNA and a >14-nucleotide region of the DNA target site, as well as fixed interactions between the protein and flanking regions. Here, we developed a highly efficient Escherichia coli genetic assay to determine detailed target site recognition rules for the Lactococcus lactis group II intron Ll.LtrB and to select introns that insert into desired target sites. Using human immunodeficiency virus-type 1 (HIV-1) proviral DNA and the human CCR5 gene as examples, we show that group II introns can be retargeted to insert efficiently into virtually any target DNA and that the retargeted introns retain activity in human cells. This work provides the practical basis for potential applications of targeted group II introns in genetic engineering, functional genomics, and gene therapy.

Journal ArticleDOI
TL;DR: A clear correlation of rDNA quantification and bacterial growth was observed, however, cell numbers cannot directly be estimated from quantitative PCR data, given that the cellular genome content varies with the growth phase of the organisms.

Journal ArticleDOI
TL;DR: It is shown here that 5‐methylcytosine occurs in the DNA of D.melanogaster at a level of ∼1 in 1000–2000 cytosine residues in adult flies.
Abstract: It is commonly accepted that the DNA of Drosophila melanogaster does not contain 5-methylcytosine, which is essential in the development of most eukaryotes. We have developed a new, highly specific and sensitive assay to detect the presence of 5-methylcytosine in genomic DNA. The DNA is degraded to nucleosides, 5-methylcytosine purified by HPLC and, for detection by 1D- and 2D-TLC, radiolabeled using deoxynucleoside kinase and [γ-32P]ATP. Using this assay, we show here that 5-methylcytosine occurs in the DNA of D.melanogaster at a level of ∼1 in 1000–2000 cytosine residues in adult flies. DNA methylation is detectable in all stages of D.melanogaster development.

Journal ArticleDOI
TL;DR: The complete genomic sequence of the archaeon Thermoplasma volcanium, possessing optimum growth temperature (OGT) of 60 degrees C, is reported and a number of strong correlations have been identified between characteristics of genomic organization and the OGT.
Abstract: The complete genomic sequence of the archaeon Thermoplasma volcanium, possessing optimum growth temperature (OGT) of 60°C, is reported. By systematically comparing this genomic sequence with the other known genomic sequences of archaea, all possessing higher OGT, a number of strong correlations have been identified between characteristics of genomic organization and the OGT. With increasing OGT, in the genomic DNA, frequency of clustering purines and pyrimidines into separate dinucleotides rises (e.g., by often forming AA and TT, whereas avoiding TA and AT). Proteins coded in a genome are divided into two distinct subpopulations possessing isoelectric points in different ranges (i.e., acidic and basic), and with increasing OGT the size of the basic subpopulation becomes larger. At the metabolic level, genes coding for enzymes mediating pathways for synthesizing some coenzymes, such as heme, start missing. These findings provide insights into the design of individual genomic components, as well as principles for coordinating changes in these designs for the adaptation to new environments.

Journal ArticleDOI
TL;DR: Three hundred cDNAs containing putatively entire open reading frames (ORFs) for previously undefined genes were obtained from CD34+ hematopoietic stem/progenitor cells (HSPCs), based on EST cataloging, clone sequencing, in silico cloning, and rapid amplification of cDNA ends (RACE).
Abstract: Three hundred cDNAs containing putatively entire open reading frames (ORFs) for previously undefined genes were obtained from CD34+ hematopoietic stem/progenitor cells (HSPCs), based on EST cataloging, clone sequencing, in silico cloning, and rapid amplification of cDNA ends (RACE). The cDNA sizes ranged from 360 to 3496 bp and their ORFs coded for peptides of 58-752 amino acids. Public database search indicated that 225 cDNAs exhibited sequence similarities to genes identified across a variety of species. Homology analysis led to the recognition of 50 basic structural motifs/domains among these cDNAs. Genomic exon-intron organization could be established in 243 genes by integration of cDNA data with genome sequence information. Interestingly, a new gene named as HSPC070 on 3p was found to share a sequence of 105bp in 3' UTR with RAF gene in reversed transcription orientation. Chromosomal localizations were obtained using electronic mapping for 192 genes and with radiation hybrid (RH) for 38 genes. Macroarray technique was applied to screen the gene expression patterns in five hematopoietic cell lines (NB4, HL60, U937, K562, and Jurkat) and a number of genes with differential expression were found. The resource work has provided a wide range of information useful not only for expression genomics and annotation of genomic DNA sequence, but also for further research on the function of genes involved in hematopoietic development and differentiation.

Journal ArticleDOI
TL;DR: Phosphate perturbations are consistent with a model of non-specific cation binding in which delocalized polyamines diffuse along DNA while confined by the strong electrostatic potential gradient perpendicular to the helix axis and provide experimental support for models in which polyamine-induced DNA condensation is driven by non- specific electrostatic binding.
Abstract: Spermidine and spermine employ similar DNA-binding mechanisms. The data of Figures ​Figures11–4 show that a given genomic DNA suffers similar spectral perturbations from binding of either Spd or Spm. This indicates that Spd and Spm exploit a common molecular mechanism of DNA binding.

Journal ArticleDOI
TL;DR: The results suggest that the risk of mutation due to integration of plasmid DNA vaccines following intramuscular injection is negligible, at least three orders of magnitude below the spontaneous mutation rate.
Abstract: The primary safety concern for DNA vaccines is their potential to integrate into the host cell genome. We describe an integration assay based on purification of high-molecular-weight genomic DNA away from free plasmid using gel electrophoresis, such that the genomic DNA can then be assayed for integrated plasmid using a sensitive PCR method. The assay sensitivity was approximately 1 plasmid copy/microg DNA (representing approximately 150,000 diploid cells). Using this assay, we carried out integration studies of three different plasmid DNA vaccines, containing either the influenza hemagglutinin, influenza matrix or HIV gag gene. Six weeks after intramuscular injection, free plasmid was detected in treated muscle at levels ranging from approximately 1,000 to 4,000 copies/microg DNA. At 6 months, the plasmid levels ranged between 200 and 800 copies/microg DNA. Gel purification of genomic DNA revealed that essentially all of the detectable plasmid in treated quadriceps was extrachromosomal. If integration had occurred, the frequency was

Journal ArticleDOI
TL;DR: Complete and apparently truncated proviruses were detected in DNA of both clinically normal koalas and those with hematopoietic disease, and phylogenetic analysis showed that KoRV paradoxically clusters with gibbon ape leukemia virus (GALV).
Abstract: A novel retrovirus, morphologically consistent with mammalian C-type retroviruses, was detected by electron microscopy in mitogen-stimulated peripheral blood mononuclear cell cultures from 163 koalas and in lymphoma tissue from 3 koalas. PCR amplified provirus from the blood and tissues of 17 wild and captive koalas, and reverse transcriptase-PCR demonstrated viral mRNA, viral genomic RNA, and reverse transcriptase activity in koala serum and cell culture supernatants. Comparison of viral sequences derived from genomic DNA and mRNA showed identity indicative of a single retroviral species—here designated koala retrovirus (KoRV). Southern blot analysis of koala tissue genomic DNA using labelled KoRV probes demonstrated banding consistent with an endogenous retrovirus. Complete and apparently truncated proviruses were detected in DNA of both clinically normal koalas and those with hematopoietic disease. KoRV-related viruses were not detected in other marsupials, and phylogenetic analysis showed that KoRV paradoxically clusters with gibbon ape leukemia virus (GALV). The strong similarity between GALV and KoRV suggests that these viruses are closely related and that recent cross-host transmission has occurred. The complete proviral DNA sequence of KoRV is reported.

Journal ArticleDOI
01 Feb 2000-Taxon
TL;DR: A simple method for isolation of genomic DNA from wild plants sampled in remote field areas is presented and is suitable for random amplified polymorphic DNA (RAPD) analysis of plant populations as well as for specific amplification of chloroplast DNA sequences.
Abstract: A simple method for isolation of genomic DNA from wild plants sampled in remote field areas is presented. The protocol combines NaCI/CTAB leaf preservation with sorbitol extraction of secondary compounds which often contain inhibitors of Taq DNA polymerase activity. The obtained DNA is suitable for random amplified polymorphic DNA (RAPD) analysis of plant populations as well as for specific amplification of chloroplast DNA sequences. The NaCI/CTAB leaf preservation is a powerful alternative to silica gel dryingbased preservation.

Journal ArticleDOI
TL;DR: The results demonstrated that this protocol is effective in identifying methylated DNA profiles, and up to 25 percent of AFLP bands were derived from methylated sequences, and among those, a few bands unique to either adult trees or in vitro shoots were observed.
Abstract: By using the isoschizomersHpa II andMsp I which display differential sensitivity to cytosine methylation, a modified amplified fragment length polymorphism (AFLP) technique has been developed to investigate DNA methylation profiles in eukaryotic organims. Genomic DNA was digested with a mixture ofEcoR I and one of the isoschizomers, and ligated to oligonucleotide adapters. After two rounds of selective PCR amplification, followed by DNA separation on a Long Ranger gel electrophoresis, a subset of restriction fragments can be displayed on an X-ray film. Comparison of AFLP banding patterns betweenHpa II andMsp I revealed the extent of DNA methylation. The technique has been successfully applied in this study to investigate DNA methylation profiles of apple (Malus domestica cv. Gala) genomic DNA extracted from leaves of field-grown adult trees andin vitro-grown shoot cultures. The results showed that up to 25 percent of AFLP bands were derived from methylated sequences, and among those, a few bands unique to either adult trees orin vitro shoots were observed. These results demonstrated that this protocol is effective in identifying methylated DNA profiles.