scispace - formally typeset
Search or ask a question

Showing papers on "genomic DNA published in 2017"


Journal ArticleDOI
23 Nov 2017-Nature
TL;DR: Adenine base editors (ABEs) that mediate the conversion of A•T to G•C in genomic DNA are described and a transfer RNA adenosine deaminase is evolved to operate on DNA when fused to a catalytically impaired CRISPR–Cas9 mutant.
Abstract: The spontaneous deamination of cytosine is a major source of transitions from C•G to T•A base pairs, which account for half of known pathogenic point mutations in humans. The ability to efficiently convert targeted A•T base pairs to G•C could therefore advance the study and treatment of genetic diseases. The deamination of adenine yields inosine, which is treated as guanine by polymerases, but no enzymes are known to deaminate adenine in DNA. Here we describe adenine base editors (ABEs) that mediate the conversion of A•T to G•C in genomic DNA. We evolved a transfer RNA adenosine deaminase to operate on DNA when fused to a catalytically impaired CRISPR-Cas9 mutant. Extensive directed evolution and protein engineering resulted in seventh-generation ABEs that convert targeted A•T base pairs efficiently to G•C (approximately 50% efficiency in human cells) with high product purity (typically at least 99.9%) and low rates of indels (typically no more than 0.1%). ABEs introduce point mutations more efficiently and cleanly, and with less off-target genome modification, than a current Cas9 nuclease-based method, and can install disease-correcting or disease-suppressing mutations in human cells. Together with previous base editors, ABEs enable the direct, programmable introduction of all four transition mutations without double-stranded DNA cleavage.

2,451 citations


Journal ArticleDOI
TL;DR: The results show that the model is sensitive enough to detect changes in genomic DNA methylation levels as a function of growth phase in Escherichia coli.
Abstract: DNA chemical modifications regulate genomic function. We present a framework for mapping cytosine and adenosine methylation with the Oxford Nanopore Technologies MinION using this nanopore sequencer's ionic current signal. We map three cytosine variants and two adenine variants. The results show that our model is sensitive enough to detect changes in genomic DNA methylation levels as a function of growth phase in Escherichia coli.

383 citations


Journal ArticleDOI
TL;DR: This work developed a biochemical method (SITE-Seq), using Cas9 programmed with single-guide RNAs (sgRNAs), to identify the sequence of cut sites within genomic DNA, and found that the number of sites identified depended on sgRNA sequence and nuclease concentration.
Abstract: RNA-guided CRISPR-Cas9 endonucleases are widely used for genome engineering, but our understanding of Cas9 specificity remains incomplete. Here, we developed a biochemical method (SITE-Seq), using Cas9 programmed with single-guide RNAs (sgRNAs), to identify the sequence of cut sites within genomic DNA. Cells edited with the same Cas9-sgRNA complexes are then assayed for mutations at each cut site using amplicon sequencing. We used SITE-Seq to examine Cas9 specificity with sgRNAs targeting the human genome. The number of sites identified depended on sgRNA sequence and nuclease concentration. Sites identified at lower concentrations showed a higher propensity for off-target mutations in cells. The list of off-target sites showing activity in cells was influenced by sgRNP delivery, cell type and duration of exposure to the nuclease. Collectively, our results underscore the utility of combining comprehensive biochemical identification of off-target sites with independent cell-based measurements of activity at those sites when assessing nuclease activity and specificity.

302 citations


Journal ArticleDOI
TL;DR: DAP-seq is developed, a transcription factor (TF)-binding site (TFBS) discovery assay that couples affinity-purified TFs with next-generation sequencing of a genomic DNA library, enabling low-cost, high-throughput generation of cistrome and epicistrome maps for any organism.
Abstract: To enable low-cost, high-throughput generation of cistrome and epicistrome maps for any organism, we developed DNA affinity purification sequencing (DAP-seq), a transcription factor (TF)-binding site (TFBS) discovery assay that couples affinity-purified TFs with next-generation sequencing of a genomic DNA library. The method is fast, inexpensive, and more easily scaled than chromatin immunoprecipitation sequencing (ChIP-seq). DNA libraries are constructed using native genomic DNA from any source of interest, preserving cell- and tissue-specific chemical modifications that are known to affect TF binding (such as DNA methylation) and providing increased specificity as compared with in silico predictions based on motifs from methods such as protein-binding microarrays (PBMs) and systematic evolution of ligands by exponential enrichment (SELEX). The resulting DNA library is incubated with an affinity-tagged in vitro-expressed TF, and TF-DNA complexes are purified using magnetic separation of the affinity tag. Bound genomic DNA is eluted from the TF and sequenced using next-generation sequencing. Sequence reads are mapped to a reference genome, identifying genome-wide binding locations for each TF assayed, from which sequence motifs can then be derived. A researcher with molecular biology experience should be able to follow this protocol, processing up to 400 samples per week.

267 citations


Journal ArticleDOI
TL;DR: It is found that the rAPOBEC1–nCas9 base editor is highly specific, inducing cytosine-to-uracil conversions at only 18 ± 9 sites in the human genome for each sgRNA.
Abstract: Cas9-linked deaminases, also called base editors, enable targeted mutation of single nucleotides in eukaryotic genomes. However, their off-target activity is largely unknown. Here we modify digested-genome sequencing (Digenome-seq) to assess the specificity of a programmable deaminase composed of a Cas9 nickase (nCas9) and the deaminase APOBEC1 in the human genome. Genomic DNA is treated with the base editor and a mixture of DNA-modifying enzymes in vitro to produce DNA double-strand breaks (DSBs) at uracil-containing sites. Off-target sites are then computationally identified from whole genome sequencing data. Testing seven different single guide RNAs (sgRNAs), we find that the rAPOBEC1-nCas9 base editor is highly specific, inducing cytosine-to-uracil conversions at only 18 ± 9 sites in the human genome for each sgRNA. Digenome-seq is sensitive enough to capture off-target sites with a substitution frequency of 0.1%. Notably, off-target sites of the base editors are often different from those of Cas9 alone, calling for independent assessment of their genome-wide specificities.

228 citations


Journal ArticleDOI
10 Mar 2017-Science
TL;DR: Synthetic yeast chromosome X was chemically synthesized from scratch, a rigorous, incremental step toward complete synthesis of the whole yeast genome, and an efficient mapping method, PoPM, was developed to identify bugs during genome synthesis, generalizable to any watermarked synthetic chromosome.
Abstract: INTRODUCTION Design and construction of an extensively modified yeast genome is a direct means to interrogate the integrity, comprehensiveness, and accuracy of the knowledge amassed by the yeast community to date. The international synthetic yeast genome project (Sc2.0) aims to build an entirely designer, synthetic Saccharomyces cerevisiae genome. The synthetic genome is designed to increase genome stability and genetic flexibility while maintaining cell fitness near that of the wild type. A major challenge for a genome synthesis lies in identifying and eliminating fitness-reducing sequence variants referred to as “bugs.” RATIONALE Debugging is imperative for successfully building a fit strain encoding a synthetic genome. However, it is time-consuming and laborious to replace wild-type genes and measure strain fitness systematically. The Sc2.0 PCRTag system, which specifies recoded sequences within open reading frames (ORFs), is designed to distinguish synthetic from wild-type DNA in a simple polymerase chain reaction (PCR) assay. This system provides an opportunity to efficiently map bugs to the related genes by using a pooling strategy and subsequently correct them. Further, as we identify bugs in designer sequences, we will identify gaps in our knowledge and gain a deeper understanding of genome biology, allowing refinement of future design strategies. RESULTS We chemically synthesized yeast chromosome X, synX, designed to be 707,459 base pairs. A high-throughput mapping strategy called pooled PCRTag mapping (PoPM) was developed to identify unexpected bugs during chromosome assembly. With this method, the genotypes of pools of colonies with normal or defective fitness are assessed by PCRTag analysis. The PoPM method exploits the patchwork structure of synthetic and wild-type sequences observed in the majority of putative synthetic DNA integrants or meiotic progeny derived from synthetic/wild-type strain backcross. PCRTag analysis with both synthetic and wild-type specific primers, carried out with genomic DNA extracted from the two pools of clones (normal fitness versus a specific growth defect), can be used to identify regions of synthetic DNA missing from the normal fitness pool and, analogously, sections of wild-type DNA absent from the specific growth-defect pool. In this way, the defect can be efficiently mapped to a very small overlapping region, and subsequent systematic analysis of designed changes in that region can be used to identify the bug. Several bugs were identified and corrected, including a growth defect mapping to a specific synonymously recoded PCRTag sequence in the essential FIP1 ORF and the effect of introducing a loxPsym site that unexpectedly altered the the promoter function of a nearby gene, ATP2. In addition, meiotic crossover was employed to repair the massive duplications and rearrangements in the synthetic chromosome. The debugged synX strain exhibited high fitness under a variety of conditions tested and in competitive growth with the wild-type strain. CONCLUSION Synthetic yeast chromosome X was chemically synthesized from scratch, a rigorous, incremental step toward complete synthesis of the whole yeast genome. Thousands of designer modifications in synX revealed extensive flexibility of the yeast genome. We developed an efficient mapping method, PoPM, to identify bugs during genome synthesis, generalizable to any watermarked synthetic chromosome, and several details of yeast biology were uncovered by debugging. Considering the numerous gene-associated PCRTags available in the synthetic chromosomes, PoPM may represent a powerful tool to map interesting phenotypes of mutated synthetic strains or even mutated wild-type strains to the relevant genes. It may also be useful to study yeast genetic interactions when an unexpected phenotype is generated by alterations in two or more genes, substantially expanding understanding of yeast genomic and cellular functions. The PoPM method is also likely to be useful for mapping phenotype(s) resulting from the genome SCRaMbLE system.

197 citations


Journal ArticleDOI
TL;DR: An archaeal plasmid from an Antarctic species of haloarchaea that transfers via a mechanism similar to a virus that encodes proteins that can insert into host membranes and membrane vesicles provides insights into the evolutionary link between plasmids and viruses.
Abstract: The major difference between viruses and plasmids is the mechanism of transferring their genomic information between host cells. Here, we describe the archaeal plasmid pR1SE from an Antarctic species of haloarchaea that transfers via a mechanism similar to a virus. pR1SE encodes proteins that are found in regularly shaped membrane vesicles, and the vesicles enclose the plasmid DNA. The released vesicles are capable of infecting a plasmid-free strain, which then gains the ability to produce plasmid-containing vesicles. pR1SE can integrate and replicate as part of the host genome, resolve out with fragments of host DNA incorporated or portions of the plasmid left behind, form vesicles and transfer to new hosts. The pR1SE mechanism of transfer of DNA could represent the predecessor of a strategy used by viruses to pass on their genomic DNA and fulfil roles in gene exchange, supporting a strong evolutionary connection between plasmids and viruses. An archaeal plasmid that can be transported in membrane vesicles, similar to a virus, and encodes proteins that can insert into host membranes and membrane vesicles, provides insights into the evolutionary link between plasmids and viruses.

189 citations


Journal ArticleDOI
TL;DR: The modified Mericon extraction method was found to be the most efficient DNA extraction method, capable to provide high DNA yields with better quality, affordable cost and less time.
Abstract: The world’s top three cereals, based on their monetary value, are rice, wheat, and corn. In cereal crops, DNA extraction is difficult owing to rigid non-cellulose components in the cell wall of leaves and high starch and protein content in grains. The advanced techniques in molecular biology require pure and quick extraction of DNA. The majority of existing DNA extraction methods rely on long incubation and multiple precipitations or commercially available kits to produce contaminant-free high molecular weight DNA. In this study, we compared three different methods used for the isolation of high-quality genomic DNA from the grains of cereal crop, Zea mays, with minor modifications. The DNA from the grains of two maize hybrids, M10 and M321, was extracted using extraction methods DNeasy Qiagen Plant Mini Kit, CTAB-method (with/without 1% PVP) and modified Mericon extraction. Genes coding for 45S ribosomal RNA are organized in tandem arrays of up to several thousand copies and contain codes for 18S, 5.8S and 26S rRNA units separated by internal transcribed spacers ITS1 and ITS2. While the rRNA units are evolutionary conserved, ITS regions show high level of interspecific divergence and have been used frequently in genetic diversity and phylogenetic studies. In this study, the genomic DNA was then amplified with PCR using primers specific for ITS gene. PCR products were then visualized on agarose gel. The modified Mericon extraction method was found to be the most efficient DNA extraction method, capable to provide high DNA yields with better quality, affordable cost and less time.

156 citations


Journal ArticleDOI
TL;DR: A systematic study of cytosine-rich DNA sequences, with varying numbers of consecutive cytosines, is described to gain insights into i-motif DNA sequence and structure stability and postulated a folding rule for i-Motif formation, analogous to (but different from) that for G-quadruplexes.
Abstract: i-Motifs are alternative DNA secondary structures formed in cytosine-rich sequences. Particular examples of these structures, traditionally assumed to be stable only at acidic pH, have been found to form under near-physiological conditions. To determine the potential impact of these structures on physiological processes, investigation of sequences with the capacity to fold under physiological conditions is required. Here we describe a systematic study of cytosine-rich DNA sequences, with varying numbers of consecutive cytosines, to gain insights into i-motif DNA sequence and structure stability. i-Motif formation was assessed using ultraviolet spectroscopy, circular dichroism and native gel electrophoresis. We found that increasing cytosine tract lengths resulted in increased thermal stability; sequences with at least five cytosines per tract folded into i-motif at room temperature and neutral pH. Using these results, we postulated a folding rule for i-motif formation, analogous to (but different from) that for G-quadruplexes. This indicated that thousands of cytosine-rich sequences in the human genome may fold into i-motif structures under physiological conditions. Many of these were found in locations where structure formation is likely to influence gene expression. Characterization of a selection of these identified i-motif forming sequences uncovered 17 genomic i-motif forming sequence examples which were stable at neutral pH.

153 citations


Journal ArticleDOI
TL;DR: 'ExoCET' (Exonuclease Combined with RecET recombination) to directly clone any chosen region from bacterial and mammalian genomes with nucleotide precision into operational plasmids is described.
Abstract: The exponentially increasing volumes of DNA sequence data highlight the need for new DNA cloning methods to explore the new information. Here, we describe 'ExoCET' (Exonuclease Combined with RecET recombination) to directly clone any chosen region from bacterial and mammalian genomes with nucleotide precision into operational plasmids. ExoCET combines in vitro exonuclease and annealing with the remarkable capacity of full length RecET homologous recombination (HR) to retrieve specified regions from genomic DNA preparations. Using T4 polymerase (T4pol) as the in vitro exonuclease for ExoCET, we directly cloned large regions (>50 kb) from bacterial and mammalian genomes, including DNA isolated from blood. Employing RecET HR or Cas9 cleavage in vitro, the directly cloned region can be chosen with nucleotide precision to position, for example, a gene into an expression vector without the need for further subcloning. In addition to its utility for bioprospecting in bacterial genomes, ExoCET presents straightforward access to mammalian genomes for various applications such as region-specific DNA sequencing that retains haplotype phasing, the rapid construction of optimal, haplotypic, isogenic targeting constructs or a new way to genotype that presents advantages over Southern blotting or polymerase chain reaction. The direct cloning capacities of ExoCET present new freedoms in recombinant DNA technology.

107 citations


Journal ArticleDOI
29 Jun 2017-Cell
TL;DR: A chip-hybridized association-mapping platform (CHAMP) that repurposes next-generation sequencing chips to simultaneously measure the interactions between proteins and ∼107 unique DNA sequences provides a framework for high-throughput, quantitative analysis of protein-DNA interactions on synthetic and genomic DNA.

Journal ArticleDOI
TL;DR: The development of a genetic approach to identifying genes involved in eDNA release is reported, and growth in biofilm-inducing medium lowers cyclic-di-AMP levels and does so in a manner that depends on the gdpP phosphodiesterase gene.
Abstract: Staphylococcus aureus is a leading cause of both nosocomial and community-acquired infection. Biofilm formation at the site of infection reduces antimicrobial susceptibility and can lead to chronic infection. During biofilm formation, a subset of cells liberate cytoplasmic proteins and DNA, which are repurposed to form the extracellular matrix that binds the remaining cells together in large clusters. Using a strain that forms robust biofilms in vitro during growth under glucose supplementation, we carried out a genome-wide screen for genes involved in the release of extracellular DNA (eDNA). A high-density transposon insertion library was grown under biofilm-inducing conditions, and the relative frequency of insertions was compared between genomic DNA (gDNA) collected from cells in the biofilm and eDNA from the matrix. Transposon insertions into genes encoding functions necessary for eDNA release were identified by reduced representation in the eDNA. On direct testing, mutants of some of these genes exhibited markedly reduced levels of eDNA and a concomitant reduction in cell clustering. Among the genes with robust mutant phenotypes were gdpP, which encodes a phosphodiesterase that degrades the second messenger cyclic-di-AMP, and xdrA, the gene for a transcription factor that, as revealed by RNA-sequencing analysis, influences the expression of multiple genes, including many involved in cell wall homeostasis. Finally, we report that growth in biofilm-inducing medium lowers cyclic-di-AMP levels and does so in a manner that depends on the gdpP phosphodiesterase gene.

Journal ArticleDOI
TL;DR: This procedure is the method of choice for purification of genomic DNA from mammalian blood when large amounts of DNA are required, for example, for Southern blotting.
Abstract: This procedure is the method of choice for purification of genomic DNA from mammalian blood when large amounts of DNA are required, for example, for Southern blotting. The usual yield of DNA from buffy coat lymphocytes isolated from 20 mL of normal blood is ∼250 µg.

Journal ArticleDOI
TL;DR: Mutational analysis shows that key residues important for guide-dependent target processing are also involved in guide-independent MjAgo function, the first characterization of guide- independent cleavage activity for an Argonaute protein potentially serving as a guide biogenesis pathway in a prokaryotic system.
Abstract: Prokaryotic Argonaute proteins acquire guide strands derived from invading or mobile genetic elements, via an unknown pathway, to direct guide-dependent cleavage of foreign DNA. Here, we report that Argonaute from the archaeal organism Methanocaldococcus jannaschii (MjAgo) possesses two modes of action: the canonical guide-dependent endonuclease activity and a non-guided DNA endonuclease activity. The latter allows MjAgo to process long double-stranded DNAs, including circular plasmid DNAs and genomic DNAs. Degradation of substrates in a guide-independent fashion primes MjAgo for subsequent rounds of DNA cleavage. Chromatinized genomic DNA is resistant to MjAgo degradation, and recombinant histones protect DNA from cleavage in vitro. Mutational analysis shows that key residues important for guide-dependent target processing are also involved in guide-independent MjAgo function. This is the first characterization of guide-independent cleavage activity for an Argonaute protein potentially serving as a guide biogenesis pathway in a prokaryotic system.

Journal ArticleDOI
TL;DR: The use of parallel biophysical, enzymatic, and informatic approaches provides a comprehensive profiling of eccDNA robust to isolation and analysis methodology, and results in human and nematode systems provide quantitative analysis of the eccDNA loci at both unique and repetitive regions.
Abstract: Investigations aimed at defining the 3D configuration of eukaryotic chromosomes have consistently encountered an endogenous population of chromosome-derived circular genomic DNA, referred to as extrachromosomal circular DNA (eccDNA). While the production, distribution, and activities of eccDNAs remain understudied, eccDNA formation from specific regions of the linear genome has profound consequences on the regulatory and coding capabilities for these regions. Here, we define eccDNA distributions in Caenorhabditis elegans and in three human cell types, utilizing a set of DNA topology-dependent approaches for enrichment and characterization. The use of parallel biophysical, enzymatic, and informatic approaches provides a comprehensive profiling of eccDNA robust to isolation and analysis methodology. Results in human and nematode systems provide quantitative analysis of the eccDNA loci at both unique and repetitive regions. Our studies converge on and support a consistent picture, in which endogenous genomic DNA circles are present in normal physiological states, and in which the circles come from both coding and noncoding genomic regions. Prominent among the coding regions generating DNA circles are several genes known to produce a diversity of protein isoforms, with mucin proteins and titin as specific examples.

Journal ArticleDOI
TL;DR: Self-transcribing active core promoter sequencing (STAP-seq), a method to determine the responsiveness of genomic sequences to enhancers, is described and applied to the Drosophila melanogaster genome and identified sequence features that predict enhancer responsiveness.
Abstract: STAP-seq quantifies the ability of genomic DNA sequences to convert enhancer activities into transcription initiation events.

Journal ArticleDOI
23 Aug 2017-Nature
TL;DR: In this article, the extracellular domain of LAIR1, a collagen-binding inhibitory receptor encoded on chromosome 19 (ref. 1), was inserted between the V and DJ segments of an antibody.
Abstract: In two previously described donors, the extracellular domain of LAIR1, a collagen-binding inhibitory receptor encoded on chromosome 19 (ref. 1), was inserted between the V and DJ segments of an antibody. This insertion generated, through somatic mutations, broadly reactive antibodies against RIFINs, a type of variant antigen expressed on the surface of Plasmodium falciparum-infected erythrocytes. To investigate how frequently such antibodies are produced in response to malaria infection, we screened plasma from two large cohorts of individuals living in malaria-endemic regions. Here we report that 5-10% of malaria-exposed individuals, but none of the European blood donors tested, have high levels of LAIR1-containing antibodies that dominate the response to infected erythrocytes without conferring enhanced protection against febrile malaria. By analysing the antibody-producing B cell clones at the protein, cDNA and gDNA levels, we characterized additional LAIR1 insertions between the V and DJ segments and discovered a second insertion modality whereby the LAIR1 exon encoding the extracellular domain and flanking intronic sequences are inserted into the switch region. By exon shuffling, this mechanism leads to the production of bispecific antibodies in which the LAIR1 domain is precisely positioned at the elbow between the VH and CH1 domains. Additionally, in one donor the genomic DNA encoding the VH and CH1 domains was deleted, leading to the production of a camel-like LAIR1-containing antibody. Sequencing of the switch regions of memory B cells from European blood donors revealed frequent templated inserts originating from transcribed genes that, in rare cases, comprised exons with orientations and frames compatible with expression. These results reveal different modalities of LAIR1 insertion that lead to public and dominant antibodies against infected erythrocytes and suggest that insertion of templated DNA represents an additional mechanism of antibody diversification that can be selected in the immune response against pathogens and exploited for B cell engineering.

Journal ArticleDOI
TL;DR: It is proposed that FACT can recognize and bind Z-DNA or DNA in transition from a B to Z form and triggers a p53 response, which may explain the role of FACT in DNA damage prevention.
Abstract: Transitions of B-DNA to alternative DNA structures (ADS) can be triggered by negative torsional strain, which occurs during replication and transcription, and may lead to genomic instability. However, how ADS are recognized in cells is unclear. We found that the binding of candidate anticancer drug, curaxin, to cellular DNA results in uncoiling of nucleosomal DNA, accumulation of negative supercoiling and conversion of multiple regions of genomic DNA into left-handed Z-form. Histone chaperone FACT binds rapidly to the same regions via the SSRP1 subunit in curaxin-treated cells. In vitro binding of purified SSRP1 or its isolated CID domain to a methylated DNA fragment containing alternating purine/pyrimidines, which is prone to Z-DNA transition, is much stronger than to other types of DNA. We propose that FACT can recognize and bind Z-DNA or DNA in transition from a B to Z form. Binding of FACT to these genomic regions triggers a p53 response. Furthermore, FACT has been shown to bind to other types of ADS through a different structural domain, which also leads to p53 activation. Thus, we propose that FACT acts as a sensor of ADS formation in cells. Recognition of ADS by FACT followed by a p53 response may explain the role of FACT in DNA damage prevention.

Journal ArticleDOI
TL;DR: RNA Pol IV mediates the transcriptional balance between maternally and paternally inherited genomes in endosperm, and an inverse relationship between the parent of origin of sRNAs and mRNAs is found, with genes more likely to be associated with maternALLY than paternALLY biased s RNAs.

Journal ArticleDOI
TL;DR: The mechanism that emerges from the analysis explains how the Cas9/sgRNA complex is able to locate the correct target sequence efficiently while interrogating numerous nontarget sequences associated with correct PAMs.
Abstract: The prokaryotic clustered regularly interspaced short palindromic repeats (CRISPR)-associated 9 (Cas9) endonuclease cleaves double-stranded DNA sequences specified by guide RNA molecules and flanked by a protospacer adjacent motif (PAM) and is widely used for genome editing in various organisms. The RNA-programmed Cas9 locates the target site by scanning genomic DNA. We sought to elucidate the mechanism of initial DNA interrogation steps that precede the pairing of target DNA with guide RNA. Using fluorometric and biochemical assays, we studied Cas9/guide RNA complexes with model DNA substrates that mimicked early intermediates on the pathway to the final Cas9/guide RNA–DNA complex. The results show that Cas9/guide RNA binding to PAM favors separation of a few PAM-proximal protospacer base pairs allowing initial target interrogation by guide RNA. The duplex destabilization is mediated, in part, by Cas9/guide RNA affinity for unpaired segments of nontarget strand DNA close to PAM. Furthermore, our data indicate that the entry of double-stranded DNA beyond a short threshold distance from PAM into the Cas9/single-guide RNA (sgRNA) interior is hindered. We suggest that the interactions unfavorable for duplex DNA binding promote DNA bending in the PAM-proximal region during early steps of Cas9/guide RNA–DNA complex formation, thus additionally destabilizing the protospacer duplex. The mechanism that emerges from our analysis explains how the Cas9/sgRNA complex is able to locate the correct target sequence efficiently while interrogating numerous nontarget sequences associated with correct PAMs.

Journal ArticleDOI
TL;DR: Kaempferol (Kae), a natural flavonoid that is present in many fruits and vegetables, exhibits potent anti-cancer effects in bladder cancer and is reported to inhibit DNA methylation by suppressing DNA methyltransferases (DNMTs), but whether Kae can inhibitDNA methylation remains unclear.
Abstract: Background: Genomic DNA methylation plays an important role in both the occurrence and development of bladder cancer. Kaempferol (Kae), a natural flavonoid that i

Patent
14 Feb 2017
TL;DR: In this article, the authors describe DNA constructs comprising a promoter that is operable in the cells of interest and nucleotide sequences that encode a mutated Cpf1 or Csm1 protein with an abolished ability to produce DSBs, optionally linked to a domain that regulates transcriptional activity.
Abstract: Compositions and methods for modifying genomic DNA sequences are provided. The methods produce double-stranded breaks (DSBs) at pre-determined target sites in a genomic DNA sequence, resulting in mutation, insertion, and/or deletion of DNA sequences at the target site(s) in a genome. Compositions comprise DNA constructs comprising nucleotide sequences that encode a Cpf1 or Csm1 protein operably linked to a promoter that is operable in the cells of interest. The DNA constructs can be used to direct the modification of genomic DNA at pre-determined genomic loci. Methods to use these DNA constructs to modify genomic DNA sequences are described herein. Additionally, compositions and methods for modulating the expression of genes are provided. Compositions comprise DNA constructs comprising a promoter that is operable in the cells of interest operably linked to nucleotide sequences that encode a mutated Cpf1 or Csm1 protein with an abolished ability to produce DSBs, optionally linked to a domain that regulates transcriptional activity. The methods can be used to up- or down-regulate the expression of genes at predetermined genomic loci.

Journal ArticleDOI
TL;DR: Light is shed on the diversity and activity of microbial eukaryotes in the water column of a tropical oligotrophic ocean and their potential contributions in the downward transportation of organic material from the surface ocean to the deep via the biological pump.
Abstract: To study the diversity and metabolic activity of microbial eukaryotes in the water column of the South China Sea, genomic DNA and RNA were co-extracted from samples collected down to bathyal depth at two sites. V9 regions of both SSU rRNA gene and its transcript (cDNA) were amplified and sequenced using high throughput sequencing. Our study revealed: (1) DNA and RNA datasets showed significant differences in microbial eukaryote community composition, with the variability between the two datasets for the same sample exceeding that between samples within each dataset, indicating that nucleic acid source overrode environmental factors in determining the composition of microeukaryotes; (2) despite the differences in community composition between the two datasets, both DNA and RNA revealed similar depth-related distribution patterns of microbial eukaryotes; (3) using the ratio of RNA: DNA as a proxy of relative metabolic activity, a depth-related pattern was found for the relative metabolic activity of some but not all groups of microbial eukaryotes, with the highest activity for the groups with depth-related pattern usually found in the middle water layers; and (4) the presence of live and active photoautotrophic microbial eukaryotes in the deep ocean was confirmed, indicating that they play an important role in controlling the deep-sea organic carbon pool. Overall, our study sheds light on the diversity and activity of microbial eukaryotes in the water column of a tropical oligotrophic ocean and their potential contributions in the downward transportation of organic material from the surface ocean to the deep via the biological pump.

Journal ArticleDOI
TL;DR: This work is the first instance of synthetic bacterial recoding beyond the Escherichia coli genome, and reveals that Salmonella is remarkably amenable to genome-scale modification.
Abstract: The ability to rewrite large stretches of genomic DNA enables the creation of new organisms with customized functions. However, few methods currently exist for accumulating such widespread genomic changes in a single organism. In this study, we demonstrate a rapid approach for rewriting bacterial genomes with modified synthetic DNA. We recode 200 kb of the Salmonella typhimurium LT2 genome through a process we term SIRCAS (stepwise integration of rolling circle amplified segments), towards constructing an attenuated and genetically isolated bacterial chassis. The SIRCAS process involves direct iterative recombineering of 10-25 kb synthetic DNA constructs which are assembled in yeast and amplified by rolling circle amplification. Using SIRCAS, we create a Salmonella with 1557 synonymous leucine codon replacements across 176 genes, the largest number of cumulative recoding changes in a single bacterial strain to date. We demonstrate reproducibility over sixteen two-day cycles of integration and parallelization for hierarchical construction of a synthetic genome by conjugation. The resulting recoded strain grows at a similar rate to the wild-type strain and does not exhibit any major growth defects. This work is the first instance of synthetic bacterial recoding beyond the Escherichia coli genome, and reveals that Salmonella is remarkably amenable to genome-scale modification.

Posted ContentDOI
16 Oct 2017-bioRxiv
TL;DR: A novel, amplification-free enrichment technique that employs the CRISPR-Cas9 system for specific targeting multiple genomic loci that enables enrichment and sequencing of complex genomic regions that cannot be investigated with other technologies.
Abstract: Targeted sequencing has proven to be an economical means of obtaining sequence information for one or more defined regions of a larger genome. However, most target enrichment methods require amplification. Some genomic regions, such as those with extreme GC content and repetitive sequences, are recalcitrant to faithful amplification. Yet, many human genetic disorders are caused by repeat expansions, including difficult to sequence tandem repeats. We have developed a novel, amplification-free enrichment technique that employs the CRISPR-Cas9 system for specific targeting multiple genomic loci. This method, in conjunction with long reads generated through Single Molecule, Real-Time (SMRT) sequencing and unbiased coverage, enables enrichment and sequencing of complex genomic regions that cannot be investigated with other technologies. Using human genomic DNA samples, we demonstrate successful targeting of causative loci for Huntington’s disease (HTT; CAG repeat), Fragile X syndrome (FMR1; CGG repeat), amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (C9orf72; GGGGCC repeat), and spinocerebellar ataxia type 10 (SCA10) (ATXN10; variable ATTCT repeat). The method, amenable to multiplexing across multiple genomic loci, uses an amplification-free approach that facilitates the isolation of hundreds of individual on-target molecules in a single SMRT Cell and accurate sequencing through long repeat stretches, regardless of extreme GC percent or sequence complexity content. Our novel targeted sequencing method opens new doors to genomic analyses independent of PCR amplification that will facilitate the study of repeat expansion disorders.

Journal ArticleDOI
TL;DR: This is the first manuscript that describes use of distilled water as the only eliminator of RBCs among all other known DNA extraction techniques.
Abstract: There is no 'one' procedure for extracting DNA from the whole blood of both mammals and birds, since each species has a unique property that require different methods to release its own DNA. Therefore, to obtain genomic DNA, a universal, rapid, and noncostly method was developed. A very simple biological basis is followed in this procedure, in which, when the blood is placed in water, it rapidly enters the RBCs by osmosis and causes cells to burst by hemolysis. The validity of extracting genomic DNA was confirmed by several molecular biological experiments. It was found that this method provides an efficient and versatile alternative for extracting bulk amounts of highly-qualified DNA from the blood of a wide range of species. This is the first manuscript that describes use of distilled water as the only eliminator of RBCs among all other known DNA extraction techniques.

Journal ArticleDOI
TL;DR: CISMR (CRISPR-mediated isolation of specific megabase-sized regions of the genome), which enables the exploration of both DNA sequence and structural diversity of complex genomic regions in any species.
Abstract: Megabase-sized, complex, repetitive regions of genomes are poorly studied, due to the technical and computational challenges inherent to both assembling precise reference sequences and accurately assessing structural variation across contiguous megabase DNA regions. Here we describe a strategy to overcome these challenges, CISMR (CRISPR-mediated isolation of specific megabase-sized regions of the genome), which enables us to perform targeted isolation of contiguous megabase-sized segments of the genome. Direct sequencing of the purified DNA segments can have >100-fold enrichment of the target region, thus enabling the exploration of both DNA sequence and structural diversity of complex genomic regions in any species.

Proceedings ArticleDOI
01 Aug 2017
TL;DR: This research demonstrated that the molecular resonance fingerprints of cancer DNAs exist in the terahertz region, and they can be measured using advanced terAhertz time-domain spectroscopy (THz-TDS) techniques, which can be utilized to diagnose early cancer at the molecular level, and provide a potential cancer biomarker.
Abstract: Cancer is defined as a genetic and epigenetic disease involving the chemical and structural alteration of DNA. Because optical techniques measure overlapped and synthetic signals of materials, regardless of “ordering” features (such as the nucleotide order of a DNA sequence), it is difficult to sequence a genome to detect cancerous genetic changes using optical techniques. For this reason, previous genome sequencing methods, such as nanopore sequencing, have usually used electronic techniques [1, 2]. This only serves to emphasize the importance role of detecting cancerous epigenetic changes to DNA, when the optical molecular fingerprints of cancer are found. Aberrant DNA methylation is an epigenetic modification, a well-known carcinogenic mechanism [3, 4], and a common chemical and structural modification of DNA that does not change the DNA sequence. Terahertz waves can be used to observe modifications to DNA because the characteristic energies of biomolecules occur in the terahertz region. We found resonance fingerprints of methylation in cancer DNAs using improved terahertz spectroscopic methods [5]. The terahertz characteristics of methylated cytidine, a nucleoside, were a clue to observing the resonance fingerprints of DNA methylation. In aqueous solutions, we tracked and monitored the molecular resonances of genomic DNA from two control (293T, M-293T) and five cancer (PC3, A431, A549, MCF-7, SNU-1) cell lines, using freezing techniques and baseline corrections, as shown in Fig. 1 (a). The amplitudes of the resonance signals were dependent on the types of cancer cells the DNA had come from. These signals were quantified to identify cancer cell types, and the results were similar to those of biological quantification methods (Fig. 1 (b)). This research demonstrated that the molecular resonance fingerprints of cancer DNAs exist in the terahertz region, and they can be measured using advanced terahertz time-domain spectroscopy (THz-TDS) techniques. These results can be utilized to diagnose early cancer at the molecular level, and provide a potential cancer biomarker.

Journal ArticleDOI
11 Aug 2017-PLOS ONE
TL;DR: A non-random distribution of Dps binding sites across the bacterial chromosome in exponentially growing cells is reported and it is found that the Dps-bound regions overlap with sites occupied by other nucleoid proteins, and contain overrepresented motifs typical for their consensus sequences.
Abstract: Dps is a multifunctional homododecameric protein that oxidizes Fe2+ ions accumulating them in the form of Fe2O3 within its protein cavity, interacts with DNA tightly condensing bacterial nucleoid upon starvation and performs some other functions. During the last two decades from discovery of this protein, its ferroxidase activity became rather well studied, but the mechanism of Dps interaction with DNA still remains enigmatic. The crucial role of lysine residues in the unstructured N-terminal tails led to the conventional point of view that Dps binds DNA without sequence or structural specificity. However, deletion of dps changed the profile of proteins in starved cells, SELEX screen revealed genomic regions preferentially bound in vitro and certain affinity of Dps for artificial branched molecules was detected by atomic force microscopy. Here we report a non-random distribution of Dps binding sites across the bacterial chromosome in exponentially growing cells and show their enrichment with inverted repeats prone to form secondary structures. We found that the Dps-bound regions overlap with sites occupied by other nucleoid proteins, and contain overrepresented motifs typical for their consensus sequences. Of the two types of genomic domains with extensive protein occupancy, which can be highly expressed or transcriptionally silent only those that are enriched with RNA polymerase molecules were preferentially occupied by Dps. In the dps-null mutant we, therefore, observed a differentially altered expression of several targeted genes and found suppressed transcription from the dps promoter. In most cases this can be explained by the relieved interference with Dps for nucleoid proteins exploiting sequence-specific modes of DNA binding. Thus, protecting bacterial cells from different stresses during exponential growth, Dps can modulate transcriptional integrity of the bacterial chromosome hampering RNA biosynthesis from some genes via competition with RNA polymerase or, vice versa, competing with inhibitors to activate transcription.

Journal ArticleDOI
TL;DR: Compared to the traditional phenol–chloroform/proteinase-K method, the Rapid-Method is indicated to be a more suitable protocol for genomic DNA extraction from human whole blood in terms of DNA quantity, quality, safety, processing time and cost.
Abstract: The extraction of genomic DNA is the crucial first step in large-scale epidemiological studies. Though there are many popular DNA isolation methods from human whole blood, only a few reports have compared their efficiencies using both end-point and real-time PCR assays. Genomic DNA was extracted from coronary artery disease patients using solution-based conventional protocols such as the phenol-chloroform/proteinase-K method and a non-phenolic non-enzymatic Rapid-Method, which were evaluated and compared vis-a-vis a commercially available silica column-based Blood DNA isolation kit. The appropriate method for efficiently extracting relatively pure DNA was assessed based on the total DNA yield, concentration, purity ratios (A260/A280 and A260/A230), spectral profile and agarose gel electrophoresis analysis. The quality of the isolated DNA was further analysed for PCR inhibition using a murine specific ATP1A3 qPCR assay and mtDNA/Y-chromosome ratio determination assay. The suitability of the extracted DNA for downstream applications such as end-point SNP genotyping, was tested using PCR-RFLP analysis of the AGTR1-1166A>C variant, a mirSNP having pharmacogenetic relevance in cardiovascular diseases. Compared to the traditional phenol-chloroform/proteinase-K method, our results indicated the Rapid-Method to be a more suitable protocol for genomic DNA extraction from human whole blood in terms of DNA quantity, quality, safety, processing time and cost. The Rapid-Method, which is based on a simple salting-out procedure, is not only safe and cost-effective, but also has the added advantage of being scaled up to process variable sample volumes, thus enabling it to be applied in large-scale epidemiological studies.