G-quadruplexes and gene expression in Arabidopsis thaliana

Open AccessDissertation

G-quadruplexes and gene expression in Arabidopsis thaliana

TLDR

A novel method for identifying G4s is introduced, which uses a machine learning approach trained on datasets derived from the high throughput sequencing of G4 structures, to study the prevalence of PG4s in the genome of Arabidopsis thaliana, the model plant.

Abstract:

G-Quadruplexes (G4s) are four stranded DNA structures which form in regions with high GC content and high GC skew. Because of the dependence of G4 structure on specific sequences, it is possible to predict putative G4s (PG4s) throughout genomic sequence. PG4s are non-uniformly distributed in genomes, with higher densities within various genic features, particularly promoters, 5’ untranslated regions (UTRs) and coding sequences (CDSs). When they form G4s, these sequences can have a variety of implications for biological processes including replication, transcription, translation and splicing. Here, we introduce a novel method for identifying PG4s, which uses a machine learning approach trained on datasets derived from the high throughput sequencing of G4 structures. We apply this and other techniques, to study the prevalence of PG4s in the genome of Arabidopsis thaliana, the model plant. Finally, we study the effect of G4 stabilisation on gene expression in Arabidopsis, using the GQuadruplex binding agent N-methyl mesoporphyrin (NMM). We identify a family of genes which are strongly downregulated by NMM, and find that they contain large numbers of PG4s in their CDSs.

Citations

PDF

Open Access

More filters

Integrative Genomics Viewer

James T. Robinson, +7 more

TL;DR: The sheer volume and scope of data posed by this flood of data pose a significant challenge to the development of efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data.

...read moreread less

Chromatin-state discovery and genome annotation with ChromHMM

Jason Ernst, +1 more

TL;DR: ChromHMM combines multiple genome-wide epigenomic maps, and uses combinatorial and spatial mark patterns to infer a complete annotation for each cell type, and provides an automated enrichment analysis of the resulting annotations to facilitate the functional interpretations of each chromatin state.

...read moreread less

Journal Article

Python: the tutorial

Michael P. Rogers

- 01 Oct 2009 -

Journal of Computing Sciences in College...

TL;DR: Over the years, programming languages have grown more powerful, but correspondingly more complex; and while that complexity is fine and appropriate for professional programmers, it hinders and discourages beginning Computer Science students.

...read moreread less

Journal ArticleDOI

A Hitchhiker's Guide to…

Cathy Lundmark

- 01 Nov 2005 -

BioScience

TL;DR: In this paper, the authors present a summary of issues that faculty members should review as they begin to consider retirement, including the benefits they consider to be important and the issues that need to be considered.

...read moreread less

Journal Article

How long is too long

Tom Reinke

- 01 Jul 2008 -

Managed care (Langhorne, Pa.)

References

PDF

Open Access

More filters

Journal ArticleDOI

Deep sequencing of subcellular RNA fractions shows splicing to be predominantly co-transcriptional in the human genome but inefficient for lncRNAs

Hagen Tilgner, +9 more

- 01 Sep 2012 -

Genome Research

TL;DR: The coSI measure, based on RNA-seq reads mapping to exon junctions and borders, is introduced, to assess the degree of splicing completion around internal exons, and significant enrichment of spliceosomal snRNAs in chromatin-associated RNA is found compared with other cellular RNA fractions and other nonspliceosome sn RNAs.

...read moreread less

Journal ArticleDOI

Potent effect of target structure on microRNA function.

Dang Long, +5 more

- 01 Apr 2007 -

Nature Structural & Molecular Biology

TL;DR: A potent effect of target structure on target recognition by miRNAs is indicated and a structure-based framework for genome-wide identification of animal miRNA targets is established.

...read moreread less

Journal ArticleDOI

Re-evaluation of G-quadruplex propensity with G4Hunter

Amina Bedrat, +2 more

- 29 Feb 2016 -

Nucleic Acids Research

TL;DR: The G4Hunter algorithm is applied to genomes of a number of species, including humans, allowing us to conclude that the number of sequences capable of forming stable quadruplexes (at least in vitro) in the human genome is significantly higher, by a factor of 2–10, than previously thought.

...read moreread less

Journal ArticleDOI

Poly(A) signals.

Nick J. Proudfoot

- 22 Feb 1991 -

Cell

Journal ArticleDOI

A Structural Model of Transcription Elongation

Nataliya Korzheva, +6 more

- 28 Jul 2000 -

Science

TL;DR: The path of the nucleic acids through a transcription elongation complex was tracked by mapping cross-links between bacterial RNA polymerase and transcript RNA or template DNA onto the x-ray crystal structure and the resulting model provides insight into the functional properties of the transcription complex.

...read moreread less

Collapse

G-quadruplexes and gene expression in Arabidopsis thaliana

Citations

Integrative Genomics Viewer

Chromatin-state discovery and genome annotation with ChromHMM

Python: the tutorial

A Hitchhiker's Guide to…

How long is too long

References

Deep sequencing of subcellular RNA fractions shows splicing to be predominantly co-transcriptional in the human genome but inefficient for lncRNAs

Potent effect of target structure on microRNA function.

Re-evaluation of G-quadruplex propensity with G4Hunter

Poly(A) signals.

A Structural Model of Transcription Elongation

Related Papers (4)

Small G-proteins in Arabidopsis thaliana.

The AtMRS2 gene family from Arabidopsis thaliana

PhytochromeA-specific induction of cabgene expression in Arabidopsis thaliana

The Ribosomal Protein L23a Family of Arabidopsis thaliana