scispace - formally typeset
Search or ask a question

Showing papers on "Exon published in 2013"


Journal ArticleDOI
TL;DR: The data indicate that intragenic DNA methylation operates in exon definition to modulate alternative RNA splicing and can enhance exon recognition via recruitment of the multifunctional protein MeCP2, which thereby maintains local histone hypoacetylation through the subsequent recruitment of HDACs.
Abstract: Although the function of DNA methylation in gene promoter regions is well established in transcriptional repression, the function of the evolutionarily conserved widespread distribution of DNA methylation in gene body regions remains incompletely understood. Here, we show that DNA methylation is enriched in included alternatively spliced exons (ASEs), and that inhibition of DNA methylation results in aberrant splicing of ASEs. The methyl-CpG-binding protein MeCP2 is enriched in included ASEs, particularly those that are also highly methylated, and inhibition of DNA methylation disrupts specific targeting of MeCP2 to exons. Interestingly, ablation of MeCP2 results in increased histone acetylation and aberrant ASE-skipping events. We further show that inhibition of histone deacetylase (HDAC) activity leads to exon skipping that shows a highly significant degree of overlap with that caused by MeCP2 knockdown. Together, our data indicate that intragenic DNA methylation operates in exon definition to modulate alternative RNA splicing and can enhance exon recognition via recruitment of the multifunctional protein MeCP2, which thereby maintains local histone hypoacetylation through the subsequent recruitment of HDACs.

497 citations


Journal ArticleDOI
TL;DR: It is shown that CAG repeat length–dependent aberrant splicing of exon 1 HTT results in a short polyadenylated mRNA that is translated into an exon 2 HTT protein, which provides a mechanistic basis for the molecular pathogenesis of HD.
Abstract: Huntington disease (HD) is a devastating, late-onset, inherited neurodegenerative disorder that manifests with personality changes, movement disorders, and cognitive decline. It is caused by a CAG repeat expansion in exon 1 of the HTT gene that translates to a polyglutamine tract in the huntingtin protein (HTT). The formation of HTT fragments has been implicated as an essential step in the molecular pathogenesis of HD and several proteases that cleave HTT have been identified. However, the importance of smaller N-terminal fragments has been highlighted by their presence in HD postmortem brains and by the fact that nuclear inclusions are only detected by antibodies to the N terminus of HTT. Despite an intense research effort, the precise length of these fragments and the mechanism by which they are generated remains unknown. Here we show that CAG repeat length–dependent aberrant splicing of exon 1 HTT results in a short polyadenylated mRNA that is translated into an exon 1 HTT protein. Given that mutant exon 1 HTT proteins have consistently been shown to be highly pathogenic in HD mouse models, the aberrant splicing of HTT mRNA provides a mechanistic basis for the molecular pathogenesis of HD. RNA-targeted therapeutic strategies designed to lower the levels of HTT are under development. Many of these approaches would not prevent the production of exon 1 HTT and should be reviewed in light of our findings.

405 citations


Journal ArticleDOI
TL;DR: ExoN is identified as the first viral protein distinct from the RdRp that determines the sensitivity of RNA viruses to mutagens, and shows the importance of ExoN as a target for inhibition, and suggests that small-molecule inhibitors of ExON activity could be potential pan-CoV therapeutics in combination with RBV or RNA mutagen.
Abstract: No therapeutics or vaccines currently exist for human coronaviruses (HCoVs). The Severe Acute Respiratory Syndrome-associated coronavirus (SARS-CoV) epidemic in 2002–2003, and the recent emergence of Middle East Respiratory Syndrome coronavirus (MERS-CoV) in April 2012, emphasize the high probability of future zoonotic HCoV emergence causing severe and lethal human disease. Additionally, the resistance of SARS-CoV to ribavirin (RBV) demonstrates the need to define new targets for inhibition of CoV replication. CoVs express a 3′-to-5′ exoribonuclease in nonstructural protein 14 (nsp14-ExoN) that is required for high-fidelity replication and is conserved across the CoV family. All genetic and biochemical data support the hypothesis that nsp14-ExoN has an RNA proofreading function. Thus, we hypothesized that ExoN is responsible for CoV resistance to RNA mutagens. We demonstrate that while wild-type (ExoN+) CoVs were resistant to RBV and 5-fluorouracil (5-FU), CoVs lacking ExoN activity (ExoN−) were up to 300-fold more sensitive. While the primary antiviral activity of RBV against CoVs was not mutagenesis, ExoN− CoVs treated with 5-FU demonstrated both enhanced sensitivity during multi-cycle replication, as well as decreased specific infectivity, consistent with 5-FU functioning as a mutagen. Comparison of full-genome next-generation sequencing of 5-FU treated SARS-CoV populations revealed a 16-fold increase in the number of mutations within the ExoN− population as compared to ExoN+. Ninety percent of these mutations represented A:G and U:C transitions, consistent with 5-FU incorporation during RNA synthesis. Together our results constitute direct evidence that CoV ExoN activity provides a critical proofreading function during virus replication. Furthermore, these studies identify ExoN as the first viral protein distinct from the RdRp that determines the sensitivity of RNA viruses to mutagens. Finally, our results show the importance of ExoN as a target for inhibition, and suggest that small-molecule inhibitors of ExoN activity could be potential pan-CoV therapeutics in combination with RBV or RNA mutagens.

392 citations


Journal ArticleDOI
14 Mar 2013-Cell
TL;DR: This extensive crosstalk between gene regulatory layers takes advantage of dynamic spatial, physical, and temporal organizational properties of the cell nucleus, and further emphasizes the importance of developing a multidimensional understanding of splicing control.

390 citations


Journal ArticleDOI
TL;DR: Quantitative analysis of APA isoforms indicated that promoter-distal pAs, regardless of intron or exon locations, become more abundant during embryonic development and cell differentiation and that upregulated isoforms have stronger pA, suggesting global modulation of the 3′ end–processing activity in development and differentiation.
Abstract: Alternative cleavage and polyadenylation (APA) generates diverse mRNA isoforms. We developed 3' region extraction and deep sequencing (3'READS) to address mispriming issues that commonly plague poly(A) site (pA) identification, and we used the method to comprehensively map pAs in the mouse genome. Thorough annotation of gene 3' ends revealed over 5,000 previously overlooked pAs (∼8% of total) flanked by A-rich sequences, underscoring the necessity of using an accurate tool for pA mapping. About 79% of mRNA genes and 66% of long noncoding RNA genes undergo APA, but these two gene types have distinct usage patterns for pAs in introns and upstream exons. Quantitative analysis of APA isoforms by 3'READS indicated that promoter-distal pAs, regardless of intron or exon locations, become more abundant during embryonic development and cell differentiation and that upregulated isoforms have stronger pAs, suggesting global modulation of the 3' end-processing activity in development and differentiation.

383 citations


Journal ArticleDOI
TL;DR: Adult-onset motor neuron disease does not require aggregation or loss of nuclear TDP-43, with ALS-linked mutants producing loss and gain of splicing function of selected RNA targets at an early disease stage.
Abstract: Transactivating response region DNA binding protein (TDP-43) is the major protein component of ubiquitinated inclusions found in amyotrophic lateral sclerosis (ALS) and frontotemporal lobar degeneration (FTLD) with ubiquitinated inclusions. Two ALS-causing mutants (TDP-43(Q331K) and TDP-43(M337V)), but not wild-type human TDP-43, are shown here to provoke age-dependent, mutant-dependent, progressive motor axon degeneration and motor neuron death when expressed in mice at levels and in a cell type-selective pattern similar to endogenous TDP-43. Mutant TDP-43-dependent degeneration of lower motor neurons occurs without: (i) loss of TDP-43 from the corresponding nuclei, (ii) accumulation of TDP-43 aggregates, and (iii) accumulation of insoluble TDP-43. Computational analysis using splicing-sensitive microarrays demonstrates alterations of endogenous TDP-43-dependent alternative splicing events conferred by both human wild-type and mutant TDP-43(Q331K), but with high levels of mutant TDP-43 preferentially enhancing exon exclusion of some target pre-mRNAs affecting genes involved in neurological transmission and function. Comparison with splicing alterations following TDP-43 depletion demonstrates that TDP-43(Q331K) enhances normal TDP-43 splicing function for some RNA targets but loss-of-function for others. Thus, adult-onset motor neuron disease does not require aggregation or loss of nuclear TDP-43, with ALS-linked mutants producing loss and gain of splicing function of selected RNA targets at an early disease stage.

362 citations


Journal ArticleDOI
TL;DR: EGFR exon 20 insertion testing identifies a distinct subset of lung adenocarcinomas, accounting for at least 9% of all EGFR-mutated cases, representing the third most common type of EGFR mutation after exon 19 deletions and L858R.
Abstract: In contrast to other primary epidermal growth factor receptor (EGFR) mutations in lung adenocarcinomas, insertions in exon 20 of EGFR have been generally associated with resistance to EGFR-tyrosine kinase inhibitors. Their molecular spectrum, clinicopathologic characteristics, and prevalence are not well established. Tumors harboring EGFR exon 20 insertions were identified through an algorithmic screen of 1,500 lung adenocarcinomas. Cases were first tested for common mutations in EGFR (exons 19 and 21) and KRAS (exon 2) and, if negative, further analyzed for EGFR exon 20 insertions. All samples underwent extended genotyping for other driver mutations in EGFR, KRAS, BRAF, ERBB2/HER2, NRAS, PIK3CA, MEK1, and AKT by mass spectrometry; a subset was evaluated for ALK rearrangements. We identified 33 EGFR exon 20 insertion cases [2.2%, 95% confidence interval (CI), 1.6-3.1], all mutually exclusive with mutations in the other genes tested (except PIK3CA). They were more common among never-smokers (P < 0.0001). There was no association with age, sex, race, or stage. Morphologically, tumors were similar to those with common EGFR mutations but with frequent solid histology. Insertions were highly variable in position and size, ranging from 3 to 12 bp, resulting in 13 different insertions, which, by molecular modeling, are predicted to have potentially different effects on erlotinib binding. EGFR exon 20 insertion testing identifies a distinct subset of lung adenocarcinomas, accounting for at least 9% of all EGFR-mutated cases, representing the third most common type of EGFR mutation after exon 19 deletions and L858R. Insertions are structurally heterogeneous with potential implications for response to EGFR inhibitors.

344 citations


Journal ArticleDOI
TL;DR: It is shown here that binding in distal intronic regions by Rbfox splicing factors important in development is extensive and is an active mode of splicing regulation.
Abstract: Alternative splicing (AS) enables programmed diversity of gene expression across tissues and development. We show here that binding in distal intronic regions (>500 nucleotides (nt) from any exon) by Rbfox splicing factors important in development is extensive and is an active mode of splicing regulation. Similarly to exon-proximal sites, distal sites contain evolutionarily conserved GCATG sequences and are associated with AS activation and repression upon modulation of Rbfox abundance in human and mouse experimental systems. As a proof of principle, we validated the activity of two specific Rbfox enhancers in KIF21A and ENAH distal introns and showed that a conserved long-range RNA-RNA base-pairing interaction (an RNA bridge) is necessary for Rbfox-mediated exon inclusion in the ENAH gene. Thus we demonstrate a previously unknown RNA-mediated mechanism for AS control by distally bound RNA-binding proteins.

328 citations


Journal ArticleDOI
TL;DR: ASprofile as mentioned in this paper identifies alternative splicing events in 16 different human tissues, which provide a dynamic picture of splicing variation across the tissues, and detects 26,989 potential exon skipping events representing differences in splicing patterns among the tissues.
Abstract: Alternative splicing is widely recognized for its roles in regulating genes and creating gene diversity. However, despite many efforts, the repertoire of gene splicing variation is still incompletely characterized, even in humans. Here we describe a new computational system, ASprofile, and its application to RNA-seq data from Illumina’s Human Body Map project (>2.5 billion reads). Using the system, we identified putative alternative splicing events in 16 different human tissues, which provide a dynamic picture of splicing variation across the tissues. We detected 26,989 potential exon skipping events representing differences in splicing patterns among the tissues. A large proportion of the events (>60%) were novel, involving new exons (~3000), new introns (~16000), or both. When tracing these events across the sixteen tissues, only a small number (4-7%) appeared to be differentially expressed (‘switched’) between two tissues, while 30-45% showed little variation, and the remaining 50-65% were not present in one or both tissues compared. Novel exon skipping events appeared to be slightly less variable than known events, but were more tissue-specific. Our study represents the first effort to build a comprehensive catalog of alternative splicing in normal human tissues from RNA-seq data, while providing insights into the role of alternative splicing in shaping tissue transcriptome differences. The catalog of events and the ASprofile software are freely available from the Zenodo repository ( http://zenodo.org/record/7068 ; doi: 10.5281/zenodo.7068 ) and from our web site http://ccb.jhu.edu/software/ASprofile .

305 citations


Journal ArticleDOI
13 Jun 2013-Nature
TL;DR: MBNL proteins are identified as conserved and direct negative regulators of a large program of cassette exon alternative splicing events that are differentially regulated between ES cells and other cell types, consistent with a central and negative regulatory role for MBNL proteins in pluripotency.
Abstract: This study identifies MBNL proteins as negative regulators of alternative splicing events that are differentially regulated between ES cells and other cell types; several lines of evidence show that these proteins repress an ES cell alternative splicing program and the reprogramming of somatic cells to induced pluripotent stem cells. Ben Blencowe and colleagues identify the muscleblind-like RNA binding proteins MBNL1 and MBNL2 as negative regulators of alternative splicing events that are differentially regulated between embryonic stem cells and other cell types. Several lines of evidence show that they are involved in the regulation of embryonic-stem-cell-like alternative splicing patterns. The authors also identify a regulatory role during the reprogramming of fibroblasts to induced pluripotent stem (iPS) cells. Previous investigations of the core gene regulatory circuitry that controls the pluripotency of embryonic stem (ES) cells have largely focused on the roles of transcription, chromatin and non-coding RNA regulators1,2,3. Alternative splicing represents a widely acting mode of gene regulation4,5,6,7,8, yet its role in regulating ES-cell pluripotency and differentiation is poorly understood. Here we identify the muscleblind-like RNA binding proteins, MBNL1 and MBNL2, as conserved and direct negative regulators of a large program of cassette exon alternative splicing events that are differentially regulated between ES cells and other cell types. Knockdown of MBNL proteins in differentiated cells causes switching to an ES-cell-like alternative splicing pattern for approximately half of these events, whereas overexpression of MBNL proteins in ES cells promotes differentiated-cell-like alternative splicing patterns. Among the MBNL-regulated events is an ES-cell-specific alternative splicing switch in the forkhead family transcription factor FOXP1 that controls pluripotency9. Consistent with a central and negative regulatory role for MBNL proteins in pluripotency, their knockdown significantly enhances the expression of key pluripotency genes and the formation of induced pluripotent stem cells during somatic cell reprogramming.

294 citations


Journal ArticleDOI
TL;DR: It is found that these SR proteins promote both inclusion and skipping of exons in vivo, but their binding patterns do not explain such opposite responses, and specific effects on regulated splicing by one SR protein actually depend on a complex set of relationships with multiple other SR proteins in mammalian genomes.

Journal ArticleDOI
TL;DR: This work solved the solution structure of the TDP-43 RRMs in complex with UG-rich RNA and revealed not only how T DP-43 recognizes UG repeats but also how RNA binding–dependent inter-RRM interactions are crucial for TDP
Abstract: TDP-43 encodes an alternative-splicing regulator with tandem RNA-recognition motifs (RRMs) The protein regulates cystic fibrosis transmembrane regulator (CFTR) exon 9 splicing through binding to long UG-rich RNA sequences and is found in cytoplasmic inclusions of several neurodegenerative diseases We solved the solution structure of the TDP-43 RRMs in complex with UG-rich RNA Ten nucleotides are bound by both RRMs, and six are recognized sequence specifically Among these, a central G interacts with both RRMs and stabilizes a new tandem RRM arrangement Mutations that eliminate recognition of this key nucleotide or crucial inter-RRM interactions disrupt RNA binding and TDP-43-dependent splicing regulation In contrast, point mutations that affect base-specific recognition in either RRM have weaker effects Our findings reveal not only how TDP-43 recognizes UG repeats but also how RNA binding-dependent inter-RRM interactions are crucial for TDP-43 function

Journal ArticleDOI
TL;DR: This review discusses how the spliceosome can successfully define exons and introns in a huge variety of pre‐mRNA molecules with nucleotide‐precision through a complex combinatorial control resulting from many different factors/influences.
Abstract: One of the fundamental issues in RNA splicing research is represented by understanding how the spliceosome can successfully define exons and introns in a huge variety of pre-mRNA molecules with nucleotide-precision. Since its first description, researchers in this field have identified and characterized many fundamental elements and players capable of affecting the splicing process, both in a negative and positive manner. Indeed, it can be argued that today we know a great deal about the forces that make an exon, an exon and an intron, an intron. As will be discussed in this review, these decisions are a result of a complex combinatorial control resulting from many different factors/influences. Most importantly, these influences act across several levels of complexity starting from the relatively simple interaction between two consensus 5' and 3' splice sites to much more complex factors: such as the interplay between silencer or enhancer sequences, transcriptional processivity, genomic milieu, nucleosome positioning, and histone modifications at the chromatin level. Depending on local contexts, all these factors will act either antagonistically or synergistically to decide the exon/intron fate of any given RNA sequence. At present, however, what we still lack is a precise understanding of how all these processes add up to help the spliceosome reach a decision. Therefore, it is expected that future challenges in splicing research will be the careful characterization of all these influences to improve our ability to predict splicing choices in different organisms or in specific contexts.

Journal ArticleDOI
28 Mar 2013-Nature
TL;DR: Genetic inactivation of p53 rescues Clp1K/K mice from the motor neuron loss, muscle denervation and respiratory failure, uncover a mechanistic link between tRNA processing, formation of a new RNA species and progressive loss of lower motor neurons regulated by p53.
Abstract: CLP1 was the first mammalian RNA kinase to be identified. However, determining its in vivo function has been elusive. Here we generated kinase-dead Clp1 (Clp1(K/K)) mice that show a progressive loss of spinal motor neurons associated with axonal degeneration in the peripheral nerves and denervation of neuromuscular junctions, resulting in impaired motor function, muscle weakness, paralysis and fatal respiratory failure. Transgenic rescue experiments show that CLP1 functions in motor neurons. Mechanistically, loss of CLP1 activity results in accumulation of a novel set of small RNA fragments, derived from aberrant processing of tyrosine pre-transfer RNA. These tRNA fragments sensitize cells to oxidative-stress-induced p53 (also known as TRP53) activation and p53-dependent cell death. Genetic inactivation of p53 rescues Clp1(K/K) mice from the motor neuron loss, muscle denervation and respiratory failure. Our experiments uncover a mechanistic link between tRNA processing, formation of a new RNA species and progressive loss of lower motor neurons regulated by p53.

Journal ArticleDOI
TL;DR: This work elucidates the mechanism of action of the AD-associated polymorphism rs3865444 in the promoter of CD33, a member of the sialic acid-binding Ig-superfamily of lectins (SIGLECs) and suggests a novel model wherein SNP-modulated RNA splicing modulates CD33 function and, thereby, AD risk.
Abstract: Genome-wide association studies are identifying novel Alzheimer's disease (AD) risk factors. Elucidating the mechanism underlying these polymorphisms is critical to the validation process and, by identifying rate-limiting steps in AD risk, may yield novel therapeutic targets. Here, we elucidate the mechanism of action of the AD-associated polymorphism rs3865444 in the promoter of CD33, a member of the sialic acid-binding Ig-superfamily of lectins (SIGLECs). Immunostaining established that CD33 is expressed in microglia in human brain. Consistent with this finding, CD33 mRNA expression correlated well with expression of the microglial genes CD11b and AIF-1 and was modestly increased with AD status and the rs3865444C AD-risk allele. Analysis of CD33 isoforms identified a common isoform lacking exon 2 (D2-CD33). The proportion of CD33 expressed as D2-CD33 correlated robustly with rs3865444 genotype. Because rs3865444 is in the CD33 promoter region, we sought the functional polymorphism by sequencing CD33 from the promoter through exon 4. We identified a single polymorphism that is coinherited with rs3865444, i.e., rs12459419 in exon 2. Minigene RNA splicing studies in BV2 microglial cells established that rs12459419 is a functional single nucleotide polymorphism (SNP) that modulates exon 2 splicing efficiency. Thus, our primary findings are that CD33 is a microglial mRNA and that rs3865444 is a proxy SNP for rs12459419 that modulates CD33 exon 2 splicing. Exon 2 encodes the CD33 IgV domain that typically mediates sialic acid binding in SIGLEC family members. In summary, these results suggest a novel model wherein SNP-modulated RNA splicing modulates CD33 function and, thereby, AD risk.

Journal ArticleDOI
TL;DR: The atomic structure of a closely related bacterial homolog has been solved and the structural core is common to six unrelated transporters, e.g. members of the SLC6 family of neurotransporter, and the conclusion that these work by a similar mechanism.

Journal ArticleDOI
TL;DR: The results confirm that SQSTM1 gene mutations could be the cause or genetic susceptibility factor of ALS in some patients and identify two novel missense mutations in two SALS that were not detected in 360 control subjects.
Abstract: Mutations in SQSTM1 encoding the sequestosome 1/p62 protein have recently been identified in familial and sporadic cases of amyotrophic lateral sclerosis (ALS). p62 is a component of the ubiquitin inclusions detected in degenerating neurons in ALS patients. We sequenced SQSTM1 in 90 French patients with familial ALS (FALS) and 74 autopsied ALS cases with sporadic ALS (SALS). We identified, at the heterozygote state, one missense c.1175C>T, p.Pro392Leu (exon 8) in one of our FALS and one substitution in intron 7 (the c.1165+1G>A, previously called IVS7+1 G-A, A390X) affecting the exon 7 splicing site in one SALS. These mutations that are located in the ubiquitin-associated domain (UBA domain) of the p62 protein have already been described in Paget’s disease and ALS patients carrying these mutations had both concomitant Paget’s disease. However, we also identified two novel missense mutations in two SALS: the c.259A>G, p.Met87Val in exon 2 and the c.304A>G, p.Lys102Glu in exon 3. These mutations that were not detected in 360 control subjects are possibly pathogenic. Neuropathology analysis of three patients carrying SQSTM1 variants revealed the presence of large round p62 inclusions in motor neurons, and immunoblot analysis showed an increased p62 and TDP-43 protein levels in the spinal cord. Our results confirm that SQSTM1 gene mutations could be the cause or genetic susceptibility factor of ALS in some patients.

Journal ArticleDOI
TL;DR: The robustness of splicing patterns in plants is highlighted and the importance of ongoing annotation and visualization of RNA-Seq data using interactive tools such as Integrated Genome Browser is highlighted.
Abstract: Pollen grains of Arabidopsis (Arabidopsis thaliana) contain two haploid sperm cells enclosed in a haploid vegetative cell. Upon germination, the vegetative cell extrudes a pollen tube that carries the sperm to an ovule for fertilization. Knowing the identity, relative abundance, and splicing patterns of pollen transcripts will improve our understanding of pollen and allow investigation of tissue-specific splicing in plants. Most Arabidopsis pollen transcriptome studies have used the ATH1 microarray, which does not assay splice variants and lacks specific probe sets for many genes. To investigate the pollen transcriptome, we performed high-throughput sequencing (RNA-Seq) of Arabidopsis pollen and seedlings for comparison. Gene expression was more diverse in seedling, and genes involved in cell wall biogenesis were highly expressed in pollen. RNA-Seq detected at least 4,172 protein-coding genes expressed in pollen, including 289 assayed only by nonspecific probe sets. Additional exons and previously unannotated 5′ and 3′ untranslated regions for pollen-expressed genes were revealed. We detected regions in the genome not previously annotated as expressed; 14 were tested and 12 were confirmed by polymerase chain reaction. Gapped read alignments revealed 1,908 high-confidence new splicing events supported by 10 or more spliced read alignments. Alternative splicing patterns in pollen and seedling were highly correlated. For most alternatively spliced genes, the ratio of variants in pollen and seedling was similar, except for some encoding proteins involved in RNA splicing. This study highlights the robustness of splicing patterns in plants and the importance of ongoing annotation and visualization of RNA-Seq data using interactive tools such as Integrated Genome Browser.

Journal ArticleDOI
TL;DR: The results suggest that DNA methylation affects exon recognition and is influenced by the GC architecture of the exon and flanking introns.
Abstract: DNA methylation is known to regulate transcription and was recently found to be involved in exon recognition via cotranscriptional splicing. We recently observed that exon-intron architectures can be grouped into two classes: one with higher GC content in exons compared to the flanking introns, and the other with similar GC content in exons and introns. The first group has higher nucleosome occupancy on exons than introns, whereas the second group exhibits weak nucleosome marking of exons, suggesting another type of epigenetic marker distinguishes exons from introns when GC content is similar. We find different and specific patterns of DNA methylation in each of the GC architectures; yet in both groups, DNA methylation clearly marks the exons. Exons of the leveled GC architecture exhibit a significantly stronger DNA methylation signal in relation to their flanking introns compared to exons of the differential GC architecture. This is accentuated by a reduction of the DNA methylation level in the intronic sequences in proximity to the splice sites and shows that different epigenetic modifications mark the location of exons already at the DNA level. Also, lower levels of methylated CpGs on alternative exons can successfully distinguish alternative exons from constitutive ones. Three positions at the splice sites show high CpG abundance and accompany elevated nucleosome occupancy in a leveled GC architecture. Overall, these results suggest that DNA methylation affects exon recognition and is influenced by the GC architecture of the exon and flanking introns.

Journal ArticleDOI
TL;DR: This report describes a novel long noncoding RNA that is induced by cigarette smoke extract both in vitro and in vivo and is elevated in numerous lung cancer cell lines, and identifies a novel and intriguing new nonc coding RNA that may act downstream of NRF2 to regulate gene expression and mediate oxidative stress protection in airway epithelial cells.
Abstract: The incidence of lung diseases and cancer caused by cigarette smoke is increasing. The molecular mechanisms of gene regulation induced by cigarette smoke that ultimately lead to cancer remain unclear. This report describes a novel long noncoding RNA (lncRNA) that is induced by cigarette smoke extract (CSE) both in vitro and in vivo and is elevated in numerous lung cancer cell lines. We have termed this lncRNA the smoke and cancer–associated lncRNA–1 (SCAL1). This lncRNA is located in chromosome 5, and initial sequencing analysis reveals a transcript with four exons and three introns. The expression of SCAL1 is regulated transcriptionally by nuclear factor erythroid 2–related factor (NRF2), as determined by the small, interfering RNA (siRNA) knockdown of NRF2 and kelch-like ECH-associated protein 1 (KEAP1). A nuclear factor erythroid-derived 2 (NF-E2) motif was identified in the promoter region that shows binding to NRF2 after its activation. Functionally, the siRNA knockdown of SCAL1 in human bronchial epithelial cells shows a significant potentiation of cytotoxicity induced by CSE in vitro. Altogether, these results identify a novel and intriguing new noncoding RNA that may act downstream of NRF2 to regulate gene expression and mediate oxidative stress protection in airway epithelial cells.

Journal ArticleDOI
TL;DR: Across lifespan, splicing changes followed discrete patterns that could be linked to neural functions, and associated with the expression profiles of the corresponding splicing factors, and were abundant in both of the brain regions studied.
Abstract: While splicing differences between tissues, sexes and species are well documented, little is known about the extent and the nature of splicing changes that take place during human or mammalian development and aging. Here, using high-throughput transcriptome sequencing, we have characterized splicing changes that take place during whole human lifespan in two brain regions: prefrontal cortex and cerebellum. Identified changes were confirmed using independent human and rhesus macaque RNA-seq data sets, exon arrays and PCR, and were detected at the protein level using mass spectrometry. Splicing changes across lifespan were abundant in both of the brain regions studied, affecting more than a third of the genes expressed in the human brain. Approximately 15% of these changes differed between the two brain regions. Across lifespan, splicing changes followed discrete patterns that could be linked to neural functions, and associated with the expression profiles of the corresponding splicing factors. More than 60% of all splicing changes represented a single splicing pattern reflecting preferential inclusion of gene segments potentially targeting transcripts for nonsense-mediated decay in infants and elderly.

Journal ArticleDOI
01 Jan 2013-RNA
TL;DR: It is shown that SR and hnRNP splicing factors exploit similar mechanisms to positively or negatively influence splice site selection, based on their binding location relative to regulated 5' splice sites.
Abstract: Alternative splicing is regulated by splicing factors that modulate splice site selection. In some cases, however, splicing factors show antagonistic activities by either activating or repressing splicing. Here, we show that these opposing outcomes are based on their binding location relative to regulated 59 splice sites. SR proteins enhance splicing only when they are recruited to the exon. However, they interfere with splicing by simply relocating them to the opposite intronic side of the splice site. hnRNP splicing factors display analogous opposing activities, but in a reversed position dependence. Activation by SR or hnRNP proteins increases splice site recognition at the earliest steps of exon definition, whereas splicing repression promotes the assembly of nonproductive complexes that arrest spliceosome assembly prior to splice site pairing. Thus, SR and hnRNP splicing factors exploit similar mechanisms to positively or negatively influence splice site selection.

Journal ArticleDOI
TL;DR: The history of research on 5' Splice sites selection is reviewed, highlighting the difficulties of establishing how base-pairing strength determines splicing outcomes and proposed that protein complexes propagate along the exon, thereby changing its physical behavior so as to affect 5'ss selection.
Abstract: Splice site selection is fundamental to pre-mRNA splicing and the expansion of genomic coding potential. 5' Splice sites (5'ss) are the critical elements at the 5' end of introns and are extremely diverse, as thousands of different sequences act as bona fide 5'ss in the human transcriptome. Most 5'ss are recognized by base-pairing with the 5' end of the U1 small nuclear RNA (snRNA). Here we review the history of research on 5'ss selection, highlighting the difficulties of establishing how base-pairing strength determines splicing outcomes. We also discuss recent work demonstrating that U1 snRNA:5'ss helices can accommodate noncanonical registers such as bulged duplexes. In addition, we describe the mechanisms by which other snRNAs, regulatory proteins, splicing enhancers, and the relative positions of alternative 5'ss contribute to selection. Moreover, we discuss mechanisms by which the recognition of numerous candidate 5'ss might lead to selection of a single 5'ss and propose that protein complexes propagate along the exon, thereby changing its physical behavior so as to affect 5'ss selection.

Journal ArticleDOI
TL;DR: This study systematically revealed splicing signatures of the three most common types of breast tumors using RNA sequencing and validated the presence of novel hybrid isoforms of critical molecules like CDK4, LARP1, ADD3, and PHLPP2.
Abstract: Breast cancer transcriptome acquires a myriad of regulation changes, and splicing is critical for the cell to "tailor-make" specific functional transcripts. We systematically revealed splicing signatures of the three most common types of breast tumors using RNA sequencing: TNBC, non-TNBC and HER2-positive breast cancer. We discovered subtype specific differentially spliced genes and splice isoforms not previously recognized in human transcriptome. Further, we showed that exon skip and intron retention are predominant splice events in breast cancer. In addition, we found that differential expression of primary transcripts and promoter switching are significantly deregulated in breast cancer compared to normal breast. We validated the presence of novel hybrid isoforms of critical molecules like CDK4, LARP1, ADD3, and PHLPP2. Our study provides the first comprehensive portrait of transcriptional and splicing signatures specific to breast cancer sub-types, as well as previously unknown transcripts that prompt the need for complete annotation of tissue and disease specific transcriptome.

Journal ArticleDOI
08 Aug 2013-Blood
TL;DR: Novel observations support the hypothesis that U2AF1 mutations play a significant role in myeloid leukemogenesis due to selective missplicing of tumor-associated genes.

Journal ArticleDOI
TL;DR: FUS autoregulation by alternative splicing provides insight into a molecular mechanism by which FUS-regulated pre-mRNA processing can impact a significant number of targets important to neurodegeneration.
Abstract: The gene encoding a DNA/RNA binding protein FUS/TLS is frequently mutated in amyotrophic lateral sclerosis (ALS). Mutations commonly affect its carboxy-terminal nuclear localization signal, resulting in varying deficiencies of FUS nuclear localization and abnormal cytoplasmic accumulation. Increasing evidence suggests deficiencies in FUS nuclear function may contribute to neuron degeneration. Here we report a novel FUS autoregulatory mechanism and its deficiency in ALS-associated mutants. Using FUS CLIP-seq, we identified significant FUS binding to a highly conserved region of exon 7 and the flanking introns of its own pre-mRNAs. We demonstrated that FUS is a repressor of exon 7 splicing and that the exon 7-skipped splice variant is subject to nonsense-mediated decay (NMD). Overexpression of FUS led to the repression of exon 7 splicing and a reduction of endogenous FUS protein. Conversely, the repression of exon 7 was reduced by knockdown of FUS protein, and moreover, it was rescued by expression of EGFP-FUS. This dynamic regulation of alternative splicing describes a novel mechanism of FUS autoregulation. Given that ALS-associated FUS mutants are deficient in nuclear localization, we examined whether cells expressing these mutants would be deficient in repressing exon 7 splicing. We showed that FUS harbouring R521G, R522G or ΔExon15 mutation (minor, moderate or severe cytoplasmic localization, respectively) directly correlated with respectively increasing deficiencies in both exon 7 repression and autoregulation of its own protein levels. These data suggest that compromised FUS autoregulation can directly exacerbate the pathogenic accumulation of cytoplasmic FUS protein in ALS. We showed that exon 7 skipping can be induced by antisense oligonucleotides targeting its flanking splice sites, indicating the potential to alleviate abnormal cytoplasmic FUS accumulation in ALS. Taken together, FUS autoregulation by alternative splicing provides insight into a molecular mechanism by which FUS-regulated pre-mRNA processing can impact a significant number of targets important to neurodegeneration.

Journal ArticleDOI
TL;DR: This is the first application of RNA capture to perform large-scale validation of novel transcriptome features and provides extensive detail about a previously uncharacterized level of transcript diversity in the human retina.
Abstract: The retina is a complex tissue comprised of multiple cell types that is affected by a diverse set of diseases that are important causes of vision loss. Characterizing the transcripts, both annotated and novel, that are expressed in a given tissue has become vital for understanding the mechanisms underlying the pathology of disease. We sequenced RNA prepared from three normal human retinas and characterized the retinal transcriptome at an unprecedented level due to the increased depth of sampling provided by the RNA-seq approach. We used a non-redundant reference transcriptome from all of the empirically-determined human reference tracks to identify annotated and novel sequences expressed in the retina. We detected 79,915 novel alternative splicing events, including 29,887 novel exons, 21,757 3′ and 5′ alternate splice sites, and 28,271 exon skipping events. We also identified 116 potential novel genes. These data represent a significant addition to the annotated human transcriptome. For example, the novel exons detected increase the number of identified exons by 3%. Using a high-throughput RNA capture approach to validate 14,696 of these novel transcriptome features we found that 99% of the putative novel events can be reproducibly detected. Further, 15-36% of the novel splicing events maintain an open reading frame, suggesting they produce novel protein products. To our knowledge, this is the first application of RNA capture to perform large-scale validation of novel transcriptome features. In total, these analyses provide extensive detail about a previously uncharacterized level of transcript diversity in the human retina.

Journal ArticleDOI
TL;DR: In this paper, the authors demonstrate the utility of RNA sequence analysis of human donor tissue and patient-derived cell lines to identify mutations that would be undetectable by exome sequencing.
Abstract: Mutations in ABCA4 cause Stargardt disease and other blinding autosomal recessive retinal disorders. However, sequencing of the complete coding sequence in patients with clinical features of Stargardt disease sometimes fails to detect one or both mutations. For example, among 208 individuals with clear clinical evidence of ABCA4 disease ascertained at a single institution, 28 had only one disease-causing allele identified in the exons and splice junctions of the primary retinal transcript of the gene. Haplotype analysis of these 28 probands revealed 3 haplotypes shared among ten families, suggesting that 18 of the 28 missing alleles were rare enough to be present only once in the cohort. We hypothesized that mutations near rare alternate splice junctions in ABCA4 might cause disease by increasing the probability of mis-splicing at these sites. Next-generation sequencing of RNA extracted from human donor eyes revealed more than a dozen alternate exons that are occasionally incorporated into the ABCA4 transcript in normal human retina. We sequenced the genomic DNA containing 15 of these minor exons in the 28 one-allele subjects and observed five instances of two different variations in the splice signals of exon 36.1 that were not present in normal individuals (P < 10−6). Analysis of RNA obtained from the keratinocytes of patients with these mutations revealed the predicted alternate transcript. This study illustrates the utility of RNA sequence analysis of human donor tissue and patient-derived cell lines to identify mutations that would be undetectable by exome sequencing.

Journal ArticleDOI
TL;DR: The findings show that PIK3R1 mutations are the major cause of SHORT syndrome and suggest that the molecular mechanism of disease might involve downregulation of the PI3K-AKT-mTOR pathway.
Abstract: SHORT syndrome is a rare, multisystem disease characterized by short stature, anterior-chamber eye anomalies, characteristic facial features, lipodystrophy, hernias, hyperextensibility, and delayed dentition. As part of the FORGE (Finding of Rare Disease Genes) Canada Consortium, we studied individuals with clinical features of SHORT syndrome to identify the genetic etiology of this rare disease. Whole-exome sequencing in a family trio of an affected child and unaffected parents identified a de novo frameshift insertion, c.1906_1907insC (p.Asn636Thrfs∗18), in exon 14 of PIK3R1. Heterozygous mutations in exon 14 of PIK3R1 were subsequently identified by Sanger sequencing in three additional affected individuals and two affected family members. One of these mutations, c.1945C>T (p.Arg649Trp), was confirmed to be a de novo mutation in one affected individual and was also identified and shown to segregate with the phenotype in an unrelated family. The other mutation, a de novo truncating mutation (c.1971T>G [p.Tyr657∗]), was identified in another affected individual. PIK3R1 is involved in the phosphatidylinositol 3 kinase (PI3K) signaling cascade and, as such, plays an important role in cell growth, proliferation, and survival. Functional studies on lymphoblastoid cells with the PIK3R1 c.1906_1907insC mutation showed decreased phosphorylation of the downstream S6 target of the PI3K-AKT-mTOR pathway. Our findings show that PIK3R1 mutations are the major cause of SHORT syndrome and suggest that the molecular mechanism of disease might involve downregulation of the PI3K-AKT-mTOR pathway.

Journal ArticleDOI
TL;DR: GGO volume percentage in tumors with exon 21 missense mutation was significantly higher than that in tumor status with other EGFR mutation status, and can be related to the fact that exon21 missense mutations was significantly more frequent in lepidic predominant adenocarcinomas, according to IASLE/ATS/ERS classification.
Abstract: Ground-glass opacity volume percentage in tumors with exon 21 missense mutation was significantly higher than that in epidermal growth factor receptor wild-type tumors and exon 19–mutated tumors.