scispace - formally typeset
Search or ask a question

Showing papers on "Exon published in 2004"


Journal ArticleDOI
TL;DR: Exons 18–21 of the tyrosine kinase domain of the epidermal growth factor receptor(EGFR) gene define a distinct subset of pulmonary adenocarcinoma without KRAS mutations, which is not caused by tobacco carcinogens.
Abstract: Recently it has been reported that mutations in the tyrosine kinase domain of the epidermal growth factor receptor(EGFR) gene occur in a subset of patients with lung cancer showing a dramatic response to EGFR tyrosine kinase inhibitors. To gain further insights in the role of EGFR in lung carcinogenesis, we sequenced exons 18-21 of the tyrosine kinase domain using total RNA extracted from unselected 277 patients with lung cancer who underwent surgical resection and correlated the results with clinical and pathologic features. EGFR mutations were present in 111 patients (40%). Fifty-two were in-frame deletions around codons 746-750 in exon 19, 54 were point mutations including 49 at codon 858 in exon 21 and 4 at codon 719 in exon 18, and 5 were duplications/insertions mainly in exon 20. They were significantly more frequent in female (P < 0.001), adenocarcinomas (P = 0.0013), and in never-smokers (P < 0.001). Multivariate analysis suggested EGFR mutations were independently associated with adenocarcinoma histology (P = 0.0012) and smoking status (P < 0.001), but not with female gender (P = 0.9917). In adenocarcinomas, EGFR mutations were more frequent in well to moderately differentiated tumors (P < 0.001) but were independent of patient age, disease stages, or patient survival. KRAS and TP53 mutations were present in 13 and 41%, respectively. EGFR mutations never occurred in tumors with KRAS mutations, whereas EGFR mutations were independent of TP53 mutations. EGFR mutations define a distinct subset of pulmonary adenocarcinoma without KRAS mutations, which is not caused by tobacco carcinogens.

1,228 citations


Journal ArticleDOI
TL;DR: Functional properties, like permeabilities, charge selectivity and unitary conductivity were investigated after directed expression of these connexins in cultured cell lines or paired Xenopus oocytes and targeted deletion of their coding sequence in the mouse genome allowed study of the biological relevance of Cx37, CX40, Cx43 and Cx45 with regard to cardiovascular morphology and function.
Abstract: Gap junctions (Gj) form conduits between adjacent cells that are composed of connexin (Cx) protein subunits and allow direct intercellular communication. To date, the connexin gene family comprises 20 members in the mouse and 21 members in the human genome, 19 of which can be grouped as sequence-orthologous pairs. The structure of connexin genes is relatively simple. An untranslated exon 1 is separated by an intron of different length from exon 2, containing the uninterrupted coding region and the 3'-untranslated region (3'-UTR). However, in some connexin genes, the untranslated regions and the reading frame are spliced. Among the known "cardiovascular" connexins, Cx37 and Cx40 were demonstrated to be functionally expressed in mouse and human endothelial cells and Cx40, Cx43 as well as Cx45 in cardiomyocytes of both species. Functional properties, like permeabilities, charge selectivity and unitary conductivity were investigated after directed expression of these connexins in cultured cell lines or paired Xenopus oocytes. Targeted deletion of their coding sequence in the mouse genome allowed study of the biological relevance of Cx37, Cx40, Cx43 and Cx45 with regard to cardiovascular morphology and function. After ablation of Cx37 or Cx40, mice were viable and could be used to study defects in the adult cardiovascular system but loss of Cx43 or Cx45 led to neonatal or embryonic lethality, respectively. Conditional and cell-type specific deletion of both connexins in the heart or blood vessels can help to overcome this obstacle. As yet only little is known about mutations in the human genes for Cx37, Cx40, Cx43 and Cx45. Thus, a profound comparison between human and mouse phenotypes is not yet possible.

945 citations


Journal ArticleDOI
TL;DR: Intramolecular pairs of Alu elements are identified as a major target for editing in the human transcriptome and it is suggested that modification of repetitive elements is a predominant activity for RNA editing with significant implications for cellular gene expression.
Abstract: RNA editing by adenosine deamination generates RNA and protein diversity through the posttranscriptional modification of single nucleotides in RNA sequences. Few mammalian A-to-I edited genes have been identified despite evidence that many more should exist. Here we identify intramolecular pairs of Alu elements as a major target for editing in the human transcriptome. An experimental demonstration in 43 genes was extended by a broader computational analysis of more than 100,000 human mRNAs. We find that 1,445 human mRNAs (1.4%) are subject to RNA editing at more than 14,500 sites, and our data further suggest that the vast majority of pre-mRNAs (greater than 85%) are targeted in introns by the editing machinery. The editing levels of Alu-containing mRNAs correlate with distance and homology between inverted repeats and vary in different tissues. Alu-mediated RNA duplexes targeted by RNA editing are formed intramolecularly, whereas editing due to intermolecular base-pairing appears to be negligible. We present evidence that these editing events can lead to the posttranscriptional creation or elimination of splice signals affecting alternatively spliced Alu-derived exons. The analysis suggests that modification of repetitive elements is a predominant activity for RNA editing with significant implications for cellular gene expression.

697 citations


Journal ArticleDOI
17 Dec 2004-Cell
TL;DR: ExonScan and related bioinformatic analyses suggest that these ESS motifs play important roles in suppression of pseudoexons, in splice site definition, and in AS.

683 citations


Journal ArticleDOI
TL;DR: An increasing amount of evidence indicates that genomic variants in both coding and non-coding sequences can have unexpected deleterious effects on the splicing of the gene transcript.
Abstract: When genome variants are identified in genomic DNA, especially during routine analysis of disease-associated genes, their functional implications might not be immediately evident. Distinguishing between a genomic variant that changes the phenotype and one that does not is a difficult task. An increasing amount of evidence indicates that genomic variants in both coding and non-coding sequences can have unexpected deleterious effects on the splicing of the gene transcript. So how can benign polymorphisms be distinguished from disease-associated splicing mutations?

573 citations


Journal ArticleDOI
TL;DR: This study distinguishes the human brain, testis and liver as having unusually high levels of AS, highlights differences in the types of AS occurring commonly in different tissues, and identifies candidate cis-regulatory elements and trans-acting factors likely to have important roles in tissue-specific AS in human cells.
Abstract: Alternative pre-mRNA splicing (AS) is widely used by higher eukaryotes to generate different protein isoforms in specific cell or tissue types. To compare AS events across human tissues, we analyzed the splicing patterns of genomically aligned expressed sequence tags (ESTs) derived from libraries of cDNAs from different tissues. Controlling for differences in EST coverage among tissues, we found that the brain and testis had the highest levels of exon skipping. The most pronounced differences between tissues were seen for the frequencies of alternative 3' splice site and alternative 5' splice site usage, which were about 50 to 100% higher in the liver than in any other human tissue studied. Quantifying differences in splice junction usage, the brain, pancreas, liver and the peripheral nervous system had the most distinctive patterns of AS. Analysis of available microarray expression data showed that the liver had the most divergent pattern of expression of serine-arginine protein and heterogeneous ribonucleoprotein genes compared to the other human tissues studied, possibly contributing to the unusually high frequency of alternative splice site usage seen in liver. Sequence motifs enriched in alternative exons in genes expressed in the brain, testis and liver suggest specific splicing factors that may be important in AS regulation in these tissues. This study distinguishes the human brain, testis and liver as having unusually high levels of AS, highlights differences in the types of AS occurring commonly in different tissues, and identifies candidate cis-regulatory elements and trans-acting factors likely to have important roles in tissue-specific AS in human cells.

563 citations


Journal ArticleDOI
TL;DR: Alternative splicing is the major source of proteome diversity in humans and thus is highly relevant to disease and therapy, and modified oligonucleotides to inhibit cryptic exons or to activate exons weakened by mutations could eventually lead to effective therapies.
Abstract: Alternative splicing is the major source of proteome diversity in humans and thus is highly relevant to disease and therapy. For example, recent work suggests that the long-sought-after target of the analgesic acetaminophen is a neural-specific, alternatively spliced isoform of cyclooxygenase 1 (COX-1). Several important diseases, such as cystic fibrosis, have been linked with mutations or variations in either cis-acting elements or trans-acting factors that lead to aberrant splicing and abnormal protein production. Correction of erroneous splicing is thus an important goal of molecular therapies. Recent experiments have used modified oligonucleotides to inhibit cryptic exons or to activate exons weakened by mutations, suggesting that these reagents could eventually lead to effective therapies.

558 citations


Journal ArticleDOI
TL;DR: The argument for a re-evaluation of the total number of human genes and an alternative term for "gene" to encompass these growing, novel classes of RNA transcripts in the human genome is strongly supported.
Abstract: In this report, we have achieved a richer view of the transcriptome for Chromosomes 21 and 22 by using high-density oligonucleotide arrays on cytosolic poly(A)(+) RNA. Conservatively, only 31.4% of the observed transcribed nucleotides correspond to well-annotated genes, whereas an additional 4.8% and 14.7% correspond to mRNAs and ESTs, respectively. Approximately 85% of the known exons were detected, and up to 21% of known genes have only a single isoform based on exon-skipping alternative expression. Overall, the expression of the well-characterized exons falls predominately into two categories, uniquely or ubiquitously expressed with an identifiable proportion of antisense transcripts. The remaining observed transcription (49.0%) was outside of any known annotation. These novel transcripts appear to be more cell-line-specific and have lower and less variation in expression than the well-characterized genes. Novel transcripts were further characterized based on their distance to annotations, transcript size, coding capacity, and identification as antisense to intronic sequences. By RT-PCR, 126 novel transcripts were independently verified, resulting in a 65% verification rate. These observations strongly support the argument for a re-evaluation of the total number of human genes and an alternative term for "gene" to encompass these growing, novel classes of RNA transcripts in the human genome.

541 citations


Journal ArticleDOI
03 Dec 2004-Science
TL;DR: In this paper, persistent exon skipping was used to remove the mutated exon on the dystrophin messenger mRNA of the mdx mouse, by a single administration of an AAV vector expressing antisense sequences linked to a modified U7 small nuclear RNA.
Abstract: Most mutations in the dystrophin gene create a frameshift or a stop in the mRNA and are associated with severe Duchenne muscular dystrophy. Exon skipping that naturally occurs at low frequency sometimes eliminates the mutation and leads to the production of a rescued protein. We have achieved persistent exon skipping that removes the mutated exon on the dystrophin messenger mRNA of the mdx mouse, by a single administration of an AAV vector expressing antisense sequences linked to a modified U7 small nuclear RNA. We report the sustained production of functional dystrophin at physiological levels in entire groups of muscles and the correction of the muscular dystrophy.

524 citations


Journal ArticleDOI
TL;DR: Alu-associated RNA editing may be a mechanism for marking nonstandard transcripts, not destined for translation, that are primarily associated with retained introns, extended UTRs, or with transcripts that have no corresponding known gene.
Abstract: More than one million copies of the approximately 300-bp Alu element are interspersed throughout the human genome, with up to 75% of all known genes having Alu insertions within their introns and/or UTRs. Transcribed Alu sequences can alter splicing patterns by generating new exons, but other impacts of intragenic Alu elements on their host RNA are largely unexplored. Recently, repeat elements present in the introns or 3'-UTRs of 15 human brain RNAs have been shown to be targets for multiple adenosine to inosine (A-to-I) editing. Using a statistical approach, we find that editing of transcripts with embedded Alu sequences is a global phenomenon in the human transcriptome, observed in 2674 ( approximately 2%) of all publicly available full-length human cDNAs (n = 128,406), from >250 libraries and >30 tissue sources. In the vast majority of edited RNAs, A-to-I substitutions are clustered within transcribed sense or antisense Alu sequences. Edited bases are primarily associated with retained introns, extended UTRs, or with transcripts that have no corresponding known gene. Therefore, Alu-associated RNA editing may be a mechanism for marking nonstandard transcripts, not destined for translation.

503 citations


Journal ArticleDOI
TL;DR: It is demonstrated that MBNL proteins regulate alternative splicing of two pre‐mRNAs that are misregulated in DM, cardiac troponin T (cTNT) and insulin receptor (IR).
Abstract: Although the muscleblind (MBNL) protein family has been implicated in myotonic dystrophy (DM), a specific function for these proteins has not been reported. A key feature of the RNA-mediated pathogenesis model for DM is the disrupted splicing of specific pre-mRNA targets. Here we demonstrate that MBNL proteins regulate alternative splicing of two pre-mRNAs that are misregulated in DM, cardiac troponin T (cTNT) and insulin receptor (IR). Alternative cTNT and IR exons are also regulated by CELF proteins, which were previously implicated in DM pathogenesis. MBNL proteins promote opposite splicing patterns for cTNT and IR alternative exons, both of which are antagonized by CELF proteins. CELF- and MBNL-binding sites are distinct and regulation by MBNL does not require the CELF-binding site. The results are consistent with a mechanism for DM pathogenesis in which expanded repeats cause a loss of MBNL and/or gain of CELF activities, leading to misregulation of alternative splicing of specific pre-mRNA targets.

Journal ArticleDOI
01 Jul 2004-Nature
TL;DR: The X-ray crystal structure of a complete group I bacterial intron in complex with both the 5′- and the 3′-exons is reported, representing the first splicing complex to include a complete intron, both exons and an organized active site occupied with metal ions.
Abstract: The discovery of the RNA self-splicing group I intron provided the first demonstration that not all enzymes are proteins. Here we report the X-ray crystal structure (3.1-A resolution) of a complete group I bacterial intron in complex with both the 5'- and the 3'-exons. This complex corresponds to the splicing intermediate before the exon ligation step. It reveals how the intron uses structurally unprecedented RNA motifs to select the 5'- and 3'-splice sites. The 5'-exon's 3'-OH is positioned for inline nucleophilic attack on the conformationally constrained scissile phosphate at the intron-3'-exon junction. Six phosphates from three disparate RNA strands converge to coordinate two metal ions that are asymmetrically positioned on opposing sides of the reactive phosphate. This structure represents the first splicing complex to include a complete intron, both exons and an organized active site occupied with metal ions.

Journal ArticleDOI
TL;DR: The large number of effective sequences implied by these results suggests that sequences that influence splicing may be very abundant in pre-mRNA.
Abstract: We have searched for sequence motifs that contribute to the recognition of human pre-mRNA splice sites by comparing the frequency of 8-mers in internal noncoding exons versus unspliced pseudo exons and 5' untranslated regions (5' untranslated regions [UTRs]) of transcripts of intronless genes. This type of comparison avoids the isolation of sequences that are distinguished by their protein-coding information. We classified sequence families comprising 2069 putative exonic enhancers and 974 putative exonic silencers. Representatives of each class functioned as enhancers or silencers when inserted into a test exon and assayed in transfected mammalian cells. As a class, the enhancer sequencers were more prevalent and the silencer elements less prevalent in all exons compared with introns. A survey of 58 reported exonic splicing mutations showed good agreement between the splicing phenotype and the effect of the mutation on the motifs defined here. The large number of effective sequences implied by these results suggests that sequences that influence splicing may be very abundant in pre-mRNA.

Journal ArticleDOI
TL;DR: The experimental basis for the current understanding of group II intron mobility mechanisms is discussed, beginning with genetic observations in yeast mitochondria, and culminating with a detailed understanding of molecular mechanisms shared by organellar and bacterial group II introns.
Abstract: Mobile group II introns, found in bacterial and organellar genomes, are both catalytic RNAs and retrotransposable elements. They use an extraordinary mobility mechanism in which the excised intron RNA reverse splices directly into a DNA target site and is then reverse transcribed by the intron-encoded protein. After DNA insertion, the introns remove themselves by protein-assisted, autocatalytic RNA splicing, thereby minimizing host damage. Here we discuss the experimental basis for our current understanding of group II intron mobility mechanisms, beginning with genetic observations in yeast mitochondria, and culminating with a detailed understanding of molecular mechanisms shared by organellar and bacterial group II introns. We also discuss recently discovered links between group II intron mobility and DNA replication, new insights into group II intron evolution arising from bacterial genome sequencing, and the evolutionary relationship between group II introns and both eukaryotic spliceosomal introns and non-LTR-retrotransposons. Finally, we describe the development of mobile group II introns into gene-targeting vectors, "targetrons," which have programmable target specificity.

Journal ArticleDOI
TL;DR: In two unrelated pedigrees of Alpers' syndrome, each affected child was found to carry a homozygous mutation in exon 17 of the POLG locus that led to a Glu873Stop mutation just upstream of the polymerase domain of the protein.
Abstract: Alpers' syndrome is a fatal neurogenetic disorder first described more than 70 years ago. It is an autosomal recessive, developmental mitochondrial DNA depletion disorder characterized by deficiency in mitochondrial DNA polymerase γ (POLG) catalytic activity, refractory seizures, neurodegeneration, and liver disease. In two unrelated pedigrees of Alpers' syndrome, each affected child was found to carry a homozygous mutation in exon 17 of the POLG locus that led to a Glu873Stop mutation just upstream of the polymerase domain of the protein. In addition, each affected child was heterozygous for the G1681A mutation in exon 7 that led to an Ala467Thr substitution in POLG, within the linker region of the protein. Ann Neurol 2004

Journal ArticleDOI
TL;DR: The reason why so many factors are needed reflects the observation that exon recognition can be affected by many pre-mRNA features such as exon length, the presence of enhancer and silencer elements, the strength of splicing signals, the promoter architecture, and the rate of RNA processivity.
Abstract: Pre-mRNA splicing in eukaryotes requires joining together the nucleotides of the various mRNA-coding regions (exons) after recognizing them from the normally vastly superior number of non-mRNA-coding sequences (introns). For three excellent reviews on general splicing and its regulation, refer to references 14, 62, and 70. In eukaryotes, the vast majority of splicing processes are catalyzed by the spliceosome, a very complex RNA-protein aggregate which has been estimated to contain several hundred different proteins in addition to five spliceosomal snRNAs (1, 54, 62, 63, 81, 109). These factors are responsible for the accurate positioning of the spliceosome on the 5′ and 3′ splice site sequences. The reason why so many factors are needed reflects the observation that exon recognition can be affected by many pre-mRNA features such as exon length (5, 97), the presence of enhancer and silencer elements (8, 62), the strength of splicing signals (45), the promoter architecture (29, 55), and the rate of RNA processivity (86). In addition, the general cellular environment also exerts an effect, as recent observations suggest the existence of extensive coupling between splicing and many other gene expression steps (69) and even its modification by external stimuli (96).

Journal ArticleDOI
TL;DR: In mammalian cells, increased translational yield correlates with enhanced cytoplasmic polysome association of spliced mRNAs, and is attributable to deposition of exon junction complexes (EJCs).
Abstract: In mammalian cells, spliced mRNAs yield greater quantities of protein per mRNA molecule than do otherwise identical mRNAs not made by splicing. This increased translational yield correlates with enhanced cytoplasmic polysome association of spliced mRNAs, and is attributable to deposition of exon junction complexes (EJCs). Translational stimulation can be replicated by tethering the EJC proteins Y14, Magoh, and RNPS1 or the nonsense-mediated decay (NMD) factors Upf1, Upf2, and Upf3b to an intronless reporter mRNA. Thus, in addition to its previously characterized role in NMD, the EJC also promotes mRNA polysome association. Furthermore, the ability to stimulate translation when bound inside an open reading frame appears to be a general feature of factors required for NMD.

Journal ArticleDOI
TL;DR: It is shown that alternative skipping of PTB exon 11 leads to an mRNA that is removed by NMD and that this pathway consumes at least 20% of the PTB mRNA in HeLa cells, indicating a high prevalence of human alternative splicing leading to NMD.

Journal ArticleDOI
TL;DR: Interestingly, it is found that transcription and AS act independently on different sets of genes in order to define tissue-specific expression profiles, which reflect tissue identity.

Journal ArticleDOI
TL;DR: The results of a systematic search for interspecies orthologues of miRNA precursors of miRNAs are reported, leading to the identification of 35 human and 45 mouse new putative miRNA genes.
Abstract: Conservation of microRNAs (miRNAs) among species suggests that they bear conserved biological functions. However, sequencing of new miRNAs has not always been accompanied by a search for orthologues in other species. I report herein the results of a systematic search for interspecies orthologues of miRNA precursors, leading to the identification of 35 human and 45 mouse new putative miRNA genes. MicroRNA tracks were written to visualize miRNAs in human and mouse genomes on the UCSC Genome Browser. Based on their localization, miRNA precursors can be excised either from introns or exons of mRNAs. When intronic miRNAs are antisense to the apparent host gene, they appear to originate from ill-characterized antisense transcription units. Exonic miRNAs are, in general, nonprotein-coding, poorly conserved genes in sense orientation. In three cases, the excision of an miRNA from a protein-coding mRNA might lead to the degradation of the rest of the transcript. Moreover, three new examples of miRNAs fully complementary to an mRNA are reported. Among these, miR135a might control the stability and/or translation of an alternative form of the glycerate kinase mRNA by RNA interference. I also discuss the presence of human miRNAs in introns of paralogous genes and in miRNA clusters.

Journal ArticleDOI
TL;DR: It is concluded that a significant portion of cassette exons evident in EST databases is not functional, and might result from aberrant rather than regulated splicing.

Journal ArticleDOI
TL;DR: This analysis explored the sequence of the human genome to define the composition of the PTP family and discovered one novel human PTP gene and defined chromosomal loci and exon structure of the additional 37 genes encoding known PTP transcripts.
Abstract: The protein tyrosine phosphatases (PTPs) are now recognized as critical regulators of signal transduction under normal and pathophysiological conditions. In this analysis we have explored the sequence of the human genome to define the composition of the PTP family. Using public and proprietary sequence databases, we discovered one novel human PTP gene and defined chromosomal loci and exon structure of the additional 37 genes encoding known PTP transcripts. Direct orthologs were present in the mouse genome for all 38 human PTP genes. In addition, we identified 12 PTP pseudogenes unique to humans that have probably contaminated previous bioinformatics analysis of this gene family. PCR amplification and transcript sequencing indicate that some PTP pseudogenes are expressed, but their function (if any) is unknown. Furthermore, we analyzed the enhanced diversity generated by alternative splicing and provide predicted amino acid sequences for four human PTPs that are currently defined by fragments only. Finally, we correlated each PTP locus with genetic disease markers and identified 4 PTPs that map to known susceptibility loci for type 2 diabetes and 19 PTPs that map to regions frequently deleted in human cancers. We have made our analysis available at http://ptp.cshl.edu or http://science.novonordisk.com/ptp and we hope this resource will facilitate the functional characterization of these key enzymes.

Journal Article
TL;DR: The human genome is revisited using exon and intron distribution profiles to suggest constraints on the splicing machinery to splice out very long or very short introns and provide insight to optimal intron length selection.
Abstract: The human genome is revisited using exon and intron distribution profiles. The 26,564 annotated genes in the human genome (build October, 2003) contain 233,785 exons and 207,344 introns. On average, there are 8.8 exons and 7.8 introns per gene. About 80% of the exons on each chromosome are < 200 bp in length. < 0.01% of the introns are < 20 bp in length and < 10% of introns are more than 11,000 bp in length. These results suggest constraints on the splicing machinery to splice out very long or very short introns and provide insight to optimal intron length selection. Interestingly, the total length in introns and intergenic DNA on each chromosome is significantly correlated to the determined chromosome size with a coefficient of correlation r = 0.95 and r = 0.97, respectively. These results suggest their implication in genome design.

Journal ArticleDOI
TL;DR: Interestingly, the functional distribution of the transcripts with retained introns is skewed towards stress and external/internal stimuli-related functions, and as such may play a regulatory function.
Abstract: Alternative splicing (AS) combines different transcript splice junctions that result in transcripts with shuffled exons, alternative 5' or 3' splicing sites, retained introns and different transcript termini. In this way, multiple mRNA species and proteins can be created from a single gene expanding the potential informational content of eukaryotic genomes. Search algorithms of AS forms in a variety of Arabidopsis databases showed they contained an unusually high fraction of retained introns (above 30%), compared with 10% that was reported for humans. The preponderance of retained introns (65%) were either part of open reading frames, present in the UTR region or present as the last intron in the transcript, indicating that their occurrence would not participate in non-sense-mediated decay. Interestingly, the functional distribution of the transcripts with retained introns is skewed towards stress and external/internal stimuli-related functions. A sampling of the alternative transcripts with retained introns were confirmed by RT-PCR and were shown to co-purify with polyribosomes, indicating their nuclear export. Thus, retained introns are a prominent feature of AS in Arabidopsis and as such may play a regulatory function.

Journal ArticleDOI
01 Jan 2004-Genomics
TL;DR: Combined database analysis and cDNA cloning have demonstrated that the primary transcript of the mammalian TDP genes undergoes alternative splicing to generate 11 mRNAs, including the one encoding TDP-43, which provides further support for the functional complexity of the eukaryotic T DP genes.

Journal ArticleDOI
22 Oct 2004-Science
TL;DR: A maskless photolithography method is used to produce DNA oligonucleotide microarrays with unique probe sequences tiled throughout the genome of Drosophila melanogaster and across predicted splice junctions to provide a draft expression map for the entire nonrepetitive genome, which reveals a much more extensive and diverse set of expressed sequences than was previously predicted.
Abstract: We used a maskless photolithography method to produce DNA oligonucleotide microarrays with unique probe sequences tiled throughout the genome of Drosophila melanogaster and across predicted splice junctions. RNA expression of protein coding and nonprotein coding sequences was determined for each major stage of the life cycle, including adult males and females. We detected transcriptional activity for 93% of annotated genes and RNA expression for 41% of the probes in intronic and intergenic sequences. Comparison to genome-wide RNA interference data and to gene annotations revealed distinguishable levels of expression for different classes of genes and higher levels of expression for genes with essential cellular functions. Differential splicing was observed in about 40% of predicted genes, and 5440 previously unknown splice forms were detected. Genes within conserved regions of synteny with D. pseudoobscura had highly correlated expression; these regions ranged in length from 10 to 900 kilobase pairs. The expressed intergenic and intronic sequences are more likely to be evolutionarily conserved than nonexpressed ones, and about 15% of them appear to be developmentally regulated. Our results provide a draft expression map for the entire nonrepetitive genome, which reveals a much more extensive and diverse set of expressed sequences than was previously predicted.

Journal ArticleDOI
TL;DR: A survey of RNA editing in human brain by comparing sequences of clones from a human brain cDNA library to the reference human genome sequence and to genomic DNA from the same individual strongly supports the idea that formation of intramolecular double-stranded RNA with an inverted copy underlies most A-->I editing.
Abstract: We have conducted a survey of RNA editing in human brain by comparing sequences of clones from a human brain cDNA library to the reference human genome sequence and to genomic DNA from the same individual. In the RNA sample from which the library was constructed, approximately 1:2000 nucleotides were edited out of >3 Mb surveyed. All edits were adenosine to inosine (A-->I) and were predominantly in intronic and in intergenic RNAs. No edits were found in translated exons and few in untranslated exons. Most edits were in high-copy-number repeats, usually Alus. Analysis of the genome in the vicinity of edited sequences strongly supports the idea that formation of intramolecular double-stranded RNA with an inverted copy underlies most A-->I editing. The likelihood of editing is increased by the presence of two inverted copies of a sequence within the same intron, proximity of the two sequences to each other (preferably within 2 kb), and by a high density of inverted copies in the vicinity. Editing exhibits sequence preferences and is less likely at an adenosine 3' to a guanosine and more likely at an adenosine 5' to a guanosine. Simulation by BLAST alignment of the double-stranded RNA molecules that underlie known edits indicates that there is a greater likelihood of A-->I editing at A:C mismatches than editing at other mismatches or at A:U matches. However, because A:U matches in double-stranded RNA are more common than all mismatches, overall the likely effect of editing is to increase the number of mismatches in double-stranded RNA.

Journal ArticleDOI
TL;DR: An online ESE analysis tool is described that annotates RESCUE-ESE hexamers in vertebrate exons and can be used to predict splicing phenotypes by identifying sequence changes that disrupt or alter predicted ESEs.
Abstract: A typical gene contains two levels of information: a sequence that encodes a particular protein and a host of other signals that are necessary for the correct expression of the transcript. While much attention has been focused on the effects of sequence variation on the amino acid sequence, variations that disrupt gene processing signals can dramatically impact gene function. A variation that disrupts an exonic splicing enhancer (ESE), for example, could cause exon skipping which would result in the exclusion of an entire exon from the mRNA transcript. RESCUE-ESE, a computational approach used in conjunction with experimental validation, previously identified 238 candidate ESE hexamers in human genes. The RESCUE-ESE method has recently been implemented in three additional species: mouse, zebrafish and pufferfish. Here we describe an online ESE analysis tool (http://genes.mit.edu/burgelab/rescue-ese/) that annotates RESCUE-ESE hexamers in vertebrate exons and can be used to predict splicing phenotypes by identifying sequence changes that disrupt or alter predicted ESEs.

Journal ArticleDOI
TL;DR: A potential role for chIL-10 in changing the Th bias during infection with an intracellular protozoan, thereby contributing to susceptibility of line 15I chickens is indicated.
Abstract: We isolated the full-length chicken IL-10 (chIL-10) cDNA from an expressed sequence tag library derived from RNA from cecal tonsils of Eimeria tenella -infected chickens. It encodes a 178-aa polypeptide, with a predicted 162-aa mature peptide. Chicken IL-10 has 45 and 42% aa identity with human and murine IL-10, respectively. The structures of the chIL-10 gene and its promoter were determined by direct sequencing of a bacterial artificial chromosome containing chIL-10. The chIL-10 gene structure is similar to (five exons, four introns), but more compact than, that of its mammalian orthologues. The promoter is more similar to that of Fugu IL-10 than human IL-10. Chicken IL-10 mRNA expression was identified mainly in the bursa of Fabricius and cecal tonsils, with low levels of expression also seen in thymus, liver, and lung. Expression was also detected in PHA-activated thymocytes and LPS-stimulated monocyte-derived macrophages, with high expression in an LPS-stimulated macrophage cell line. Recombinant chIL-10 was produced and bioactivity demonstrated through IL-10-induced inhibition of IFN-γ synthesis by mitogen-activated lymphocytes. We measured the expression of mRNA for chIL-10 and other signature cytokines in gut and spleen of resistant (line C.B12) and susceptible (line 15I) chickens during the course of an E. maxima infection. Susceptible chickens showed higher levels of chIL-10 mRNA expression in the spleen, both constitutively and after infection, and in the small intestine after infection than did resistant chickens. These data indicate a potential role for chIL-10 in changing the Th bias during infection with an intracellular protozoan, thereby contributing to susceptibility of line 15I chickens.

Journal ArticleDOI
TL;DR: It is now apparent that the means of connection is a set of proteins, collectively called the exon junction complex, which are deposited as a consequence of splicing upstream of mRNA exon-exon junctions.