scispace - formally typeset
Search or ask a question
Journal ArticleDOI

The African coelacanth genome provides insights into tetrapod evolution

Chris T. Amemiya1, Chris T. Amemiya2, Jessica Alföldi3, Alison P. Lee4, Shaohua Fan5, Hervé Philippe6, Iain MacCallum3, Ingo Braasch7, Tereza Manousaki5, Igor Schneider8, Nicolas Rohner9, Chris L. Organ10, Domitille Chalopin11, J. Joshua Smith12, Mark Robinson1, Rosemary A. Dorrington13, Marco Gerdol14, Bronwen Aken15, Maria Assunta Biscotti16, Marco Barucca16, Denis Baurain17, Aaron M. Berlin3, Gregory L. Blatch13, Gregory L. Blatch18, Francesco Buonocore, Thorsten Burmester19, Michael S. Campbell10, Adriana Canapa16, John P. Cannon20, Alan Christoffels21, Gianluca De Moro14, Adrienne L. Edkins13, Lin Fan3, Anna Maria Fausto, Nathalie Feiner5, Mariko Forconi16, Junaid Gamieldien21, Sante Gnerre3, Andreas Gnirke3, Jared V. Goldstone22, Wilfried Haerty23, Mark E. Hahn22, Uljana Hesse21, Steve Hoffmann24, Jeremy Johnson3, Sibel I. Karchner22, Shigehiro Kuraku5, Marcia Lara3, Joshua Z. Levin3, Gary W. Litman20, Evan Mauceli3, Evan Mauceli9, Tsutomu Miyake25, M. Gail Mueller26, David R. Nelson27, Anne Nitsche24, Ettore Olmo16, Tatsuya Ota28, Alberto Pallavicini14, Sumir Panji21, Barbara Picone21, Chris P. Ponting23, Sonja J. Prohaska24, Dariusz Przybylski3, Nil Ratan Saha1, Vydianathan Ravi4, Filipe J. Ribeiro3, Tatjana Sauka-Spengler23, Giuseppe Scapigliati, Stephen M. J. Searle15, Ted Sharpe3, Oleg Simakov5, Peter F. Stadler24, John J. Stegeman22, Kenta Sumiyama29, Diana Tabbaa3, Hakim Tafer24, Jason Turner-Maier3, Peter van Heusden21, Simon D. M. White15, Louise Williams3, Mark Yandell10, Henner Brinkmann6, Jean Nicolas Volff11, Clifford J. Tabin9, Neil H. Shubin30, Manfred Schartl31, David B. Jaffe3, John H. Postlethwait7, Byrappa Venkatesh4, Federica Di Palma3, Eric S. Lander3, Axel Meyer5, Kerstin Lindblad-Toh3, Kerstin Lindblad-Toh32 
18 Apr 2013-Nature (Nature Publishing Group)-Vol. 496, Iss: 7445, pp 311-316
TL;DR: Through a phylogenomic analysis, it is concluded that the lungfish, and not the coelacanth, is the closest living relative of tetrapods.
Abstract: The discovery of a living coelacanth specimen in 1938 was remarkable, as this lineage of lobe-finned fish was thought to have become extinct 70 million years ago. The modern coelacanth looks remarkably similar to many of its ancient relatives, and its evolutionary proximity to our own fish ancestors provides a glimpse of the fish that first walked on land. Here we report the genome sequence of the African coelacanth, Latimeria chalumnae. Through a phylogenomic analysis, we conclude that the lungfish, and not the coelacanth, is the closest living relative of tetrapods. Coelacanth protein-coding genes are significantly more slowly evolving than those of tetrapods, unlike other genomic features. Analyses of changes in genes and regulatory elements during the vertebrate adaptation to land highlight genes involved in immunity, nitrogen excretion and the development of fins, tail, ear, eye, brain and olfaction. Functional assays of enhancers involved in the fin-to-limb transition and in the emergence of extra-embryonic tissues show the importance of the coelacanth genome as a blueprint for understanding tetrapod evolution.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
01 Nov 2014-RNA
TL;DR: A database and website, "circBase," where merged and unified data sets of circRNAs and the evidence supporting their expression can be accessed, downloaded, and browsed within the genomic context.
Abstract: Recently, several laboratories have reported thousands of circular RNAs (circRNAs) in animals. Numerous circRNAs are highly stable and have specific spatiotemporal expression patterns. Even though a function for circRNAs is unknown, these features make circRNAs an interesting class of RNAs as possible biomarkers and for further research. We developed a database and website, "circBase," where merged and unified data sets of circRNAs and the evidence supporting their expression can be accessed, downloaded, and browsed within the genomic context. circBase also provides scripts to identify known and novel circRNAs in sequencing data. The database is freely accessible through the web server at http://www.circbase.org/.

1,285 citations

Journal ArticleDOI
01 Jan 2016-Database
TL;DR: The Ensembl gene annotation system has been used to annotate over 70 different vertebrate species across a wide range of genome projects and generates the automatic alignment-based annotation for the human and mouse GENCODE gene sets.
Abstract: The Ensembl gene annotation system has been used to annotate over 70 different vertebrate species across a wide range of genome projects. Furthermore, it generates the automatic alignment-based ann ...

849 citations

Journal ArticleDOI
09 Jan 2014-Nature
TL;DR: The whole-genome analysis of a cartilaginous fish, the elephant shark (Callorhinchus milii), finds that the C. milii genome is the slowest evolving of all known vertebrates, and features extensive synteny conservation with tetrapod genomes, making it a good model for comparative analyses of gnathostome genomes.
Abstract: The emergence of jawed vertebrates (gnathostomes) from jawless vertebrates was accompanied by major morphological and physiological innovations, such as hinged jaws, paired fins and immunoglobulin-based adaptive immunity. Gnathostomes subsequently diverged into two groups, the cartilaginous fishes and the bony vertebrates. Here we report the whole-genome analysis of a cartilaginous fish, the elephant shark (Callorhinchus milii). We find that the C. milii genome is the slowest evolving of all known vertebrates, including the ‘living fossil’ coelacanth, and features extensive synteny conservation with tetrapod genomes, making it a good model for comparative analyses of gnathostome genomes. Our functional studies suggest that the lack of genes encoding secreted calcium-binding phosphoproteins in cartilaginous fishes explains the absence of bone in their endoskeleton. Furthermore, the adaptive immune system of cartilaginous fishes is unusual: it lacks the canonical CD4 co-receptor and most transcription factors, cytokines and cytokine receptors related to the CD4 lineage, despite the presence of polymorphic major histocompatibility complex class II molecules. It thus presents a new model for understanding the origin of adaptive immunity. Whole-genome analysis of the elephant shark, a cartilaginous fish, shows that it is the slowest evolving of all known vertebrates, lacks critical bone formation genes and has an unusual adaptive immune system. The elephant shark (Callorhinchus milii) is a cartilaginous fish native to the temperate waters off southern Australia and New Zealand, living at depths of 200 to 500 metres and migrating into shallow waters during spring for breeding. The genome sequence is published in this issue of Nature. Comparison with other vertebrate genomes shows that it is the slowest evolving genome of all known vertebrates — coelacanth included. Genome analysis points to an unusual adaptive immune system lacking the CD4 receptor and some associated cytokines, indicating that cartilaginous fishes possess a primordial gnathostome adaptive immune system. Also absent are genes encoding secreted calcium-binding phosphoproteins, in line with the absence of bone in cartilaginous fish.

616 citations

Journal ArticleDOI
Hans Ellegren1
TL;DR: High-throughput sequencing technologies are revolutionizing the life sciences, and the past 12 months have seen a burst of genome sequences from non-model organisms, in each case representing a fundamental source of data of significant importance to biological research.
Abstract: High-throughput sequencing technologies are revolutionizing the life sciences. The past 12 months have seen a burst of genome sequences from non-model organisms, in each case representing a fundamental source of data of significant importance to biological research. This has bearing on several aspects of evolutionary biology, and we are now beginning to see patterns emerging from these studies. These include significant heterogeneity in the rate of recombination that affects adaptive evolution and base composition, the role of population size in adaptive evolution, and the importance of expansion of gene families in lineage-specific adaptation. Moreover, resequencing of population samples (population genomics) has enabled the identification of the genetic basis of critical phenotypes and cast light on the landscape of genomic divergence during speciation.

607 citations

Journal ArticleDOI
TL;DR: In this article, the authors sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before teleost genome duplication (TGD).
Abstract: To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before teleost genome duplication (TGD). The slowly evolving gar genome has conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization and development (mediated, for example, by Hox, ParaHox and microRNA genes). Numerous conserved noncoding elements (CNEs; often cis regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles for such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses showed that the sums of expression domains and expression levels for duplicated teleost genes often approximate the patterns and levels of expression for gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes and the function of human regulatory sequences.

494 citations

References
More filters
Journal ArticleDOI
TL;DR: The Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available, providing a unified solution for transcriptome reconstruction in any sample.
Abstract: Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.

15,665 citations

Journal ArticleDOI
01 Sep 2005-Nature
TL;DR: It is found that the patterns of evolution in human and chimpanzee protein-coding genes are highly correlated and dominated by the fixation of neutral and slightly deleterious alleles.
Abstract: Here we present a draft genome sequence of the common chimpanzee (Pan troglodytes). Through comparison with the human genome, we have generated a largely complete catalogue of the genetic differenc ...

2,267 citations

Journal ArticleDOI
28 May 2004-Science
TL;DR: There are 481 segments longer than 200 base pairs that are absolutely conserved between orthologous regions of the human, rat, and mouse genomes, which represent a class of genetic elements whose functions and evolutionary origins are yet to be determined, but which are more highly conserving between these species than are proteins.
Abstract: There are 481 segments longer than 200 base pairs (bp) that are absolutely conserved (100% identity with no insertions or deletions) between orthologous regions of the human, rat, and mouse genomes. Nearly all of these segments are also conserved in the chicken and dog genomes, with an average of 95 and 99% identity, respectively. Many are also significantly conserved in fish. These ultraconserved elements of the human genome are most often located either overlapping exons in genes involved in RNA processing or in introns or nearby genes involved in the regulation of transcription and development. Along with more than 5000 sequences of over 100 bp that are absolutely conserved among the three sequenced mammals, these represent a class of genetic elements whose functions and evolutionary origins are yet to be determined, but which are more highly conserved between these species than are proteins and appear to be essential for the ontogeny of mammals and other vertebrates.

1,690 citations

Journal ArticleDOI
TL;DR: The development of an algorithm for genome assembly, ALLPATHS-LG, and its application to massively parallel DNA sequence data from the human and mouse genomes, generated on the Illumina platform, have good accuracy, short-range contiguity, long-range connectivity, and coverage of the genome.
Abstract: Massively parallel DNA sequencing technologies are revolutionizing genomics by making it possible to generate billions of relatively short (~100-base) sequence reads at very low cost. Whereas such data can be readily used for a wide range of biomedical applications, it has proven difficult to use them to generate high-quality de novo genome assemblies of large, repeat-rich vertebrate genomes. To date, the genome assemblies generated from such data have fallen far short of those obtained with the older (but much more expensive) capillary-based sequencing approach. Here, we report the development of an algorithm for genome assembly, ALLPATHS-LG, and its application to massively parallel DNA sequence data from the human and mouse genomes, generated on the Illumina platform. The resulting draft genome assemblies have good accuracy, short-range contiguity, long-range connectivity, and coverage of the genome. In particular, the base accuracy is high (≥99.95%) and the scaffold sizes (N50 size = 11.5 Mb for human and 7.2 Mb for mouse) approach those obtained with capillary-based sequencing. The combination of improved sequencing technology and improved computational methods should now make it possible to increase dramatically the de novo sequencing of large genomes. The ALLPATHS-LG program is available at http://www.broadinstitute.org/science/programs/genome-biology/crd.

1,616 citations

Journal ArticleDOI
05 Apr 2012-Nature
TL;DR: A high-quality reference genome assembly for threespine stickleback fish is developed and it is indicated that reuse of globally shared standing genetic variation has an important role in repeated evolution of distinct marine and freshwater sticklebacks, and in the maintenance of divergent ecotypes during early stages of reproductive isolation.
Abstract: Marine stickleback fish have colonized and adapted to thousands of streams and lakes formed since the last ice age, providing an exceptional opportunity to characterize genomic mechanisms underlying repeated ecological adaptation in nature. Here we develop a high-quality reference genome assembly for threespine sticklebacks. By sequencing the genomes of twenty additional individuals from a global set of marine and freshwater populations, we identify a genome-wide set of loci that are consistently associated with marine-freshwater divergence. Our results indicate that reuse of globally shared standing genetic variation, including chromosomal inversions, has an important role in repeated evolution of distinct marine and freshwater sticklebacks, and in the maintenance of divergent ecotypes during early stages of reproductive isolation. Both coding and regulatory changes occur in the set of loci underlying marine-freshwater evolution, but regulatory changes appear to predominate in this well known example of repeated adaptive evolution in nature.

1,557 citations

Related Papers (5)