scispace - formally typeset
Search or ask a question
Journal ArticleDOI

The draft genome of watermelon (Citrullus lanatus) and resequencing of 20 diverse accessions

TL;DR: A high-quality draft genome sequence of the east Asia watermelon cultivar 97103 containing 23,440 predicted protein-coding genes is reported, which yielded important insights into aspects of phloem-based vascular signaling in common between watermelon and cucumber and identified genes crucial to valuable fruit-quality traits.
Abstract: Zhangjun Fei and colleagues report the draft genome of a Chinese elite watermelon inbred line 97103 and resequencing of 20 diverse accessions that represent the three subspecies of Citrullus lunatus. Comparative genome-wide analyses identify the extent of genetic diversity and population structure of watermelon germplasm.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
Hans Ellegren1
TL;DR: High-throughput sequencing technologies are revolutionizing the life sciences, and the past 12 months have seen a burst of genome sequences from non-model organisms, in each case representing a fundamental source of data of significant importance to biological research.
Abstract: High-throughput sequencing technologies are revolutionizing the life sciences. The past 12 months have seen a burst of genome sequences from non-model organisms, in each case representing a fundamental source of data of significant importance to biological research. This has bearing on several aspects of evolutionary biology, and we are now beginning to see patterns emerging from these studies. These include significant heterogeneity in the rate of recombination that affects adaptive evolution and base composition, the role of population size in adaptive evolution, and the importance of expansion of gene families in lineage-specific adaptation. Moreover, resequencing of population samples (population genomics) has enabled the identification of the genetic basis of critical phenotypes and cast light on the landscape of genomic divergence during speciation.

607 citations

Journal ArticleDOI
05 Sep 2014-Science
TL;DR: The Coffea canephora (coffee) genome was sequenced and identified a conserved gene order, and comparative analyses of caffeine NMTs demonstrate that these genes expanded through sequential tandem duplications independently of genes from cacao and tea, suggesting that caffeine in eudicots is of polyphyletic origin.
Abstract: Coffee is a valuable beverage crop due to its characteristic flavor, aroma, and the stimulating effects of caffeine. We generated a high-quality draft genome of the species Coffea canephora, which displays a conserved chromosomal gene order among asterid angiosperms. Although it shows no sign of the whole-genome triplication identified in Solanaceae species such as tomato, the genome includes several species-specific gene family expansions, among them N-methyltransferases (NMTs) involved in caffeine production, defense-related genes, and alkaloid and flavonoid enzymes involved in secondary compound synthesis. Comparative analyses of caffeine NMTs demonstrate that these genes expanded through sequential tandem duplications independently of genes from cacao and tea, suggesting that caffeine in eudicots is of polyphyletic origin.

513 citations

Journal ArticleDOI
TL;DR: The evolutionary events that gave rise to the tracheophytes are examined, followed by analysis of the genetic and hormonal networks that cooperate to orchestrate vascular development in the gymnosperms and angiosperms, in a comprehensive picture of the state-of-the-art in the area of plant vascular biology.
Abstract: The emergence of the tracheophyte-based vascular system of land plants had major impacts on the evolution of terrestrial biology, in general, through its role in facilitating the development of plants with increased stature, photosynthetic output, and ability to colonize a greatly expanded range of environmental habitats. Recently, considerable progress has been made in terms of our understanding of the developmental and physiological programs involved in the formation and function of the plant vascular system. In this review, we first examine the evolutionary events that gave rise to the tracheophytes, followed by analysis of the genetic and hormonal networks that cooperate to orchestrate vascular development in the gymnosperms and angiosperms. The two essential functions performed by the vascular system, namely the delivery of resources (water, essential mineral nutrients, sugars and amino acids) to the various plant organs and provision of mechanical support are next discussed. Here, we focus on critical questions relating to structural and physiological properties controlling the delivery of material through the xylem and phloem. Recent discoveries into the role of the vascular system as an effective long-distance communication system are next assessed in terms of the coordination of developmental, physiological and defense-related processes, at the whole-plant level. A concerted effort has been made to integrate all these new findings into a comprehensive picture of the state-of-the-art in the area of plant vascular biology. Finally, areas important for future research are highlighted in terms of their likely contribution both to basic knowledge and applications to primary industry.

491 citations

Journal ArticleDOI
TL;DR: A comprehensive landscape of different modes of gene duplication across the plant kingdom is identified by comparing 141 genomes, which provides a solid foundation for further investigation of the dynamic evolution of duplicate genes.
Abstract: The sharp increase of plant genome and transcriptome data provide valuable resources to investigate evolutionary consequences of gene duplication in a range of taxa, and unravel common principles underlying duplicate gene retention. We survey 141 sequenced plant genomes to elucidate consequences of gene and genome duplication, processes central to the evolution of biodiversity. We develop a pipeline named DupGen_finder to identify different modes of gene duplication in plants. Genes derived from whole-genome, tandem, proximal, transposed, or dispersed duplication differ in abundance, selection pressure, expression divergence, and gene conversion rate among genomes. The number of WGD-derived duplicate genes decreases exponentially with increasing age of duplication events—transposed duplication- and dispersed duplication-derived genes declined in parallel. In contrast, the frequency of tandem and proximal duplications showed no significant decrease over time, providing a continuous supply of variants available for adaptation to continuously changing environments. Moreover, tandem and proximal duplicates experienced stronger selective pressure than genes formed by other modes and evolved toward biased functional roles involved in plant self-defense. The rate of gene conversion among WGD-derived gene pairs declined over time, peaking shortly after polyploidization. To provide a platform for accessing duplicated gene pairs in different plants, we constructed the Plant Duplicate Gene Database. We identify a comprehensive landscape of different modes of gene duplication across the plant kingdom by comparing 141 genomes, which provides a solid foundation for further investigation of the dynamic evolution of duplicate genes.

461 citations

Journal ArticleDOI
TL;DR: As the first sequenced species in the Ericales, the kiwifruit genome sequence provides a valuable resource not only for biological discovery and crop improvement but also for evolutionary and comparative genomics analysis, particularly in the asterid lineage.
Abstract: The kiwifruit is an economically and nutritionally important fruit crop with high vitamin C content. Here, the authors report the draft genome sequence of a heterozygous kiwifruit and through comparative genomic analysis provide valuable insight into kiwifruit evolution.

402 citations

References
More filters
Journal ArticleDOI
TL;DR: In this paper, a different approach to problems of multiple significance testing is presented, which calls for controlling the expected proportion of falsely rejected hypotheses -the false discovery rate, which is equivalent to the FWER when all hypotheses are true but is smaller otherwise.
Abstract: SUMMARY The common approach to the multiplicity problem calls for controlling the familywise error rate (FWER). This approach, though, has faults, and we point out a few. A different approach to problems of multiple significance testing is presented. It calls for controlling the expected proportion of falsely rejected hypotheses -the false discovery rate. This error rate is equivalent to the FWER when all hypotheses are true but is smaller otherwise. Therefore, in problems where the control of the false discovery rate rather than that of the FWER is desired, there is potential for a gain in power. A simple sequential Bonferronitype procedure is proved to control the false discovery rate for independent test statistics, and a simulation study shows that the gain in power is substantial. The use of the new procedure and the appropriateness of the criterion are illustrated with examples.

83,420 citations

Journal ArticleDOI
TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.
Abstract: Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: [email protected]

45,957 citations

Journal ArticleDOI
TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.
Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

43,862 citations

Journal ArticleDOI
TL;DR: The newest addition in MEGA5 is a collection of maximum likelihood (ML) analyses for inferring evolutionary trees, selecting best-fit substitution models, inferring ancestral states and sequences, and estimating evolutionary rates site-by-site.
Abstract: Comparative analysis of molecular sequence data is essential for reconstructing the evolutionary histories of species and inferring the nature and extent of selective forces shaping the evolution of genes and species. Here, we announce the release of Molecular Evolutionary Genetics Analysis version 5 (MEGA5), which is a user-friendly software for mining online databases, building sequence alignments and phylogenetic trees, and using methods of evolutionary bioinformatics in basic biology, biomedicine, and evolution. The newest addition in MEGA5 is a collection of maximum likelihood (ML) analyses for inferring evolutionary trees, selecting best-fit substitution models (nucleotide or amino acid), inferring ancestral states and sequences (along with probabilities), and estimating evolutionary rates site-by-site. In computer simulation analyses, ML tree inference algorithms in MEGA5 compared favorably with other software packages in terms of computational efficiency and the accuracy of the estimates of phylogenetic trees, substitution parameters, and rate variation among sites. The MEGA user interface has now been enhanced to be activity driven to make it easier for the use of both beginners and experienced scientists. This version of MEGA is intended for the Windows platform, and it has been configured for effective use on Mac OS X and Linux desktops. It is available free of charge from http://www.megasoftware.net.

39,110 citations

Journal ArticleDOI
TL;DR: Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches and can be used simultaneously to achieve even greater alignment speeds.
Abstract: Bowtie is an ultrafast, memory-efficient alignment program for aligning short DNA sequence reads to large genomes. For the human genome, Burrows-Wheeler indexing allows Bowtie to align more than 25 million reads per CPU hour with a memory footprint of approximately 1.3 gigabytes. Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches. Multiple processor cores can be used simultaneously to achieve even greater alignment speeds. Bowtie is open source http://bowtie.cbcb.umd.edu.

20,335 citations

Related Papers (5)
Riccardo Velasco, Andrey Zharkikh, Jason P. Affourtit, Amit Dhingra, Alessandro Cestaro, Ananth Kalyanaraman, Paolo Fontana, Satish Bhatnagar, Michela Troggio, Dmitry Pruss, Silvio Salvi, Massimo Pindo, Paolo Baldi, Sara Castelletti, Marina Cavaiuolo, G. Coppola, Fabrizio Costa, V. Cova, Antonio Dal Ri, Vadim V. Goremykin, M. Komjanc, Sara Longhi, P. Magnago, Giulia Malacarne, Mickael Malnoy, Diego Micheletti, Marco Moretto, Michele Perazzolli, Azeddine Si-Ammour, Silvia Vezzulli, E. Zini, Glenn Eldredge, Lisa M. Fitzgerald, N. Gutin, Jerry S. Lanchbury, Teresita Macalma, J.T. Mitchell, Julia Reid, Bryan Wardell, Chinnappa D. Kodira, Zhoutao Chen, Brian Desany, Faheem Niazi, Melinda Palmer, Tyson Koepke, Derick Jiwan, Scott Schaeffer, Vandhana Krishnan, Changjun Wu, Vu T. Chu, Stephen T. King, Jessica Vick, Quanzhou Tao, Amy Mraz, Aimee Stormo, Keith E. Stormo, Robert Bogden, Davide Ederle, Alessandra Stella, Alberto Vecchietti, Martin M. Kater, Simona Masiero, Pauline Lasserre, Yves Lespinasse, Andrew C. Allan, Vincent G. M. Bus, David Chagné, Ross N. Crowhurst, Andrew P. Gleave, Enrico Lavezzo, Jeffrey A. Fawcett, Jeffrey A. Fawcett, Sebastian Proost, Sebastian Proost, Pierre Rouzé, Pierre Rouzé, Lieven Sterck, Lieven Sterck, Stefano Toppo, Barbara Lazzari, Roger P. Hellens, Charles-Eric Durel, Alexander Gutin, Roger E. Bumgarner, Susan E. Gardiner, Mark H. Skolnick, Michael Egholm, Yves Van de Peer, Yves Van de Peer, Francesco Salamini, Roberto Viola