Showing papers by "David Goodstein published in 2014"
••
United States Department of Energy1, North Dakota State University2, United States Department of Agriculture3, University of Georgia4, Institut national de la recherche agronomique5, Tennessee State University6, Colorado State University7, University of California, Davis8, Michigan State University9, University of Arizona10, University of Paris-Sud11, University of Nebraska–Lincoln12
TL;DR: 2 independent domestications from genetic pools that diverged before human colonization are confirmed and a set of genes linked with increased leaf and seed size are identified and combined with quantitative trait locus data from Mesoamerican cultivars.
Abstract: Common bean (Phaseolus vulgaris L.) is the most important grain legume for human consumption and has a role in sustainable agriculture owing to its ability to fix atmospheric nitrogen. We assembled 473 Mb of the 587-Mb genome and genetically anchored 98% of this sequence in 11 chromosome-scale pseudomolecules. We compared the genome for the common bean against the soybean genome to find changes in soybean resulting from polyploidy. Using resequencing of 60 wild individuals and 100 landraces from the genetically differentiated Mesoamerican and Andean gene pools, we confirmed 2 independent domestications from genetic pools that diverged before human colonization. Less than 10% of the 74 Mb of sequence putatively involved in domestication was shared by the two domestication events. We identified a set of genes linked with increased leaf and seed size and combined these results with quantitative trait locus data from Mesoamerican cultivars. Genes affected by domestication may be useful for genomics-enabled crop improvement.
1,012 citations
••
University of Pretoria1, Universidade Católica de Brasília2, Empresa Brasileira de Pesquisa Agropecuária3, Oak Ridge National Laboratory4, United States Department of Energy5, Joint Genome Institute6, Ghent University7, Institut national de la recherche agronomique8, University of Toulouse9, University of British Columbia10, University of Münster11, University of Düsseldorf12, Oregon State University13, Federal University of Rio de Janeiro14, University of São Paulo15, Australian National University16, Indian Institute of Chemical Technology17, University of Arizona18, Universidade Federal de Viçosa19, Universidade Federal do Rio Grande do Sul20, Department of Environment and Primary Industries21, University of Melbourne22, University of Tasmania23, University of the Sunshine Coast24, University of Brasília25
TL;DR: Of 36,376 predicted protein-coding genes, 34% occur in tandem duplications, the largest proportion thus far in plant genomes, which shows the highest diversity of genes for specialized metabolites such as terpenes that act as chemical defence and provide unique pharmaceutical oils.
Abstract: Eucalypts are the world's most widely planted hardwood trees. Their outstanding diversity, adaptability and growth have made them a global renewable resource of fibre and energy. We sequenced and assembled >94% of the 640-megabase genome of Eucalyptus grandis. Of 36,376 predicted protein-coding genes, 34% occur in tandem duplications, the largest proportion thus far in plant genomes. Eucalyptus also shows the highest diversity of genes for specialized metabolites such as terpenes that act as chemical defence and provide unique pharmaceutical oils. Genome sequencing of the E. grandis sister species E. globulus and a set of inbred E. grandis tree genomes reveals dynamic genome evolution and hotspots of inbreeding depression. The E. grandis genome is the first reference for the eudicot order Myrtales and is placed here sister to the eurosids. This resource expands our understanding of the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology.
679 citations
••
University of California, Los Angeles1, Centre national de la recherche scientifique2, Harvard University3, Carnegie Institution for Science4, Donald Danforth Plant Science Center5, Washington University in St. Louis6, University of Minnesota7, University of Connecticut Health Center8, University of Massachusetts Medical School9, University of Göttingen10, Duke University11, United States Department of Energy12, Pierre-and-Marie-Curie University13
TL;DR: The past, present, and future of Chlamydomonas genomics are presented and progress on genome assembly and gene model refinement is detailed, resources for gene annotations, functional predictions, and locus ID mapping between versions are discussed and a standardized framework for naming genes is outlined.
157 citations
09 Sep 2014
TL;DR: The Plant Program at the JGI applies genomic, analytical, computational and informatics platforms and methods to understand and accelerate the improvement of bioenergy crops and characterize and moderate plant response to climate change.
Abstract: The Dept. of Energy Joint Genome Institute is a genomics user facility supporting DOE mission science in the areas of Bioenergy, Carbon Cycling, and Biogeochemistry. The Plant Program at the JGI applies genomic, analytical, computational and informatics platforms and methods to: 1. Understand and accelerate the improvement (domestication) of bioenergy crops 2. Characterize and moderate plant response to climate change 3. Use comparative genomics to identify constrained elements and infer gene function 4. Build high quality genomic resource platforms of JGI Plant Flagship genomes for functional and experimental work 5. Expand functional genomic resources for Plant Flagship genomes
5 citations
14 Jul 2014
TL;DR: JGI plant genomics gene annotation pipeline, called integrated gene call (IGC), is the effort toward this aim with aid of a RNA-seq transcriptome assembly pipeline that utilizes several gene predictors based on homolog peptides and transcript ORFs.
Abstract: Plant genomes vary in size and are highly complex with a high amount of repeats, genome duplication and tandem duplication. Gene encodes a wealth of information useful in studying organism and it is critical to have high quality and stable gene annotation. Thanks to advancement of sequencing technology, many plant species genomes have been sequenced and transcriptomes are also sequenced. To use these vastly large amounts of sequence data to make gene annotation or re-annotation in a timely fashion, an automatic pipeline is needed. JGI plant genomics gene annotation pipeline, called integrated gene call (IGC), is our effort toward this aim with aid of a RNA-seq transcriptome assembly pipeline. It utilizes several gene predictors based on homolog peptides and transcript ORFs. See Methods for detail. Here we present genome annotation of JGI flagship green plants produced by this pipeline plus Arabidopsis and rice except for chlamy which is done by a third party. The genome annotations of these species and others are used in our gene family build pipeline and accessible via JGI Phytozome portal whose URL and front page snapshot are shown below.
3 citations