scispace - formally typeset
Open AccessJournal ArticleDOI

Genome sequence of the palaeopolyploid soybean

Reads0
Chats0
TLDR
An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.
Abstract
Soybean (Glycine max) is one of the most important crop plants for seed protein and oil content, and for its capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms. We sequenced the 1.1-gigabase genome by a whole-genome shotgun approach and integrated it with physical and high-density genetic maps to create a chromosome-scale draft sequence assembly. We predict 46,430 protein-coding genes, 70% more than Arabidopsis and similar to the poplar genome which, like soybean, is an ancient polyploid (palaeopolyploid). About 78% of the predicted genes occur in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. Genome duplications occurred at approximately 59 and 13 million years ago, resulting in a highly duplicated genome with nearly 75% of the genes present in multiple copies. The two duplication events were followed by gene diversification and loss, and numerous chromosome rearrangements. An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

The Role of Deleterious Substitutions in Crop Genomes

TL;DR: It is concluded that individual cultivars carry hundreds of deleterious SNPs on average, and that nonsense variants make up a minority of deleters in the protein-coding regions of the genomes of two crops.
Journal ArticleDOI

Large-scale development of expressed sequence tag-derived simple sequence repeat markers and diversity analysis in Arachis spp.

TL;DR: Large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed in peanut to obtain more informative genetic markers and 1,571 markers showing clear polymorphisms were selected for further polymorphic analysis.
Journal ArticleDOI

Genome-level and biochemical diversity of the acyl-activating enzyme superfamily in plants

TL;DR: Gene duplication and evolution of novel functions in Arabidopsis appears to have occurred rapidly, because acquisition of new substrate specificity is relatively easy in this class of proteins, which makes it difficult to use homology searches and other genomics tools to predict enzyme function.
Journal ArticleDOI

Analysis of Proteome Profile in Germinating Soybean Seed, and Its Comparison with Rice Showing the Styles of Reserves Mobilization in Different Crops

TL;DR: This study is the first comprehensive analysis of proteome profile in germinating soybean seeds to date and will improve the understanding of the physiological and biochemical status in the imbibed soy bean seeds just prior to germination.
Journal ArticleDOI

Evolution of GOLDEN2-LIKE gene function in C(3) and C (4) plants.

TL;DR: It is proposed that the ancestral state is a single GLK gene, and hypothesize thatGLK gene duplication enabled sub-functionalization, which in turn enabled cell-specific function in C4 plants with dimorphic chloroplasts, which preconditioned the evolution of C4 physiology that is associated with chloroplast dimorphism.
References
More filters
Journal ArticleDOI

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Journal ArticleDOI

Clustal w: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice

TL;DR: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved and modifications are incorporated into a new program, CLUSTAL W, which is freely available.
Journal ArticleDOI

MUSCLE: multiple sequence alignment with high accuracy and high throughput

TL;DR: MUSCLE is a new computer program for creating multiple alignments of protein sequences that includes fast distance estimation using kmer counting, progressive alignment using a new profile function the authors call the log-expectation score, and refinement using tree-dependent restricted partitioning.
Journal ArticleDOI

Circos: An information aesthetic for comparative genomics

TL;DR: Circos uses a circular ideogram layout to facilitate the display of relationships between pairs of positions by the use of ribbons, which encode the position, size, and orientation of related genomic elements.
Journal ArticleDOI

Versatile and open software for comparing large genomes

TL;DR: The newest version of MUMmer easily handles comparisons of large eukaryotic genomes at varying evolutionary distances, as demonstrated by applications to multiple genomes.
Related Papers (5)

The B73 Maize Genome: Complexity, Diversity, and Dynamics

Patrick S. Schnable, +159 more
- 20 Nov 2009 - 

The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)

Gerald A. Tuskan, +115 more
- 15 Sep 2006 -