scispace - formally typeset
Open AccessJournal ArticleDOI

Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.

Arabidopsis Genome Initiative
- 14 Dec 2000 - 
- Vol. 408, Iss: 6814, pp 796-815
Reads0
Chats0
TLDR
This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.
Abstract
The flowering plant Arabidopsis thaliana is an important model system for identifying genes and determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans--the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

MUMmer4: A fast and versatile genome alignment system.

TL;DR: MUMmer4 is described, a substantially improved version of MUMmer that addresses genome size constraints by changing the 32-bit suffix tree data structure at the core of Mummer to a 48- bit suffix array, and that offers improved speed through parallel processing of input query sequences.
Journal ArticleDOI

Widespread Paleopolyploidy in Model Plant Species Inferred from Age Distributions of Duplicate Genes

TL;DR: The unusual age profile of tandem gene duplications in Arabidopsis indicates that other scenarios, such as variation in the rate at which duplicated genes are deleted, must also be considered.
Journal ArticleDOI

The frequency of polyploid speciation in vascular plants

TL;DR: It is established that 15% of angiosperm and 31% of fern speciation events are accompanied by ploidy increase, and frequency estimates are higher by a factor of four than earlier estimates and lead to a standing incidence of polyploid species within genera of 35% (n = 1,506).
Journal ArticleDOI

A chromosome conformation capture ordered sequence of the barley genome

Martin Mascher, +81 more
- 27 Apr 2017 - 
TL;DR: The importance of the barley reference sequence for breeding is demonstrated by inspecting the genomic partitioning of sequence variation in modern elite germplasm, highlighting regions vulnerable to genetic erosion.
Journal ArticleDOI

Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes.

TL;DR: In this article, the abundance and relative distribution of microsatellites between transcribed and nontranscribed regions and the relationship of these features to haploid genome size was evaluated in plants.
References
More filters
Journal ArticleDOI

Basic Local Alignment Search Tool

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI

tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

TL;DR: A program is described, tRNAscan-SE, which identifies 99-100% of transfer RNA genes in DNA sequence while giving less than one false positive per 15 gigabases.
Journal ArticleDOI

The Complete Genome Sequence of Escherichia coli K-12

TL;DR: The 4,639,221-base pair sequence of Escherichia coli K-12 is presented and reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within E. coli are also evident.
Journal ArticleDOI

SCOP: a structural classification of proteins database for the investigation of sequences and structures.

TL;DR: This database provides a detailed and comprehensive description of the structural and evolutionary relationships of the proteins of known structure and provides for each entry links to co-ordinates, images of the structure, interactive viewers, sequence data and literature references.
Journal ArticleDOI

The genome sequence of Drosophila melanogaster

Mark Raymond Adams, +194 more
- 24 Mar 2000 - 
TL;DR: The nucleotide sequence of nearly all of the approximately 120-megabase euchromatic portion of the Drosophila genome is determined using a whole-genome shotgun sequencing strategy supported by extensive clone-based sequence and a high-quality bacterial artificial chromosome physical map.
Related Papers (5)

Initial sequencing and analysis of the human genome.

Eric S. Lander, +248 more
- 15 Feb 2001 -