scispace - formally typeset
Open AccessJournal ArticleDOI

Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.

Arabidopsis Genome Initiative
- 14 Dec 2000 - 
- Vol. 408, Iss: 6814, pp 796-815
Reads0
Chats0
TLDR
This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.
Abstract
The flowering plant Arabidopsis thaliana is an important model system for identifying genes and determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans--the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Assessing genome assembly quality using the LTR Assembly Index (LAI).

TL;DR: A reference-free genome metric called LTR Assembly Index (LAI) that evaluates assembly continuity using LTR-RTs is proposed that can facilitate iterative assembly improvement with assembler selection and identify low-quality genomic regions.
Journal ArticleDOI

Epigenetic variation in Arabidopsis disease resistance

TL;DR: It is shown that an Arabidopsis thaliana R-gene cluster is also subject to epigenetic variation, and a heritable but metastable epigenetic variant bal that overexpresses the R-like gene At4g16890 from a gene cluster on Chromosome 4 is described.
Journal ArticleDOI

Genome-wide prediction and identification of cis-natural antisense transcripts in Arabidopsis thaliana

TL;DR: A new computational method was developed to predict and identify cis-encoded NATs in Arabidopsis and found 1,340 potential NAT pairs that could not otherwise be identified by using one of the two datasets only.
Journal ArticleDOI

Analysis and functional annotation of an expressed sequence tag collection for tropical crop sugarcane.

André Luiz Vettore, +59 more
- 01 Dec 2003 - 
TL;DR: A global analysis of the whole SUCEST data set indicated that 14,409 assembled sequences contained at least one cDNA clone with a full-length insert, which indicated that possibly 33,620 unique genes had been identified and indicated that >90% of the sugarcane expressed genes were tagged.
Journal ArticleDOI

Mechanisms and rates of genome expansion and contraction in flowering plants

TL;DR: Current data suggest that unequal recombination can slow the growth in genome size caused by retrotransposon amplification, but that illegitimate recombination and other deletion processes may be primarily responsible for the removal of non-essential DNA from small genome plants.
References
More filters
Journal ArticleDOI

Basic Local Alignment Search Tool

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI

tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

TL;DR: A program is described, tRNAscan-SE, which identifies 99-100% of transfer RNA genes in DNA sequence while giving less than one false positive per 15 gigabases.
Journal ArticleDOI

The Complete Genome Sequence of Escherichia coli K-12

TL;DR: The 4,639,221-base pair sequence of Escherichia coli K-12 is presented and reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within E. coli are also evident.
Journal ArticleDOI

SCOP: a structural classification of proteins database for the investigation of sequences and structures.

TL;DR: This database provides a detailed and comprehensive description of the structural and evolutionary relationships of the proteins of known structure and provides for each entry links to co-ordinates, images of the structure, interactive viewers, sequence data and literature references.
Journal ArticleDOI

The genome sequence of Drosophila melanogaster

Mark Raymond Adams, +194 more
- 24 Mar 2000 - 
TL;DR: The nucleotide sequence of nearly all of the approximately 120-megabase euchromatic portion of the Drosophila genome is determined using a whole-genome shotgun sequencing strategy supported by extensive clone-based sequence and a high-quality bacterial artificial chromosome physical map.
Related Papers (5)

Initial sequencing and analysis of the human genome.

Eric S. Lander, +248 more
- 15 Feb 2001 -