scispace - formally typeset
Open AccessJournal ArticleDOI

Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.

Arabidopsis Genome Initiative
- 14 Dec 2000 - 
- Vol. 408, Iss: 6814, pp 796-815
Reads0
Chats0
TLDR
This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.
Abstract
The flowering plant Arabidopsis thaliana is an important model system for identifying genes and determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans--the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Assessing the gene space in draft genomes

TL;DR: From an analysis of a phylogenetically diverse set of eukaryotic genome assemblies, it is found that the proportion of CEGs mapped in draft genomes provides a useful metric for describing the gene space, and complements the commonly used N50 length and x-fold coverage values.
Journal ArticleDOI

Using next-generation sequencing approaches to isolate simple sequence repeat (SSR) loci in the plant sciences

TL;DR: In the future, NGS technologies will massively increase the number of SSRs and other genetic markers available to conduct genetic research in understudied but economically important crops such as cranberry.
Journal ArticleDOI

Abundance, Distribution, and Transcriptional Activity of Repetitive Elements in the Maize Genome

TL;DR: A sequenced library of randomly sheared genomic DNA from maize demonstrated that the maize genome is composed of diverse sequences that represent numerous families of retrotransposons and indicated that retroelements abundant in the genome are poorly represented in hypomethylated regions.
Journal ArticleDOI

Interphase chromosomes in Arabidopsis are organized as well defined chromocenters from which euchromatin loops emanate

TL;DR: The arrangement of interphase chromosomes in Arabidopsis provides a well defined system to investigate chromatin organization and its role in epigenetic processes.
References
More filters
Journal ArticleDOI

Basic Local Alignment Search Tool

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI

tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

TL;DR: A program is described, tRNAscan-SE, which identifies 99-100% of transfer RNA genes in DNA sequence while giving less than one false positive per 15 gigabases.
Journal ArticleDOI

The Complete Genome Sequence of Escherichia coli K-12

TL;DR: The 4,639,221-base pair sequence of Escherichia coli K-12 is presented and reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within E. coli are also evident.
Journal ArticleDOI

SCOP: a structural classification of proteins database for the investigation of sequences and structures.

TL;DR: This database provides a detailed and comprehensive description of the structural and evolutionary relationships of the proteins of known structure and provides for each entry links to co-ordinates, images of the structure, interactive viewers, sequence data and literature references.
Journal ArticleDOI

The genome sequence of Drosophila melanogaster

Mark Raymond Adams, +194 more
- 24 Mar 2000 - 
TL;DR: The nucleotide sequence of nearly all of the approximately 120-megabase euchromatic portion of the Drosophila genome is determined using a whole-genome shotgun sequencing strategy supported by extensive clone-based sequence and a high-quality bacterial artificial chromosome physical map.
Related Papers (5)

Initial sequencing and analysis of the human genome.

Eric S. Lander, +248 more
- 15 Feb 2001 -