scispace - formally typeset
Open AccessJournal ArticleDOI

Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.

Arabidopsis Genome Initiative
- 14 Dec 2000 - 
- Vol. 408, Iss: 6814, pp 796-815
Reads0
Chats0
TLDR
This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.
Abstract
The flowering plant Arabidopsis thaliana is an important model system for identifying genes and determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans--the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Regulatory Role of SGT1 in Early R Gene-Mediated Plant Defenses

TL;DR: In this article, SGT1 is a component of Skp1-Cullin-F-box protein (SCF) ubiquitin ligases that target regulatory proteins for degradation.
Journal ArticleDOI

Millions of Years of Evolution Preserved: A Comprehensive Catalog of the Processed Pseudogenes in the Human Genome

TL;DR: Overall, processed pseudogenes are very similar to their closest corresponding human gene, being 94% complete in coding regions, with sequence similarity of 75% for amino acids and 86% for nucleotides, however, it does vary with GC-content: Processed pseudogene occur mostly in intermediate GC- content regions.
Journal ArticleDOI

The Medicago truncatula SUNN gene encodes a CLV1-like leucine-rich repeat receptor kinase that regulates nodule number and root length.

TL;DR: Comparison of related LRR receptor kinases determined that all nodulation autoregulation genes identified to date are the closest legume relatives of AtCLV1 by sequence, yet sunn, har and nark mutants do not display the fasciated clv phenotype.
Journal ArticleDOI

AGRIS: Arabidopsis gene regulatory information server, an information resource of Arabidopsis cis-regulatory elements and transcription factors.

TL;DR: AGRIS provides the necessary software tools onArabidopsis transcription factors and their putative binding sites on all genes to initiate the identification of transcriptional regulatory networks in the model dicotyledoneous plant Arabidopsis thaliana.
Journal ArticleDOI

Following tetraploidy in an Arabidopsis ancestor, genes were removed preferentially from one homeolog leaving clusters enriched in dose-sensitive genes

TL;DR: It is found that islands of retention contain "connected genes," those genes predicted-by the gene balance hypothesis-to be resistant to removal because the products they encode interact with other products in a dose-sensitive manner, creating a web of dependency.
References
More filters
Journal ArticleDOI

Basic Local Alignment Search Tool

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI

tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

TL;DR: A program is described, tRNAscan-SE, which identifies 99-100% of transfer RNA genes in DNA sequence while giving less than one false positive per 15 gigabases.
Journal ArticleDOI

The Complete Genome Sequence of Escherichia coli K-12

TL;DR: The 4,639,221-base pair sequence of Escherichia coli K-12 is presented and reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within E. coli are also evident.
Journal ArticleDOI

SCOP: a structural classification of proteins database for the investigation of sequences and structures.

TL;DR: This database provides a detailed and comprehensive description of the structural and evolutionary relationships of the proteins of known structure and provides for each entry links to co-ordinates, images of the structure, interactive viewers, sequence data and literature references.
Journal ArticleDOI

The genome sequence of Drosophila melanogaster

Mark Raymond Adams, +194 more
- 24 Mar 2000 - 
TL;DR: The nucleotide sequence of nearly all of the approximately 120-megabase euchromatic portion of the Drosophila genome is determined using a whole-genome shotgun sequencing strategy supported by extensive clone-based sequence and a high-quality bacterial artificial chromosome physical map.
Related Papers (5)

Initial sequencing and analysis of the human genome.

Eric S. Lander, +248 more
- 15 Feb 2001 -