scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Genome-Wide Analysis of NBS-LRR–Encoding Genes in Arabidopsis

01 Apr 2003-The Plant Cell (American Society of Plant Biologists)-Vol. 15, Iss: 4, pp 809-834
TL;DR: The observed diversity of these NBS-LRR proteins indicates the variety of recognition molecules available in an individual genotype to detect diverse biotic challenges.
Abstract: The Arabidopsis genome contains ∼200 genes that encode proteins with similarity to the nucleotide binding site and other domains characteristic of plant resistance proteins. Through a reiterative process of sequence analysis and reannotation, we identified 149 NBS-LRR–encoding genes in the Arabidopsis (ecotype Columbia) genomic sequence. Fifty-six of these genes were corrected from earlier annotations. At least 12 are predicted to be pseudogenes. As described previously, two distinct groups of sequences were identified: those that encoded an N-terminal domain with Toll/Interleukin-1 Receptor homology (TIR-NBS-LRR, or TNL), and those that encoded an N-terminal coiled-coil motif (CC-NBS-LRR, or CNL). The encoded proteins are distinct from the 58 predicted adapter proteins in the previously described TIR-X, TIR-NBS, and CC-NBS groups. Classification based on protein domains, intron positions, sequence conservation, and genome distribution defined four subgroups of CNL proteins, eight subgroups of TNL proteins, and a pair of divergent NL proteins that lack a defined N-terminal motif. CNL proteins generally were encoded in single exons, although two subclasses were identified that contained introns in unique positions. TNL proteins were encoded in modular exons, with conserved intron positions separating distinct protein domains. Conserved motifs were identified in the LRRs of both CNL and TNL proteins. In contrast to CNL proteins, TNL proteins contained large and variable C-terminal domains. The extant distribution and diversity of the NBS-LRR sequences has been generated by extensive duplication and ectopic rearrangements that involved segmental duplications as well as microscale events. The observed diversity of these NBS-LRR proteins indicates the variety of recognition molecules available in an individual genotype to detect diverse biotic challenges.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
29 Jan 2009-Nature
TL;DR: An initial analysis of the ∼730-megabase Sorghum bicolor (L.) Moench genome is presented, placing ∼98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information.
Abstract: Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the approximately 730-megabase Sorghum bicolor (L.) Moench genome, placing approximately 98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the approximately 75% larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidization approximately 70 million years ago, most duplicated gene sets lost one member before the sorghum-rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24% of genes are grass-specific and 7% are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum's drought tolerance.

2,809 citations

Journal ArticleDOI
TL;DR: Current evidence indicates that MAMPs, DAMPs, and effectors are all perceived as danger signals and induce a stereotypic defense response, and the importance of MAMP/PRR signaling for plant immunity is highlighted.
Abstract: Microbe-associated molecular patterns (MAMPs) are molecular signatures typical of whole classes of microbes, and their recognition plays a key role in innate immunity. Endogenous elicitors are similarly recognized as damage-associated molecular patterns (DAMPs). This review focuses on the diversity of MAMPs/DAMPs and on progress to identify the corresponding pattern recognition receptors (PRRs) in plants. The two best-characterized MAMP/PRR pairs, flagellin/FLS2 and EF-Tu/EFR, are discussed in detail and put into a phylogenetic perspective. Both FLS2 and EFR are leucine-rich repeat receptor kinases (LRR-RKs). Upon treatment with flagellin, FLS2 forms a heteromeric complex with BAK1, an LRR-RK that also acts as coreceptor for the brassinolide receptor BRI1. The importance of MAMP/PRR signaling for plant immunity is highlighted by the finding that plant pathogens use effectors to inhibit PRR complexes or downstream signaling events. Current evidence indicates that MAMPs, DAMPs, and effectors are all perceived as danger signals and induce a stereotypic defense response.

2,801 citations

Journal ArticleDOI
Boulos Chalhoub1, Shengyi Liu2, Isobel A. P. Parkin3, Haibao Tang4, Haibao Tang5, Xiyin Wang6, Julien Chiquet1, Harry Belcram1, Chaobo Tong2, Birgit Samans7, Margot Correa8, Corinne Da Silva8, Jérémy Just1, Cyril Falentin9, Chu Shin Koh10, Isabelle Le Clainche1, Maria Bernard8, Pascal Bento8, Benjamin Noel8, Karine Labadie8, Adriana Alberti8, Mathieu Charles9, Dominique Arnaud1, Hui Guo6, Christian Daviaud, Salman Alamery11, Kamel Jabbari12, Kamel Jabbari1, Meixia Zhao13, Patrick P. Edger14, Houda Chelaifa1, David C. Tack15, Gilles Lassalle9, Imen Mestiri1, Nicolas Schnel9, Marie-Christine Le Paslier9, Guangyi Fan, Victor Renault16, Philippe E. Bayer11, Agnieszka A. Golicz11, Sahana Manoli11, Tae-Ho Lee6, Vinh Ha Dinh Thi1, Smahane Chalabi1, Qiong Hu2, Chuchuan Fan17, Reece Tollenaere11, Yunhai Lu1, Christophe Battail8, Jinxiong Shen17, Christine Sidebottom10, Xinfa Wang2, Aurélie Canaguier1, Aurélie Chauveau9, Aurélie Bérard9, G. Deniot9, Mei Guan18, Zhongsong Liu18, Fengming Sun, Yong Pyo Lim19, Eric Lyons20, Christopher D. Town4, Ian Bancroft21, Xiaowu Wang, Jinling Meng17, Jianxin Ma13, J. Chris Pires22, Graham J.W. King23, Dominique Brunel9, Régine Delourme9, Michel Renard9, Jean-Marc Aury8, Keith L. Adams15, Jacqueline Batley24, Jacqueline Batley11, Rod J. Snowdon7, Jörg Tost, David Edwards11, David Edwards24, Yongming Zhou17, Wei Hua2, Andrew G. Sharpe10, Andrew H. Paterson6, Chunyun Guan18, Patrick Wincker8, Patrick Wincker25, Patrick Wincker1 
22 Aug 2014-Science
TL;DR: The polyploid genome of Brassica napus, which originated from a recent combination of two distinct genomes approximately 7500 years ago and gave rise to the crops of rape oilseed, is sequenced.
Abstract: Oilseed rape (Brassica napus L.) was formed ~7500 years ago by hybridization between B. rapa and B. oleracea, followed by chromosome doubling, a process known as allopolyploidy. Together with more ancient polyploidizations, this conferred an aggregate 72× genome multiplication since the origin of angiosperms and high gene content. We examined the B. napus genome and the consequences of its recent duplication. The constituent An and Cn subgenomes are engaged in subtle structural, functional, and epigenetic cross-talk, with abundant homeologous exchanges. Incipient gene loss and expression divergence have begun. Selection in B. napus oilseed types has accelerated the loss of glucosinolate genes, while preserving expansion of oil biosynthesis genes. These processes provide insights into allopolyploid evolution and its relationship with crop domestication and improvement.

1,743 citations

Journal ArticleDOI
TL;DR: Progress made on several aspects of elicitor signal transduction leading to production of plant secondary metabolites are summarized, including the integration of multiple signaling pathways into or by transcription factors, as well as the linkage of the above signal components in eliciting network through protein phosphorylation and dephosphorylation.

1,649 citations

Journal ArticleDOI
John P. Vogel1, David F. Garvin2, Todd C. Mockler2, Jeremy Schmutz, Daniel S. Rokhsar3, Michael W. Bevan4, Kerrie Barry5, Susan Lucas5, Miranda Harmon-Smith5, Kathleen Lail5, Hope Tice5, Jane Grimwood, Neil McKenzie4, Naxin Huo6, Yong Q. Gu6, Gerard R. Lazo6, Olin D. Anderson6, Frank M. You7, Ming-Cheng Luo7, Jan Dvorak7, Jonathan M. Wright4, Melanie Febrer4, Dominika Idziak8, Robert Hasterok8, Erika Lindquist5, Mei Wang5, Samuel E. Fox2, Henry D. Priest2, Sergei A. Filichkin2, Scott A. Givan2, Douglas W. Bryant2, Jeff H. Chang2, Haiyan Wu9, Wei Wu10, An-Ping Hsia10, Patrick S. Schnable9, Anantharaman Kalyanaraman11, Brad Barbazuk12, Todd P. Michael, Samuel P. Hazen13, Jennifer N. Bragg6, Debbie Laudencia-Chingcuanco6, Yiqun Weng14, Georg Haberer, Manuel Spannagl, Klaus F. X. Mayer, Thomas Rattei15, Therese Mitros3, Sang-Jik Lee16, Jocelyn K. C. Rose16, Lukas A. Mueller16, Thomas L. York16, Thomas Wicker17, Jan P. Buchmann17, Jaakko Tanskanen18, Alan H. Schulman18, Heidrun Gundlach, Michael W. Bevan4, Antonio Costa de Oliveira19, Luciano da C. Maia19, William R. Belknap6, Ning Jiang, Jinsheng Lai9, Liucun Zhu20, Jianxin Ma20, Cheng Sun21, Ellen J. Pritham21, Jérôme Salse, Florent Murat, Michael Abrouk, Rémy Bruggmann, Joachim Messing, Noah Fahlgren2, Christopher M. Sullivan2, James C. Carrington2, Elisabeth J. Chapman, Greg D. May22, Jixian Zhai23, Matthias Ganssmann23, Sai Guna Ranjan Gurazada23, Marcelo A German23, Blake C. Meyers23, Pamela J. Green23, Ludmila Tyler3, Jiajie Wu7, James A. Thomson6, Shan Chen13, Henrik Vibe Scheller24, Jesper Harholt25, Peter Ulvskov25, Jeffrey A. Kimbrel2, Laura E. Bartley24, Peijian Cao24, Ki-Hong Jung26, Manoj Sharma24, Miguel E. Vega-Sánchez24, Pamela C. Ronald24, Chris Dardick6, Stefanie De Bodt27, Wim Verelst27, Dirk Inzé27, Maren Heese28, Arp Schnittger28, Xiaohan Yang29, Udaya C. Kalluri29, Gerald A. Tuskan29, Zhihua Hua14, Richard D. Vierstra14, Yu Cui9, Shuhong Ouyang9, Qixin Sun9, Zhiyong Liu9, Alper Yilmaz30, Erich Grotewold30, Richard Sibout31, Kian Hématy31, Grégory Mouille31, Herman Höfte31, Todd P. Michael, Jérôme Pelloux32, Devin O'Connor3, James C. Schnable3, Scott C. Rowe3, Frank G. Harmon3, Cynthia L. Cass33, John C. Sedbrook33, Mary E. Byrne4, Sean Walsh4, Janet Higgins4, Pinghua Li16, Thomas P. Brutnell16, Turgay Unver34, Hikmet Budak34, Harry Belcram, Mathieu Charles, Boulos Chalhoub, Ivan Baxter35 
11 Feb 2010-Nature
TL;DR: The high-quality genome sequence will help Brachypodium reach its potential as an important model system for developing new energy and food crops and establishes a template for analysis of the large genomes of economically important pooid grasses such as wheat.
Abstract: Three subfamilies of grasses, the Ehrhartoideae, Panicoideae and Pooideae, provide the bulk of human nutrition and are poised to become major sources of renewable energy. Here we describe the genome sequence of the wild grass Brachypodium distachyon (Brachypodium), which is, to our knowledge, the first member of the Pooideae subfamily to be sequenced. Comparison of the Brachypodium, rice and sorghum genomes shows a precise history of genome evolution across a broad diversity of the grasses, and establishes a template for analysis of the large genomes of economically important pooid grasses such as wheat. The high-quality genome sequence, coupled with ease of cultivation and transformation, small size and rapid life cycle, will help Brachypodium reach its potential as an important model system for developing new energy and food crops.

1,603 citations

References
More filters
Journal ArticleDOI
TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Abstract: The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. For protein comparisons, a variety of definitional, algorithmic and statistical refinements described here permits the execution time of the BLAST programs to be decreased substantially while enhancing their sensitivity to weak similarities. A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original. In addition, a method is introduced for automatically combining statistically significant alignments produced by BLAST into a position-specific score matrix, and searching the database using this matrix. The resulting Position-Specific Iterated BLAST (PSIBLAST) program runs at approximately the same speed per iteration as gapped BLAST, but in many cases is much more sensitive to weak but biologically relevant sequence similarities. PSI-BLAST is used to uncover several new and interesting members of the BRCT superfamily.

70,111 citations


"Genome-Wide Analysis of NBS-LRR–Enc..." refers methods in this paper

  • ...3 (Altschul et al., 1997) was used to search the Arabidopsis thaliana genomic sequence using servers available from MIPS (http://mips....

    [...]

  • ...BLAST (Basic Local Alignment Search Tool) version 2.0.3 (Altschul et al., 1997) was used to search the Arabidopsis thaliana genomic sequence using servers available from MIPS (http://mips.gsf.de) and TAIR (http:// www.arabidopsis.org)....

    [...]

Journal ArticleDOI
TL;DR: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved and modifications are incorporated into a new program, CLUSTAL W, which is freely available.
Abstract: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved for the alignment of divergent protein sequences. Firstly, individual weights are assigned to each sequence in a partial alignment in order to down-weight near-duplicate sequences and up-weight the most divergent ones. Secondly, amino acid substitution matrices are varied at different alignment stages according to the divergence of the sequences to be aligned. Thirdly, residue-specific gap penalties and locally reduced gap penalties in hydrophilic regions encourage new gaps in potential loop regions rather than regular secondary structure. Fourthly, positions in early alignments where gaps have been opened receive locally reduced gap penalties to encourage the opening up of new gaps at these positions. These modifications are incorporated into a new program, CLUSTAL W which is freely available.

63,427 citations


"Genome-Wide Analysis of NBS-LRR–Enc..." refers methods in this paper

  • ...Sequences then were aligned using CLUSTAL W (Thompson et al., 1994) with default options, and the alignment was corrected manually using the alignment editor in GeneDoc (Nicholas et al., 1997)....

    [...]

  • ...Sequences then were aligned using CLUSTAL W (Thompson et al., 1994) with default options, and the alignment was corrected manually using the alignment editor in GeneDoc...

    [...]

Journal ArticleDOI
TL;DR: Some examples were worked out using reported globin sequences to show that synonymous substitutions occur at much higher rates than amino acid-altering substitutions in evolution.
Abstract: Some simple formulae were obtained which enable us to estimate evolutionary distances in terms of the number of nucleotide substitutions (and, also, the evolutionary rates when the divergence times are known). In comparing a pair of nucleotide sequences, we distinguish two types of differences; if homologous sites are occupied by different nucleotide bases but both are purines or both pyrimidines, the difference is called type I (or “transition” type), while, if one of the two is a purine and the other is a pyrimidine, the difference is called type II (or “transversion” type). Letting P and Q be respectively the fractions of nucleotide sites showing type I and type II differences between two sequences compared, then the evolutionary distance per site is K = — (1/2) ln {(1 — 2P — Q) }. The evolutionary rate per year is then given by k = K/(2T), where T is the time since the divergence of the two sequences. If only the third codon positions are compared, the synonymous component of the evolutionary base substitutions per site is estimated by K'S = — (1/2) ln (1 — 2P — Q). Also, formulae for standard errors were obtained. Some examples were worked out using reported globin sequences to show that synonymous substitutions occur at much higher rates than amino acid-altering substitutions in evolution.

26,016 citations


"Genome-Wide Analysis of NBS-LRR–Enc..." refers methods in this paper

  • ...Neighbor-joining trees from distance matrices constructed according to the two-parameter method of Kimura (1980) using the aligned NBS protein sequences....

    [...]

Journal ArticleDOI
TL;DR: The definition and use of family-specific, manually curated gathering thresholds are explained and some of the features of domains of unknown function (also known as DUFs) are discussed, which constitute a rapidly growing class of families within Pfam.
Abstract: Pfam is a widely used database of protein families and domains. This article describes a set of major updates that we have implemented in the latest release (version 24.0). The most important change is that we now use HMMER3, the latest version of the popular profile hidden Markov model package. This software is approximately 100 times faster than HMMER2 and is more sensitive due to the routine use of the forward algorithm. The move to HMMER3 has necessitated numerous changes to Pfam that are described in detail. Pfam release 24.0 contains 11,912 families, of which a large number have been significantly updated during the past two years. Pfam is available via servers in the UK (http://pfam.sanger.ac.uk/), the USA (http://pfam.janelia.org/) and Sweden (http://pfam.sbc.su.se/).

14,075 citations

Journal ArticleDOI
14 Dec 2000-Nature
TL;DR: This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.
Abstract: The flowering plant Arabidopsis thaliana is an important model system for identifying genes and determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans--the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.

8,742 citations