Home
/
Authors
/
Alexander Kozik

Author

Alexander Kozik

Other affiliations: University of California, Berkeley, University of California

Bio: Alexander Kozik is an academic researcher from University of California, Davis. The author has contributed to research in topics: Genome & Population. The author has an hindex of 32, co-authored 52 publications receiving 5003 citations. Previous affiliations of Alexander Kozik include University of California, Berkeley & University of California.

Topics: Genome, Population, Gene, Expressed sequence tag, DNA sequencing ...read more

Papers published on a yearly basis

2021
2020
2019
2017
2016
2015
2014
2013
2012
2011
2009
2008
2007
2006
2005
2003
2002
2000
1998
1996
1995
1994
1993

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Genome-Wide Analysis of NBS-LRR–Encoding Genes in Arabidopsis

[...]

Blake C. Meyers¹, Alexander Kozik¹, Alyssa Griego¹, Hanhui Kuang¹, Richard W Michelmore¹ - Show less +1 more•Institutions (1)

University of California, Davis¹

01 Apr 2003-The Plant Cell

TL;DR: The observed diversity of these NBS-LRR proteins indicates the variety of recognition molecules available in an individual genotype to detect diverse biotic challenges.

...read moreread less

Abstract: The Arabidopsis genome contains ∼200 genes that encode proteins with similarity to the nucleotide binding site and other domains characteristic of plant resistance proteins. Through a reiterative process of sequence analysis and reannotation, we identified 149 NBS-LRR–encoding genes in the Arabidopsis (ecotype Columbia) genomic sequence. Fifty-six of these genes were corrected from earlier annotations. At least 12 are predicted to be pseudogenes. As described previously, two distinct groups of sequences were identified: those that encoded an N-terminal domain with Toll/Interleukin-1 Receptor homology (TIR-NBS-LRR, or TNL), and those that encoded an N-terminal coiled-coil motif (CC-NBS-LRR, or CNL). The encoded proteins are distinct from the 58 predicted adapter proteins in the previously described TIR-X, TIR-NBS, and CC-NBS groups. Classification based on protein domains, intron positions, sequence conservation, and genome distribution defined four subgroups of CNL proteins, eight subgroups of TNL proteins, and a pair of divergent NL proteins that lack a defined N-terminal motif. CNL proteins generally were encoded in single exons, although two subclasses were identified that contained introns in unique positions. TNL proteins were encoded in modular exons, with conserved intron positions separating distinct protein domains. Conserved motifs were identified in the LRRs of both CNL and TNL proteins. In contrast to CNL proteins, TNL proteins contained large and variable C-terminal domains. The extant distribution and diversity of the NBS-LRR sequences has been generated by extensive duplication and ectopic rearrangements that involved segmental duplications as well as microscale events. The observed diversity of these NBS-LRR proteins indicates the variety of recognition molecules available in an individual genotype to detect diverse biotic challenges.

...read moreread less

1,503 citations

Journal Article•DOI•

The genome sequences of Arachis duranensis and Arachis ipaensis, the diploid ancestors of cultivated peanut.

[...]

David J. Bertioli¹, David J. Bertioli², Steven B. Cannon³, Lutz Froenicke⁴, Guodong Huang, Andrew Farmer⁵, Ethalinda K. S. Cannon⁶, Xin Liu, Dongying Gao¹, Josh Clevenger¹, Sudhansu Dash⁵, Longhui Ren⁶, Márcio C. Moretzsohn⁷, Kenta Shirasawa, Wei Huang⁶, Bruna Vidigal⁷, Bruna Vidigal², Brian Abernathy¹, Ye Chu¹, Chad E. Niederhuth¹, Pooja E. Umale⁵, Ana Claudia Guerra Araujo⁷, Alexander Kozik⁴, Kyung Do Kim¹, Mark D. Burow⁸, Mark D. Burow⁹, Rajeev K. Varshney¹⁰, Xingjun Wang, Xinyou Zhang, Noelle A. Barkley³, Noelle A. Barkley¹¹, Patricia M. Guimarães⁷, Sachiko Isobe, Baozhu Guo³, Boshou Liao¹², H. Thomas Stalker¹³, Robert J. Schmitz¹, Brian E. Scheffler³, Soraya C. M. Leal-Bertioli⁷, Soraya C. M. Leal-Bertioli¹, Xu Xun, Scott A. Jackson¹, Richard W Michelmore⁴, Peggy Ozias-Akins¹ - Show less +40 more•Institutions (13)

University of Georgia¹, University of Brasília², United States Department of Agriculture³, University of California, Davis⁴, National Center for Genome Resources⁵, Iowa State University⁶, Empresa Brasileira de Pesquisa Agropecuária⁷, Texas Tech University⁸, Texas A&M University⁹, International Crops Research Institute for the Semi-Arid Tropics¹⁰, International Potato Center¹¹, Crops Research Institute¹², North Carolina State University¹³

01 Apr 2016-Nature Genetics

TL;DR: The genome sequences of its diploid ancestors are reported to show that these genomes are similar to cultivated peanut's A and B subgenomes and used to identify candidate disease resistance genes, to guide tetraploid transcript assemblies and to detect genetic exchange between cultivated peanuts' subgenome.

...read moreread less

Abstract: Cultivated peanut (Arachis hypogaea) is an allotetraploid with closely related subgenomes of a total size of ∼2.7 Gb. This makes the assembly of chromosomal pseudomolecules very challenging. As a foundation to understanding the genome of cultivated peanut, we report the genome sequences of its diploid ancestors (Arachis duranensis and Arachis ipaensis). We show that these genomes are similar to cultivated peanut's A and B subgenomes and use them to identify candidate disease resistance genes, to guide tetraploid transcript assemblies and to detect genetic exchange between cultivated peanut's subgenomes. On the basis of remarkably high DNA identity of the A. ipaensis genome and the B subgenome of cultivated peanut and biogeographic evidence, we conclude that A. ipaensis may be a direct descendant of the same population that contributed the B subgenome to cultivated peanut.

...read moreread less

643 citations

Journal Article•DOI•

Multiple Paleopolyploidizations during the Evolution of the Compositae Reveal Parallel Patterns of Duplicate Gene Retention after Millions of Years

[...]

Michael S. Barker¹, Nolan C. Kane², Nolan C. Kane¹, Marta Matvienko³, Alexander Kozik³, Richard W Michelmore³, Steven J. Knapp⁴, Loren H. Rieseberg¹, Loren H. Rieseberg² - Show less +5 more•Institutions (4)

University of British Columbia¹, Indiana University², University of California, Davis³, University of Georgia⁴

01 Nov 2008-Molecular Biology and Evolution

TL;DR: It is suggested that paleopolyploidy can yield strikingly consistent signatures of gene retention in plant genomes despite extensive lineage radiations and recurrent genome duplications but that these patterns vary substantially among higher taxonomic categories.

...read moreread less

Abstract: Of the approximately 250,000 species of flowering plants, nearly one in ten are members of the Compositae (Asteraceae), a diverse family found in almost every habitat on all continents except Antarctica. With an origin in the mid Eocene, the Compositae is also a relatively young family with remarkable diversifications during the last 40 My. Previous cytologic and systematic investigations suggested that paleopolyploidy may have occurred in at least one Compositae lineage, but a recent analysis of genomic data was equivocal. We tested for evidence of paleopolyploidy in the evolutionary history of the family using recently available expressed sequence tag (EST) data from the Compositae Genome Project. Combined with data available on GenBank, we analyzed nearly 1 million ESTs from 18 species representing seven genera and four tribes. Our analyses revealed at least three ancient whole-genome duplications in the Compositae-a paleopolyploidization shared by all analyzed taxa and placed near the origin of the family just prior to the rapid radiation of its tribes and independent genome duplications near the base of the tribes Mutisieae and Heliantheae. These results are consistent with previous research implicating paleopolyploidy in the evolution and diversification of the Heliantheae. Further, we observed parallel retention of duplicate genes from the basal Compositae genome duplication across all tribes, despite divergence times of 33-38 My among these lineages. This pattern of retention was also repeated for the paleologs from the Heliantheae duplication. Intriguingly, the categories of genes retained in duplicate were substantially different from those in Arabidopsis. In particular, we found that genes annotated to structural components or cellular organization Gene Ontology categories were significantly enriched among paleologs, whereas genes associated with transcription and other regulatory functions were significantly underrepresented. Our results suggest that paleopolyploidy can yield strikingly consistent signatures of gene retention in plant genomes despite extensive lineage radiations and recurrent genome duplications but that these patterns vary substantially among higher taxonomic categories.

...read moreread less

331 citations

Journal Article•DOI•

Genome assembly with in vitro proximity ligation data and whole-genome triplication in lettuce.

[...]

Sebastian Reyes-Chin-Wo¹, Zhiwen Wang, Xinhua Yang, Alexander Kozik¹, Siwaret Arikit², Chi Song, Liangfeng Xia, Lutz Froenicke¹, Dean Lavelle¹, Maria Jose Truco¹, Rui Xia³, Shilin Zhu, Chunyan Xu, Huaqin Xu¹, Xun Xu, Kyle Cox¹, Ian F Korf¹, Blake C. Meyers³, Blake C. Meyers², Richard W Michelmore - Show less +16 more•Institutions (3)

University of California, Davis¹, Delaware Biotechnology Institute², Donald Danforth Plant Science Center³

12 Apr 2017-Nature Communications

TL;DR: This work identifies several genomic features that may have contributed to the success of the Compositae family of flowering plants, including genes encoding Cycloidea-like transcription factors, kinases, enzymes involved in rubber biosynthesis and disease resistance proteins that are expanded in the genome.

...read moreread less

Abstract: Lettuce (Lactuca sativa) is a major crop and a member of the large, highly successful Compositae family of flowering plants. Here we present a reference assembly for the species and family. This was generated using whole-genome shotgun Illumina reads plus in vitro proximity ligation data to create large superscaffolds; it was validated genetically and superscaffolds were oriented in genetic bins ordered along nine chromosomal pseudomolecules. We identify several genomic features that may have contributed to the success of the family, including genes encoding Cycloidea-like transcription factors, kinases, enzymes involved in rubber biosynthesis and disease resistance proteins that are expanded in the genome. We characterize 21 novel microRNAs, one of which may trigger phasiRNAs from numerous kinase transcripts. We provide evidence for a whole-genome triplication event specific but basal to the Compositae. We detect 26% of the genome in triplicated regions containing 30% of all genes that are enriched for regulatory sequences and depleted for genes involved in defence.

...read moreread less

281 citations

Journal Article•DOI•

Next generation sequencing provides rapid access to the genome of Puccinia striiformis f. sp. tritici, the causal agent of wheat stripe rust.

[...]

Dario Cantu¹, Manjula Govindarajulu¹, Alexander Kozik¹, Meinan Wang², Xianming Chen², Xianming Chen³, Kenji K. Kojima⁴, Jerzy Jurka⁴, Richard W Michelmore¹, Jorge Dubcovsky⁵, Jorge Dubcovsky¹, Jorge Dubcovsky⁶ - Show less +8 more•Institutions (6)

University of California, Davis¹, Washington State University², United States Department of Agriculture³, Genetic Information Research Institute⁴, Howard Hughes Medical Institute⁵, Gordon and Betty Moore Foundation⁶

31 Aug 2011-PLOS ONE

TL;DR: Cantu et al. as discussed by the authors used Illumina sequencing to rapidly access the genomic sequence of the highly virulent PST race 130 (PST-130), which was assembled into 29,178 contigs (64.8 Mb).

...read moreread less

Abstract: Author(s): Cantu, Dario; Govindarajulu, Manjula; Kozik, Alex; Wang, Meinan; Chen, Xianming; Kojima, Kenji K; Jurka, Jerzy; Michelmore, Richard W; Dubcovsky, Jorge | Abstract: BackgroundThe wheat stripe rust fungus (Puccinia striiformis f. sp. tritici, PST) is responsible for significant yield losses in wheat production worldwide. In spite of its economic importance, the PST genomic sequence is not currently available. Fortunately Next Generation Sequencing (NGS) has radically improved sequencing speed and efficiency with a great reduction in costs compared to traditional sequencing technologies. We used Illumina sequencing to rapidly access the genomic sequence of the highly virulent PST race 130 (PST-130).Methodology/principal findingsWe obtained nearly 80 million high quality paired-end reads (g50x coverage) that were assembled into 29,178 contigs (64.8 Mb), which provide an estimated coverage of at least 88% of the PST genes and are available through GenBank. Extensive micro-synteny with the Puccinia graminis f. sp. tritici (PGTG) genome and high sequence similarity with annotated PGTG genes support the quality of the PST-130 contigs. We characterized the transposable elements present in the PST-130 contigs and using an ab initio gene prediction program we identified and tentatively annotated 22,815 putative coding sequences. We provide examples on the use of comparative approaches to improve gene annotation for both PST and PGTG and to identify candidate effectors. Finally, the assembled contigs provided an inventory of PST repetitive elements, which were annotated and deposited in Repbase.Conclusions/significanceThe assembly of the PST-130 genome and the predicted proteins provide useful resources to rapidly identify and clone PST genes and their regulatory regions. Although the automatic gene prediction has limitations, we show that a comparative genomics approach using multiple rust species can greatly improve the quality of gene annotation in these species. The PST-130 sequence will also be useful for comparative studies within PST as more races are sequenced. This study illustrates the power of NGS for rapid and efficient access to genomic sequence in non-model organisms.

...read moreread less

181 citations

1
2
3
4
…
5
6
7
8
9
10
11

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Circos: An information aesthetic for comparative genomics

[...]

Martin Krzywinski, Jacqueline E. Schein, Inanc Birol, Joseph M. Connors, Randy D. Gascoyne, Doug Horsman, Steven J.M. Jones, Marco A. Marra - Show less +4 more

01 Sep 2009-Genome Research

TL;DR: Circos uses a circular ideogram layout to facilitate the display of relationships between pairs of positions by the use of ribbons, which encode the position, size, and orientation of related genomic elements.

...read moreread less

Abstract: We created a visualization tool called Circos to facilitate the identification and analysis of similarities and differences arising from comparisons of genomes. Our tool is effective in displaying variation in genome structure and, generally, any other kind of positional relationships between genomic intervals. Such data are routinely produced by sequence alignments, hybridization arrays, genome mapping, and genotyping studies. Circos uses a circular ideogram layout to facilitate the display of relationships between pairs of positions by the use of ribbons, which encode the position, size, and orientation of related genomic elements. Circos is capable of displaying data as scatter, line, and histogram plots, heat maps, tiles, connectors, and text. Bitmap or vector images can be created from GFF-style data inputs and hierarchical configuration files, which can be easily generated by automated tools, making Circos suitable for rapid deployment in data analysis and reporting pipelines.

...read moreread less

8,315 citations

Journal Article•DOI•

MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity

[...]

Yupeng Wang¹, Haibao Tang¹, Jeremy D. DeBarry¹, Xu-fei Tan¹, Jingping Li¹, Xiyin Wang¹, Tae-Ho Lee¹, Huizhe Jin¹, Barry S. Marler¹, Hui Guo¹, Jessica C. Kissinger¹, Andrew H. Paterson¹ - Show less +8 more•Institutions (1)

Plant Genome Mapping Laboratory¹

01 Apr 2012-Nucleic Acids Research

TL;DR: The MCScanX toolkit implements an adjusted MCScan algorithm for detection of synteny and collinearity that extends the original software by incorporating 14 utility programs for visualization of results and additional downstream analyses.

...read moreread less

Abstract: MCScan is an algorithm able to scan multiple genomes or subgenomes in order to identify putative homologous chromosomal regions, and align these regions using genes as anchors. The MCScanX toolkit implements an adjusted MCScan algorithm for detection of synteny and collinearity that extends the original software by incorporating 14 utility programs for visualization of results and additional downstream analyses. Applications of MCScanX to several sequenced plant genomes and gene families are shown as examples. MCScanX can be used to effectively analyze chromosome structural changes, and reveal the history of gene family expansions that might contribute to the adaptation of lineages and taxa. An integrated view of various modes of gene duplication can supplement the traditional gene tree analysis in specific families. The source code and documentation of MCScanX are freely available at http://chibba.pgml.uga.edu/mcscan2/.

...read moreread less

3,388 citations

Journal Article•DOI•

The Sorghum bicolor genome and the diversification of grasses

[...]

Andrew H. Paterson¹, John E. Bowers¹, Rémy Bruggmann², Inna Dubchak³, Jane Grimwood⁴, Heidrun Gundlach, Georg Haberer, Uffe Hellsten³, Therese Mitros⁵, Alexander Poliakov³, Jeremy Schmutz⁴, Manuel Spannagl, Haibao Tang¹, Xiyin Wang⁶, Xiyin Wang¹, Thomas Wicker⁷, Arvind K. Bharti², Jarrod Chapman³, F. Alex Feltus¹, F. Alex Feltus⁸, Udo Gowik⁹, Igor V. Grigoriev³, Eric Lyons⁵, Christopher G. Maher¹⁰, Mihaela Martis, Apurva Narechania¹⁰, Robert Otillar³, Bryan W. Penning¹¹, Asaf Salamov³, Yu Wang, Lifang Zhang¹⁰, Nicholas C. Carpita¹¹, Michael Freeling⁵, Alan R. Gingle¹, C. Thomas Hash¹², Beat Keller⁷, Patricia E. Klein¹³, Stephen Kresovich¹⁴, Maureen C. McCann¹¹, Ray Ming¹⁵, Daniel G. Peterson¹, Daniel G. Peterson¹⁶, Mehboob-ur-Rahman¹⁷, Mehboob-ur-Rahman¹, Doreen Ware¹⁰, Doreen Ware¹⁸, Peter Westhoff⁹, Klaus F. X. Mayer, Joachim Messing², Daniel S. Rokhsar³, Daniel S. Rokhsar⁴ - Show less +47 more•Institutions (18)

University of Georgia¹, Rutgers University², United States Department of Energy³, Stanford University⁴, University of California, Berkeley⁵, North China University of Science and Technology⁶, University of Zurich⁷, Clemson University⁸, University of Düsseldorf⁹, Cold Spring Harbor Laboratory¹⁰, Purdue University¹¹, International Crops Research Institute for the Semi-Arid Tropics¹², Texas A&M University¹³, Cornell University¹⁴, University of Illinois at Urbana–Champaign¹⁵, Mississippi State University¹⁶, National Institute for Biotechnology and Genetic Engineering¹⁷, United States Department of Agriculture¹⁸

29 Jan 2009-Nature

TL;DR: An initial analysis of the ∼730-megabase Sorghum bicolor (L.) Moench genome is presented, placing ∼98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information.

...read moreread less

Abstract: Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the approximately 730-megabase Sorghum bicolor (L.) Moench genome, placing approximately 98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the approximately 75% larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidization approximately 70 million years ago, most duplicated gene sets lost one member before the sorghum-rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24% of genes are grass-specific and 7% are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum's drought tolerance.

...read moreread less

2,809 citations

Journal Article•DOI•

The Top 10 fungal pathogens in molecular plant pathology

[...]

Ralph A. Dean¹, Jan A. L. van Kan², Zacharias A. Pretorius³, Kim E. Hammond-Kosack⁴, Antonio Di Pietro⁵, Pietro Spanu⁶, Jason J. Rudd⁴, Martin B. Dickman⁷, Regine Kahmann⁸, Jeff Ellis⁹, Gary D. Foster¹⁰ - Show less +7 more•Institutions (10)

North Carolina State University¹, Wageningen University and Research Centre², University of the Free State³, Rothamsted Research⁴, University of Córdoba (Spain)⁵, Imperial College London⁶, Texas A&M University⁷, Max Planck Society⁸, Commonwealth Scientific and Industrial Research Organisation⁹, University of Bristol¹⁰

01 May 2012-Molecular Plant Pathology

TL;DR: A short resumé of each fungus in the Top 10 list and its importance is presented, with the intent of initiating discussion and debate amongst the plant mycology community, as well as laying down a bench-mark.

...read moreread less

Abstract: The aim of this review was to survey all fungal pathologists with an association with the journal Molecular Plant Pathology and ask them to nominate which fungal pathogens they would place in a 'Top 10' based on scientific/economic importance. The survey generated 495 votes from the international community, and resulted in the generation of a Top 10 fungal plant pathogen list for Molecular Plant Pathology. The Top 10 list includes, in rank order, (1) Magnaporthe oryzae; (2) Botrytis cinerea; (3) Puccinia spp.; (4) Fusarium graminearum; (5) Fusarium oxysporum; (6) Blumeria graminis; (7) Mycosphaerella graminicola; (8) Colletotrichum spp.; (9) Ustilago maydis; (10) Melampsora lini, with honourable mentions for fungi just missing out on the Top 10, including Phakopsora pachyrhizi and Rhizoctonia solani. This article presents a short resume of each fungus in the Top 10 list and its importance, with the intent of initiating discussion and debate amongst the plant mycology community, as well as laying down a bench-mark. It will be interesting to see in future years how perceptions change and what fungi will comprise any future Top 10.

...read moreread less

2,807 citations

Journal Article•DOI•

A Renaissance of Elicitors: Perception of Microbe-Associated Molecular Patterns and Danger Signals by Pattern-Recognition Receptors

[...]

Thomas Boller¹, Georg Felix²•Institutions (2)

University of Basel¹, University of Tübingen²

28 Apr 2009-Annual Review of Plant Biology

TL;DR: Current evidence indicates that MAMPs, DAMPs, and effectors are all perceived as danger signals and induce a stereotypic defense response, and the importance of MAMP/PRR signaling for plant immunity is highlighted.

...read moreread less

Abstract: Microbe-associated molecular patterns (MAMPs) are molecular signatures typical of whole classes of microbes, and their recognition plays a key role in innate immunity. Endogenous elicitors are similarly recognized as damage-associated molecular patterns (DAMPs). This review focuses on the diversity of MAMPs/DAMPs and on progress to identify the corresponding pattern recognition receptors (PRRs) in plants. The two best-characterized MAMP/PRR pairs, flagellin/FLS2 and EF-Tu/EFR, are discussed in detail and put into a phylogenetic perspective. Both FLS2 and EFR are leucine-rich repeat receptor kinases (LRR-RKs). Upon treatment with flagellin, FLS2 forms a heteromeric complex with BAK1, an LRR-RK that also acts as coreceptor for the brassinolide receptor BRI1. The importance of MAMP/PRR signaling for plant immunity is highlighted by the finding that plant pathogens use effectors to inhibit PRR complexes or downstream signaling events. Current evidence indicates that MAMPs, DAMPs, and effectors are all perceived as danger signals and induce a stereotypic defense response.

...read moreread less

2,801 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse