scispace - formally typeset
Search or ask a question
Author

Frank Guzman

Bio: Frank Guzman is an academic researcher from Universidade Federal do Rio Grande do Sul. The author has contributed to research in topics: Genome & Gene. The author has an hindex of 13, co-authored 33 publications receiving 2457 citations. Previous affiliations of Frank Guzman include Cayetano Heredia University & International Potato Center.

Papers
More filters
Journal ArticleDOI
Xun Xu1, Shengkai Pan1, Shifeng Cheng1, Bo Zhang1, Mu D1, Peixiang Ni1, Gengyun Zhang1, Shuang Yang1, Ruiqiang Li1, Jun Wang1, Gisella Orjeda2, Frank Guzman2, Torres M2, Roberto Lozano2, Olga Ponce2, Diana Martinez2, De la Cruz G3, Chakrabarti Sk3, Patil Vu3, Konstantin G. Skryabin4, Boris B. Kuznetsov4, Nikolai V. Ravin4, Tatjana V. Kolganova4, Alexey V. Beletsky4, Andrey V. Mardanov4, Di Genova A5, Dan Bolser5, David M. A. Martin5, Li G, Yang Y, Hanhui Kuang6, Hu Q6, Xiong X7, Gerard J. Bishop8, Boris Sagredo, Nilo Mejía, Zagorski W9, Robert Gromadka9, Jan Gawor9, Pawel Szczesny9, Sanwen Huang, Zhang Z, Liang C, He J, Li Y, He Y, Xu J, Youjun Zhang, Xie B, Du Y, Qu D, Merideth Bonierbale10, Marc Ghislain10, Herrera Mdel R, Giovanni Giuliano, Marco Pietrella, Gaetano Perrotta, Paolo Facella, O'Brien K11, Sergio Enrique Feingold, Barreiro Le, Massa Ga, Luis Aníbal Diambra12, Brett R Whitty13, Brieanne Vaillancourt13, Lin H13, Alicia N. Massa13, Geoffroy M13, Lundback S13, Dean DellaPenna13, Buell Cr14, Sanjeev Kumar Sharma14, David Marshall14, Robbie Waugh14, Glenn J. Bryan14, Destefanis M15, Istvan Nagy15, Dan Milbourne15, Susan Thomson16, Mark Fiers16, Jeanne M. E. Jacobs16, Kåre Lehmann Nielsen17, Mads Sønderkær17, Marina Iovene18, Giovana Augusta Torres18, Jiming Jiang18, Richard E. Veilleux19, Christian W. B. Bachem20, de Boer J20, Theo Borm20, Bjorn Kloosterman20, van Eck H20, Erwin Datema20, Hekkert Bt20, Aska Goverse20, van Ham Rc20, Richard G. F. Visser20 
10 Jul 2011-Nature
TL;DR: The potato genome sequence provides a platform for genetic improvement of this vital crop and predicts 39,031 protein-coding genes and presents evidence for at least two genome duplication events indicative of a palaeopolyploid origin.
Abstract: Potato (Solanum tuberosum L.) is the world's most important non-grain food crop and is central to global food security. It is clonally propagated, highly heterozygous, autotetraploid, and suffers acute inbreeding depression. Here we use a homozygous doubled-monoploid potato clone to sequence and assemble 86% of the 844-megabase genome. We predict 39,031 protein-coding genes and present evidence for at least two genome duplication events indicative of a palaeopolyploid origin. As the first genome sequence of an asterid, the potato genome reveals 2,642 genes specific to this large angiosperm clade. We also sequenced a heterozygous diploid clone and show that gene presence/absence variants and other potentially deleterious mutations occur frequently and are a likely cause of inbreeding depression. Gene family expansion, tissue-specific expression and recruitment of genes to new pathways contributed to the evolution of tuber development. The potato genome sequence provides a platform for genetic improvement of this vital crop.

1,813 citations

Journal ArticleDOI
TL;DR: The work presented here has led to a greatly improved ordering of the potato reference genome superscaffolds into chromosomal “pseudomolecules”.
Abstract: The genome of potato, a major global food crop, was recently sequenced. The work presented here details the integration of the potato reference genome (DM) with a new sequence-tagged site marker−based linkage map and other physical and genetic maps of potato and the closely related species tomato. Primary anchoring of the DM genome assembly was accomplished by the use of a diploid segregating population, which was genotyped with several types of molecular genetic markers to construct a new ~936 cM linkage map comprising 2469 marker loci. In silico anchoring approaches used genetic and physical maps from the diploid potato genotype RH89-039-16 (RH) and tomato. This combined approach has allowed 951 superscaffolds to be ordered into pseudomolecules corresponding to the 12 potato chromosomes. These pseudomolecules represent 674 Mb (~93%) of the 723 Mb genome assembly and 37,482 (~96%) of the 39,031 predicted genes. The superscaffold order and orientation within the pseudomolecules are closely collinear with independently constructed high density linkage maps. Comparisons between marker distribution and physical location reveal regions of greater and lesser recombination, as well as regions exhibiting significant segregation distortion. The work presented here has led to a greatly improved ordering of the potato reference genome superscaffolds into chromosomal “pseudomolecules”.

236 citations

Journal ArticleDOI
TL;DR: One of the largest molecular marker studies of any crop landraces to date is provided, to include an extensive study of 742 landracing of all cultivated species (or Cultivar Groups) and 8 closely related wild species progenitors, with 50 nuclear simple sequence repeat (SSR) primer pairs and a plastid DNA deletion marker that distinguishes most lowland Chilean from upland Andean landrace.
Abstract: Contrasting taxonomic treatments of potato landraces have continued over the last century, with the recognition of anywhere from 1 to 21 distinct Linnean species, or of Cultivar Groups within the single species Solanum tuberosum. We provide one of the largest molecular marker studies of any crop landraces to date, to include an extensive study of 742 landraces of all cultivated species (or Cultivar Groups) and 8 closely related wild species progenitors, with 50 nuclear simple sequence repeat (SSR) (also known as microsatellite) primer pairs and a plastid DNA deletion marker that distinguishes most lowland Chilean from upland Andean landraces. Neighbor-joining results highlight a tendency to separate three groups: (i) putative diploids, (ii) putative tetraploids, and (iii) the hybrid cultivated species S. ajanhuiri (diploid), S. juzepczukii (triploid), and S. curtilobum (pentaploid). However, there are many exceptions to grouping by ploidy. Strong statistical support occurs only for S. ajanhuiri, S. juzepczukii, and S. curtilobum. In combination with recent morphological analyses and an examination of the identification history of these collections, we support the reclassification of the cultivated potatoes into four species: (i) S. tuberosum, with two Cultivar Groups (Andigenum Group of upland Andean genotypes containing diploids, triploids, and tetraploids, and the Chilotanum Group of lowland tetraploid Chilean landraces); (ii) S. ajanhuiri (diploid); (iii) S. juzepczukii (triploid); and (iv) S. curtilobum (pentaploid). For other classifications, consistent and stable identifications are impossible, and their classification as species is artificial and only maintains the confusion of users of the gene banks and literature.

223 citations

Journal ArticleDOI
TL;DR: A marker-specific set of allele size standards is made that conveniently and unambiguously provide accurate sizing of all alleles of the 24 SSR markers across laboratories and platforms to standardize the choice and allele sizing of microsatellites in potato and aid in collaborative projects by allowing cumulative analysis of independently generated data.
Abstract: The fingerprinting of 742 potato landraces with 51 simple sequence repeat (SSR, or microsatellite) markers resulted in improving a previously constructed potato genetic identity kit. All SSR marker loci were assayed with a collection of highly diverse landraces of all species of cultivated potato with ploidies ranging from diploid to pentaploid. Loci number, amplification reproducibility, and polymorphic information content were recorded. Out of 148 SSR markers of which 30 are new, we identified 58 new SSR marker locations on at least one of three potato genetic linkage maps. These results permitted the selection of a new potato genetic identity kit based on 24 SSR markers with two per chromosome separated by at least 10 cM, single locus, high polymorphic information content, and high quality of amplicons as determined by clarity and reproducibility. The comparison of a similarity matrix of 742 landraces obtained with the 24 SSR markers of the new kit and with the entire dataset of 51 SSR markers showed a high correlation (r = 0.94) by a Mantel test and even higher correlations (r = 0.99) regarding topological comparisons of major branches of a neighbor joining tree. This new potato genetic identity kit is able to discriminate 93.5% of the 742 landraces compared to 98.8% with 51 SSR markers. In addition, we made a marker-specific set of allele size standards that conveniently and unambiguously provide accurate sizing of all alleles of the 24 SSR markers across laboratories and platforms. The new potato genetic identity kit will be of particular utility to standardize the choice and allele sizing of microsatellites in potato and aid in collaborative projects by allowing cumulative analysis of independently generated data.

150 citations

Journal ArticleDOI
30 Nov 2012-PLOS ONE
TL;DR: A comparative study of the miRNA transcriptome of mature and developing B. napus seeds is presented and provides a basis for future research on individual miRNAs and their functions in embryogenesis, seed maturation and lipid accumulation in B. Napus.
Abstract: MicroRNAs (miRNAs) are important post-transcriptional regulators of plant development and seed formation. In Brassica napus, an important edible oil crop, valuable lipids are synthesized and stored in specific seed tissues during embryogenesis. The miRNA transcriptome of B. napus is currently poorly characterized, especially at different seed developmental stages. This work aims to describe the miRNAome of developing seeds of B. napus by identifying plant-conserved and novel miRNAs and comparing miRNA abundance in mature versus developing seeds. Members of 59 miRNA families were detected through a computational analysis of a large number of reads obtained from deep sequencing two small RNA and two RNA-seq libraries of (i) pooled immature developing stages and (ii) mature B. napus seeds. Among these miRNA families, 17 families are currently known to exist in B. napus; additionally 29 families not reported in B. napus but conserved in other plant species were identified by alignment with known plant mature miRNAs. Assembled mRNA-seq contigs allowed for a search of putative new precursors and led to the identification of 13 novel miRNA families. Analysis of miRNA population between libraries reveals that several miRNAs and isomiRNAs have different abundance in developing stages compared to mature seeds. The predicted miRNA target genes encode a broad range of proteins related to seed development and energy storage. This work presents a comparative study of the miRNA transcriptome of mature and developing B. napus seeds and provides a basis for future research on individual miRNAs and their functions in embryogenesis, seed maturation and lipid accumulation in B. napus.

64 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number of complete plant genomes.
Abstract: The number of sequenced plant genomes and associated genomic resources is growing rapidly with the advent of both an increased focus on plant genomics from funding agencies, and the application of inexpensive next generation sequencing. To interact with this increasing body of data, we have developed Phytozome (http://www.phytozome.net), a comparative hub for plant genome and gene family data and analysis. Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number (currently 25) of complete plant genomes, including all the land plants and selected algae sequenced at the Joint Genome Institute, as well as selected species sequenced elsewhere. Through a comprehensive plant genome database and web portal, these data and analyses are available to the broader plant science research community, providing powerful comparative genomics tools that help to link model systems with other plants of economic and ecological importance.

3,728 citations

Journal ArticleDOI
Shusei Sato, Satoshi Tabata, Hideki Hirakawa, Erika Asamizu  +320 moreInstitutions (51)
31 May 2012-Nature
TL;DR: A high-quality genome sequence of domesticated tomato is presented, a draft sequence of its closest wild relative, Solanum pimpinellifolium, is compared, and the two tomato genomes are compared to each other and to the potato genome.
Abstract: Tomato (Solanum lycopersicum) is a major crop plant and a model system for fruit development. Solanum is one of the largest angiosperm genera1 and includes annual and perennial plants from diverse habitats. Here we present a high-quality genome sequence of domesticated tomato, a draft sequence of its closest wild relative, Solanum pimpinellifolium2, and compare them to each other and to the potato genome (Solanum tuberosum). The two tomato genomes show only 0.6% nucleotide divergence and signs of recent admixture, but show more than 8% divergence from potato, with nine large and several smaller inversions. In contrast to Arabidopsis, but similar to soybean, tomato and potato small RNAs map predominantly to gene-rich chromosomal regions, including gene promoters. The Solanum lineage has experienced two consecutive genome triplications: one that is ancient and shared with rosids, and a more recent one. These triplications set the stage for the neofunctionalization of genes controlling fruit characteristics, such as colour and fleshiness.

2,687 citations

Journal ArticleDOI
04 Oct 2012-Nature
TL;DR: The sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy and transcriptomes of development and stress response and the proteome of the shell are reported, showing that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes.
Abstract: The Pacific oyster Crassostrea gigas belongs to one of the most species-rich but genomically poorly explored phyla, the Mollusca. Here we report the sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy, along with transcriptomes of development and stress response and the proteome of the shell. The oyster genome is highly polymorphic and rich in repetitive sequences, with some transposable elements still actively shaping variation. Transcriptome studies reveal an extensive set of genes responding to environmental stress. The expansion of genes coding for heat shock protein 70 and inhibitors of apoptosis is probably central to the oyster's adaptation to sessile life in the highly stressful intertidal zone. Our analyses also show that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes. The oyster genome sequence fills a void in our understanding of the Lophotrochozoa.

1,806 citations

Journal ArticleDOI
TL;DR: The computational problems surrounding repeats are discussed and strategies used by current bioinformatics systems to solve them are described.
Abstract: Repetitive DNA sequences are abundant in a broad range of species, from bacteria to mammals, and they cover nearly half of the human genome. Repeats have always presented technical challenges for sequence alignment and assembly programs. Next-generation sequencing projects, with their short read lengths and high data volumes, have made these challenges more difficult. From a computational perspective, repeats create ambiguities in alignment and assembly, which, in turn, can produce biases and errors when interpreting results. Simply ignoring repeats is not an option, as this creates problems of its own and may mean that important biological phenomena are missed. We discuss the computational problems surrounding repeats and describe strategies used by current bioinformatics systems to solve them.

1,451 citations

Journal ArticleDOI
Klaus F. X. Mayer, Jane Rogers, Jaroslav Doležel1, Curtis J. Pozniak2, Kellye Eversole, Catherine Feuillet3, Bikram S. Gill4, Bernd Friebe4, Adam J. Lukaszewski5, Pierre Sourdille6, Takashi R. Endo7, M. Kubaláková1, Jarmila Číhalíková1, Zdeňka Dubská1, Jan Vrána1, Romana Šperková1, Hana Šimková1, Melanie Febrer8, Leah Clissold, Kirsten McLay, Kuldeep Singh9, Parveen Chhuneja9, Nagendra K. Singh10, Jitendra P. Khurana11, Eduard Akhunov4, Frédéric Choulet6, Adriana Alberti, Valérie Barbe, Patrick Wincker, Hiroyuki Kanamori12, Fuminori Kobayashi12, Takeshi Itoh12, Takashi Matsumoto12, Hiroaki Sakai12, Tsuyoshi Tanaka12, Jianzhong Wu12, Yasunari Ogihara13, Hirokazu Handa12, P. Ron Maclachlan2, Andrew G. Sharpe14, Darrin Klassen14, David Edwards, Jacqueline Batley, Odd-Arne Olsen, Simen Rød Sandve15, Sigbjørn Lien15, Burkhard Steuernagel16, Brande B. H. Wulff16, Mario Caccamo, Sarah Ayling, Ricardo H. Ramirez-Gonzalez, Bernardo J. Clavijo, Jonathan M. Wright, Matthias Pfeifer, Manuel Spannagl, Mihaela Martis, Martin Mascher17, Jarrod Chapman18, Jesse Poland4, Uwe Scholz17, Kerrie Barry18, Robbie Waugh19, Daniel S. Rokhsar18, Gary J. Muehlbauer, Nils Stein17, Heidrun Gundlach, Matthias Zytnicki20, Véronique Jamilloux20, Hadi Quesneville20, Thomas Wicker21, Primetta Faccioli, Moreno Colaiacovo, Antonio Michele Stanca, Hikmet Budak22, Luigi Cattivelli, Natasha Glover6, Lise Pingault6, Etienne Paux6, Sapna Sharma, Rudi Appels23, Matthew I. Bellgard23, Brett Chapman23, Thomas Nussbaumer, Kai Christian Bader, Hélène Rimbert, Shichen Wang4, Ron Knox, Andrzej Kilian, Michael Alaux20, Françoise Alfama20, Loïc Couderc20, Nicolas Guilhot6, Claire Viseux20, Mikaël Loaec20, Beat Keller21, Sébastien Praud 
18 Jul 2014-Science
TL;DR: Insight into the genome biology of a polyploid crop provide a springboard for faster gene isolation, rapid genetic marker development, and precise breeding to meet the needs of increasing food demand worldwide.
Abstract: An ordered draft sequence of the 17-gigabase hexaploid bread wheat (Triticum aestivum) genome has been produced by sequencing isolated chromosome arms. We have annotated 124,201 gene loci distributed nearly evenly across the homeologous chromosomes and subgenomes. Comparative gene analysis of wheat subgenomes and extant diploid and tetraploid wheat relatives showed that high sequence similarity and structural conservation are retained, with limited gene loss, after polyploidization. However, across the genomes there was evidence of dynamic gene gain, loss, and duplication since the divergence of the wheat lineages. A high degree of transcriptional autonomy and no global dominance was found for the subgenomes. These insights into the genome biology of a polyploid crop provide a springboard for faster gene isolation, rapid genetic marker development, and precise breeding to meet the needs of increasing food demand worldwide.

1,421 citations