scispace - formally typeset
Search or ask a question
Author

Theo Borm

Bio: Theo Borm is an academic researcher from Wageningen University and Research Centre. The author has contributed to research in topics: Genome & Reference genome. The author has an hindex of 12, co-authored 24 publications receiving 3013 citations.

Papers
More filters
Journal ArticleDOI
Xun Xu1, Shengkai Pan1, Shifeng Cheng1, Bo Zhang1, Mu D1, Peixiang Ni1, Gengyun Zhang1, Shuang Yang1, Ruiqiang Li1, Jun Wang1, Gisella Orjeda2, Frank Guzman2, Torres M2, Roberto Lozano2, Olga Ponce2, Diana Martinez2, De la Cruz G3, Chakrabarti Sk3, Patil Vu3, Konstantin G. Skryabin4, Boris B. Kuznetsov4, Nikolai V. Ravin4, Tatjana V. Kolganova4, Alexey V. Beletsky4, Andrey V. Mardanov4, Di Genova A5, Dan Bolser5, David M. A. Martin5, Li G, Yang Y, Hanhui Kuang6, Hu Q6, Xiong X7, Gerard J. Bishop8, Boris Sagredo, Nilo Mejía, Zagorski W9, Robert Gromadka9, Jan Gawor9, Pawel Szczesny9, Sanwen Huang, Zhang Z, Liang C, He J, Li Y, He Y, Xu J, Youjun Zhang, Xie B, Du Y, Qu D, Merideth Bonierbale10, Marc Ghislain10, Herrera Mdel R, Giovanni Giuliano, Marco Pietrella, Gaetano Perrotta, Paolo Facella, O'Brien K11, Sergio Enrique Feingold, Barreiro Le, Massa Ga, Luis Aníbal Diambra12, Brett R Whitty13, Brieanne Vaillancourt13, Lin H13, Alicia N. Massa13, Geoffroy M13, Lundback S13, Dean DellaPenna13, Buell Cr14, Sanjeev Kumar Sharma14, David Marshall14, Robbie Waugh14, Glenn J. Bryan14, Destefanis M15, Istvan Nagy15, Dan Milbourne15, Susan Thomson16, Mark Fiers16, Jeanne M. E. Jacobs16, Kåre Lehmann Nielsen17, Mads Sønderkær17, Marina Iovene18, Giovana Augusta Torres18, Jiming Jiang18, Richard E. Veilleux19, Christian W. B. Bachem20, de Boer J20, Theo Borm20, Bjorn Kloosterman20, van Eck H20, Erwin Datema20, Hekkert Bt20, Aska Goverse20, van Ham Rc20, Richard G. F. Visser20 
10 Jul 2011-Nature
TL;DR: The potato genome sequence provides a platform for genetic improvement of this vital crop and predicts 39,031 protein-coding genes and presents evidence for at least two genome duplication events indicative of a palaeopolyploid origin.
Abstract: Potato (Solanum tuberosum L.) is the world's most important non-grain food crop and is central to global food security. It is clonally propagated, highly heterozygous, autotetraploid, and suffers acute inbreeding depression. Here we use a homozygous doubled-monoploid potato clone to sequence and assemble 86% of the 844-megabase genome. We predict 39,031 protein-coding genes and present evidence for at least two genome duplication events indicative of a palaeopolyploid origin. As the first genome sequence of an asterid, the potato genome reveals 2,642 genes specific to this large angiosperm clade. We also sequenced a heterozygous diploid clone and show that gene presence/absence variants and other potentially deleterious mutations occur frequently and are a likely cause of inbreeding depression. Gene family expansion, tissue-specific expression and recruitment of genes to new pathways contributed to the evolution of tuber development. The potato genome sequence provides a platform for genetic improvement of this vital crop.

1,813 citations

Journal ArticleDOI
16 Feb 2017-Nature
TL;DR: The assembly of a high-quality, chromosome-scale reference genome sequence for quinoa was produced using single-molecule real-time sequencing in combination with optical, chromosomes-contact and genetic maps, which facilitated the identification of the transcription factor likely to control the production of anti-nutritional triterpenoid saponins found in quinoa seeds.
Abstract: Chenopodium quinoa (quinoa) is a highly nutritious grain identified as an important crop to improve world food security. Unfortunately, few resources are available to facilitate its genetic improvement. Here we report the assembly of a high-quality, chromosome-scale reference genome sequence for quinoa, which was produced using single-molecule real-time sequencing in combination with optical, chromosome-contact and genetic maps. We also report the sequencing of two diploids from the ancestral gene pools of quinoa, which enables the identification of sub-genomes in quinoa, and reduced-coverage genome sequences for 22 other samples of the allotetraploid goosefoot complex. The genome sequence facilitated the identification of the transcription factor likely to control the production of anti-nutritional triterpenoid saponins found in quinoa seeds, including a mutation that appears to cause alternative splicing and a premature stop codon in sweet quinoa strains. These genomic resources are an important first step towards the genetic improvement of quinoa.

467 citations

Journal ArticleDOI
TL;DR: A local RGA approach was adopted using genomic information from the model Solanaceous plant tomato to isolate R3a, a potato gene that confers race-specific resistance to the late blight pathogen Phytophthora infestans and some of its paralogues whose functions remain unknown.
Abstract: Comparative genomics provides a tool to utilize the exponentially increasing sequence information from model plants to clone agronomically important genes from less studied crop species. Plant disease resistance (R) loci frequently lack synteny between related species of cereals and crucifers but appear to be positionally well conserved in the Solanaceae. In this report, we adopted a local RGA approach using genomic information from the model Solanaceous plant tomato to isolate R3a, a potato gene that confers race-specific resistance to the late blight pathogen Phytophthora infestans. R3a is a member of the R3 complex locus on chromosome 11. Comparative analyses of the R3 complex locus with the corresponding I2 complex locus in tomato suggest that this is an ancient locus involved in plant innate immunity against oomycete and fungal pathogens. However, the R3 complex locus has evolved after divergence from tomato and the locus has experienced a significant expansion in potato without disruption of the flanking colinearity. This expansion has resulted in an increase in the number of R genes and in functional diversification, which has probably been driven by the co-evolutionary history between P. infestans and its host potato. Constitutive expression was observed for the R3a gene, as well as some of its paralogues whose functions remain unknown.

365 citations

Journal ArticleDOI
08 May 2013-PLOS ONE
TL;DR: This study demonstrates the accuracy of genotyping-by-sequencing (GBS) of a large collection of autotetraploid potato cultivars using next-generation sequencing and validation showed that read depths of ∼60–80× can be used as a lower boundary for reliable assessment of allele copy number of sequence variants in autottraploids
Abstract: Assessment of genomic DNA sequence variation and genotype calling in autotetraploids implies the ability to distinguish among five possible alternative allele copy number states. This study demonstrates the accuracy of genotyping-by-sequencing (GBS) of a large collection of autotetraploid potato cultivars using next-generation sequencing. It is still costly to reach sufficient read depths on a genome wide scale, across the cultivated gene pool. Therefore, we enriched cultivar-specific DNA sequencing libraries using an in-solution hybridisation method (SureSelect). This complexity reduction allowed to confine our study to 807 target genes distributed across the genomes of 83 tetraploid cultivars and one reference (DM 1–3 511). Indexed sequencing libraries were paired-end sequenced in 7 pools of 12 samples using Illumina HiSeq2000. After filtering and processing the raw sequence data, 12.4 Gigabases of high-quality sequence data was obtained, which mapped to 2.1 Mb of the potato reference genome, with a median average read depth of 63× per cultivar. We detected 129,156 sequence variants and genotyped the allele copy number of each variant for every cultivar. In this cultivar panel a variant density of 1 SNP/24 bp in exons and 1 SNP/15 bp in introns was obtained. The average minor allele frequency (MAF) of a variant was 0.14. Potato germplasm displayed a large number of relatively rare variants and/or haplotypes, with 61% of the variants having a MAF below 0.05. A very high average nucleotide diversity (p = 0.0107) was observed. Nucleotide diversity varied among potato chromosomes. Several genes under selection were identified. Genotyping-by-sequencing results, with allele copy number estimates, were validated with a KASP genotyping assay. This validation showed that read depths of ~60–80× can be used as a lower boundary for reliable assessment of allele copy number of sequence variants in autotetraploids. Genotypic data were associated with traits, and alleles strongly influencing maturity and flesh colour were identified.

299 citations

Journal ArticleDOI
TL;DR: It is demonstrated that the mesohexaploidization of the two Brassica genomes contributed to their diversification into heading and tuber-forming morphotypes through convergent subgenome parallel selection of paralogous genes.
Abstract: Brassica species, including crops such as cabbage, turnip and oilseed, display enormous phenotypic variation. Brassica genomes have all undergone a whole-genome triplication (WGT) event with unknown effects on phenotype diversification. We resequenced 199 Brassica rapa and 119 Brassica oleracea accessions representing various morphotypes and identified signals of selection at the mesohexaploid subgenome level. For cabbage morphotypes with their typical leaf-heading trait, we identified four subgenome loci that show signs of parallel selection among subgenomes within B. rapa, as well as four such loci within B. oleracea. Fifteen subgenome loci are under selection and are shared by these two species. We also detected strong subgenome parallel selection linked to the domestication of the tuberous morphotypes, turnip (B. rapa) and kohlrabi (B. oleracea). Overall, we demonstrated that the mesohexaploidization of the two Brassica genomes contributed to their diversification into heading and tuber-forming morphotypes through convergent subgenome parallel selection of paralogous genes.

259 citations


Cited by
More filters
01 Jun 2012
TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).
Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

10,124 citations

Journal ArticleDOI
TL;DR: Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number of complete plant genomes.
Abstract: The number of sequenced plant genomes and associated genomic resources is growing rapidly with the advent of both an increased focus on plant genomics from funding agencies, and the application of inexpensive next generation sequencing. To interact with this increasing body of data, we have developed Phytozome (http://www.phytozome.net), a comparative hub for plant genome and gene family data and analysis. Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number (currently 25) of complete plant genomes, including all the land plants and selected algae sequenced at the Joint Genome Institute, as well as selected species sequenced elsewhere. Through a comprehensive plant genome database and web portal, these data and analyses are available to the broader plant science research community, providing powerful comparative genomics tools that help to link model systems with other plants of economic and ecological importance.

3,728 citations

Journal ArticleDOI
Shusei Sato, Satoshi Tabata, Hideki Hirakawa, Erika Asamizu  +320 moreInstitutions (51)
31 May 2012-Nature
TL;DR: A high-quality genome sequence of domesticated tomato is presented, a draft sequence of its closest wild relative, Solanum pimpinellifolium, is compared, and the two tomato genomes are compared to each other and to the potato genome.
Abstract: Tomato (Solanum lycopersicum) is a major crop plant and a model system for fruit development. Solanum is one of the largest angiosperm genera1 and includes annual and perennial plants from diverse habitats. Here we present a high-quality genome sequence of domesticated tomato, a draft sequence of its closest wild relative, Solanum pimpinellifolium2, and compare them to each other and to the potato genome (Solanum tuberosum). The two tomato genomes show only 0.6% nucleotide divergence and signs of recent admixture, but show more than 8% divergence from potato, with nine large and several smaller inversions. In contrast to Arabidopsis, but similar to soybean, tomato and potato small RNAs map predominantly to gene-rich chromosomal regions, including gene promoters. The Solanum lineage has experienced two consecutive genome triplications: one that is ancient and shared with rosids, and a more recent one. These triplications set the stage for the neofunctionalization of genes controlling fruit characteristics, such as colour and fleshiness.

2,687 citations

Journal ArticleDOI
04 Oct 2012-Nature
TL;DR: The sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy and transcriptomes of development and stress response and the proteome of the shell are reported, showing that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes.
Abstract: The Pacific oyster Crassostrea gigas belongs to one of the most species-rich but genomically poorly explored phyla, the Mollusca. Here we report the sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy, along with transcriptomes of development and stress response and the proteome of the shell. The oyster genome is highly polymorphic and rich in repetitive sequences, with some transposable elements still actively shaping variation. Transcriptome studies reveal an extensive set of genes responding to environmental stress. The expansion of genes coding for heat shock protein 70 and inhibitors of apoptosis is probably central to the oyster's adaptation to sessile life in the highly stressful intertidal zone. Our analyses also show that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes. The oyster genome sequence fills a void in our understanding of the Lophotrochozoa.

1,806 citations

10 Dec 2007
TL;DR: The experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.
Abstract: EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.

1,528 citations