Author
Sanjeev Kumar Sharma
Other affiliations: National Botanical Research Institute, Seattle Children's Research Institute, Scottish Crop Research Institute
Bio: Sanjeev Kumar Sharma is an academic researcher from James Hutton Institute. The author has contributed to research in topics: Biology & Population. The author has an hindex of 14, co-authored 21 publications receiving 2681 citations. Previous affiliations of Sanjeev Kumar Sharma include National Botanical Research Institute & Seattle Children's Research Institute.
Topics: Biology, Population, Genome, Somatic embryogenesis, Horticulture
Papers
More filters
••
Beijing Institute of Genomics1, Cayetano Heredia University2, Indian Council of Agricultural Research3, Russian Academy of Sciences4, University of Dundee5, Huazhong Agricultural University6, Hunan Agricultural University7, Imperial College London8, Polish Academy of Sciences9, International Potato Center10, J. Craig Venter Institute11, National University of La Plata12, Michigan State University13, James Hutton Institute14, Teagasc15, Plant & Food Research16, Aalborg University17, University of Wisconsin-Madison18, Virginia Tech19, Wageningen University and Research Centre20
TL;DR: The potato genome sequence provides a platform for genetic improvement of this vital crop and predicts 39,031 protein-coding genes and presents evidence for at least two genome duplication events indicative of a palaeopolyploid origin.
Abstract: Potato (Solanum tuberosum L.) is the world's most important non-grain food crop and is central to global food security. It is clonally propagated, highly heterozygous, autotetraploid, and suffers acute inbreeding depression. Here we use a homozygous doubled-monoploid potato clone to sequence and assemble 86% of the 844-megabase genome. We predict 39,031 protein-coding genes and present evidence for at least two genome duplication events indicative of a palaeopolyploid origin. As the first genome sequence of an asterid, the potato genome reveals 2,642 genes specific to this large angiosperm clade. We also sequenced a heterozygous diploid clone and show that gene presence/absence variants and other potentially deleterious mutations occur frequently and are a likely cause of inbreeding depression. Gene family expansion, tissue-specific expression and recruitment of genes to new pathways contributed to the evolution of tuber development. The potato genome sequence provides a platform for genetic improvement of this vital crop.
1,813 citations
••
TL;DR: By establishing the phylogenetic and positional relationship of potato NB-LRRs, the analysis offers significant insight into the evolution of potato R genes and provides a blueprint for future efforts to identify and more rapidly clone functional NB- LRR genes from Solanum species.
Abstract: The potato genome sequence derived from the Solanum tuberosum Group Phureja clone DM1-3 516 R44 provides unparalleled insight into the genome composition and organisation of this important crop. A key class of genes that comprises the vast majority of plant resistance (R) genes contains a nucleotide-binding and leucine-rich repeat domain, and is collectively known as NB-LRRs. As part of an effort to accelerate the process of functional R gene isolation, we performed an amino acid motif based search of the annotated potato genome and identified 438 NB-LRR type genes among the ~39,000 potato gene models. Of the predicted genes, 77 contain an N-terminal toll/interleukin 1 receptor (TIR)-like domain, and 107 of the remaining 361 non-TIR genes contain an N-terminal coiled-coil (CC) domain. Physical map positions were established for 370 predicted NB-LRR genes across all 12 potato chromosomes. The majority of NB-LRRs are physically organised within 63 identified clusters, of which 50 are homogeneous in that they contain NB-LRRs derived from a recent common ancestor. By establishing the phylogenetic and positional relationship of potato NB-LRRs, our analysis offers significant insight into the evolution of potato R genes. Furthermore, the data provide a blueprint for future efforts to identify and more rapidly clone functional NB-LRR genes from Solanum species.
276 citations
••
James Hutton Institute1, University of Dundee2, Wageningen University and Research Centre3, Aalborg University4, International Potato Center5, International Trademark Association6, University of Chile7, Michigan State University8, Cayetano Heredia University9, Teagasc10, Virginia Tech11, Plant & Food Research12
TL;DR: The work presented here has led to a greatly improved ordering of the potato reference genome superscaffolds into chromosomal “pseudomolecules”.
Abstract: The genome of potato, a major global food crop, was recently sequenced. The work presented here details the integration of the potato reference genome (DM) with a new sequence-tagged site marker−based linkage map and other physical and genetic maps of potato and the closely related species tomato. Primary anchoring of the DM genome assembly was accomplished by the use of a diploid segregating population, which was genotyped with several types of molecular genetic markers to construct a new ~936 cM linkage map comprising 2469 marker loci. In silico anchoring approaches used genetic and physical maps from the diploid potato genotype RH89-039-16 (RH) and tomato. This combined approach has allowed 951 superscaffolds to be ordered into pseudomolecules corresponding to the 12 potato chromosomes. These pseudomolecules represent 674 Mb (~93%) of the 723 Mb genome assembly and 37,482 (~96%) of the 39,031 predicted genes. The superscaffold order and orientation within the pseudomolecules are closely collinear with independently constructed high density linkage maps. Comparisons between marker distribution and physical location reveal regions of greater and lesser recombination, as well as regions exhibiting significant segregation distortion. The work presented here has led to a greatly improved ordering of the potato reference genome superscaffolds into chromosomal “pseudomolecules”.
236 citations
••
TL;DR: The level of variation detected within cultivated lentils suggests that RAPD markers may be an appropriate technology for the construction of genetic linkage maps between closely related Lens accessions.
Abstract: RAPD markers were used to distinguish between six different Lens taxa, representing cultivated lentil and its wild relatives. Twenty-four arbitrary sequence 10-mer primers were identified which revealed robust and easily interpretable amplification-product profiles. These generated a total of 88 polymorphic bands in 54 accessions and were used to partition variation within and among Lens taxa. The data showed that, of the taxa examined, ssp. orientalis is most similar to cultivated lentil. L. ervoides was the most divergent wild taxon followed by L. nigricans. The genetic similarity between the latter two species was of the same magnitude as between ssp. orientalis and cultivated lentil. In addition, species-diagnostic amplification products specific to L. odemensis, L. ervoides and L. nigricans were identified. These results correspond well with previous isozyme and RFLP studies. RAPDs, however, appear to provide a greater degree of resolution at a sub-species level. The level of variation detected within cultivated lentils suggests that RAPD markers may be an appropriate technology for the construction of genetic linkage maps between closely related Lens accessions.
135 citations
••
TL;DR: A very low level of AFLP marker profile variation was seen amongst the somatic embryo and microtuber derived plants, and is discussed in the context of possible methylation changes occurring during the process of somatic embryogenesis.
Abstract: The stability, both genetic and phenotypic, of potato (Solanum tuberosum L.) cultivar Desiree plants derived from alternative propagation methodologies has been compared. Plants obtained through three clonal propagation routes-axillary-bud-proliferation, microtuberisation and a novel somatic embryogenesis system, and through true potato seeds (TPS) produced by selfing were evaluated at three levels: gross phenotype and minituber yield, changes in ploidy (measured by flow cytometry) and by molecular marker analysis [measured using AFLP (amplified fragment length polymorphism)]. The clonally propagated plants exhibited no phenotypic variation while the TPS-derived plants showed obvious phenotypic segregation. Significant differences were observed with respect to minituber yield while average plant height, at the time of harvesting, was not significantly different among plants propagated through four different routes. None of the plant types varied with respect to gross genome constitution as assessed by flow cytometry. However, a very low level of AFLP marker profile variation was seen amongst the somatic embryo (3 out of 451 bands) and microtuber (2 out of 451 bands) derived plants. Intriguingly, only AFLP markers generated using methylation sensitive restriction enzymes were found to show polymorphism. No polymorphism was observed in plants regenerated through axillary-bud-proliferation. The low level of molecular variation observed could be significant on a genome-wide scale, and is discussed in the context of possible methylation changes occurring during the process of somatic embryogenesis.
84 citations
Cited by
More filters
••
TL;DR: Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number of complete plant genomes.
Abstract: The number of sequenced plant genomes and associated genomic resources is growing rapidly with the advent of both an increased focus on plant genomics from funding agencies, and the application of inexpensive next generation sequencing. To interact with this increasing body of data, we have developed Phytozome (http://www.phytozome.net), a comparative hub for plant genome and gene family data and analysis. Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number (currently 25) of complete plant genomes, including all the land plants and selected algae sequenced at the Joint Genome Institute, as well as selected species sequenced elsewhere. Through a comprehensive plant genome database and web portal, these data and analyses are available to the broader plant science research community, providing powerful comparative genomics tools that help to link model systems with other plants of economic and ecological importance.
3,728 citations
••
TL;DR: A high-quality genome sequence of domesticated tomato is presented, a draft sequence of its closest wild relative, Solanum pimpinellifolium, is compared, and the two tomato genomes are compared to each other and to the potato genome.
Abstract: Tomato (Solanum lycopersicum) is a major crop plant and a model system for fruit development. Solanum is one of the largest angiosperm genera1 and includes annual and perennial plants from diverse habitats. Here we present a high-quality genome sequence of domesticated tomato, a draft sequence of its closest wild relative, Solanum pimpinellifolium2, and compare them to each other and to the potato genome (Solanum tuberosum). The two tomato genomes show only 0.6% nucleotide divergence and signs of recent admixture, but show more than 8% divergence from potato, with nine large and several smaller inversions. In contrast to Arabidopsis, but similar to soybean, tomato and potato small RNAs map predominantly to gene-rich chromosomal regions, including gene promoters. The Solanum lineage has experienced two consecutive genome triplications: one that is ancient and shared with rosids, and a more recent one. These triplications set the stage for the neofunctionalization of genes controlling fruit characteristics, such as colour and fleshiness.
2,687 citations
••
TL;DR: The sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy and transcriptomes of development and stress response and the proteome of the shell are reported, showing that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes.
Abstract: The Pacific oyster Crassostrea gigas belongs to one of the most species-rich but genomically poorly explored phyla, the Mollusca. Here we report the sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy, along with transcriptomes of development and stress response and the proteome of the shell. The oyster genome is highly polymorphic and rich in repetitive sequences, with some transposable elements still actively shaping variation. Transcriptome studies reveal an extensive set of genes responding to environmental stress. The expansion of genes coding for heat shock protein 70 and inhibitors of apoptosis is probably central to the oyster's adaptation to sessile life in the highly stressful intertidal zone. Our analyses also show that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes. The oyster genome sequence fills a void in our understanding of the Lophotrochozoa.
1,806 citations
••
University of Évry Val d'Essonne1, Crops Research Institute2, Agriculture and Agri-Food Canada3, J. Craig Venter Institute4, Fujian Agriculture and Forestry University5, Plant Genome Mapping Laboratory6, University of Giessen7, French Alternative Energies and Atomic Energy Commission8, Institut national de la recherche agronomique9, National Research Council10, Australian Centre for Plant Functional Genomics11, University of Cologne12, Purdue University13, University of California, Berkeley14, University of British Columbia15, Fondation Jean Dausset Centre d'Etude du Polymorphisme Humain16, Huazhong Agricultural University17, Hunan Agricultural University18, Chungnam National University19, University of Arizona20, University of York21, University of Missouri22, Southern Cross University23, University of Western Australia24, Centre national de la recherche scientifique25
TL;DR: The polyploid genome of Brassica napus, which originated from a recent combination of two distinct genomes approximately 7500 years ago and gave rise to the crops of rape oilseed, is sequenced.
Abstract: Oilseed rape (Brassica napus L.) was formed ~7500 years ago by hybridization between B. rapa and B. oleracea, followed by chromosome doubling, a process known as allopolyploidy. Together with more ancient polyploidizations, this conferred an aggregate 72× genome multiplication since the origin of angiosperms and high gene content. We examined the B. napus genome and the consequences of its recent duplication. The constituent An and Cn subgenomes are engaged in subtle structural, functional, and epigenetic cross-talk, with abundant homeologous exchanges. Incipient gene loss and expression divergence have begun. Selection in B. napus oilseed types has accelerated the loss of glucosinolate genes, while preserving expansion of oil biosynthesis genes. These processes provide insights into allopolyploid evolution and its relationship with crop domestication and improvement.
1,743 citations
••
TL;DR: The computational problems surrounding repeats are discussed and strategies used by current bioinformatics systems to solve them are described.
Abstract: Repetitive DNA sequences are abundant in a broad range of species, from bacteria to mammals, and they cover nearly half of the human genome. Repeats have always presented technical challenges for sequence alignment and assembly programs. Next-generation sequencing projects, with their short read lengths and high data volumes, have made these challenges more difficult. From a computational perspective, repeats create ambiguities in alignment and assembly, which, in turn, can produce biases and errors when interpreting results. Simply ignoring repeats is not an option, as this creates problems of its own and may mean that important biological phenomena are missed. We discuss the computational problems surrounding repeats and describe strategies used by current bioinformatics systems to solve them.
1,451 citations