Home
/
Authors
/
Sanjeev Kumar Sharma

Author

Sanjeev Kumar Sharma

Other affiliations: National Botanical Research Institute, Seattle Children's Research Institute, Scottish Crop Research Institute

Bio: Sanjeev Kumar Sharma is an academic researcher from James Hutton Institute. The author has contributed to research in topics: Biology & Population. The author has an hindex of 14, co-authored 21 publications receiving 2681 citations. Previous affiliations of Sanjeev Kumar Sharma include National Botanical Research Institute & Seattle Children's Research Institute.

Topics: Biology, Population, Genome, Somatic embryogenesis, Horticulture ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Genome sequence and analysis of the tuber crop potato.

[...]

Xun Xu¹, Shengkai Pan¹, Shifeng Cheng¹, Bo Zhang¹, Mu D¹, Peixiang Ni¹, Gengyun Zhang¹, Shuang Yang¹, Ruiqiang Li¹, Jun Wang¹, Gisella Orjeda², Frank Guzman², Torres M², Roberto Lozano², Olga Ponce², Diana Martinez², De la Cruz G³, Chakrabarti Sk³, Patil Vu³, Konstantin G. Skryabin⁴, Boris B. Kuznetsov⁴, Nikolai V. Ravin⁴, Tatjana V. Kolganova⁴, Alexey V. Beletsky⁴, Andrey V. Mardanov⁴, Di Genova A⁵, Dan Bolser⁵, David M. A. Martin⁵, Li G, Yang Y, Hanhui Kuang⁶, Hu Q⁶, Xiong X⁷, Gerard J. Bishop⁸, Boris Sagredo, Nilo Mejía, Zagorski W⁹, Robert Gromadka⁹, Jan Gawor⁹, Pawel Szczesny⁹, Sanwen Huang, Zhang Z, Liang C, He J, Li Y, He Y, Xu J, Youjun Zhang, Xie B, Du Y, Qu D, Merideth Bonierbale¹⁰, Marc Ghislain¹⁰, Herrera Mdel R, Giovanni Giuliano, Marco Pietrella, Gaetano Perrotta, Paolo Facella, O'Brien K¹¹, Sergio Enrique Feingold, Barreiro Le, Massa Ga, Luis Aníbal Diambra¹², Brett R Whitty¹³, Brieanne Vaillancourt¹³, Lin H¹³, Alicia N. Massa¹³, Geoffroy M¹³, Lundback S¹³, Dean DellaPenna¹³, Buell Cr¹⁴, Sanjeev Kumar Sharma¹⁴, David Marshall¹⁴, Robbie Waugh¹⁴, Glenn J. Bryan¹⁴, Destefanis M¹⁵, Istvan Nagy¹⁵, Dan Milbourne¹⁵, Susan Thomson¹⁶, Mark Fiers¹⁶, Jeanne M. E. Jacobs¹⁶, Kåre Lehmann Nielsen¹⁷, Mads Sønderkær¹⁷, Marina Iovene¹⁸, Giovana Augusta Torres¹⁸, Jiming Jiang¹⁸, Richard E. Veilleux¹⁹, Christian W. B. Bachem²⁰, de Boer J²⁰, Theo Borm²⁰, Bjorn Kloosterman²⁰, van Eck H²⁰, Erwin Datema²⁰, Hekkert Bt²⁰, Aska Goverse²⁰, van Ham Rc²⁰, Richard G. F. Visser²⁰ - Show less +93 more•Institutions (20)

Beijing Institute of Genomics¹, Cayetano Heredia University², Indian Council of Agricultural Research³, Russian Academy of Sciences⁴, University of Dundee⁵, Huazhong Agricultural University⁶, Hunan Agricultural University⁷, Imperial College London⁸, Polish Academy of Sciences⁹, International Potato Center¹⁰, J. Craig Venter Institute¹¹, National University of La Plata¹², Michigan State University¹³, James Hutton Institute¹⁴, Teagasc¹⁵, Plant & Food Research¹⁶, Aalborg University¹⁷, University of Wisconsin-Madison¹⁸, Virginia Tech¹⁹, Wageningen University and Research Centre²⁰

10 Jul 2011-Nature

TL;DR: The potato genome sequence provides a platform for genetic improvement of this vital crop and predicts 39,031 protein-coding genes and presents evidence for at least two genome duplication events indicative of a palaeopolyploid origin.

...read moreread less

Abstract: Potato (Solanum tuberosum L.) is the world's most important non-grain food crop and is central to global food security. It is clonally propagated, highly heterozygous, autotetraploid, and suffers acute inbreeding depression. Here we use a homozygous doubled-monoploid potato clone to sequence and assemble 86% of the 844-megabase genome. We predict 39,031 protein-coding genes and present evidence for at least two genome duplication events indicative of a palaeopolyploid origin. As the first genome sequence of an asterid, the potato genome reveals 2,642 genes specific to this large angiosperm clade. We also sequenced a heterozygous diploid clone and show that gene presence/absence variants and other potentially deleterious mutations occur frequently and are a likely cause of inbreeding depression. Gene family expansion, tissue-specific expression and recruitment of genes to new pathways contributed to the evolution of tuber development. The potato genome sequence provides a platform for genetic improvement of this vital crop.

...read moreread less

1,813 citations

Journal Article•DOI•

Identification and localisation of the NB-LRR gene family within the potato genome

[...]

Florian Jupe¹, Florian Jupe², Leighton Pritchard¹, Graham J Etherington², Katrin MacKenzie¹, Peter J. A. Cock¹, Frank Wright¹, Sanjeev Kumar Sharma¹, Dan Bolser³, Glenn J. Bryan¹, Jonathan D. G. Jones², Ingo Hein¹ - Show less +8 more•Institutions (3)

James Hutton Institute¹, Sainsbury Laboratory², University of Dundee³

15 Feb 2012-BMC Genomics

TL;DR: By establishing the phylogenetic and positional relationship of potato NB-LRRs, the analysis offers significant insight into the evolution of potato R genes and provides a blueprint for future efforts to identify and more rapidly clone functional NB- LRR genes from Solanum species.

...read moreread less

Abstract: The potato genome sequence derived from the Solanum tuberosum Group Phureja clone DM1-3 516 R44 provides unparalleled insight into the genome composition and organisation of this important crop. A key class of genes that comprises the vast majority of plant resistance (R) genes contains a nucleotide-binding and leucine-rich repeat domain, and is collectively known as NB-LRRs. As part of an effort to accelerate the process of functional R gene isolation, we performed an amino acid motif based search of the annotated potato genome and identified 438 NB-LRR type genes among the ~39,000 potato gene models. Of the predicted genes, 77 contain an N-terminal toll/interleukin 1 receptor (TIR)-like domain, and 107 of the remaining 361 non-TIR genes contain an N-terminal coiled-coil (CC) domain. Physical map positions were established for 370 predicted NB-LRR genes across all 12 potato chromosomes. The majority of NB-LRRs are physically organised within 63 identified clusters, of which 50 are homogeneous in that they contain NB-LRRs derived from a recent common ancestor. By establishing the phylogenetic and positional relationship of potato NB-LRRs, our analysis offers significant insight into the evolution of potato R genes. Furthermore, the data provide a blueprint for future efforts to identify and more rapidly clone functional NB-LRR genes from Solanum species.

...read moreread less

276 citations

Journal Article•DOI•

Construction of Reference Chromosome-Scale Pseudomolecules for Potato: Integrating the Potato Genome with Genetic and Physical Maps

[...]

Sanjeev Kumar Sharma¹, Dan Bolser², Jan M. de Boer³, Mads Sønderkær⁴, Walter Amoros⁵, Martín Federico Carboni⁶, Juan Martín D'Ambrosio, Germán De la Cruz, Alex Di Genova⁷, David S. Douches⁸, Maria Eguiluz⁹, Xiao-Qiang Guo, Frank Guzman⁹, Christine A. Hackett², John P. Hamilton⁸, Guangcun Li, Ying Li, Roberto Lozano⁹, Alejandro Maass⁷, David Marshall¹, Diana Martinez⁹, Karen McLean¹, Nilo Mejía, Linda Milne¹, Susan Munive⁵, Istvan Nagy¹⁰, Olga Ponce⁹, Manuel Ramirez⁹, Reinhard Simon⁵, Susan Thomson, Yerisf Torres⁹, Robbie Waugh¹, Zhonghua Zhang, Sanwen Huang, Richard G. F. Visser³, Christian W. B. Bachem³, Boris Sagredo, Sergio Enrique Feingold⁶, Gisella Orjeda⁹, Richard E. Veilleux¹¹, Merideth Bonierbale⁵, Jeanne M. E. Jacobs¹², Dan Milbourne¹⁰, David M. A. Martin², Glenn J. Bryan¹ - Show less +41 more•Institutions (12)

James Hutton Institute¹, University of Dundee², Wageningen University and Research Centre³, Aalborg University⁴, International Potato Center⁵, International Trademark Association⁶, University of Chile⁷, Michigan State University⁸, Cayetano Heredia University⁹, Teagasc¹⁰, Virginia Tech¹¹, Plant & Food Research¹²

01 Nov 2013-G3: Genes, Genomes, Genetics

TL;DR: The work presented here has led to a greatly improved ordering of the potato reference genome superscaffolds into chromosomal “pseudomolecules”.

...read moreread less

Abstract: The genome of potato, a major global food crop, was recently sequenced. The work presented here details the integration of the potato reference genome (DM) with a new sequence-tagged site marker−based linkage map and other physical and genetic maps of potato and the closely related species tomato. Primary anchoring of the DM genome assembly was accomplished by the use of a diploid segregating population, which was genotyped with several types of molecular genetic markers to construct a new ~936 cM linkage map comprising 2469 marker loci. In silico anchoring approaches used genetic and physical maps from the diploid potato genotype RH89-039-16 (RH) and tomato. This combined approach has allowed 951 superscaffolds to be ordered into pseudomolecules corresponding to the 12 potato chromosomes. These pseudomolecules represent 674 Mb (~93%) of the 723 Mb genome assembly and 37,482 (~96%) of the 39,031 predicted genes. The superscaffold order and orientation within the pseudomolecules are closely collinear with independently constructed high density linkage maps. Comparisons between marker distribution and physical location reveal regions of greater and lesser recombination, as well as regions exhibiting significant segregation distortion. The work presented here has led to a greatly improved ordering of the potato reference genome superscaffolds into chromosomal “pseudomolecules”.

...read moreread less

236 citations

Journal Article•DOI•

Relationships among cultivated and wild lentils revealed by RAPD analysis.

[...]

Sanjeev Kumar Sharma¹, Ian K. Dawson¹, R. Waugh¹•Institutions (1)

Scottish Crop Research Institute¹

01 Sep 1995-Theoretical and Applied Genetics

TL;DR: The level of variation detected within cultivated lentils suggests that RAPD markers may be an appropriate technology for the construction of genetic linkage maps between closely related Lens accessions.

...read moreread less

Abstract: RAPD markers were used to distinguish between six different Lens taxa, representing cultivated lentil and its wild relatives. Twenty-four arbitrary sequence 10-mer primers were identified which revealed robust and easily interpretable amplification-product profiles. These generated a total of 88 polymorphic bands in 54 accessions and were used to partition variation within and among Lens taxa. The data showed that, of the taxa examined, ssp. orientalis is most similar to cultivated lentil. L. ervoides was the most divergent wild taxon followed by L. nigricans. The genetic similarity between the latter two species was of the same magnitude as between ssp. orientalis and cultivated lentil. In addition, species-diagnostic amplification products specific to L. odemensis, L. ervoides and L. nigricans were identified. These results correspond well with previous isozyme and RFLP studies. RAPDs, however, appear to provide a greater degree of resolution at a sub-species level. The level of variation detected within cultivated lentils suggests that RAPD markers may be an appropriate technology for the construction of genetic linkage maps between closely related Lens accessions.

...read moreread less

135 citations

Journal Article•DOI•

Stability of potato ( Solanum tuberosum L.) plants regenerated via somatic embryos, axillary bud proliferated shoots, microtubers and true potato seeds: a comparative phenotypic, cytogenetic and molecular assessment

[...]

Sanjeev Kumar Sharma¹, Glenn J. Bryan¹, Mark O. Winfield¹, Steve Millam¹, Steve Millam² - Show less +1 more•Institutions (2)

Scottish Crop Research Institute¹, University of Edinburgh²

01 Aug 2007-Planta

TL;DR: A very low level of AFLP marker profile variation was seen amongst the somatic embryo and microtuber derived plants, and is discussed in the context of possible methylation changes occurring during the process of somatic embryogenesis.

...read moreread less

Abstract: The stability, both genetic and phenotypic, of potato (Solanum tuberosum L.) cultivar Desiree plants derived from alternative propagation methodologies has been compared. Plants obtained through three clonal propagation routes-axillary-bud-proliferation, microtuberisation and a novel somatic embryogenesis system, and through true potato seeds (TPS) produced by selfing were evaluated at three levels: gross phenotype and minituber yield, changes in ploidy (measured by flow cytometry) and by molecular marker analysis [measured using AFLP (amplified fragment length polymorphism)]. The clonally propagated plants exhibited no phenotypic variation while the TPS-derived plants showed obvious phenotypic segregation. Significant differences were observed with respect to minituber yield while average plant height, at the time of harvesting, was not significantly different among plants propagated through four different routes. None of the plant types varied with respect to gross genome constitution as assessed by flow cytometry. However, a very low level of AFLP marker profile variation was seen amongst the somatic embryo (3 out of 451 bands) and microtuber (2 out of 451 bands) derived plants. Intriguingly, only AFLP markers generated using methylation sensitive restriction enzymes were found to show polymorphism. No polymorphism was observed in plants regenerated through axillary-bud-proliferation. The low level of molecular variation observed could be significant on a genome-wide scale, and is discussed in the context of possible methylation changes occurring during the process of somatic embryogenesis.

...read moreread less

84 citations

1
2
3
4
…
5
6
7

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Phytozome: a comparative platform for green plant genomics

[...]

David Goodstein¹, Shengqiang Shu¹, Russell Howson¹, Rochak Neupane¹, Richard D. Hayes¹, Joni Fazo¹, Therese Mitros¹, William Dirks¹, Uffe Hellsten¹, Nicholas H. Putnam¹, Daniel S. Rokhsar¹ - Show less +7 more•Institutions (1)

United States Department of Energy¹

01 Jan 2012-Nucleic Acids Research

TL;DR: Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number of complete plant genomes.

...read moreread less

Abstract: The number of sequenced plant genomes and associated genomic resources is growing rapidly with the advent of both an increased focus on plant genomics from funding agencies, and the application of inexpensive next generation sequencing. To interact with this increasing body of data, we have developed Phytozome (http://www.phytozome.net), a comparative hub for plant genome and gene family data and analysis. Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number (currently 25) of complete plant genomes, including all the land plants and selected algae sequenced at the Joint Genome Institute, as well as selected species sequenced elsewhere. Through a comprehensive plant genome database and web portal, these data and analyses are available to the broader plant science research community, providing powerful comparative genomics tools that help to link model systems with other plants of economic and ecological importance.

...read moreread less

3,728 citations

Journal Article•DOI•

The tomato genome sequence provides insights into fleshy fruit evolution

[...]

Shusei Sato, Satoshi Tabata, Hideki Hirakawa, Erika Asamizu +320 more•Institutions (51)

31 May 2012-Nature

TL;DR: A high-quality genome sequence of domesticated tomato is presented, a draft sequence of its closest wild relative, Solanum pimpinellifolium, is compared, and the two tomato genomes are compared to each other and to the potato genome.

...read moreread less

Abstract: Tomato (Solanum lycopersicum) is a major crop plant and a model system for fruit development. Solanum is one of the largest angiosperm genera1 and includes annual and perennial plants from diverse habitats. Here we present a high-quality genome sequence of domesticated tomato, a draft sequence of its closest wild relative, Solanum pimpinellifolium2, and compare them to each other and to the potato genome (Solanum tuberosum). The two tomato genomes show only 0.6% nucleotide divergence and signs of recent admixture, but show more than 8% divergence from potato, with nine large and several smaller inversions. In contrast to Arabidopsis, but similar to soybean, tomato and potato small RNAs map predominantly to gene-rich chromosomal regions, including gene promoters. The Solanum lineage has experienced two consecutive genome triplications: one that is ancient and shared with rosids, and a more recent one. These triplications set the stage for the neofunctionalization of genes controlling fruit characteristics, such as colour and fleshiness.

...read moreread less

2,687 citations

Journal Article•DOI•

The oyster genome reveals stress adaptation and complexity of shell formation

[...]

Guofan Zhang¹, Xiaodong Fang, Ximing Guo², Li Li, Ruibang Luo, Fei Xu, Pengcheng Yang, Linlin Zhang, Xiaotong Wang, Haigang Qi, Zhiqiang Xiong, Huayong Que, Yinlong Xie, Peter W. H. Holland³, Jordi Paps³, Yabing Zhu, Fucun Wu, Yuanxin Chen, Jiafeng Wang, Chunfang Peng, Jie Meng, Lan Yang, Jun Liu, Bo Wen, Na Zhang, Zhiyong Huang, Qihui Zhu, Yue Feng, Andrew S. Mount⁴, Dennis Hedgecock⁵, Zhe Xu⁶, Yunjie Liu, Tomislav Domazet-Lošo, Yishuai Du, Xiaoqing Sun, Shoudu Zhang, Binghang Liu, Peizhou Cheng, Xuanting Jiang, Juan Li, Dingding Fan, Wei Wang, Wenjing Fu, Tong Wang, Bo Wang, Jibiao Zhang, Zhiyu Peng, Yingxiang Li, Na Li, Jinpeng Wang, Maoshan Chen, Yan He², Fengji Tan, Xiaorui Song, Qiumei Zheng, Ronglian Huang, Hailong Yang, Du Xuedi, Li Chen, Mei Yang, Patrick M. Gaffney⁷, Shan Wang², Longhai Luo, Zhicai She, Yao Ming, Huang Wen, Shu Zhang, Baoyu Huang, Yong Zhang, Tao Qu, Peixiang Ni, Guoying Miao, Junyi Wang, Qiang Wang, Christian E. W. Steinberg⁸, Haiyan Wang, Ning Li, Lumin Qian², Guojie Zhang, Yingrui Li, Huanming Yang, Xiao Liu, Jian Wang, Ye Yin, Jun Wang⁹ - Show less +81 more•Institutions (9)

Chinese Academy of Sciences¹, Rutgers University², University of Oxford³, Clemson University⁴, University of Southern California⁵, Atlantic Cape Community College⁶, University of Delaware⁷, Humboldt University of Berlin⁸, University of Copenhagen⁹

04 Oct 2012-Nature

TL;DR: The sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy and transcriptomes of development and stress response and the proteome of the shell are reported, showing that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes.

...read moreread less

Abstract: The Pacific oyster Crassostrea gigas belongs to one of the most species-rich but genomically poorly explored phyla, the Mollusca. Here we report the sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy, along with transcriptomes of development and stress response and the proteome of the shell. The oyster genome is highly polymorphic and rich in repetitive sequences, with some transposable elements still actively shaping variation. Transcriptome studies reveal an extensive set of genes responding to environmental stress. The expansion of genes coding for heat shock protein 70 and inhibitors of apoptosis is probably central to the oyster's adaptation to sessile life in the highly stressful intertidal zone. Our analyses also show that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes. The oyster genome sequence fills a void in our understanding of the Lophotrochozoa.

...read moreread less

1,806 citations

Journal Article•DOI•

Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome

[...]

Boulos Chalhoub¹, Shengyi Liu², Isobel A. P. Parkin³, Haibao Tang⁴, Haibao Tang⁵, Xiyin Wang⁶, Julien Chiquet¹, Harry Belcram¹, Chaobo Tong², Birgit Samans⁷, Margot Correa⁸, Corinne Da Silva⁸, Jérémy Just¹, Cyril Falentin⁹, Chu Shin Koh¹⁰, Isabelle Le Clainche¹, Maria Bernard⁸, Pascal Bento⁸, Benjamin Noel⁸, Karine Labadie⁸, Adriana Alberti⁸, Mathieu Charles⁹, Dominique Arnaud¹, Hui Guo⁶, Christian Daviaud, Salman Alamery¹¹, Kamel Jabbari¹, Kamel Jabbari¹², Meixia Zhao¹³, Patrick P. Edger¹⁴, Houda Chelaifa¹, David C. Tack¹⁵, Gilles Lassalle⁹, Imen Mestiri¹, Nicolas Schnel⁹, Marie-Christine Le Paslier⁹, Guangyi Fan, Victor Renault¹⁶, Philippe E. Bayer¹¹, Agnieszka A. Golicz¹¹, Sahana Manoli¹¹, Tae-Ho Lee⁶, Vinh Ha Dinh Thi¹, Smahane Chalabi¹, Qiong Hu², Chuchuan Fan¹⁷, Reece Tollenaere¹¹, Yunhai Lu¹, Christophe Battail⁸, Jinxiong Shen¹⁷, Christine Sidebottom¹⁰, Xinfa Wang², Aurélie Canaguier¹, Aurélie Chauveau⁹, Aurélie Bérard⁹, G. Deniot⁹, Mei Guan¹⁸, Zhongsong Liu¹⁸, Fengming Sun, Yong Pyo Lim¹⁹, Eric Lyons²⁰, Christopher D. Town⁴, Ian Bancroft²¹, Xiaowu Wang, Jinling Meng¹⁷, Jianxin Ma¹³, J. Chris Pires²², Graham J.W. King²³, Dominique Brunel⁹, Régine Delourme⁹, Michel Renard⁹, Jean-Marc Aury⁸, Keith L. Adams¹⁵, Jacqueline Batley²⁴, Jacqueline Batley¹¹, Rod J. Snowdon⁷, Jörg Tost, David Edwards¹¹, David Edwards²⁴, Yongming Zhou¹⁷, Wei Hua², Andrew G. Sharpe¹⁰, Andrew H. Paterson⁶, Chunyun Guan¹⁸, Patrick Wincker⁸, Patrick Wincker²⁵, Patrick Wincker¹ - Show less +83 more•Institutions (25)

University of Évry Val d'Essonne¹, Crops Research Institute², Agriculture and Agri-Food Canada³, J. Craig Venter Institute⁴, Fujian Agriculture and Forestry University⁵, Plant Genome Mapping Laboratory⁶, University of Giessen⁷, French Alternative Energies and Atomic Energy Commission⁸, Institut national de la recherche agronomique⁹, National Research Council¹⁰, Australian Centre for Plant Functional Genomics¹¹, University of Cologne¹², Purdue University¹³, University of California, Berkeley¹⁴, University of British Columbia¹⁵, Fondation Jean Dausset Centre d'Etude du Polymorphisme Humain¹⁶, Huazhong Agricultural University¹⁷, Hunan Agricultural University¹⁸, Chungnam National University¹⁹, University of Arizona²⁰, University of York²¹, University of Missouri²², Southern Cross University²³, University of Western Australia²⁴, Centre national de la recherche scientifique²⁵

22 Aug 2014-Science

TL;DR: The polyploid genome of Brassica napus, which originated from a recent combination of two distinct genomes approximately 7500 years ago and gave rise to the crops of rape oilseed, is sequenced.

...read moreread less

Abstract: Oilseed rape (Brassica napus L.) was formed ~7500 years ago by hybridization between B. rapa and B. oleracea, followed by chromosome doubling, a process known as allopolyploidy. Together with more ancient polyploidizations, this conferred an aggregate 72× genome multiplication since the origin of angiosperms and high gene content. We examined the B. napus genome and the consequences of its recent duplication. The constituent An and Cn subgenomes are engaged in subtle structural, functional, and epigenetic cross-talk, with abundant homeologous exchanges. Incipient gene loss and expression divergence have begun. Selection in B. napus oilseed types has accelerated the loss of glucosinolate genes, while preserving expansion of oil biosynthesis genes. These processes provide insights into allopolyploid evolution and its relationship with crop domestication and improvement.

...read moreread less

1,743 citations

Journal Article•DOI•

Repetitive DNA and next-generation sequencing: computational challenges and solutions.

[...]

Todd J. Treangen¹, Steven L. Salzberg², Steven L. Salzberg¹•Institutions (2)

Johns Hopkins University School of Medicine¹, Johns Hopkins University²

01 Jan 2012-Nature Reviews Genetics

TL;DR: The computational problems surrounding repeats are discussed and strategies used by current bioinformatics systems to solve them are described.

...read moreread less

Abstract: Repetitive DNA sequences are abundant in a broad range of species, from bacteria to mammals, and they cover nearly half of the human genome. Repeats have always presented technical challenges for sequence alignment and assembly programs. Next-generation sequencing projects, with their short read lengths and high data volumes, have made these challenges more difficult. From a computational perspective, repeats create ambiguities in alignment and assembly, which, in turn, can produce biases and errors when interpreting results. Simply ignoring repeats is not an option, as this creates problems of its own and may mean that important biological phenomena are missed. We discuss the computational problems surrounding repeats and describe strategies used by current bioinformatics systems to solve them.

...read moreread less

1,451 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse