Home
/
Authors
/
Ankit Malhotra

Author

Ankit Malhotra

Bio: Ankit Malhotra is an academic researcher from University of Virginia. The author has contributed to research in topics: Structural variation & Genomics. The author has an hindex of 17, co-authored 27 publications receiving 20446 citations.

Topics: Structural variation, Genomics, 1000 Genomes Project, Gene, Human genome ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A global reference for human genetic variation.

[...]

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature

TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.

...read moreread less

Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

...read moreread less

12,661 citations

Journal Article•DOI•

Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project

[...]

Ewan Birney, John A. Stamatoyannopoulos¹, Anindya Dutta², Roderic Guigó³ +317 more•Institutions (44)

14 Jun 2007-Nature

TL;DR: Functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project are reported, providing convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts.

...read moreread less

Abstract: We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.

...read moreread less

5,091 citations

A global reference for human genetic variation

[...]

Adam Auton, Gonçalo R. Abecasis, David Altshuler, Richard Durbin +476 more

01 Oct 2015

TL;DR: The 1000 Genomes Project as mentioned in this paper provided a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and reported the completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole genome sequencing, deep exome sequencing and dense microarray genotyping.

...read moreread less

3,247 citations

Journal Article•DOI•

An integrated map of structural variation in 2,504 human genomes

[...]

Peter H. Sudmant¹, Tobias Rausch, Eugene J. Gardner², Robert E. Handsaker³, Robert E. Handsaker⁴, Alexej Abyzov⁵, John Huddleston¹, Yan Zhang⁶, Kai Ye⁷, Goo Jun⁸, Goo Jun⁹, Markus His Yang Fritz, Miriam K. Konkel¹⁰, Ankit Malhotra, Adrian M. Stütz, Xinghua Shi¹¹, Francesco Paolo Casale¹², Jieming Chen⁶, Fereydoun Hormozdiari¹, Gargi Dayama⁹, Ken Chen¹³, Maika Malig¹, Mark Chaisson¹, Klaudia Walter¹², Sascha Meiers, Seva Kashin³, Seva Kashin⁴, Erik Garrison¹⁴, Adam Auton¹⁵, Hugo Y. K. Lam, Xinmeng Jasmine Mu⁶, Xinmeng Jasmine Mu³, Can Alkan¹⁶, Danny Antaki¹⁷, Taejeong Bae⁵, Eliza Cerveira, Peter S. Chines¹⁸, Zechen Chong¹³, Laura Clarke¹², Elif Dal¹⁶, Li Ding⁷, S. Emery⁹, Xian Fan¹³, Madhusudan Gujral¹⁷, Fatma Kahveci¹⁶, Jeffrey M. Kidd⁹, Yu Kong¹⁵, Eric-Wubbo Lameijer¹⁹, Shane A. McCarthy¹², Paul Flicek¹², Richard A. Gibbs²⁰, Gabor T. Marth¹⁴, Christopher E. Mason²¹, Androniki Menelaou²², Androniki Menelaou²³, Donna M. Muzny²⁴, Bradley J. Nelson¹, Amina Noor¹⁷, Nicholas F. Parrish²⁵, Matthew Pendleton²⁴, Andrew Quitadamo¹¹, Benjamin Raeder, Eric E. Schadt²⁴, Mallory Romanovitch, Andreas Schlattl, Robert Sebra²⁴, Andrey A. Shabalin²⁶, Andreas Untergasser²⁷, Jerilyn A. Walker¹⁰, Min Wang²⁰, Fuli Yu²⁰, Chengsheng Zhang, Jing Zhang⁶, Xiangqun Zheng-Bradley¹², Wanding Zhou¹³, Thomas Zichner, Jonathan Sebat¹⁷, Mark A. Batzer¹⁰, Steven A. McCarroll³, Steven A. McCarroll⁴, Ryan E. Mills⁹, Mark Gerstein⁶, Ali Bashir²⁴, Oliver Stegle¹², Scott E. Devine², Charles Lee²⁸, Evan E. Eichler¹, Jan O. Korbel¹² - Show less +84 more•Institutions (28)

University of Washington¹, University of Maryland, Baltimore², Broad Institute³, Harvard University⁴, Mayo Clinic⁵, Yale University⁶, Washington University in St. Louis⁷, University of Texas Health Science Center at Houston⁸, University of Michigan⁹, Louisiana State University¹⁰, University of North Carolina at Charlotte¹¹, Wellcome Trust¹², University of Texas MD Anderson Cancer Center¹³, Boston College¹⁴, Yeshiva University¹⁵, Bilkent University¹⁶, University of California, San Diego¹⁷, National Institutes of Health¹⁸, Leiden University¹⁹, Baylor College of Medicine²⁰, Cornell University²¹, University of Oxford²², Utrecht University²³, Icahn School of Medicine at Mount Sinai²⁴, Kyoto University²⁵, Virginia Commonwealth University²⁶, Heidelberg University²⁷, Ewha Womans University²⁸

01 Oct 2015-Nature

TL;DR: In this paper, the authors describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which are constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations.

...read moreread less

Abstract: Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association.

...read moreread less

1,971 citations

Journal Article•DOI•

A novel class of small RNAs: tRNA-derived RNA fragments (tRFs)

[...]

Yong Sun Lee¹, Yoshiyuki Shibata¹, Ankit Malhotra¹, Anindya Dutta¹•Institutions (1)

University of Virginia¹

15 Nov 2009-Genes & Development

TL;DR: The data suggest that tRFs are not random by-products of tRNA degradation or biogenesis, but an abundant and novel class of short RNAs with precise sequence structure that have specific expression patterns and specific biological roles.

...read moreread less

Abstract: New types of small RNAs distinct from microRNAs (miRNAs) are progressively being discovered in various organisms. In order to discover such novel small RNAs, a library of 17- to 26-base-long RNAs was created from prostate cancer cell lines and sequenced by ultra-high-throughput sequencing. A significant number of the sequences are derived from precise processing at the 5' or 3' end of mature or precursor tRNAs to form three series of tRFs (tRNA-derived RNA fragments): the tRF-5, tRF-3, and tRF-1 series. These sequences constitute a class of short RNAs that are second most abundant to miRNAs. Northern hybridization, quantitative RT-PCR, and splinted ligation assays independently measured the levels of at least 17 tRFs. To demonstrate the biological importance of tRFs, we further investigated tRF-1001, derived from the 3' end of a Ser-TGA tRNA precursor transcript that is not retained in the mature tRNA. tRF-1001 is expressed highly in a wide range of cancer cell lines but much less in tissues, and its expression in cell lines was tightly correlated with cell proliferation. siRNA-mediated knockdown of tRF-1001 impaired cell proliferation with the specific accumulation of cells in G2, phenotypes that were reversed specifically by cointroducing a synthetic 2'-O-methyl tRF-1001 oligoribonucleotide resistant to the siRNA. tRF-1001 is generated in the cytoplasm by tRNA 3'-endonuclease ELAC2, a prostate cancer susceptibility gene. Our data suggest that tRFs are not random by-products of tRNA degradation or biogenesis, but an abundant and novel class of short RNAs with precise sequence structure that have specific expression patterns and specific biological roles.

...read moreread less

923 citations

1
2
3
4
…
5
6

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

An integrated encyclopedia of DNA elements in the human genome

[...]

Principal investigators¹, Nhgri groups², Data production leads³, Lead analysts³•Institutions (3)

Wellcome Trust¹, University of Washington², Pennsylvania State University³

06 Sep 2012-Nature

TL;DR: The Encyclopedia of DNA Elements project provides new insights into the organization and regulation of the authors' genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

Abstract: The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

13,548 citations

Journal Article•DOI•

A global reference for human genetic variation.

[...]

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature

...read moreread less

12,661 citations

Journal Article•DOI•

Induced Pluripotent Stem Cell Lines Derived from Human Somatic Cells

[...]

Junying Yu¹, Maxim A. Vodyanik, Kim Smuga-Otto, Jessica Antosiewicz-Bourget, Jennifer L. Frane, Shulan Tian, Jeff Nie, Gudrun A. Jonsdottir, Victor Ruotti, Ron Stewart, Igor I. Slukvin, James A. Thomson - Show less +8 more•Institutions (1)

University of Wisconsin-Madison¹

21 Dec 2007-Science

TL;DR: This article showed that OCT4, SOX2, NANOG, and LIN28 factors are sufficient to reprogram human somatic cells to pluripotent stem cells that exhibit the essential characteristics of embryonic stem (ES) cells.

...read moreread less

Abstract: Somatic cell nuclear transfer allows trans-acting factors present in the mammalian oocyte to reprogram somatic cell nuclei to an undifferentiated state. We show that four factors (OCT4, SOX2, NANOG, and LIN28) are sufficient to reprogram human somatic cells to pluripotent stem cells that exhibit the essential characteristics of embryonic stem (ES) cells. These induced pluripotent human stem cells have normal karyotypes, express telomerase activity, express cell surface markers and genes that characterize human ES cells, and maintain the developmental potential to differentiate into advanced derivatives of all three primary germ layers. Such induced pluripotent human cell lines should be useful in the production of new disease models and in drug development, as well as for applications in transplantation medicine, once technical limitations (for example, mutation through viral integration) are eliminated.

...read moreread less

9,836 citations

Journal Article•DOI•

Tissue-based map of the human proteome

[...]

Mathias Uhlén¹, Mathias Uhlén², Linn Fagerberg¹, Björn M. Hallström¹, Cecilia Lindskog³, Per Oksvold¹, Adil Mardinoglu⁴, Åsa Sivertsson¹, Caroline Kampf³, Evelina Sjöstedt³, Evelina Sjöstedt¹, Anna Asplund³, IngMarie Olsson³, Karolina Edlund, Emma Lundberg¹, Sanjay Navani, Cristina Al-Khalili Szigyarto¹, Jacob Odeberg¹, Dijana Djureinovic³, Jenny Ottosson Takanen¹, Sophia Hober¹, Tove Alm¹, Per-Henrik Edqvist³, Holger Berling¹, Hanna Tegel¹, Jan Mulder³, Johan Rockberg¹, Peter Nilsson¹, Jochen M. Schwenk¹, Marica Hamsten¹, Kalle von Feilitzen¹, Mattias Forsberg¹, Lukas Persson¹, Fredric Johansson¹, Martin Zwahlen¹, Gunnar von Heijne⁵, Jens Nielsen⁴, Jens Nielsen², Fredrik Pontén³ - Show less +35 more•Institutions (5)

Royal Institute of Technology¹, Technical University of Denmark², Science for Life Laboratory³, Chalmers University of Technology⁴, Stockholm University⁵

23 Jan 2015-Science

TL;DR: In this paper, a map of the human tissue proteome based on an integrated omics approach that involves quantitative transcriptomics at the tissue and organ level, combined with tissue microarray-based immunohistochemistry, to achieve spatial localization of proteins down to the single-cell level.

...read moreread less

Abstract: Resolving the molecular details of proteome variation in the different tissues and organs of the human body will greatly increase our knowledge of human biology and disease. Here, we present a map of the human tissue proteome based on an integrated omics approach that involves quantitative transcriptomics at the tissue and organ level, combined with tissue microarray-based immunohistochemistry, to achieve spatial localization of proteins down to the single-cell level. Our tissue-based analysis detected more than 90% of the putative protein-coding genes. We used this approach to explore the human secretome, the membrane proteome, the druggable proteome, the cancer proteome, and the metabolic functions in 32 different tissues and organs. All the data are integrated in an interactive Web-based database that allows exploration of individual proteins, as well as navigation of global expression patterns, in all major tissues and organs in the human body.

...read moreread less

9,745 citations

Journal Article•DOI•

Analysis of protein-coding genetic variation in 60,706 humans

[...]

Monkol Lek, Konrad J. Karczewski¹, Konrad J. Karczewski², Eric Vallabh Minikel¹, Eric Vallabh Minikel², Kaitlin E. Samocha, Eric Banks², Timothy Fennell², Anne H. O’Donnell-Luria¹, Anne H. O’Donnell-Luria², Anne H. O’Donnell-Luria³, James S. Ware, Andrew J. Hill⁴, Andrew J. Hill¹, Andrew J. Hill², Beryl B. Cummings², Beryl B. Cummings¹, Taru Tukiainen², Taru Tukiainen¹, Daniel P. Birnbaum², Jack A. Kosmicki, Laramie E. Duncan², Laramie E. Duncan¹, Karol Estrada², Karol Estrada¹, Fengmei Zhao², Fengmei Zhao¹, James Zou², Emma Pierce-Hoffman¹, Emma Pierce-Hoffman², Joanne Berghout⁵, David Neil Cooper⁶, Nicole A. Deflaux⁷, Mark A. DePristo², Ron Do, Jason Flannick¹, Jason Flannick², Menachem Fromer, Laura D. Gauthier², Jackie Goldstein¹, Jackie Goldstein², Namrata Gupta², Daniel P. Howrigan², Daniel P. Howrigan¹, Adam Kiezun², Mitja I. Kurki¹, Mitja I. Kurki², Ami Levy Moonshine², Pradeep Natarajan, Lorena Orozco, Gina M. Peloso¹, Gina M. Peloso², Ryan Poplin², Manuel A. Rivas², Valentin Ruano-Rubio², Samuel A. Rose², Douglas M. Ruderfer⁸, Khalid Shakir², Peter D. Stenson⁶, Christine Stevens², Brett Thomas¹, Brett Thomas², Grace Tiao², María Teresa Tusié-Luna, Ben Weisburd², Hong-Hee Won⁹, Dongmei Yu, David Altshuler², David Altshuler¹⁰, Diego Ardissino, Michael Boehnke¹¹, John Danesh¹², Stacey Donnelly², Roberto Elosua, Jose C. Florez¹, Jose C. Florez², Stacey Gabriel², Gad Getz¹, Gad Getz², Stephen J. Glatt¹³, Christina M. Hultman¹⁴, Sekar Kathiresan, Markku Laakso¹⁵, Steven A. McCarroll², Steven A. McCarroll¹, Mark I. McCarthy¹⁶, Mark I. McCarthy¹⁷, Dermot P.B. McGovern¹⁸, Ruth McPherson¹⁹, Benjamin M. Neale¹, Benjamin M. Neale², Aarno Palotie, Shaun Purcell⁸, Danish Saleheen²⁰, Jeremiah M. Scharf, Pamela Sklar, Patrick F. Sullivan¹⁴, Patrick F. Sullivan²¹, Jaakko Tuomilehto²², Ming T. Tsuang²³, Hugh Watkins¹⁶, Hugh Watkins¹⁷, James G. Wilson²⁴, Mark J. Daly², Mark J. Daly¹, Daniel G. MacArthur², Daniel G. MacArthur¹ - Show less +103 more•Institutions (24)

Harvard University¹, Broad Institute², Boston Children's Hospital³, University of Washington⁴, University of Arizona⁵, Cardiff University⁶, Google⁷, Icahn School of Medicine at Mount Sinai⁸, Samsung Medical Center⁹, Vertex Pharmaceuticals¹⁰, University of Michigan¹¹, University of Cambridge¹², State University of New York Upstate Medical University¹³, Karolinska Institutet¹⁴, University of Eastern Finland¹⁵, Wellcome Trust Centre for Human Genetics¹⁶, University of Oxford¹⁷, Cedars-Sinai Medical Center¹⁸, University of Ottawa¹⁹, University of Pennsylvania²⁰, University of North Carolina at Chapel Hill²¹, University of Helsinki²², University of California, San Diego²³, University of Mississippi Medical Center²⁴

18 Aug 2016-Nature

TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.

...read moreread less

Abstract: Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.

...read moreread less

8,758 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse