Home
/
Authors
/
Sara Mostafavi

Author

Sara Mostafavi

Other affiliations: University of Zanjan, University of Washington, Harvard University ...read more

Bio: Sara Mostafavi is an academic researcher from University of British Columbia. The author has contributed to research in topics: Genome-wide association study & Expression quantitative trait loci. The author has an hindex of 39, co-authored 127 publications receiving 21436 citations. Previous affiliations of Sara Mostafavi include University of Zanjan & University of Washington.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The Genotype-Tissue Expression (GTEx) project

[...]

John T. Lonsdale, Jeffrey Thomas, Mike Salvatore, Rebecca Phillips, Edmund Lo, Saboor Shad, Richard Hasz, Gary Walters, Fernando U. Garcia¹, Nancy Young², Barbara A. Foster³, Mike Moser³, Ellen Karasik³, Bryan Gillard³, Kimberley Ramsey³, Susan L. Sullivan, Jason Bridge, Harold Magazine, John Syron, Johnelle Fleming, Laura A. Siminoff⁴, Heather M. Traino⁴, Maghboeba Mosavel⁴, Laura Barker⁴, Scott D. Jewell⁵, Daniel C. Rohrer⁵, Dan Maxim⁵, Dana Filkins⁵, Philip Harbach⁵, Eddie Cortadillo⁵, Bree Berghuis⁵, Lisa Turner⁵, Eric Hudson⁵, Kristin Feenstra⁵, Leslie H. Sobin⁶, James A. Robb⁶, Phillip Branton, Greg E. Korzeniewski⁶, Charles Shive⁶, David Tabor⁶, Liqun Qi⁶, Kevin Groch⁶, Sreenath Nampally⁶, Steve Buia⁶, Angela Zimmerman⁶, Anna M. Smith⁶, Robin Burges⁶, Karna Robinson⁶, Kim Valentino⁶, Deborah Bradbury⁶, Mark Cosentino⁶, Norma Diaz-Mayoral⁶, Mary Kennedy⁶, Theresa Engel⁶, Penelope Williams⁶, Kenyon Erickson, Kristin G. Ardlie⁷, Wendy Winckler⁷, Gad Getz⁸, Gad Getz⁷, David S. DeLuca⁷, MacArthur Daniel MacArthur⁷, MacArthur Daniel MacArthur⁸, Manolis Kellis⁷, Alexander Thomson⁷, Taylor Young⁷, Ellen Gelfand⁷, Molly Donovan⁷, Yan Meng⁷, George B. Grant⁷, Deborah C. Mash⁹, Yvonne Marcus⁹, Margaret J. Basile⁹, Jun Liu⁸, Jun Zhu¹⁰, Zhidong Tu¹⁰, Nancy J. Cox¹¹, Dan L. Nicolae¹¹, Eric R. Gamazon¹¹, Hae Kyung Im¹¹, Anuar Konkashbaev¹¹, Jonathan K. Pritchard¹¹, Jonathan K. Pritchard¹², Matthew Stevens¹¹, Timothée Flutre¹¹, Xiaoquan Wen¹¹, Emmanouil T. Dermitzakis¹³, Tuuli Lappalainen¹³, Roderic Guigó, Jean Monlong, Michael Sammeth, Daphne Koller¹⁴, Alexis Battle¹⁴, Sara Mostafavi¹⁴, Mark I. McCarthy¹⁵, Manual Rivas¹⁵, Julian Maller¹⁵, Ivan Rusyn¹⁶, Andrew B. Nobel¹⁶, Fred A. Wright¹⁶, Andrey A. Shabalin¹⁶, Mike Feolo¹⁷, Nataliya Sharopova¹⁷, Anne Sturcke¹⁷, Justin Paschal¹⁷, James M. Anderson¹⁷, Elizabeth L. Wilder¹⁷, Leslie Derr¹⁷, Eric D. Green¹⁷, Jeffery P. Struewing¹⁷, Gary F. Temple¹⁷, Simona Volpi¹⁷, Joy T. Boyer¹⁷, Elizabeth J. Thomson¹⁷, Mark S. Guyer¹⁷, Cathy Ng¹⁷, Assya Abdallah¹⁷, Deborah Colantuoni¹⁷, Thomas R. Insel¹⁷, Susan E. Koester¹⁷, Roger Little¹⁷, Patrick Bender¹⁷, Thomas Lehner¹⁷, Yin Yao¹⁷, Carolyn C. Compton¹⁷, Jimmie B. Vaught¹⁷, Sherilyn Sawyer¹⁷, Nicole C. Lockhart¹⁷, Joanne P. Demchok¹⁷, Helen F. Moore¹⁷ - Show less +126 more•Institutions (17)

Drexel University¹, Yeshiva University², Roswell Park Cancer Institute³, Virginia Commonwealth University⁴, Van Andel Institute⁵, Science Applications International Corporation⁶, Massachusetts Institute of Technology⁷, Harvard University⁸, University of Miami⁹, Icahn School of Medicine at Mount Sinai¹⁰, University of Chicago¹¹, Howard Hughes Medical Institute¹², University of Geneva¹³, Stanford University¹⁴, University of Oxford¹⁵, University of North Carolina at Chapel Hill¹⁶, National Institutes of Health¹⁷

29 May 2013-Nature Genetics

TL;DR: The Genotype-Tissue Expression (GTEx) project is described, which will establish a resource database and associated tissue bank for the scientific community to study the relationship between genetic variation and gene expression in human tissues.

...read moreread less

Abstract: Genome-wide association studies have identified thousands of loci for common diseases, but, for the majority of these, the mechanisms underlying disease susceptibility remain unknown. Most associated variants are not correlated with protein-coding changes, suggesting that polymorphisms in regulatory regions probably contribute to many disease phenotypes. Here we describe the Genotype-Tissue Expression (GTEx) project, which will establish a resource database and associated tissue bank for the scientific community to study the relationship between genetic variation and gene expression in human tissues.

...read moreread less

6,545 citations

Journal Article•DOI•

The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans

[...]

Kristin G. Ardlie, David S. DeLuca, Ayellet V. Segrè, Timothy J. Sullivan, Taylor Young, Ellen Gelfand, Casandra A. Trowbridge, Julian Maller, Taru Tukiainen, Monkol Lek, Lucas D. Ward, Pouya Kheradpour, Benjamin Iriarte, Yan Meng, Cameron D. Palmer, Tõnu Esko, Wendy Winckler, Joel N. Hirschhorn, Manolis Kellis, Daniel G. MacArthur, Gad Getz, Andrey A. Shabalin, Gen Li, Yi-Hui Zhou, Andrew B. Nobel, Ivan Rusyn, Fred A. Wright, Tuuli Lappalainen, Pedro G. Ferreira, Halit Ongen, Manuel A. Rivas, Alexis Battle, Sara Mostafavi, Jean Monlong, Michael Sammeth, Marta Melé, Ferran Reverter, Jakob M. Goldmann, Daphne Koller, Roderic Guigó, Mark I. McCarthy, Emmanouil T. Dermitzakis, Eric R. Gamazon, Hae Kyung Im, Anuar Konkashbaev, Dan L. Nicolae, Nancy J. Cox, Timothée Flutre, Xiaoquan Wen, Matthew Stephens, Jonathan K. Pritchard, Zhidong Tu, Bin Zhang, Tao Huang, Quan Long, Luan Lin, Jialiang Yang, Jun Zhu, Jun Liu, Amanda Brown, Bernadette Mestichelli, Denee Tidwell, Edmund Lo, Mike Salvatore, Saboor Shad, Jeffrey A. Thomas, John T. Lonsdale, Michael T. Moser, Bryan Gillard, Ellen Karasik, Kimberly Ramsey, Christopher Choi, Barbara A. Foster, John Syron, Johnell Fleming, Harold Magazine, Rick Hasz, Gary Walters, Jason Bridge, Mark Miklos, Susan L. Sullivan, Laura Barker, Heather M. Traino, Maghboeba Mosavel, Laura A. Siminoff, Dana R. Valley, Daniel C. Rohrer, Scott D. Jewell, Philip A. Branton, Leslie H. Sobin, Mary Barcus, Liqun Qi, Jeffrey McLean, Pushpa Hariharan, Ki Sung Um, Shenpei Wu, David Tabor, Charles Shive, Anna M. Smith, Stephen A. Buia, Anita H. Undale, Karna Robinson, Nancy Roche, Kimberly M. Valentino, Angela Britton, Robin Burges, Debra Bradbury, Kenneth W. Hambright, John Seleski, Greg E. Korzeniewski, Kenyon Erickson, Yvonne Marcus, Jorge Tejada, Mehran Taherian, Chunrong Lu, Margaret J. Basile, Deborah C. Mash, Simona Volpi, Jeffery P. Struewing, Gary F. Temple, Joy T. Boyer, Deborah Colantuoni, Roger Little, Susan E. Koester, Latarsha J. Carithers, Helen M. Moore, Ping Guan, Carolyn C. Compton, Sherilyn Sawyer, Joanne P. Demchok, Jimmie B. Vaught, Chana A. Rabiner, Nicole C. Lockhart - Show less +129 more

08 May 2015-Science

TL;DR: The landscape of gene expression across tissues is described, thousands of tissue-specific and shared regulatory expression quantitative trait loci (eQTL) variants are cataloged, complex network relationships are described, and signals from genome-wide association studies explained by eQTLs are identified.

...read moreread less

Abstract: Understanding the functional consequences of genetic variation, and how it affects complex human disease and quantitative traits, remains a critical challenge for biomedicine. We present an analysi...

...read moreread less

4,418 citations

Journal Article•DOI•

The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function.

[...]

David Warde-Farley¹, Sylva L. Donaldson¹, Ovi Comes¹, Khalid Zuberi¹, Rashad Badrawi¹, Pauline Chao¹, Max Franz¹, Chris Grouios¹, Farzana Kazi¹, Christian Lopes¹, Anson Maitland¹, Sara Mostafavi¹, Jason Montojo¹, Quentin Shao¹, George Wright¹, Gary D. Bader¹, Quaid Morris¹ - Show less +13 more•Institutions (1)

University of Toronto¹

01 Jul 2010-Nucleic Acids Research

TL;DR: The high accuracy of the GeneMANIA prediction algorithm, an intuitive user interface and large database make Gene MANIA a useful tool for any biologist.

...read moreread less

Abstract: GeneMANIA (http://www.genemania.org) is a flexible, user-friendly web interface for generating hypotheses about gene function, analyzing gene lists and prioritizing genes for functional assays. Given a query list, GeneMANIA extends the list with functionally similar genes that it identifies using available genomics and proteomics data. GeneMANIA also reports weights that indicate the predictive value of each selected data set for the query. Six organisms are currently supported (Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster, Mus musculus, Homo sapiens and Saccharomyces cerevisiae) and hundreds of data sets have been collected from GEO, BioGRID, Pathway Commons and I2D, as well as organism-specific functional genomics data sets. Users can select arbitrary subsets of the data sets associated with an organism to perform their analyses and can upload their own data sets to analyze. The GeneMANIA algorithm performs as well or better than other gene function prediction methods on yeast and mouse benchmarks. The high accuracy of the GeneMANIA prediction algorithm, an intuitive user interface and large database make GeneMANIA a useful tool for any biologist.

...read moreread less

3,211 citations

Journal Article•DOI•

The genetic landscape of a cell.

[...]

Michael Costanzo¹, Anastasia Baryshnikova¹, Jeremy Bellay², Yungil Kim², Eric D. Spear³, Carolyn S. Sevier³, Huiming Ding¹, Judice L. Y. Koh¹, Kiana Toufighi¹, Sara Mostafavi¹, Jeany Prinz¹, Robert P. St.Onge⁴, Benjamin VanderSluis², Taras Makhnevych¹, Franco J. Vizeacoumar¹, Solmaz Alizadeh¹, Sondra Bahr¹, Renee L. Brost¹, Yiqun Chen¹, Murat Cokol⁵, Raamesh Deshpande², Zhijian Li¹, Zhen Yuan Lin¹, Wendy Liang¹, Michaela Marback¹, Jadine Paw¹, Bryan Joseph San Luis¹, Ermira Shuteriqi¹, Amy Hin Yan Tong¹, Nydia Van Dyk¹, Iain M. Wallace¹, Joseph Whitney¹, Matthew T. Weirauch⁶, Guoqing Zhong¹, Hongwei Zhu¹, Walid A. Houry¹, Michael Brudno¹, Sasan Ragibizadeh, Balázs Papp⁷, Csaba Pál⁷, Frederick P. Roth⁵, Guri Giaever¹, Corey Nislow¹, Olga G. Troyanskaya⁸, Howard Bussey⁹, Gary D. Bader¹, Anne-Claude Gingras¹, Quaid Morris¹, Philip M. Kim¹, Chris A. Kaiser³, Chad L. Myers², Brenda J. Andrews¹, Charles Boone¹ - Show less +49 more•Institutions (9)

University of Toronto¹, University of Minnesota², Massachusetts Institute of Technology³, Stanford University⁴, Harvard University⁵, University of California, Santa Cruz⁶, Hungarian Academy of Sciences⁷, Princeton University⁸, McGill University⁹

22 Jan 2010-Science

TL;DR: A network based on genetic interaction profiles reveals a functional map of the cell in which genes of similar biological processes cluster together in coherent subsets, and highly correlated profiles delineate specific pathways to define gene function.

...read moreread less

Abstract: A genome-scale genetic interaction map was constructed by examining 5.4 million gene-gene pairs for synthetic genetic interactions, generating quantitative genetic interaction profiles for ~75% of all genes in the budding yeast, Saccharomyces cerevisiae. A network based on genetic interaction profiles reveals a functional map of the cell in which genes of similar biological processes cluster together in coherent subsets, and highly correlated profiles delineate specific pathways to define gene function. The global network identifies functional cross-connections between all bioprocesses, mapping a cellular wiring diagram of pleiotropy. Genetic interaction degree correlated with a number of different gene attributes, which may be informative about genetic network hubs in other organisms. We also demonstrate that extensive and unbiased mapping of the genetic landscape provides a key for interpretation of chemical-genetic interactions and drug target identification.

...read moreread less

2,225 citations

Journal Article•DOI•

Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression

[...]

Naomi R. Wray¹, Stephan Ripke², Stephan Ripke³, Stephan Ripke⁴ +259 more•Institutions (79)

26 Apr 2018-Nature Genetics

TL;DR: A genome-wide association meta-analysis of individuals with clinically assessed or self-reported depression identifies 44 independent and significant loci and finds important relationships of genetic risk for major depression with educational attainment, body mass, and schizophrenia.

...read moreread less

Abstract: Major depressive disorder (MDD) is a common illness accompanied by considerable morbidity, mortality, costs, and heightened risk of suicide. We conducted a genome-wide association meta-analysis based in 135,458 cases and 344,901 controls and identified 44 independent and significant loci. The genetic findings were associated with clinical features of major depression and implicated brain regions exhibiting anatomical differences in cases. Targets of antidepressant medications and genes involved in gene splicing were enriched for smaller association signal. We found important relationships of genetic risk for major depression with educational attainment, body mass, and schizophrenia: lower educational attainment and higher body mass were putatively causal, whereas major depression and schizophrenia reflected a partly shared biological etiology. All humans carry lesser or greater numbers of genetic risk factors for major depression. These findings help refine the basis of major depression and imply that a continuous measure of risk underlies the clinical phenotype.

...read moreread less

1,898 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29

Collapse

Cited by

PDF

Open Access

More filters

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

Journal Article•DOI•

STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.

[...]

Damian Szklarczyk¹, Annika L. Gable¹, David Lyon¹, Alexander Junge², Stefan Wyder¹, Jaime Huerta-Cepas³, Milan Simonovic¹, Nadezhda Tsankova Doncheva², John H. Morris⁴, Peer Bork, Lars Juhl Jensen², Christian von Mering¹ - Show less +8 more•Institutions (4)

Swiss Institute of Bioinformatics¹, University of Copenhagen², Technical University of Madrid³, University of California, San Francisco⁴

08 Jan 2019-Nucleic Acids Research

TL;DR: The latest version of STRING more than doubles the number of organisms it covers, and offers an option to upload entire, genome-wide datasets as input, allowing users to visualize subsets as interaction networks and to perform gene-set enrichment analysis on the entire input.

...read moreread less

Abstract: Proteins and their functional interactions form the backbone of the cellular machinery. Their connectivity network needs to be considered for the full understanding of biological phenomena, but the available information on protein-protein associations is incomplete and exhibits varying levels of annotation granularity and reliability. The STRING database aims to collect, score and integrate all publicly available sources of protein-protein interaction information, and to complement these with computational predictions. Its goal is to achieve a comprehensive and objective global network, including direct (physical) as well as indirect (functional) interactions. The latest version of STRING (11.0) more than doubles the number of organisms it covers, to 5090. The most important new feature is an option to upload entire, genome-wide datasets as input, allowing users to visualize subsets as interaction networks and to perform gene-set enrichment analysis on the entire input. For the enrichment analysis, STRING implements well-known classification systems such as Gene Ontology and KEGG, but also offers additional, new classification systems based on high-throughput text-mining as well as on a hierarchical clustering of the association network itself. The STRING resource is available online at https://string-db.org/.

...read moreread less

10,584 citations

Journal Article•

Human biochemical genetics

[...]

Grüneberg H

01 Jul 1960-The Eugenics Review

TL;DR: For the next few weeks the course is going to be exploring a field that’s actually older than classical population genetics, although the approach it’ll be taking to it involves the use of population genetic machinery.

...read moreread less

Abstract: So far in this course we have dealt entirely with the evolution of characters that are controlled by simple Mendelian inheritance at a single locus. There are notes on the course website about gametic disequilibrium and how allele frequencies change at two loci simultaneously, but we didn’t discuss them. In every example we’ve considered we’ve imagined that we could understand something about evolution by examining the evolution of a single gene. That’s the domain of classical population genetics. For the next few weeks we’re going to be exploring a field that’s actually older than classical population genetics, although the approach we’ll be taking to it involves the use of population genetic machinery. If you know a little about the history of evolutionary biology, you may know that after the rediscovery of Mendel’s work in 1900 there was a heated debate between the “biometricians” (e.g., Galton and Pearson) and the “Mendelians” (e.g., de Vries, Correns, Bateson, and Morgan). Biometricians asserted that the really important variation in evolution didn’t follow Mendelian rules. Height, weight, skin color, and similar traits seemed to

...read moreread less

9,847 citations

Journal Article•DOI•

Analysis of protein-coding genetic variation in 60,706 humans

[...]

Monkol Lek, Konrad J. Karczewski¹, Konrad J. Karczewski², Eric Vallabh Minikel², Eric Vallabh Minikel¹, Kaitlin E. Samocha, Eric Banks¹, Timothy Fennell¹, Anne H. O’Donnell-Luria², Anne H. O’Donnell-Luria³, Anne H. O’Donnell-Luria¹, James S. Ware, Andrew J. Hill⁴, Andrew J. Hill¹, Andrew J. Hill², Beryl B. Cummings¹, Beryl B. Cummings², Taru Tukiainen², Taru Tukiainen¹, Daniel P. Birnbaum¹, Jack A. Kosmicki, Laramie E. Duncan², Laramie E. Duncan¹, Karol Estrada², Karol Estrada¹, Fengmei Zhao², Fengmei Zhao¹, James Zou¹, Emma Pierce-Hoffman¹, Emma Pierce-Hoffman², Joanne Berghout⁵, David Neil Cooper⁶, Nicole A. Deflaux⁷, Mark A. DePristo¹, Ron Do, Jason Flannick², Jason Flannick¹, Menachem Fromer, Laura D. Gauthier¹, Jackie Goldstein¹, Jackie Goldstein², Namrata Gupta¹, Daniel P. Howrigan¹, Daniel P. Howrigan², Adam Kiezun¹, Mitja I. Kurki¹, Mitja I. Kurki², Ami Levy Moonshine¹, Pradeep Natarajan, Lorena Orozco, Gina M. Peloso¹, Gina M. Peloso², Ryan Poplin¹, Manuel A. Rivas¹, Valentin Ruano-Rubio¹, Samuel A. Rose¹, Douglas M. Ruderfer⁸, Khalid Shakir¹, Peter D. Stenson⁶, Christine Stevens¹, Brett Thomas², Brett Thomas¹, Grace Tiao¹, María Teresa Tusié-Luna, Ben Weisburd¹, Hong-Hee Won⁹, Dongmei Yu, David Altshuler¹, David Altshuler¹⁰, Diego Ardissino, Michael Boehnke¹¹, John Danesh¹², Stacey Donnelly¹, Roberto Elosua, Jose C. Florez¹, Jose C. Florez², Stacey Gabriel¹, Gad Getz², Gad Getz¹, Stephen J. Glatt¹³, Christina M. Hultman¹⁴, Sekar Kathiresan, Markku Laakso¹⁵, Steven A. McCarroll¹, Steven A. McCarroll², Mark I. McCarthy¹⁶, Mark I. McCarthy¹⁷, Dermot P.B. McGovern¹⁸, Ruth McPherson¹⁹, Benjamin M. Neale¹, Benjamin M. Neale², Aarno Palotie, Shaun Purcell⁸, Danish Saleheen²⁰, Jeremiah M. Scharf, Pamela Sklar, Patrick F. Sullivan²¹, Patrick F. Sullivan¹⁴, Jaakko Tuomilehto²², Ming T. Tsuang²³, Hugh Watkins¹⁷, Hugh Watkins¹⁶, James G. Wilson²⁴, Mark J. Daly¹, Mark J. Daly², Daniel G. MacArthur¹, Daniel G. MacArthur² - Show less +103 more•Institutions (24)

Broad Institute¹, Harvard University², Boston Children's Hospital³, University of Washington⁴, University of Arizona⁵, Cardiff University⁶, Google⁷, Icahn School of Medicine at Mount Sinai⁸, Samsung Medical Center⁹, Vertex Pharmaceuticals¹⁰, University of Michigan¹¹, University of Cambridge¹², State University of New York Upstate Medical University¹³, Karolinska Institutet¹⁴, University of Eastern Finland¹⁵, Wellcome Trust Centre for Human Genetics¹⁶, University of Oxford¹⁷, Cedars-Sinai Medical Center¹⁸, University of Ottawa¹⁹, University of Pennsylvania²⁰, University of North Carolina at Chapel Hill²¹, University of Helsinki²², University of California, San Diego²³, University of Mississippi Medical Center²⁴

18 Aug 2016-Nature

TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.

...read moreread less

Abstract: Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.

...read moreread less

8,758 citations

Journal Article•DOI•

Integrating single-cell transcriptomic data across different conditions, technologies, and species.

[...]

Andrew Butler, Paul J. Hoffman, Peter Smibert, Efthymia Papalexi¹, Rahul Satija¹ - Show less +1 more•Institutions (1)

New York University¹

02 Apr 2018-Nature Biotechnology

TL;DR: An analytical strategy for integrating scRNA-seq data sets based on common sources of variation is introduced, enabling the identification of shared populations across data sets and downstream comparative analysis.

...read moreread less

Abstract: Computational single-cell RNA-seq (scRNA-seq) methods have been successfully applied to experiments representing a single condition, technology, or species to discover and define cellular phenotypes. However, identifying subpopulations of cells that are present across multiple data sets remains challenging. Here, we introduce an analytical strategy for integrating scRNA-seq data sets based on common sources of variation, enabling the identification of shared populations across data sets and downstream comparative analysis. We apply this approach, implemented in our R toolkit Seurat (http://satijalab.org/seurat/), to align scRNA-seq data sets of peripheral blood mononuclear cells under resting and stimulated conditions, hematopoietic progenitors sequenced using two profiling technologies, and pancreatic cell 'atlases' generated from human and mouse islets. In each case, we learn distinct or transitional cell states jointly across data sets, while boosting statistical power through integrated analysis. Our approach facilitates general comparisons of scRNA-seq data sets, potentially deepening our understanding of how distinct cell states respond to perturbation, disease, and evolution.

...read moreread less

7,741 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse