Home
/
Authors
/
Marc R. J. Carlson

Author

Marc R. J. Carlson

Other affiliations: University of California, Seattle Children's Research Institute, University of California, Los Angeles ...read more

Bio: Marc R. J. Carlson is an academic researcher from Fred Hutchinson Cancer Research Center. The author has contributed to research in topics: Bioconductor & Regeneration (biology). The author has an hindex of 19, co-authored 20 publications receiving 7458 citations. Previous affiliations of Marc R. J. Carlson include University of California & Seattle Children's Research Institute.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Software for computing and annotating genomic ranges.

[...]

Michael F. Lawrence¹, Wolfgang Huber², Hervé Pagès³, Patrick Aboyoun³, Marc R. J. Carlson³, Robert Gentleman¹, Martin Morgan³, Vincent J. Carey⁴ - Show less +4 more•Institutions (4)

Genentech¹, European Bioinformatics Institute², Fred Hutchinson Cancer Research Center³, Brigham and Women's Hospital⁴

08 Aug 2013-PLOS Computational Biology

TL;DR: This work describes Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions, including those for sequence analysis, differential expression analysis and visualization.

...read moreread less

Abstract: We describe Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions. At the core of the infrastructure are three packages: IRanges, GenomicRanges, and GenomicFeatures. These packages provide scalable data structures for representing annotated ranges on the genome, with special support for transcript structures, read alignments and coverage vectors. Computational facilities include efficient algorithms for overlap and nearest neighbor detection, coverage calculation and other range operations. This infrastructure directly supports more than 80 other Bioconductor packages, including those for sequence analysis, differential expression analysis and visualization.

...read moreread less

3,005 citations

Journal Article•DOI•

Orchestrating high-throughput genomic analysis with Bioconductor

[...]

Wolfgang Huber, Vincent J. Carey¹, Robert Gentleman², Simon Anders, Marc R. J. Carlson³, Benilton S. Carvalho⁴, Héctor Corrada Bravo⁵, Sean Davis⁶, Laurent Gatto⁷, Thomas Girke⁸, Raphael Gottardo³, Florian Hahne⁹, Kasper D. Hansen¹⁰, Rafael A. Irizarry¹, Michael S. Lawrence², Michael I. Love¹, James W. MacDonald¹¹, Valerie Obenchain³, Andrzej K. Oleś, Hervé Pagès³, Alejandro Reyes, Paul Shannon³, Gordon K. Smyth¹², Dan Tenenbaum³, Levi Waldron¹³, Martin Morgan³ - Show less +22 more•Institutions (13)

Harvard University¹, Genentech², Fred Hutchinson Cancer Research Center³, State University of Campinas⁴, University of Maryland, College Park⁵, National Institutes of Health⁶, University of Cambridge⁷, University of California, Riverside⁸, Novartis⁹, Johns Hopkins University¹⁰, University of Washington¹¹, Walter and Eliza Hall Institute of Medical Research¹², City University of New York¹³

01 Feb 2015-Nature Methods

TL;DR: An overview of Bioconductor, an open-source, open-development software project for the analysis and comprehension of high-throughput data in genomics and molecular biology, which comprises 934 interoperable packages contributed by a large, diverse community of scientists.

...read moreread less

Abstract: Bioconductor is an open-source, open-development software project for the analysis and comprehension of high-throughput data in genomics and molecular biology. The project aims to enable interdisciplinary research, collaboration and rapid development of scientific software. Based on the statistical programming language R, Bioconductor comprises 934 interoperable packages contributed by a large, diverse community of scientists. Packages cover a range of bioinformatic and statistical applications. They undergo formal initial review and continuous automated testing. We present an overview for prospective users and contributors.

...read moreread less

2,818 citations

Journal Article•DOI•

Analysis of oncogenic signaling networks in glioblastoma identifies ASPM as a molecular target

[...]

Steve Horvath¹, Bin Zhang, Marc R. J. Carlson, K. V. Lu, Shaojun Zhu, R. M. Felciano, M. F. Laurance, W. Zhao, S. Qi, Zhihong Chen, Yohan Lee, Adrienne C. Scheck, Linda M. Liau, Hong Wu, Daniel H. Geschwind, Phillip G. Febbo, Harley I. Kornblum, Timothy F. Cloughesy, Stanley F. Nelson, Paul S. Mischel - Show less +16 more•Institutions (1)

University of California, Los Angeles¹

14 Nov 2006-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The weighted gene coexpression network analysis provides a blueprint for leveraging genomic data to identify key control networks and molecular targets for glioblastoma, and the principle eluted from this work can be applied to other cancers.

...read moreread less

Abstract: Glioblastoma is the most common primary malignant brain tumor of adults and one of the most lethal of all cancers. Patients with this disease have a median survival of 15 months from the time of diagnosis despite surgery, radiation, and chemotherapy. New treatment approaches are needed. Recent works suggest that glioblastoma patients may benefit from molecularly targeted therapies. Here, we address the compelling need for identification of new molecular targets. Leveraging global gene expression data from two independent sets of clinical tumor samples (n = 55 and n = 65), we identify a gene coexpression module in glioblastoma that is also present in breast cancer and significantly overlaps with the "metasignature" for undifferentiated cancer. Studies in an isogenic model system demonstrate that this module is downstream of the mutant epidermal growth factor receptor, EGFRvIII, and that it can be inhibited by the epidermal growth factor receptor tyrosine kinase inhibitor Erlotinib. We identify ASPM (abnormal spindle-like microcephaly associated) as a key gene within this module and demonstrate its overexpression in glioblastoma relative to normal brain (or body tissues). Finally, we show that ASPM inhibition by siRNA-mediated knockdown inhibits tumor cell proliferation and neural stem cell proliferation, supporting ASPM as a potential molecular target in glioblastoma. Our weighted gene coexpression network analysis provides a blueprint for leveraging genomic data to identify key control networks and molecular targets for glioblastoma, and the principle eluted from our work can be applied to other cancers.

...read moreread less

595 citations

Journal Article•DOI•

Regulatory T cell development in the absence of functional Foxp3

[...]

Wen Lin¹, Dipica Haribhai², Lance M. Relland², Nga Truong¹, Marc R. J. Carlson¹, Calvin B. Williams², Talal A. Chatila¹ - Show less +3 more•Institutions (2)

University of California, Los Angeles¹, Medical College of Wisconsin²

02 Feb 2007-Nature Immunology

TL;DR: The results indicate that Treg cell effector function but not lineage commitment requires the expression of functional Foxp3 protein.

...read moreread less

Abstract: Although the development of regulatory T cells (T(reg) cells) in the thymus is defined by expression of the lineage marker Foxp3, the precise function of Foxp3 in T(reg) cell lineage commitment is unknown. Here we examined T(reg) cell development and function in mice with a Foxp3 allele that directs expression of a nonfunctional fusion protein of Foxp3 and enhanced green fluorescent protein (Foxp3DeltaEGFP). Thymocyte development in Foxp3DeltaEGFP male mice and Foxp3DeltaEGFP/+ female mice recapitulated that of wild-type mice. Although mature EGFP(+) CD4(+) T cells from Foxp3DeltaEGFP mice lacked suppressor function, they maintained the characteristic T(reg) cell 'genetic signature' and failed to develop from EGFP(-) CD4(+) T cells when transferred into lymphopenic hosts, indicative of their common ontogeny with T(reg) cells. Our results indicate that T(reg) cell effector function but not lineage commitment requires the expression of functional Foxp3 protein.

...read moreread less

456 citations

Journal Article•DOI•

New insights into the Tyrolean Iceman's origin and phenotype as inferred by whole-genome sequencing

[...]

Andreas Keller¹, Angela Graefen, Markus Ball², Mark Matzas, Valesca Boisguerin, Frank Maixner, Petra Leidinger¹, Christina Backes¹, Rabab Khairat², Michael Forster³, Bjoern Stade³, Andre Franke³, Jens Mayer¹, Jessica Spangler⁴, Stephen F. McLaughlin⁴, Minita Shah⁴, Clarence Lee⁴, Timothy T. Harkins⁴, Alexander Sartori⁴, Andrés Moreno-Estrada⁵, Brenna M. Henn⁵, Martin Sikora⁵, Ornella Semino⁶, Jacques Chiaroni⁷, Siiri Rootsi⁸, Natalie M. Myres⁹, Vicente M. Cabrera¹⁰, Peter A. Underhill⁵, Carlos Bustamante⁵, Eduard Egarter Vigl, Marco Samadelli, Giovanna Cipollini, Jan Haas¹¹, Hugo A. Katus¹¹, Brian O'Connor¹², Marc R. J. Carlson¹³, Benjamin Meder¹¹, Nikolaus Blin², Nikolaus Blin¹⁴, Eckart Meese¹, Carsten M. Pusch², Albert Zink - Show less +38 more•Institutions (14)

Saarland University¹, University of Tübingen², University of Kiel³, Life Technologies⁴, Stanford University⁵, University of Pavia⁶, Centre national de la recherche scientifique⁷, University of Tartu⁸, Sorenson Molecular Genealogy Foundation⁹, University of La Laguna¹⁰, Heidelberg University¹¹, Ontario Institute for Cancer Research¹², Fred Hutchinson Cancer Research Center¹³, Wrocław Medical University¹⁴

28 Feb 2012-Nature Communications

TL;DR: The complete genome sequence of the Iceman is reported and 100% concordance between the previously reported mitochondrial genome sequence and the consensus sequence generated from the genomic data is shown.

...read moreread less

Abstract: The Tyrolean Iceman, a 5,300-year-old Copper age individual, was discovered in 1991 on the Tisenjoch Pass in the Italian part of the Otztal Alps. Here we report the complete genome sequence of the Iceman and show 100% concordance between the previously reported mitochondrial genome sequence and the consensus sequence generated from our genomic data. We present indications for recent common ancestry between the Iceman and present-day inhabitants of the Tyrrhenian Sea, that the Iceman probably had brown eyes, belonged to blood group O and was lactose intolerant. His genetic predisposition shows an increased risk for coronary heart disease and may have contributed to the development of previously reported vascular calcifications. Sequences corresponding to ~60% of the genome of Borrelia burgdorferi are indicative of the earliest human case of infection with the pathogen for Lyme borreliosis.

...read moreread less

413 citations

1
2
3
4
…

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2

[...]

Michael I. Love¹, Michael I. Love², Wolfgang Huber, Simon Anders•Institutions (2)

Harvard University¹, Max Planck Society²

05 Dec 2014-Genome Biology

TL;DR: This work presents DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates, which enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.

...read moreread less

Abstract: In comparative high-throughput sequencing assays, a fundamental task is the analysis of count data, such as read counts per gene in RNA-seq, for evidence of systematic changes across experimental conditions. Small replicate numbers, discreteness, large dynamic range and the presence of outliers require a suitable statistical approach. We present DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates. This enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression. The DESeq2 package is available at http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html .

...read moreread less

47,038 citations

Posted Content•DOI•

Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2

[...]

Michael I. Love¹, Wolfgang Huber, Simon Anders•Institutions (1)

Harvard University¹

17 Nov 2014-bioRxiv

...read moreread less

Abstract: In comparative high-throughput sequencing assays, a fundamental task is the analysis of count data, such as read counts per gene in RNA-Seq data, for evidence of systematic changes across experimental conditions. Small replicate numbers, discreteness, large dynamic range and the presence of outliers require a suitable statistical approach. We present DESeq2, a method for differential analysis of count data. DESeq2 uses shrinkage estimation for dispersions and fold changes to improve stability and interpretability of the estimates. This enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression and facilitates downstream tasks such as gene ranking and visualization. DESeq2 is available as an R/Bioconductor package.

...read moreread less

17,014 citations

Journal Article•DOI•

HTSeq—a Python framework to work with high-throughput sequencing data

[...]

Simon Anders, Paul Theodor Pyl, Wolfgang Huber

15 Jan 2015-Bioinformatics

TL;DR: This work presents HTSeq, a Python library to facilitate the rapid development of custom scripts for high-throughput sequencing data analysis, and presents htseq-count, a tool developed with HTSequ that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.

...read moreread less

Abstract: Motivation: A large choice of tools exists for many standard tasks in the analysis of high-throughput sequencing (HTS) data. However, once a project deviates from standard workflows, custom scripts are needed. Results: We present HTSeq, a Python library to facilitate the rapid development of such scripts. HTSeq offers parsers for many common data formats in HTS projects, as well as classes to represent data, such as genomic coordinates, sequences, sequencing reads, alignments, gene model information and variant calls, and provides data structures that allow for querying via genomic coordinates. We also present htseq-count, a tool developed with HTSeq that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes. Availability and implementation: HTSeq is released as an opensource software under the GNU General Public Licence and available from http://www-huber.embl.de/HTSeq or from the Python Package Index at https://pypi.python.org/pypi/HTSeq. Contact: sanders@fs.tum.de

...read moreread less

15,744 citations

Journal Article•DOI•

WGCNA: an R package for weighted correlation network analysis.

[...]

Peter Langfelder¹, Steve Horvath¹•Institutions (1)

University of California, Los Angeles¹

29 Dec 2008-BMC Bioinformatics

TL;DR: The WGCNA R software package is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis that includes functions for network construction, module detection, gene selection, calculations of topological properties, data simulation, visualization, and interfacing with external software.

...read moreread less

Abstract: Correlation networks are increasingly being used in bioinformatics applications For example, weighted gene co-expression network analysis is a systems biology method for describing the correlation patterns among genes across microarray samples Weighted correlation network analysis (WGCNA) can be used for finding clusters (modules) of highly correlated genes, for summarizing such clusters using the module eigengene or an intramodular hub gene, for relating modules to one another and to external sample traits (using eigengene network methodology), and for calculating module membership measures Correlation networks facilitate network based gene screening methods that can be used to identify candidate biomarkers or therapeutic targets These methods have been successfully applied in various biological contexts, eg cancer, mouse genetics, yeast genetics, and analysis of brain imaging data While parts of the correlation network methodology have been described in separate publications, there is a need to provide a user-friendly, comprehensive, and consistent software implementation and an accompanying tutorial The WGCNA R software package is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis The package includes functions for network construction, module detection, gene selection, calculations of topological properties, data simulation, visualization, and interfacing with external software Along with the R package we also present R software tutorials While the methods development was motivated by gene expression data, the underlying data mining approach can be applied to a variety of different settings The WGCNA package provides R functions for weighted correlation network analysis, eg co-expression network analysis of gene expression data The R package along with its source code and additional material are freely available at http://wwwgeneticsuclaedu/labs/horvath/CoexpressionNetwork/Rpackages/WGCNA

...read moreread less

14,243 citations

Journal Article•DOI•

Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2

[...]

Evan Bolyen¹, Jai Ram Rideout¹, Matthew R. Dillon¹, Nicholas A. Bokulich¹, Christian C. Abnet², Gabriel A. Al-Ghalith³, Harriet Alexander⁴, Harriet Alexander⁵, Eric J. Alm⁶, Manimozhiyan Arumugam⁷, Francesco Asnicar⁸, Yang Bai⁹, Jordan E. Bisanz¹⁰, Kyle Bittinger¹¹, Asker Daniel Brejnrod⁷, Colin J. Brislawn¹², C. Titus Brown⁴, Benjamin J. Callahan¹³, Andrés Mauricio Caraballo-Rodríguez¹⁴, John Chase¹, Emily K. Cope¹, Ricardo Silva¹⁴, Christian Diener¹⁵, Pieter C. Dorrestein¹⁴, Gavin M. Douglas¹⁶, Daniel M. Durall¹⁷, Claire Duvallet⁶, Christian F. Edwardson, Madeleine Ernst¹⁸, Madeleine Ernst¹⁴, Mehrbod Estaki¹⁷, Jennifer Fouquier¹⁹, Julia M. Gauglitz¹⁴, Sean M. Gibbons²⁰, Sean M. Gibbons¹⁵, Deanna L. Gibson¹⁷, Antonio Gonzalez¹⁴, Kestrel Gorlick¹, Jiarong Guo²¹, Benjamin Hillmann³, Susan Holmes²², Hannes Holste¹⁴, Curtis Huttenhower²³, Curtis Huttenhower²⁴, Gavin A. Huttley²⁵, Stefan Janssen²⁶, Alan K. Jarmusch¹⁴, Lingjing Jiang¹⁴, Benjamin D. Kaehler²⁵, Benjamin D. Kaehler²⁷, Kyo Bin Kang¹⁴, Kyo Bin Kang²⁸, Christopher R. Keefe¹, Paul Keim¹, Scott T. Kelley²⁹, Dan Knights³, Irina Koester¹⁴, Tomasz Kosciolek¹⁴, Jorden Kreps¹, Morgan G. I. Langille¹⁶, Joslynn S. Lee³⁰, Ruth E. Ley³¹, Ruth E. Ley³², Yong-Xin Liu, Erikka Loftfield², Catherine A. Lozupone¹⁹, Massoud Maher¹⁴, Clarisse Marotz¹⁴, Bryan D Martin²⁰, Daniel McDonald¹⁴, Lauren J. McIver²³, Lauren J. McIver²⁴, Alexey V. Melnik¹⁴, Jessica L. Metcalf³³, Sydney C. Morgan¹⁷, Jamie Morton¹⁴, Ahmad Turan Naimey¹, Jose A. Navas-Molina³⁴, Jose A. Navas-Molina¹⁴, Louis-Félix Nothias¹⁴, Stephanie B. Orchanian, Talima Pearson¹, Samuel L. Peoples³⁵, Samuel L. Peoples²⁰, Daniel Petras¹⁴, Mary L. Preuss³⁶, Elmar Pruesse¹⁹, Lasse Buur Rasmussen⁷, Adam R. Rivers³⁷, Michael S. Robeson³⁸, Patrick Rosenthal³⁶, Nicola Segata⁸, Michael Shaffer¹⁹, Arron Shiffer¹, Rashmi Sinha², Se Jin Song¹⁴, John R. Spear³⁹, Austin D. Swafford, Luke R. Thompson⁴⁰, Luke R. Thompson⁴¹, Pedro J. Torres²⁹, Pauline Trinh²⁰, Anupriya Tripathi¹⁴, Peter J. Turnbaugh¹⁰, Sabah Ul-Hasan⁴², Justin J. J. van der Hooft⁴³, Fernando Vargas, Yoshiki Vázquez-Baeza¹⁴, Emily Vogtmann², Max von Hippel⁴⁴, William A. Walters³¹, Yunhu Wan², Mingxun Wang¹⁴, Jonathan Warren⁴⁵, Kyle C. Weber³⁷, Kyle C. Weber⁴⁶, Charles H. D. Williamson¹, Amy D. Willis²⁰, Zhenjiang Zech Xu¹⁴, Jesse R. Zaneveld²⁰, Yilong Zhang⁴⁷, Qiyun Zhu¹⁴, Rob Knight¹⁴, J. Gregory Caporaso¹ - Show less +120 more•Institutions (47)

Northern Arizona University¹, National Institutes of Health², University of Minnesota³, University of California, Davis⁴, Woods Hole Oceanographic Institution⁵, Massachusetts Institute of Technology⁶, University of Copenhagen⁷, University of Trento⁸, Chinese Academy of Sciences⁹, University of California, San Francisco¹⁰, University of Pennsylvania¹¹, Pacific Northwest National Laboratory¹², North Carolina State University¹³, University of California, San Diego¹⁴, Institute for Systems Biology¹⁵, Dalhousie University¹⁶, University of British Columbia¹⁷, Statens Serum Institut¹⁸, Anschutz Medical Campus¹⁹, University of Washington²⁰, Michigan State University²¹, Stanford University²², Harvard University²³, Broad Institute²⁴, Australian National University²⁵, University of Düsseldorf²⁶, University of New South Wales²⁷, Sookmyung Women's University²⁸, San Diego State University²⁹, Howard Hughes Medical Institute³⁰, Max Planck Society³¹, Cornell University³², Colorado State University³³, Google³⁴, Syracuse University³⁵, Webster University³⁶, United States Department of Agriculture³⁷, University of Arkansas for Medical Sciences³⁸, Colorado School of Mines³⁹, University of Southern Mississippi⁴⁰, National Oceanic and Atmospheric Administration⁴¹, University of California, Merced⁴², Wageningen University and Research Centre⁴³, University of Arizona⁴⁴, Environment Agency⁴⁵, University of Florida⁴⁶, Merck & Co.⁴⁷

01 Aug 2019-Nature Biotechnology

TL;DR: QIIME 2 development was primarily funded by NSF Awards 1565100 to J.G.C. and R.K.P. and partial support was also provided by the following: grants NIH U54CA143925 and U54MD012388.

...read moreread less

Abstract: QIIME 2 development was primarily funded by NSF Awards 1565100 to J.G.C. and 1565057 to R.K. Partial support was also provided by the following: grants NIH U54CA143925 (J.G.C. and T.P.) and U54MD012388 (J.G.C. and T.P.); grants from the Alfred P. Sloan Foundation (J.G.C. and R.K.); ERCSTG project MetaPG (N.S.); the Strategic Priority Research Program of the Chinese Academy of Sciences QYZDB-SSW-SMC021 (Y.B.); the Australian National Health and Medical Research Council APP1085372 (G.A.H., J.G.C., Von Bing Yap and R.K.); the Natural Sciences and Engineering Research Council (NSERC) to D.L.G.; and the State of Arizona Technology and Research Initiative Fund (TRIF), administered by the Arizona Board of Regents, through Northern Arizona University. All NCI coauthors were supported by the Intramural Research Program of the National Cancer Institute. S.M.G. and C. Diener were supported by the Washington Research Foundation Distinguished Investigator Award.

...read moreread less

8,821 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse