Institution

Broad Institute

Nonprofit•Cambridge, Massachusetts, United States•

About: Broad Institute is a nonprofit organization based out in Cambridge, Massachusetts, United States. It is known for research contribution in the topics: Population & Genome-wide association study. The organization has 6584 authors who have published 11618 publications receiving 1522743 citations. The organization is also known as: Eli and Edythe L. Broad Institute of MIT and Harvard.

...read moreread less

Topics: Population, Genome-wide association study, Genome, Gene, Chromatin ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Sustainable data analysis with Snakemake.

[...]

Felix Mölder¹, Kim Philipp Jablonski², Kim Philipp Jablonski³, Brice Letcher⁴, Michael B Hall⁴, Christopher Tomkins-Tinch⁵, Christopher Tomkins-Tinch⁶, Vanessa Sochat⁷, Jan Forster⁸, Jan Forster¹, Soohyun Lee⁵, Sven Twardziok⁹, Alexander Kanitz³, Alexander Kanitz¹⁰, Andreas Wilm¹¹, Manuel Holtgrewe⁹, Sven Rahmann¹, Sven Nahnsen¹², Johannes Köster¹, Johannes Köster⁵ - Show less +16 more•Institutions (12)

University of Duisburg-Essen¹, ETH Zurich², Swiss Institute of Bioinformatics³, European Bioinformatics Institute⁴, Harvard University⁵, Broad Institute⁶, Stanford University⁷, German Cancer Research Center⁸, Humboldt University of Berlin⁹, University of Basel¹⁰, Microsoft¹¹, University of Tübingen¹²

19 Apr 2021-F1000Research

TL;DR: It is shown how the popular workflow management system Snakemake can be used to guarantee reproducibility, and how it enables an ergonomic, combined, unified representation of all steps involved in data analysis, ranging from raw data processing, to quality control and fine-grained, interactive exploration and plotting of final results.

...read moreread less

Abstract: Data analysis often entails a multitude of heterogeneous steps, from the application of various command line tools to the usage of scripting languages like R or Python for the generation of plots and tables. It is widely recognized that data analyses should ideally be conducted in a reproducible way. Reproducibility enables technical validation and regeneration of results on the original or even new data. However, reproducibility alone is by no means sufficient to deliver an analysis that is of lasting impact (i.e., sustainable) for the field, or even just one research group. We postulate that it is equally important to ensure adaptability and transparency. The former describes the ability to modify the analysis to answer extended or slightly different research questions. The latter describes the ability to understand the analysis in order to judge whether it is not only technically, but methodologically valid. Here, we analyze the properties needed for a data analysis to become reproducible, adaptable, and transparent. We show how the popular workflow management system Snakemake can be used to guarantee this, and how it enables an ergonomic, combined, unified representation of all steps involved in data analysis, ranging from raw data processing, to quality control and fine-grained, interactive exploration and plotting of final results.

...read moreread less

519 citations

Journal Article•DOI•

Detectable clonal mosaicism from birth to old age and its relationship to cancer.

[...]

Cathy C. Laurie¹, Cecelia A. Laurie¹, Kenneth Rice¹, Kimberly F. Doheny², Leila R. Zelnick¹, Caitlin P. McHugh¹, Hua Ling², Kurt N. Hetrick², Elizabeth W. Pugh², Christopher I. Amos³, Qingyi Wei³, Li-E Wang³, Jeffrey E. Lee, Kathleen C. Barnes², Nadia N. Hansel², Rasika A. Mathias², Denise Daley⁴, Terri H. Beaty², Alan F. Scott², Ingo Ruczinski², Robert B. Scharpf², Laura J. Bierut⁵, Sarah M. Hartz⁵, Maria Teresa Landi⁶, Neal D. Freedman⁶, Lynn R. Goldin⁶, David Ginsburg⁷, Jun-Jun Li⁷, Karl C. Desch⁷, Sara S. Strom³, William J. Blot⁸, Lisa B. Signorello⁸, Sue A. Ingles⁹, Stephen J. Chanock⁶, Sonja I. Berndt⁶, Loic Le Marchand¹⁰, Brian E. Henderson⁹, Kristine R. Monroe⁹, John A. Heit¹¹, Mariza de Andrade¹¹, Sebastian M. Armasu¹¹, Cynthia Regnier¹¹, William L. Lowe¹², M. Geoffrey Hayes¹², Mary L. Marazita¹³, Eleanor Feingold¹³, Jeffrey C. Murray¹⁴, Mads Melbye¹⁵, Bjarke Feenstra¹⁵, Jae H. Kang¹⁶, Janey L. Wiggs¹⁶, Gail P. Jarvik¹, Andrew McDavid¹⁷, Venkatraman E. Seshan¹⁸, Daniel B. Mirel¹⁹, Andrew Crenshaw¹⁹, Nataliya Sharopova⁶, Anastasia L. Wise⁶, Jess Shen¹, David R. Crosslin¹, David M. Levine¹, Xiuwen Zheng¹, Jenna Udren¹, Siiri N. Bennett¹, Sarah C. Nelson¹, Stephanie M. Gogarten¹, Matthew P. Conomos¹, Patrick J. Heagerty¹, Teri A. Manolio⁶, Louis R. Pasquale¹⁶, Christopher A. Haiman⁹, Neil E. Caporaso⁶, Bruce S. Weir¹ - Show less +69 more•Institutions (19)

University of Washington¹, Johns Hopkins University², University of Texas Health Science Center at Houston³, University of British Columbia⁴, Washington University in St. Louis⁵, National Institutes of Health⁶, University of Michigan⁷, Vanderbilt University⁸, University of Southern California⁹, University of Hawaii at Manoa¹⁰, Mayo Clinic¹¹, Northwestern University¹², University of Pittsburgh¹³, University of Iowa¹⁴, Statens Serum Institut¹⁵, Harvard University¹⁶, Fred Hutchinson Cancer Research Center¹⁷, Memorial Sloan Kettering Cancer Center¹⁸, Broad Institute¹⁹

01 Jun 2012-Nature Genetics

TL;DR: Clonal mosaicism for large chromosomal anomalies (duplications, deletions and uniparental disomy) is detected using SNP microarray data from over 50,000 subjects recruited for genome-wide association studies to identify common deleted regions with genes previously associated with hematological cancers.

...read moreread less

Abstract: We detected clonal mosaicism for large chromosomal anomalies (duplications, deletions and uniparental disomy) using SNP microarray data from over 50,000 subjects recruited for genome-wide association studies. This detection method requires a relatively high frequency of cells with the same abnormal karyotype (>5-10%; presumably of clonal origin) in the presence of normal cells. The frequency of detectable clonal mosaicism in peripheral blood is low (<0.5%) from birth until 50 years of age, after which it rapidly rises to 2-3% in the elderly. Many of the mosaic anomalies are characteristic of those found in hematological cancers and identify common deleted regions with genes previously associated with these cancers. Although only 3% of subjects with detectable clonal mosaicism had any record of hematological cancer before DNA sampling, those without a previous diagnosis have an estimated tenfold higher risk of a subsequent hematological cancer (95% confidence interval = 6-18).

...read moreread less

519 citations

Journal Article•DOI•

Scalable whole-exome sequencing of cell-free DNA reveals high concordance with metastatic tumors

[...]

Viktor A. Adalsteinsson¹, Viktor A. Adalsteinsson², Gavin Ha¹, Gavin Ha³, Samuel S. Freeman³, Samuel S. Freeman¹, Atish D. Choudhury³, Daniel G. Stover³, Heather A. Parsons³, Gregory Gydush¹, Sarah C. Reed¹, Denisse Rotem¹, Justin Rhoades¹, Denis Loginov¹, Denis Loginov², Dimitri Livitz¹, Daniel Rosebrock³, Daniel Rosebrock¹, Ignaty Leshchiner¹, Jaegil Kim¹, Chip Stewart¹, Mara Rosenberg¹, Joshua M. Francis¹, Joshua M. Francis³, Cheng-Zhong Zhang³, Cheng-Zhong Zhang¹, Ofir Cohen¹, Ofir Cohen³, Coyin Oh¹, Huiming Ding², Paz Polak³, Paz Polak¹, Max Lloyd³, Sairah Mahmud³, Karla Helvie³, Margaret S. Merrill³, Rebecca A. Santiago³, Edward P. O’Connor³, Seong Ho Jeong³, Rachel Leeson², Rachel M. Barry², Joseph F. Kramkowski³, Zhenwei Zhang³, Laura Polacek³, Jens G. Lohr³, Jens G. Lohr¹, Molly Schleicher¹, Emily Lipscomb¹, Andrea Saltzman¹, Nelly Oliver³, Lori Marini³, Adrienne G. Waks⁴, Adrienne G. Waks³, Lauren C. Harshman³, Sara M. Tolaney³, Eliezer M. Van Allen, Eric P. Winer³, Nan Lin³, Mari Nakabayashi³, Mary-Ellen Taplin³, Cory M. Johannessen¹, Levi A. Garraway, Todd R. Golub, Jesse S. Boehm¹, Nikhil Wagle³, Nikhil Wagle¹, Gad Getz¹, Gad Getz³, J. Christopher Love¹, J. Christopher Love², Matthew Meyerson - Show less +67 more•Institutions (4)

Broad Institute¹, Massachusetts Institute of Technology², Harvard University³, Brigham and Women's Hospital⁴

06 Nov 2017-Nature Communications

TL;DR: In this paper, a software called ichorCNA was proposed to quantitatively measure tumor content in cfDNA from 0.1× coverage whole-genome sequencing data without prior knowledge of tumor mutations.

...read moreread less

Abstract: Whole-exome sequencing of cell-free DNA (cfDNA) could enable comprehensive profiling of tumors from blood but the genome-wide concordance between cfDNA and tumor biopsies is uncertain. Here we report ichorCNA, software that quantifies tumor content in cfDNA from 0.1× coverage whole-genome sequencing data without prior knowledge of tumor mutations. We apply ichorCNA to 1439 blood samples from 520 patients with metastatic prostate or breast cancers. In the earliest tested sample for each patient, 34% of patients have ≥10% tumor-derived cfDNA, sufficient for standard coverage whole-exome sequencing. Using whole-exome sequencing, we validate the concordance of clonal somatic mutations (88%), copy number alterations (80%), mutational signatures, and neoantigens between cfDNA and matched tumor biopsies from 41 patients with ≥10% cfDNA tumor content. In summary, we provide methods to identify patients eligible for comprehensive cfDNA profiling, revealing its applicability to many patients, and demonstrate high concordance of cfDNA and metastatic tumor whole-exome sequencing.

...read moreread less

519 citations

Journal Article•DOI•

Colocalization of GWAS and eQTL Signals Detects Target Genes

[...]

Farhad Hormozdiari¹, Martijn van de Bunt², Martijn van de Bunt³, Ayellet V. Segrè⁴, Xiao Li⁴, Jong Wha J. Joo¹, Michael Bilow¹, Jae Hoon Sul¹, Sriram Sankararaman¹, Bogdan Pasaniuc¹, Eleazar Eskin¹ - Show less +7 more•Institutions (4)

University of California, Los Angeles¹, Wellcome Trust Centre for Human Genetics², University of Oxford³, Broad Institute⁴

01 Dec 2016-American Journal of Human Genetics

TL;DR: eCAVIAR is presented, a probabilistic method that has several key advantages over existing methods and can account for more than one causal variant in any given locus, and can leverage summary statistics without accessing the individual genotype data.

...read moreread less

Abstract: The vast majority of genome-wide association study (GWAS) risk loci fall in non-coding regions of the genome. One possible hypothesis is that these GWAS risk loci alter the individual's disease risk through their effect on gene expression in different tissues. In order to understand the mechanisms driving a GWAS risk locus, it is helpful to determine which gene is affected in specific tissue types. For example, the relevant gene and tissue could play a role in the disease mechanism if the same variant responsible for a GWAS locus also affects gene expression. Identifying whether or not the same variant is causal in both GWASs and expression quantitative trail locus (eQTL) studies is challenging because of the uncertainty induced by linkage disequilibrium and the fact that some loci harbor multiple causal variants. However, current methods that address this problem assume that each locus contains a single causal variant. In this paper, we present eCAVIAR, a probabilistic method that has several key advantages over existing methods. First, our method can account for more than one causal variant in any given locus. Second, it can leverage summary statistics without accessing the individual genotype data. We use both simulated and real datasets to demonstrate the utility of our method. Using publicly available eQTL data on 45 different tissues, we demonstrate that eCAVIAR can prioritize likely relevant tissues and target genes for a set of glucose- and insulin-related trait loci.

...read moreread less

519 citations

Journal Article•DOI•

Comprehensive analysis of cancer-associated somatic mutations in class I HLA genes

[...]

Sachet A. Shukla¹, Sachet A. Shukla², Sachet A. Shukla³, Michael S. Rooney⁴, Michael S. Rooney¹, Mohini Rajasagi², Grace Tiao¹, Philip M. Dixon³, Michael S. Lawrence¹, Jonathan Stevens⁵, William J. Lane², William J. Lane⁵, Jamie L. DellaGatta⁵, Scott Steelman¹, Carrie Sougnez¹, Kristian Cibulskis¹, Adam Kiezun¹, Nir Hacohen¹, Nir Hacohen², Vladimir Brusic², Catherine J. Wu, Gad Getz¹, Gad Getz² - Show less +19 more•Institutions (5)

Broad Institute¹, Harvard University², Iowa State University³, Massachusetts Institute of Technology⁴, Brigham and Women's Hospital⁵

01 Nov 2015-Nature Biotechnology

TL;DR: Cancers with recurrent somatic HLA mutations were associated with upregulation of signatures of cytolytic activity characteristic of tumor infiltration by effector lymphocytes, supporting immune evasion by altered HLA function as a contributory mechanism in cancer.

...read moreread less

Abstract: Detection of somatic mutations in human leukocyte antigen (HLA) genes using whole-exome sequencing (WES) is hampered by the high polymorphism of the HLA loci, which prevents alignment of sequencing reads to the human reference genome. We describe a computational pipeline that enables accurate inference of germline alleles of class I HLA-A, B and C genes and subsequent detection of mutations in these genes using the inferred alleles as a reference. Analysis of WES data from 7,930 pairs of tumor and healthy tissue from the same patient revealed 298 nonsilent HLA mutations in tumors from 266 patients. These 298 mutations are enriched for likely functional mutations, including putative loss-of-function events. Recurrence of mutations suggested that these 'hotspot' sites were positively selected. Cancers with recurrent somatic HLA mutations were associated with upregulation of signatures of cytolytic activity characteristic of tumor infiltration by effector lymphocytes, supporting immune evasion by altered HLA function as a contributory mechanism in cancer.

...read moreread less

518 citations

Collapse

Authors

Showing all 7146 results

Name	H-index	Papers	Citations
Eric S. Lander	301	826	525976
Albert Hofman	267	2530	321405
Frank B. Hu	250	1675	253464
David J. Hunter	213	1836	207050
Kari Stefansson	206	794	174819
Mark J. Daly	204	763	304452
Lewis C. Cantley	196	748	169037
Matthew Meyerson	194	553	243726
Gad Getz	189	520	247560
Stacey Gabriel	187	383	294284
Stuart H. Orkin	186	715	112182
Ralph Weissleder	184	1160	142508
Chris Sander	178	713	233287
Michael I. Jordan	176	1016	216204
Richard A. Young	173	520	126642