Home
/
Authors
/
Rachel Maupin

Author

Rachel Maupin

Bio: Rachel Maupin is an academic researcher from Washington University in St. Louis. The author has contributed to research in topics: Chromosome 20 & Sequence analysis. The author has an hindex of 5, co-authored 6 publications receiving 2562 citations.

Topics: Chromosome 20, Sequence analysis, Euchromatin, Heterochromatin, Sequence logo ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes.

[...]

Helen Skaletsky¹, Tomoko Kuroda-Kawaguchi¹, Patrick Minx², Holland S. Cordum², LaDeana W. Hillier², Laura G. Brown¹, Sjoerd Repping, Tatyana Pyntikova¹, Johar Ali², Tamberlyn Bieri², Asif T. Chinwalla², Andrew Delehaunty², Kim D. Delehaunty², Hui Du², Ginger A. Fewell², Lucinda Fulton², Robert S. Fulton², Tina Graves², Shunfang Hou², Philip Latrielle², Shawn Leonard², Elaine R. Mardis², Rachel Maupin², John Douglas Mcpherson², Tracie L. Miner², William E. Nash², Christine Nguyen², Philip Ozersky², Kymberlie H. Pepin², Susan M. Rock², Tracy Rohlfing², Kelsi Scott², Brian Schultz², Cindy Strong², Aye Mon Tin-Wollam², Shiaw-Pyng Yang², Robert H. Waterston², Richard K. Wilson², Steve Rozen¹, David C. Page¹ - Show less +36 more•Institutions (2)

Massachusetts Institute of Technology¹, Washington University in St. Louis²

19 Jun 2003-Nature

TL;DR: The male-specific region of the Y chromosome, the MSY, differentiates the sexes and comprises 95% of the chromosome's length, and is a mosaic of heterochromatic sequences and three classes of euchromatics sequences: X-transposed, X-degenerate and ampliconic.

...read moreread less

Abstract: The male-specific region of the Y chromosome, the MSY, differentiates the sexes and comprises 95% of the chromosome's length. Here, we report that the MSY is a mosaic of heterochromatic sequences and three classes of euchromatic sequences: X-transposed, X-degenerate and ampliconic. These classes contain all 156 known transcription units, which include 78 protein-coding genes that collectively encode 27 distinct proteins. The X-transposed sequences exhibit 99% identity to the X chromosome. The X-degenerate sequences are remnants of ancient autosomes from which the modern X and Y chromosomes evolved. The ampliconic class includes large regions (about 30% of the MSY euchromatin) where sequence pairs show greater than 99.9% identity, which is maintained by frequent gene conversion (non-reciprocal transfer). The most prominent features here are eight massive palindromes, at least six of which contain testis genes.

...read moreread less

2,022 citations

Journal Article•DOI•

The DNA sequence of human chromosome 7

[...]

LaDeana W. Hillier¹, Robert S. Fulton¹, Lucinda Fulton¹, Tina Graves¹, Kymberlie H. Pepin¹, Caryn Wagner-McPherson¹, Dan Layman¹, Jason Maas¹, Sara Jaeger¹, Rebecca S. Walker¹, Kristine M. Wylie¹, Mandeep Sekhon¹, Michael C. Becker¹, Michelle O'Laughlin¹, Mark E. Schaller¹, Ginger A. Fewell¹, Kimberly D. Delehaunty¹, Tracie L. Miner¹, William E. Nash¹, Matt Cordes¹, Hui Du¹, Hui Sun¹, Jennifer Edwards¹, Holland Bradshaw-Cordum¹, Johar Ali¹, Stephanie Andrews¹, Amber Isak¹, Andrew Vanbrunt¹, Christine Nguyen¹, Feiyu Du¹, Betty Lamar¹, Laura Courtney¹, Joelle Kalicki¹, Philip Ozersky¹, Lauren Bielicki¹, Kelsi Scott¹, Andrea Holmes¹, Richard Harkins¹, Anthony R. Harris¹, Cindy Strong¹, Shunfang Hou¹, Chad Tomlinson¹, Sara Dauphin-Kohlberg¹, Amy Kozlowicz-Reilly¹, Shawn Leonard¹, Theresa Rohlfing¹, Susan M. Rock¹, Aye-Mon Tin-Wollam¹, Amanda Abbott¹, Patrick Minx¹, Rachel Maupin¹, Catrina Strowmatt¹, Phil Latreille¹, Nancy Miller¹, Doug Johnson¹, Jennifer Murray¹, Jeffrey Woessner¹, Michael C. Wendl¹, Shiaw-Pyng Yang¹, Brian Schultz¹, John W. Wallis¹, John Spieth¹, Tamberlyn Bieri¹, Joanne O. Nelson¹, Nicolas Berkowicz¹, Patricia Wohldmann¹, Lisa Cook¹, Matthew T. Hickenbotham¹, James M. Eldred¹, Donald Williams¹, Joseph A. Bedell¹, Elaine R. Mardis¹, Sandra W. Clifton¹, Stephanie L. Chissoe¹, Marco A. Marra¹, Marco A. Marra², Christopher K. Raymond³, Eric Haugen³, Will Gillett³, Yang Zhou³, R. James³, Karen A. Phelps³, Shawn Iadanoto³, Kerry L. Bubb³, Elizabeth Simms³, Ruth Levy³, James B. Clendenning³, Rajinder Kaul³, W. James Kent⁴, Terrence S. Furey⁴, Robert Baertsch⁴, Michael R. Brent¹, Evan Keibler¹, Paul Flicek¹, Peer Bork⁵, Mikita Suyama⁵, Jeffrey A. Bailey⁶, Matthew E. Portnoy⁷, David Torrents⁵, Asif T. Chinwalla¹, Warren Gish¹, Sean R. Eddy¹, John Douglas Mcpherson¹, John Douglas Mcpherson⁸, Maynard V. Olson³, Evan E. Eichler⁶, Eric D. Green⁷, Robert H. Waterston³, Robert H. Waterston¹, Richard K. Wilson¹ - Show less +106 more•Institutions (8)

Washington University in St. Louis¹, BC Cancer Agency², University of Washington³, University of California, Santa Cruz⁴, European Bioinformatics Institute⁵, Case Western Reserve University⁶, National Institutes of Health⁷, Human Genome Sequencing Center⁸

10 Jul 2003-Nature

TL;DR: The euchromatic sequence of chromosome 7, the first metacentric chromosome completed so far, has excellent concordance with previously established physical and genetic maps, and it exhibits an unusual amount of segmentally duplicated sequence.

...read moreread less

Abstract: Human chromosome 7 has historically received prominent attention in the human genetics community, primarily related to the search for the cystic fibrosis gene and the frequent cytogenetic changes associated with various forms of cancer. Here we present more than 153 million base pairs representing 99.4% of the euchromatic sequence of chromosome 7, the first metacentric chromosome completed so far. The sequence has excellent concordance with previously established physical and genetic maps, and it exhibits an unusual amount of segmentally duplicated sequence (8.2%), with marked differences between the two arms. Our initial analyses have identified 1,150 protein-coding genes, 605 of which have been confirmed by complementary DNA sequences, and an additional 941 pseudogenes. Of genes confirmed by transcript sequences, some are polymorphic for mutations that disrupt the reading frame.

...read moreread less

244 citations

Journal Article•DOI•

Acquired copy number alterations in adult acute myeloid leukemia genomes

[...]

Matthew J. Walter¹, Jacqueline E. Payton¹, Rhonda E. Ries¹, William D. Shannon¹, Hrishikesh Deshmukh¹, Yu Zhao¹, Jack Baty¹, Sharon Heath¹, Peter Westervelt¹, Mark A. Watson¹, Michael H. Tomasson¹, Rakesh Nagarajan¹, Brian O’Gara¹, Clara D. Bloomfield², Krzysztof Mrózek², Rebecca R. Selzer³, Todd Richmond³, Jacob O. Kitzman³, Joel Geoghegan³, Peggy S. Eis³, Rachel Maupin¹, Robert S. Fulton¹, Michael D. McLellan¹, Richard K. Wilson¹, Elaine R. Mardis¹, Daniel C. Link¹, Timothy A. Graubert¹, John F. DiPersio¹, Timothy J. Ley¹ - Show less +25 more•Institutions (3)

Washington University in St. Louis¹, Ohio State University², Hoffmann-La Roche³

04 Aug 2009-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The use of an unbiased high-resolution genomic screen identified many genes not previously implicated in AML that may be relevant for pathogenesis, along with many known oncogenes and tumor suppressor genes.

...read moreread less

Abstract: Cytogenetic analysis of acute myeloid leukemia (AML) cells has accelerated the identification of genes important for AML pathogenesis. To complement cytogenetic studies and to identify genes altered in AML genomes, we performed genome-wide copy number analysis with paired normal and tumor DNA obtained from 86 adult patients with de novo AML using 1.85 million feature SNP arrays. Acquired copy number alterations (CNAs) were confirmed using an ultra-dense array comparative genomic hybridization platform. A total of 201 somatic CNAs were found in the 86 AML genomes (mean, 2.34 CNAs per genome), with French-American-British system M6 and M7 genomes containing the most changes (10–29 CNAs per genome). Twenty-four percent of AML patients with normal cytogenetics had CNA, whereas 40% of patients with an abnormal karyotype had additional CNA detected by SNP array, and several CNA regions were recurrent. The mRNA expression levels of 57 genes were significantly altered in 27 of 50 recurrent CNA regions <5 megabases in size. A total of 8 uniparental disomy (UPD) segments were identified in the 86 genomes; 6 of 8 UPD calls occurred in samples with a normal karyotype. Collectively, 34 of 86 AML genomes (40%) contained alterations not found with cytogenetics, and 98% of these regions contained genes. Of 86 genomes, 43 (50%) had no CNA or UPD at this level of resolution. In this study of 86 adult AML genomes, the use of an unbiased high-resolution genomic screen identified many genes not previously implicated in AML that may be relevant for pathogenesis, along with many known oncogenes and tumor suppressor genes.

...read moreread less

241 citations

Journal Article•DOI•

Generation and annotation of the DNA sequences of human chromosomes 2 and 4

[...]

LaDeana W. Hillier¹, Tina Graves¹, Robert S. Fulton¹, Lucinda Fulton¹, Kymberlie H. Pepin¹, Patrick Minx¹, Caryn Wagner-McPherson¹, Dan Layman¹, Kristine M. Wylie¹, Mandeep Sekhon¹, Michael C. Becker¹, Ginger A. Fewell¹, Kimberly D. Delehaunty¹, Tracie L. Miner¹, William E. Nash¹, Colin Kremitzki¹, Lachlan G. Oddy¹, Hui Du¹, Hui Sun¹, Holland Bradshaw-Cordum¹, Johar Ali¹, Jason Carter¹, Matt Cordes¹, Anthony R. Harris¹, Amber Isak¹, Andrew Van Brunt¹, Christine Nguyen¹, Feiyu Du¹, Laura Courtney¹, Joelle Kalicki¹, Philip Ozersky¹, Scott Abbott¹, Jon R. Armstrong¹, Edward A. Belter¹, Lauren Caruso¹, Maria Cedroni¹, Marc Cotton¹, Teresa Davidson¹, Anu Desai¹, Glendoria Elliott¹, Thomas Erb¹, Catrina Fronick¹, Tony Gaige¹, William Haakenson¹, Krista Haglund¹, Andrea Holmes¹, Richard Harkins¹, Kyung Kim¹, Scott Kruchowski¹, Cindy Strong¹, Neenu Grewal¹, Ernest Goyea¹, Shunfang Hou¹, Andrew Levy¹, Scott Martinka¹, Kelly Mead¹, Michael D. McLellan¹, Rick Meyer¹, Jennifer Randall-Maher¹, Chad Tomlinson¹, Sara Dauphin-Kohlberg¹, Amy Kozlowicz-Reilly¹, Neha Shah¹, Sharhonda Swearengen-Shahid¹, Jacqueline E. Snider¹, Joseph T. Strong¹, Johanna Thompson¹, Martin Yoakum¹, Shawn Leonard¹, Charlene Pearman¹, Lee Trani¹, Maxim Radionenko¹, Jason Waligorski¹, Chunyan Wang¹, Susan M. Rock¹, Aye Mon Tin-Wollam¹, Rachel Maupin¹, Phil Latreille¹, Michael C. Wendl¹, Shiaw Pyng Yang¹, Craig Pohl¹, John W. Wallis¹, John Spieth¹, Tamberlyn Bieri¹, Nicolas Berkowicz¹, Joanne O. Nelson¹, John R. Osborne¹, Li Ding¹, Rekha Meyer¹, Aniko Sabo¹, Yoram Shotland¹, Prashant R. Sinha¹, Patricia Wohldmann¹, Lisa Cook¹, Matthew T. Hickenbotham¹, James M. Eldred¹, Donald Williams¹, Thomas A. Jones¹, Xinwei She², Francesca D. Ciccarelli, Elisa Izaurralde, James Taylor³, Jeremy Schmutz⁴, Richard M. Myers⁴, David R. Cox⁴, Xiaoqiu Huang⁵, John Douglas Mcpherson⁶, John Douglas Mcpherson¹, Elaine R. Mardis¹, Sandra W. Clifton¹, Wesley C. Warren¹, Asif T. Chinwalla¹, Sean R. Eddy¹, Marco A. Marra⁷, Marco A. Marra¹, Ivan Ovcharenko⁸, Terrence S. Furey⁹, Webb Miller³, Evan E. Eichler², Peer Bork, Mikita Suyama, David Torrents, Robert H. Waterston¹, Robert H. Waterston², Richard K. Wilson¹ - Show less +121 more•Institutions (9)

Washington University in St. Louis¹, University of Washington², Pennsylvania State University³, Stanford University⁴, Iowa State University⁵, Baylor College of Medicine⁶, University of British Columbia⁷, Lawrence Livermore National Laboratory⁸, University of California, Santa Cruz⁹

07 Apr 2005-Nature

TL;DR: Extensive analyses confirm the underlying construction of the sequence, and expand the understanding of the structure and evolution of mammalian chromosomes, including gene deserts, segmental duplications and highly variant regions.

...read moreread less

Abstract: Human chromosome 2 is unique to the human lineage in being the product of a head-to-head fusion of two intermediate-sized ancestral chromosomes. Chromosome 4 has received attention primarily related to the search for the Huntington's disease gene, but also for genes associated with Wolf-Hirschhorn syndrome, polycystic kidney disease and a form of muscular dystrophy. Here we present approximately 237 million base pairs of sequence for chromosome 2, and 186 million base pairs for chromosome 4, representing more than 99.6% of their euchromatic sequences. Our initial analyses have identified 1,346 protein-coding genes and 1,239 pseudogenes on chromosome 2, and 796 protein-coding genes and 778 pseudogenes on chromosome 4. Extensive analyses confirm the underlying construction of the sequence, and expand our understanding of the structure and evolution of mammalian chromosomes, including gene deserts, segmental duplications and highly variant regions.

...read moreread less

107 citations

Journal Article•DOI•

Comparative genomic sequence analysis of the human and mouse cystic fibrosis transmembrane conductance regulator genes.

[...]

Rachel E. Ellsworth¹, D. Curtis Jamison¹, Jeffrey W. Touchman¹, Stephanie L. Chissoe², Valerie Maduro¹, Gerard G. Bouffard¹, Nicole Dietrich¹, Stephen M. Beckstrom-Sternberg¹, Leslie M. Iyer¹, Lauren A. Weintraub¹, Marc Cotton², Laura Courtney², Jennifer Edwards², Rachel Maupin², Philip Ozersky², Theresa Rohlfing², Patricia Wohldmann², Tracie L. Miner², Kimberley Kemp², Jason B. Kramer², Ian F Korf², Kimberlie Pepin², Lucinda Antonacci-Fulton², Robert S. Fulton², Patrick Minx², LaDeana W. Hillier², Richard K. Wilson², Robert H. Waterston², Webb Miller³, Eric D. Green¹ - Show less +26 more•Institutions (3)

National Institutes of Health¹, Washington University in St. Louis², Pennsylvania State University³

01 Feb 2000-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The generated sequence reveals the precise architecture of genes residing near CFTR/Cftr, including one known gene (WNT2/Wnt2) and two previously unknown genes that immediately flank CFTR or Cftr.

...read moreread less

Abstract: The identification of the cystic fibrosis transmembrane conductance regulator gene (CFTR) in 1989 represents a landmark accomplishment in human genetics. Since that time, there have been numerous advances in elucidating the function of the encoded protein and the physiological basis of cystic fibrosis. However, numerous areas of cystic fibrosis biology require additional investigation, some of which would be facilitated by information about the long-range sequence context of the CFTR gene. For example, the latter might provide clues about the sequence elements responsible for the temporal and spatial regulation of CFTR expression. We thus sought to establish the sequence of the chromosomal segments encompassing the human CFTR and mouse Cftr genes, with the hope of identifying conserved regions of biologic interest by sequence comparison. Bacterial clone-based physical maps of the relevant human and mouse genomic regions were constructed, and minimally overlapping sets of clones were selected and sequenced, eventually yielding ≈1.6 Mb and ≈358 kb of contiguous human and mouse sequence, respectively. These efforts have produced the complete sequence of the ≈189-kb and ≈152-kb segments containing the human CFTR and mouse Cftr genes, respectively, as well as significant amounts of flanking DNA. Analyses of the resulting data provide insights about the organization of the CFTR/Cftr genes and potential sequence elements regulating their expression. Furthermore, the generated sequence reveals the precise architecture of genes residing near CFTR/Cftr, including one known gene (WNT2/Wnt2) and two previously unknown genes that immediately flank CFTR/Cftr.

...read moreread less

81 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Initial sequencing and comparative analysis of the mouse genome.

[...]

Robert H. Waterston¹, Kerstin Lindblad-Toh², Ewan Birney, Jane Rogers³ +219 more•Institutions (26)

05 Dec 2002-Nature

TL;DR: The results of an international collaboration to produce a high-quality draft sequence of the mouse genome are reported and an initial comparative analysis of the Mouse and human genomes is presented, describing some of the insights that can be gleaned from the two sequences.

...read moreread less

Abstract: The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.

...read moreread less

6,643 citations

Journal Article•DOI•

GSVA: gene set variation analysis for microarray and RNA-seq data.

[...]

Sonja Hänzelmann, Robert Castelo¹, Justin Guinney²•Institutions (2)

Pompeu Fabra University¹, Sage Bionetworks²

16 Jan 2013-BMC Bioinformatics

TL;DR: This work introduces Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner and constitutes a starting point to build pathway-centric models of biology.

...read moreread less

Abstract: Gene set enrichment (GSE) analysis is a popular framework for condensing information from gene expression profiles into a pathway or signature summary. The strengths of this approach over single gene analysis include noise and dimension reduction, as well as greater biological interpretability. As molecular profiling experiments move beyond simple case-control studies, robust and flexible GSE methodologies are needed that can model pathway activity within highly heterogeneous data sets. To address this challenge, we introduce Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner. We demonstrate the robustness of GSVA in a comparison with current state of the art sample-wise enrichment methods. Further, we provide examples of its utility in differential pathway activity and survival analysis. Lastly, we show how GSVA works analogously with data from both microarray and RNA-seq experiments. GSVA provides increased power to detect subtle pathway activity changes over a sample population in comparison to corresponding methods. While GSE methods are generally regarded as end points of a bioinformatic analysis, GSVA constitutes a starting point to build pathway-centric models of biology. Moreover, GSVA contributes to the current need of GSE methods for RNA-seq data. GSVA is an open source software package for R which forms part of the Bioconductor project and can be downloaded at http://www.bioconductor.org .

...read moreread less

6,125 citations

Journal Article•DOI•

voom: precision weights unlock linear model analysis tools for RNA-seq read counts

[...]

Charity W. Law¹, Charity W. Law², Yunshun Chen¹, Yunshun Chen², Wei Shi², Wei Shi¹, Gordon K. Smyth², Gordon K. Smyth¹ - Show less +4 more•Institutions (2)

University of Melbourne¹, Walter and Eliza Hall Institute of Medical Research²

03 Feb 2014-Genome Biology

TL;DR: New normal linear modeling strategies are presented for analyzing read counts from RNA-seq experiments, and the voom method estimates the mean-variance relationship of the log-counts, generates a precision weight for each observation and enters these into the limma empirical Bayes analysis pipeline.

...read moreread less

Abstract: New normal linear modeling strategies are presented for analyzing read counts from RNA-seq experiments. The voom method estimates the mean-variance relationship of the log-counts, generates a precision weight for each observation and enters these into the limma empirical Bayes analysis pipeline. This opens access for RNA-seq analysts to a large body of methodology developed for microarrays. Simulation studies show that voom performs as well or better than count-based RNA-seq methods even when the data are generated according to the assumptions of the earlier methods. Two case studies illustrate the use of linear modeling and gene set testing methods.

...read moreread less

4,475 citations

Journal Article•DOI•

Finishing the euchromatic sequence of the human genome

[...]

Chris P. Ponting, Daniel Barker

21 Oct 2004-Nature

TL;DR: The current human genome sequence (Build 35) as discussed by the authors contains 2.85 billion nucleotides interrupted by only 341 gaps and is accurate to an error rate of approximately 1 event per 100,000 bases.

...read moreread less

Abstract: The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers approximately 99% of the euchromatic genome and is accurate to an error rate of approximately 1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human genome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead.

...read moreread less

3,989 citations

Journal Article•DOI•

Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia

[...]

Timothy J. Ley¹, Christopher A. Miller¹, Li Ding¹, Benjamin J. Raphael², Andrew J. Mungall³, Gordon Robertson³, Katherine A. Hoadley⁴, Timothy J. Triche⁵, Peter W. Laird⁵, Jack Baty¹, Lucinda Fulton¹, Robert S. Fulton¹, Sharon Heath¹, Joelle Kalicki-Veizer¹, Cyriac Kandoth¹, Jeffery M. Klco¹, Daniel C. Koboldt¹, Krishna L. Kanchi¹, Shashikant Kulkarni¹, Tamara Lamprecht¹, David E. Larson¹, G. Lin¹, Charles Lu¹, Michael D. McLellan¹, Joshua F. McMichael¹, Jacqueline E. Payton¹, Heather Schmidt¹, David H. Spencer¹, Michael H. Tomasson¹, John W. Wallis¹, Lukas D. Wartman¹, Mark A. Watson¹, John S. Welch¹, Michael C. Wendl¹, Adrian Ally³, Miruna Balasundaram³, Inanc Birol³, Yaron S.N. Butterfield³, Readman Chiu³, Andy Chu³, Eric Chuah³, Hye Jung E. Chun³, Richard Corbett³, Noreen Dhalla³, Ranabir Guin³, An He³, Carrie Hirst³, Martin Hirst³, Robert A. Holt³, Steven J.M. Jones³, Aly Karsan³, Darlene Lee³, Haiyan I. Li³, Marco A. Marra³, Michael Mayo³, Richard A. Moore³, Karen Mungall³, Jeremy Parker³, Erin Pleasance³, Patrick Plettner³, Jacquie Schein³, Dominik Stoll³, Lucas Swanson³, Angela Tam³, Nina Thiessen³, Richard Varhol³, Natasja Wye³, Yongjun Zhao³, Stacey Gabriel⁶, Gad Getz⁶, Carrie Sougnez⁶, Lihua Zou⁶, Mark D.M. Leiserson², Fabio Vandin², Hsin-Ta Wu², Frederick Applebaum⁷, Stephen B. Baylin⁸, Rehan Akbani⁹, Bradley M. Broom⁹, Ken Chen⁹, Thomas C. Motter⁹, Khanh Thi-Thuy Nguyen⁹, John N. Weinstein⁹, Nianziang Zhang⁹, Martin L. Ferguson, Christopher Adams¹⁰, Aaron D. Black¹⁰, Jay Bowen¹⁰, Julie M. Gastier-Foster¹⁰, Thomas Grossman¹⁰, Tara M. Lichtenberg¹⁰, Lisa Wise¹⁰, Tanja Davidsen¹¹, John A. Demchok¹¹, Kenna R. Mills Shaw¹¹, Margi Sheth¹¹, Heidi J. Sofia, Liming Yang¹¹, James R. Downing, Greg Eley, Shelley Alonso¹², Brenda Ayala¹², Julien Baboud¹², Mark Backus¹², Sean P. Barletta¹², Dominique L. Berton¹², Anna L. Chu¹², Stanley Girshik¹², Mark A. Jensen¹², Ari B. Kahn¹², Prachi Kothiyal¹², Matthew C. Nicholls¹², Todd Pihl¹², David Pot¹², Rohini Raman¹², Rashmi N. Sanbhadti¹², Eric E. Snyder¹², Deepak Srinivasan¹², Jessica Walton¹², Yunhu Wan¹², Zhining Wang¹², Jean Pierre J. Issa¹³, Michelle M. Le Beau¹⁴, Martin Carroll¹⁵, Hagop M. Kantarjian, Steven M. Kornblau, Moiz S. Bootwalla⁵, Phillip H. Lai⁵, Hui Shen⁵, David Van Den Berg⁵, Daniel J. Weisenberger⁵, Daniel C. Link¹, Matthew J. Walter¹, Bradley A. Ozenberger¹¹, Elaine R. Mardis¹, Peter Westervelt¹, Timothy A. Graubert¹, John F. DiPersio¹, Richard K. Wilson¹ - Show less +135 more•Institutions (15)

Washington University in St. Louis¹, Brown University², University of British Columbia³, University of North Carolina at Chapel Hill⁴, University of Southern California⁵, Massachusetts Institute of Technology⁶, Seattle Cancer Care Alliance⁷, Johns Hopkins University⁸, University of Texas MD Anderson Cancer Center⁹, Nationwide Children's Hospital¹⁰, National Institutes of Health¹¹, SRA International¹², Temple University¹³, University of Chicago¹⁴, University of Pennsylvania¹⁵

30 May 2013-The New England Journal of Medicine

TL;DR: It is found that a complex interplay of genetic events contributes to AML pathogenesis in individual patients and the databases from this study are widely available to serve as a foundation for further investigations of AMl pathogenesis, classification, and risk stratification.

...read moreread less

Abstract: BACKGROUND—Many mutations that contribute to the pathogenesis of acute myeloid leukemia (AML) are undefined The relationships between patterns of mutations and epigenetic phenotypes are not yet clear METHODS—We analyzed the genomes of 200 clinically annotated adult cases of de novo AML, using either whole-genome sequencing (50 cases) or whole-exome sequencing (150 cases), along with RNA and microRNA sequencing and DNA-methylation analysis RESULTS—AML genomes have fewer mutations than most other adult cancers, with an average of only 13 mutations found in genes Of these, an average of 5 are in genes that are recurrently mutated in AML A total of 23 genes were significantly mutated, and another 237 were mutated in two or more samples Nearly all samples had at least 1 nonsynonymous mutation in one of nine categories of genes that are almost certainly relevant for pathogenesis, including transcriptionfactor fusions (18% of cases), the gene encoding nucleophosmin (NPM1) (27%), tumorsuppressor genes (16%), DNA-methylation–related genes (44%), signaling genes (59%), chromatin-modifying genes (30%), myeloid transcription-factor genes (22%), cohesin-complex genes (13%), and spliceosome-complex genes (14%) Patterns of cooperation and mutual exclusivity suggested strong biologic relationships among several of the genes and categories CONCLUSIONS—We identified at least one potential driver mutation in nearly all AML samples and found that a complex interplay of genetic events contributes to AML pathogenesis in individual patients The databases from this study are widely available to serve as a foundation for further investigations of AML pathogenesis, classification, and risk stratification (Funded by the National Institutes of Health) The molecular pathogenesis of acute myeloid leukemia (AML) has been studied with the use of cytogenetic analysis for more than three decades Recurrent chromosomal structural variations are well established as diagnostic and prognostic markers, suggesting that acquired genetic abnormalities (ie, somatic mutations) have an essential role in pathogenesis 1,2 However, nearly 50% of AML samples have a normal karyotype, and many of these genomes lack structural abnormalities, even when assessed with high-density comparative genomic hybridization or single-nucleotide polymorphism (SNP) arrays 3-5 (see Glossary) Targeted sequencing has identified recurrent mutations in FLT3, NPM1, KIT, CEBPA, and TET2 6-8 Massively parallel sequencing enabled the discovery of recurrent mutations in DNMT3A 9,10 and IDH1 11 Recent studies have shown that many patients with

...read moreread less

3,980 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse