Home
/
Authors
/
Marcel Martin

Author

Marcel Martin

Other affiliations: Max Planck Society, University of Duisburg-Essen, Technical University of Dortmund ...read more

Bio: Marcel Martin is an academic researcher from Science for Life Laboratory. The author has contributed to research in topics: Exome sequencing & Population. The author has an hindex of 24, co-authored 42 publications receiving 15979 citations. Previous affiliations of Marcel Martin include Max Planck Society & University of Duisburg-Essen.

Topics: Exome sequencing, Population, Gene, Exome, Deep sequencing ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Cutadapt removes adapter sequences from high-throughput sequencing reads

[...]

Marcel Martin¹•Institutions (1)

Technical University of Dortmund¹

02 May 2011-EMBnet.journal

TL;DR: The command-line tool cutadapt is developed, which supports 454, Illumina and SOLiD (color space) data, offers two adapter trimming algorithms, and has other useful features.

...read moreread less

Abstract: When small RNA is sequenced on current sequencing machines, the resulting reads are usually longer than the RNA and therefore contain parts of the 3' adapter. That adapter must be found and removed error-tolerantly from each read before read mapping. Previous solutions are either hard to use or do not offer required features, in particular support for color space data. As an easy to use alternative, we developed the command-line tool cutadapt, which supports 454, Illumina and SOLiD (color space) data, offers two adapter trimming algorithms, and has other useful features. Cutadapt, including its MIT-licensed source code, is available for download at http://code.google.com/p/cutadapt/

...read moreread less

20,255 citations

Journal Article•DOI•

Exome sequencing identifies recurrent somatic mutations in EIF1AX and SF3B1 in uveal melanoma with disomy 3.

[...]

Marcel Martin¹, Marcel Martin², Lars Maßhöfer¹, Petra Temming, Sven Rahmann¹, Claudia H D Metz¹, Norbert Bornfeld¹, Johannes van de Nes¹, Ludger Klein-Hitpass¹, Alan G. Hinnebusch³, Bernhard Horsthemke¹, Dietmar R. Lohmann¹, Michael Zeschnigk¹ - Show less +9 more•Institutions (3)

University of Duisburg-Essen¹, Technical University of Dortmund², National Institutes of Health³

01 Aug 2013-Nature Genetics

TL;DR: Using exome sequencing, recurrent somatic mutations in EIF1AX and SF3B1 are identified occurring in uveal melanomas with disomy 3, which rarely metastasize and are associated with poor prognosis.

...read moreread less

Abstract: Michael Zeschnigk and colleagues identify recurrent somatic mutations of EIF1AX and SF3B1 in uveal melanomas with disomy 3. The EIF1AX mutations specifically alter the N-terminal tail of the protein and were found exclusively in tumors lacking SF3B1 mutations.

...read moreread less

407 citations

Posted Content•DOI•

WhatsHap: fast and accurate read-based phasing

[...]

Marcel Martin¹, Patterson², Shilpa Garg³, S. Fischer⁴, Nadia Pisanti⁵, Gunnar W. Klau⁶, Alexander Schönhuth⁶, Tobias Marschall³ - Show less +4 more•Institutions (6)

Science for Life Laboratory¹, University of Lyon², Max Planck Society³, Saarland University⁴, University of Pisa⁵, Centrum Wiskunde & Informatica⁶

02 Nov 2016-bioRxiv

TL;DR: WhatsHap is a production-ready tool for highly accurate read-based phasing that was designed from the beginning to leverage third-generation sequencing technologies, whose long reads can span many variants and are therefore ideal for phasing.

...read moreread less

Abstract: Read-based phasing allows to reconstruct the haplotype structure of a sample purely from sequencing reads. While phasing is a required step for answering questions about population genetics, compound heterozygosity, and to aid in clinical decision making, there has been a lack of an accurate, usable and standards-based software. WhatsHap is a production-ready tool for highly accurate read-based phasing. It was designed from the beginning to leverage third-generation sequencing technologies, whose long reads can span many variants and are therefore ideal for phasing. WhatsHap works also well with second-generation data, is easy to use and will phase not only SNVs, but also indels and other variants. It is unique in its ability to combine read-based with genetic phasing, allowing to further improve accuracy if multiple related samples are provided.

...read moreread less

230 citations

Journal Article•DOI•

Computational pan-genomics: status, promises and challenges.

[...]

Tobias Marschall¹, Manja Marz², Manja Marz¹, Thomas Abeel³, Louis Dijkstra, Bas E. Dutilh⁴, Ali Ghaffaari¹, Ali Ghaffaari⁵, Paul Kersey⁶, Wigard P. Kloosterman, Veli Mäkinen⁷, Adam M. Novak⁸, Benedict Paten⁸, David Porubsky, Eric Rivals, Can Alkan, Jasmijn A. Baaijens, Paul I.W. de Bakker, Valentina Boeva, Raoul J. P. Bonnal, Francesca Chiaromonte, Rayan Chikhi⁹, Francesca D. Ciccarelli, Robin Cijvat, Erwin Datema, Cornelia M. van Duijn, Evan E. Eichler¹⁰, Evan E. Eichler⁸, Corinna Ernst, Eleazar Eskin, Erik Garrison¹¹, Mohammed El-Kebir, Gunnar W. Klau, Jan O. Korbel¹¹, Eric-Wubbo Lameijer¹², Benjamin Langmead, Marcel Martin, Paul Medvedev¹³, John C. Mu¹⁴, Pieter B. Neerincx¹⁵, Klaasjan G. Ouwens, Pierre Peterlongo, Nadia Pisanti, Sven Rahmann, Ben Raphael, Knut Reinert, Dick de Ridder¹⁶, Jeroen de Ridder¹⁷, Matthias Schlesner, Ole Schulz-Trieglaff¹⁸, Ashley D. Sanders, Siavash Sheikhizadeh, Carl Shneider, Sandra Smit, Daniel Valenzuela¹⁹, Jiayin Wang²⁰, Lodewyk F. A. Wessels²¹, Y. Zhang, Victor Guryev, Fabio Vandin²², Kai Ye²⁰, Alexander Schönhuth - Show less +58 more•Institutions (22)

Max Planck Society¹, Karlsruhe Institute of Technology², Broad Institute³, Federal University of Rio de Janeiro⁴, Saarland University⁵, European Bioinformatics Institute⁶, Helsinki Institute for Information Technology⁷, Howard Hughes Medical Institute⁸, Centre national de la recherche scientifique⁹, University of Washington¹⁰, Wellcome Trust Sanger Institute¹¹, Leiden University¹², University of Pennsylvania¹³, China Agricultural University¹⁴, University of Groningen¹⁵, Wageningen University and Research Centre¹⁶, Catholic University of Leuven¹⁷, Illumina¹⁸, Regeneron¹⁹, Shaanxi University of Science and Technology²⁰, Netherlands Cancer Institute²¹, University of Padua²²

01 Jan 2018-Briefings in Bioinformatics

TL;DR: Already available approaches to construct and use pan-genomes are examined, the potential benefits of future technologies and methodologies are discussed, and open challenges from the vantage point of the above-mentioned biological disciplines are reviewed.

...read moreread less

Abstract: Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic data sets. Instead, novel, qualitatively different computational methods and paradigms are needed. We will witness the rapid extension of computational pan-genomics, a new sub-area of research in computational biology. In this article, we generalize existing definitions and understand a pan-genome as any collection of genomic sequences to be analyzed jointly or to be used as a reference. We examine already available approaches to construct and use pan-genomes, discuss the potential benefits of future technologies and methodologies and review open challenges from the vantage point of the above-mentioned biological disciplines. As a prominent example for a computational paradigm shift, we particularly highlight the transition from the representation of reference genomes as strings to representations as graphs. We outline how this and other challenges from different application domains translate into common computational problems, point out relevant bioinformatics techniques and identify open problems in computer science. With this review, we aim to increase awareness that a joint approach to computational pan-genomics can help address many of the problems currently faced in various domains.

...read moreread less

220 citations

Journal Article•DOI•

A comprehensive molecular study on Coffin-Siris and Nicolaides-Baraitser syndromes identifies a broad molecular and clinical spectrum converging on altered chromatin remodeling.

[...]

Dagmar Wieczorek, Nina Bögershausen¹, Filippo Beleggia¹, Sabine Steiner-Haldenstätt, Esther Pohl¹, Yun Li¹, Esther Milz¹, Marcel Martin², Holger Thiele¹, Janine Altmüller¹, Yasemin Alanay³, Yasemin Alanay⁴, Hülya Kayserili⁵, Ludger Klein-Hitpass⁶, Stefan Böhringer⁷, Andreas Wollstein⁸, Beate Albrecht, Koray Boduroğlu⁴, Almuth Caliebe⁹, Krystyna H. Chrzanowska, Ozgur Cogulu, Francesca Cristofoli¹⁰, Johanna Christina Czeschik, Koenraad Devriendt¹⁰, Maria Teresa Dotti¹¹, Nursel Elcioglu¹², Blanca Gener, Timm O. Goecke, Małgorzata Krajewska-Walasek, Encarnación Guillén-Navarro, Joussef Hayek, Gunnar Houge¹³, Esra Kılıç⁴, Pelin Ozlem Simsek-Kiper⁴, Vanesa López-González, Alma Kuechler, Stanislas Lyonnet¹⁴, Francesca Mari¹¹, Annabella Marozza, Michèle Mathieu Dramard, Barbara Mikat, Gilles Morin, Fanny Morice-Picard¹⁵, Ferda Ozkinay¹⁶, Anita Rauch¹⁷, Alessandra Renieri¹¹, Sigrid Tinschert¹⁸, G. Eda Utine⁴, Catheline Vilain, R. Vivarelli, Christiane Zweier¹⁹, Peter Nürnberg¹, Sven Rahmann⁶, Joris Vermeesch¹⁰, Hermann-Josef Lüdecke, Michael Zeschnigk, Bernd Wollnik¹ - Show less +53 more•Institutions (19)

University of Cologne¹, Technical University of Dortmund², Acıbadem University³, Boston Children's Hospital⁴, Istanbul University⁵, University of Duisburg-Essen⁶, Leiden University Medical Center⁷, Ludwig Maximilian University of Munich⁸, University of Kiel⁹, Katholieke Universiteit Leuven¹⁰, University of Siena¹¹, Marmara University¹², Haukeland University Hospital¹³, Necker-Enfants Malades Hospital¹⁴, University of Bordeaux¹⁵, Ege University¹⁶, University of Zurich¹⁷, Dresden University of Technology¹⁸, University of Erlangen-Nuremberg¹⁹

20 Dec 2013-Human Molecular Genetics

TL;DR: It is shown that mutations in ARID1B are the main cause of CSS, accounting for 76% of identified mutations, and proposed genotype-phenotype correlations are important for molecular screening strategies.

...read moreread less

Abstract: Chromatin remodeling complexes are known to modify chemical marks on histones or to induce conformational changes in the chromatin in order to regulate transcription. De novo dominant mutations in different members of the SWI/SNF chromatin remodeling complex have recently been described in individuals with Coffin-Siris (CSS) and Nicolaides-Baraitser (NCBRS) syndromes. Using a combination of whole-exome sequencing, NGS-based sequencing of 23 SWI/SNF complex genes, and molecular karyotyping in 46 previously undescribed individuals with CSS and NCBRS, we identified a de novo 1-bp deletion (c.677delG, p.Gly226Glufs*53) and a de novo missense mutation (c.914G>T, p.Cys305Phe) in PHF6 in two individuals diagnosed with CSS. PHF6 interacts with the nucleosome remodeling and deacetylation (NuRD) complex implicating dysfunction of a second chromatin remodeling complex in the pathogenesis of CSS-like phenotypes. Altogether, we identified mutations in 60% of the studied individuals (28/46), located in the genes ARID1A, ARID1B, SMARCB1, SMARCE1, SMARCA2, and PHF6. We show that mutations in ARID1B are the main cause of CSS, accounting for 76% of identified mutations. ARID1B and SMARCB1 mutations were also found in individuals with the initial diagnosis of NCBRS. These individuals apparently belong to a small subset who display an intermediate CSS/NCBRS phenotype. Our proposed genotype-phenotype correlations are important for molecular screening strategies.

...read moreread less

181 citations

1
2
3
4
…
5
6
7
8
9

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Trimmomatic: a flexible trimmer for Illumina sequence data

[...]

Anthony Bolger¹, Marc Lohse¹, Bjoern Usadel¹•Institutions (1)

Max Planck Society¹

01 Aug 2014-Bioinformatics

TL;DR: Timmomatic is developed as a more flexible and efficient preprocessing tool, which could correctly handle paired-end data and is shown to produce output that is at least competitive with, and in many cases superior to, that produced by other tools, in all scenarios tested.

...read moreread less

Abstract: Motivation: Although many next-generation sequencing (NGS) read preprocessing tools already existed, we could not find any tool or combination of tools that met our requirements in terms of flexibility, correct handling of paired-end data and high performance. We have developed Trimmomatic as a more flexible and efficient preprocessing tool, which could correctly handle paired-end data. Results: The value of NGS read preprocessing is demonstrated for both reference-based and reference-free tasks. Trimmomatic is shown to produce output that is at least competitive with, and in many cases superior to, that produced by other tools, in all scenarios tested. Availability and implementation: Trimmomatic is licensed under GPL V3. It is cross-platform (Java 1.5+ required) and available at http://www.usadellab.org/cms/index.php?page=trimmomatic Contact: ed.nehcaa-htwr.1oib@ledasu Supplementary information: Supplementary data are available at Bioinformatics online.

...read moreread less

39,291 citations

Journal Article•DOI•

Cutadapt removes adapter sequences from high-throughput sequencing reads

[...]

Marcel Martin¹•Institutions (1)

Technical University of Dortmund¹

02 May 2011-EMBnet.journal

TL;DR: The command-line tool cutadapt is developed, which supports 454, Illumina and SOLiD (color space) data, offers two adapter trimming algorithms, and has other useful features.

...read moreread less

20,255 citations

Journal Article•DOI•

fastp: an ultra-fast all-in-one FASTQ preprocessor.

[...]

Shifu Chen¹, Yanqing Zhou, Yaru Chen, Jia Gu¹•Institutions (1)

Chinese Academy of Sciences¹

01 Sep 2018-Bioinformatics

TL;DR: Fastp is developed as an ultra‐fast FASTQ preprocessor with useful quality control and data‐filtering features that can perform quality control, adapter trimming, quality filtering, per‐read quality pruning and many other operations with a single scan of the FAST Q data.

...read moreread less

Abstract: Motivation Quality control and preprocessing of FASTQ files are essential to providing clean data for downstream analysis. Traditionally, a different tool is used for each operation, such as quality control, adapter trimming and quality filtering. These tools are often insufficiently fast as most are developed using high-level programming languages (e.g. Python and Java) and provide limited multi-threading support. Reading and loading data multiple times also renders preprocessing slow and I/O inefficient. Results We developed fastp as an ultra-fast FASTQ preprocessor with useful quality control and data-filtering features. It can perform quality control, adapter trimming, quality filtering, per-read quality pruning and many other operations with a single scan of the FASTQ data. This tool is developed in C++ and has multi-threading support. Based on our evaluation, fastp is 2-5 times faster than other FASTQ preprocessing tools such as Trimmomatic or Cutadapt despite performing far more operations than similar tools. Availability and implementation The open-source code and corresponding instructions are available at https://github.com/OpenGene/fastp.

...read moreread less

7,461 citations

Journal Article•DOI•

De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis

[...]

Brian J. Haas¹, Alexie Papanicolaou², Moran Yassour³, Moran Yassour⁴, Manfred Grabherr⁵, Philip D. Blood⁶, Joshua C. Bowden², M. B. Couger⁷, David Eccles⁸, Bo Li⁹, Matthias Lieber¹⁰, Matthew D. MacManes¹¹, Michael Ott², Joshua Orvis, Nathalie Pochet³, Nathalie Pochet¹², Francesco Strozzi¹³, Nathan T. Weeks¹⁴, Rick Westerman¹⁵, Thomas William, Colin N. Dewey⁹, Robert Henschel¹⁶, Richard D. LeDuc¹⁶, Nir Friedman⁴, Aviv Regev³ - Show less +21 more•Institutions (16)

Broad Institute¹, Commonwealth Scientific and Industrial Research Organisation², Massachusetts Institute of Technology³, Hebrew University of Jerusalem⁴, Science for Life Laboratory⁵, Pittsburgh Supercomputing Center⁶, Oklahoma State University–Stillwater⁷, Griffith University⁸, University of Wisconsin-Madison⁹, Dresden University of Technology¹⁰, California Institute for Quantitative Biosciences¹¹, Flanders Institute for Biotechnology¹², Parco Tecnologico Padano¹³, United States Department of Agriculture¹⁴, Purdue University¹⁵, Indiana University¹⁶

01 Aug 2013-Nature Protocols

TL;DR: This protocol provides a workflow for genome-independent transcriptome analysis leveraging the Trinity platform and presents Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes.

...read moreread less

Abstract: De novo assembly of RNA-seq data enables researchers to study transcriptomes without the need for a genome sequence; this approach can be usefully applied, for instance, in research on 'non-model organisms' of ecological and evolutionary importance, cancer samples or the microbiome. In this protocol we describe the use of the Trinity platform for de novo transcriptome assembly from RNA-seq data in non-model organisms. We also present Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes. In the procedure, we provide a workflow for genome-independent transcriptome analysis leveraging the Trinity platform. The software, documentation and demonstrations are freely available from http://trinityrnaseq.sourceforge.net. The run time of this protocol is highly dependent on the size and complexity of data to be analyzed. The example data set analyzed in the procedure detailed herein can be processed in less than 5 h.

...read moreread less

6,369 citations

“Bioinformatics” 특집을 내면서

[...]

장병탁, 김삼묘, 허철구

01 Aug 2000

TL;DR: Assessment of medical technology in the context of commercialization with Bioentrepreneur course, which addresses many issues unique to biomedical products.

...read moreread less

Abstract: BIOE 402. Medical Technology Assessment. 2 or 3 hours. Bioentrepreneur course. Assessment of medical technology in the context of commercialization. Objectives, competition, market share, funding, pricing, manufacturing, growth, and intellectual property; many issues unique to biomedical products. Course Information: 2 undergraduate hours. 3 graduate hours. Prerequisite(s): Junior standing or above and consent of the instructor.

...read moreread less

4,833 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse