Home
/
Authors
/
Marc R. Wilkins

Author

Marc R. Wilkins

Other affiliations: Geneva College, Swiss Institute of Bioinformatics, University of Cambridge ...read more

Bio: Marc R. Wilkins is an academic researcher from University of New South Wales. The author has contributed to research in topics: Proteome & Methylation. The author has an hindex of 54, co-authored 249 publications receiving 19797 citations. Previous affiliations of Marc R. Wilkins include Geneva College & Swiss Institute of Bioinformatics.

Topics: Proteome, Methylation, Peptide mass fingerprinting, Genome, Gene ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2001
2000
1999
1998
1997
1996
1995
1992

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

Protein identification and analysis tools in the ExPASy server

[...]

Marc R. Wilkins¹, Elisabeth Gasteiger², Amos Marc Bairoch², Jean Emmanuel Sanchez³, Keith L. Williams¹, Ron D. Appel³, Denis Hochstrasser³ - Show less +3 more•Institutions (3)

Macquarie University¹, University of Geneva², Geneva College³

01 Jan 1999-Methods of Molecular Biology

TL;DR: Details are given about protein identification and analysis software that is available through the ExPASy World Wide Web server and the extensive annotation available in the Swiss-Prot database is used.

...read moreread less

Abstract: Protein identification and analysis software performs a central role in the investigation of proteins from two-dimensional (2-D) gels and mass spectrometry. For protein identification, the user matches certain empirically acquired information against a protein database to define a protein as already known or as novel. For protein analysis, information in protein databases can be used to predict certain properties about a protein, which can be useful for its empirical investigation. The two processes are thus complementary. Although there are numerous programs available for those applications, we have developed a set of original tools with a few main goals in mind. Specifically, these are: 1. To utilize the extensive annotation available in the Swiss-Prot database wherever possible, in particular the position-specific annotation in the Swiss-Prot feature tables to take into account posttranslational modifications and protein processing. 2. To develop tools specifically, but not exclusively, applicable to proteins prepared by two dimensional gel electrophoresis and peptide mass fingerprinting experiments. 3. To make all tools available on the World-Wide Web (WWW), and freely usable by the scientific community. In this chapter we give details about protein identification and analysis software that is available through the ExPASy World Wide Web server.

...read moreread less

8,007 citations

Journal Article•DOI•

Progress with Proteome Projects: Why all Proteins Expressed by a Genome Should be Identified and How To Do It

[...]

Marc R. Wilkins¹, Jean-Charles Sanchez², Andrew A. Gooley¹, Ron D. Appel², Ian Humphery-Smith³, Denis F. Hochstrasser², Keith L. Williams¹ - Show less +3 more•Institutions (3)

Macquarie University¹, University of Geneva², University of Sydney³

01 Jan 1996-Biotechnology & Genetic Engineering Reviews

TL;DR: The Progress with Proteome Projects: Why all Proteins Expressed by a Genome Should be Identified and How To Do It as discussed by the authors is an example of such a project.

...read moreread less

Abstract: (1996). Progress with Proteome Projects: Why all Proteins Expressed by a Genome Should be Identified and How To Do It. Biotechnology and Genetic Engineering Reviews: Vol. 13, No. 1, pp. 19-50.

...read moreread less

1,158 citations

Journal Article•DOI•

Progress with gene‐product mapping of the Mollicutes: Mycoplasma genitalium

[...]

Valerie C. Wasinger¹, Stuart J. Cordwell¹, Anne Cerpa-Poljak², Anne Cerpa-Poljak³, Jun X. Yan⁴, Andrew A. Gooley⁴, Marc R. Wilkins⁴, Mark W. Duncan², Ray J. Harris⁵, Keith L. Williams⁴, Ian Humphery-Smith¹ - Show less +7 more•Institutions (5)

University of Sydney¹, Macquarie University², Cooperative Research Centre³, University of New South Wales⁴, University of South Australia⁵

01 Jan 1995-Electrophoresis

TL;DR: A protein map of the smallest known self‐replicating organism, Mycoplasma genitalium, revealed a high proportion of acidic proteins, which allowed proteins to be identified prior to detection of their respective genes via the M. genitalium sequencing initiative.

...read moreread less

Abstract: A protein map of the smallest known self-replicating organism, Mycoplasma genitalium (Class: Mollicutes), revealed a high proportion of acidic proteins. Amino acid composition was used to putatively identify, or provide unique parameters, for 50 gene products separated by two-dimensional gel electrophoresis. A further 19 proteins were subjected to peptide-mass fingerprinting using matrix-assisted laser desorption ionisation-time of flight (MALDI-TOF) mass spectrometry and 4 were subjected to N-terminal Edman degradation. The majority of M. genitalium proteins remain uncharacterised. However, the combined approach of amino acid analysis and peptide-mass fingerprinting allowed gene products to be linked to homologous genes in a variety of organisms. This has allowed proteins to be identified prior to detection of their respective genes via the M. genitalium sequencing initiative. The principle of ‘hierarchical’ analysis for the mass screening of proteins and the analysis of microbial genomes via their protein complement or ‘proteome’ is detailed. Here, characterisation of gene products depends upon the quickest and most economical technologies being employed initially, so as to determine if a large number of proteins are already present in both homologous and heterologous species databases. Initial screening, which lends itself to automation and robotics, can then be followed by more time and cost intensive procedures, when necessary.

...read moreread less

955 citations

Journal Article•DOI•

From Proteins to Proteomes: Large Scale Protein Identification by Two-Dimensional Electrophoresis and Amino Acid Analysis

[...]

Marc R. Wilkins¹, Christian Pasquali², Ron D. Appel², Keli Ou¹, Olivier Golaz², Jean-Charles Sanchez², Jun X. Yan¹, Andrew A. Gooley¹, Graham J. Hughes², Ian Humphery-Smith³, Keith L. Williams¹, Denis F. Hochstrasser² - Show less +8 more•Institutions (3)

Macquarie University¹, University of Geneva², University of Sydney³

01 Jan 1996-Nature Biotechnology

TL;DR: Single protein spots, from polyvinylidene difluoride blots of micropreparative E. coli 2-D gels, were rapidly and economically identified by matching their amino acid composition, estimated pI and molecular weight against all E. bacteria entries in the SWISS-PROT database.

...read moreread less

Abstract: Separation and identification of proteins by two-dimensional (2-D) electrophoresis can be used for protein-based gene expression analysis In this report single protein spots, from polyvinylidene difluoride blots of micropreparative E coli 2-D gels, were rapidly and economically identified by matching their amino acid composition, estimated pI and molecular weight against all E coli entries in the SWISS-PROT database Thirty proteins from an E coli 2-D map were analyzed and identities assigned Three of the proteins were unknown By protein sequencing analysis, 20 of the 27 proteins were correctly identified Importantly, correct identifications showed unambiguous “correct” score patterns While incorrect protein identifications also showed distinctive score patterns, indicating that protein must be identified by other means These techniques allow large-scale screening of the protein complement of simple organisms, or tissues in normal and disease states The computer program described here is accessible via the World Wide Web at URL address (http://expasyhcugech/)

...read moreread less

897 citations

Journal Article•DOI•

The minimum information about a proteomics experiment (MIAPE)

[...]

Chris F. Taylor¹, Chris F. Taylor², Norman W. Paton³, Norman W. Paton¹, Kathryn S. Lilley¹, Kathryn S. Lilley⁴, Pierre-Alain Binz⁵, Pierre-Alain Binz¹, Randall K. Julian¹, Andrew R. Jones³, Andrew R. Jones¹, Weimin Zhu², Weimin Zhu¹, Rolf Apweiler¹, Rolf Apweiler², Ruedi Aebersold⁶, Ruedi Aebersold¹, Eric W. Deutsch⁷, Eric W. Deutsch¹, Michael J. Dunn⁸, Albert J. R. Heck⁹, Alexander Leitner¹⁰, Marcus Macht, Matthias Mann¹¹, Lennart Martens², Lennart Martens¹, Thomas A. Neubert¹², Scott D. Patterson¹³, Peipei Ping¹⁴, Sean L. Seymour¹, Sean L. Seymour¹⁵, Puneet Souda¹⁶, Akira Tsugita, Joël Vandekerckhove¹⁷, Thomas M. Vondriska¹⁴, Julian P. Whitelegge¹⁶, Marc R. Wilkins¹⁸, Ioannnis Xenarios, John R. Yates¹⁹, Henning Hermjakob², Henning Hermjakob¹ - Show less +37 more•Institutions (19)

Wellcome Trust¹, European Bioinformatics Institute², University of Manchester³, University of Cambridge⁴, Swiss Institute of Bioinformatics⁵, École Polytechnique Fédérale de Lausanne⁶, Institute for Systems Biology⁷, University College Dublin⁸, Utrecht University⁹, University of Vienna¹⁰, Max Planck Society¹¹, New York University¹², Amgen¹³, University of California, Los Angeles¹⁴, Applied Biosystems¹⁵, Semel Institute for Neuroscience and Human Behavior¹⁶, Flanders Institute for Biotechnology¹⁷, University of New South Wales¹⁸, Scripps Research Institute¹⁹

01 Aug 2007-Nature Biotechnology

TL;DR: The processes and principles underpinning the development of guidance modules for reporting the use of techniques such as gel electrophoresis and mass spectrometry are described and the ramifications for various interest groups such as experimentalists, funders, publishers and the private sector are discussed.

...read moreread less

Abstract: Both the generation and the analysis of proteomics data are now widespread, and high-throughput approaches are commonplace. Protocols continue to increase in complexity as methods and technologies evolve and diversify. To encourage the standardized collection, integration, storage and dissemination of proteomics data, the Human Proteome Organization's Proteomics Standards Initiative develops guidance modules for reporting the use of techniques such as gel electrophoresis and mass spectrometry. This paper describes the processes and principles underpinning the development of these modules; discusses the ramifications for various interest groups such as experimentalists, funders, publishers and the private sector; addresses the issue of overlap with other reporting guidelines; and highlights the criticality of appropriate tools and resources in enabling 'MIAPE-compliant' reporting.

...read moreread less

703 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The MIQE Guidelines: Minimum Information for Publication of Quantitative Real-Time PCR Experiments

[...]

Stephen A. Bustin¹, Vladimir Benes², Jeremy A. Garson³, Jan Hellemans⁴, Jim F. Huggett³, Mikael Kubista, Reinhold Mueller, Tania Nolan⁵, Michael W. Pfaffl⁶, Gregory L. Shipley⁷, Jo Vandesompele⁴, Carl T. Wittwer⁸, Carl T. Wittwer⁹ - Show less +9 more•Institutions (9)

Queen Mary University of London¹, European Bioinformatics Institute², University College London³, Ghent University Hospital⁴, Sigma-Aldrich⁵, Technische Universität München⁶, University of Texas Health Science Center at Houston⁷, University of Utah⁸, ARUP Laboratories⁹

01 Apr 2009-Clinical Chemistry

TL;DR: The Minimum Information for Publication of Quantitative Real-Time PCR Experiments (MIQE) guidelines target the reliability of results to help ensure the integrity of the scientific literature, promote consistency between laboratories, and increase experimental transparency.

...read moreread less

Abstract: Background: Currently, a lack of consensus exists on how best to perform and interpret quantitative real-time PCR (qPCR) experiments. The problem is exacerbated by a lack of sufficient experimental detail in many publications, which impedes a reader’s ability to evaluate critically the quality of the results presented or to repeat the experiments. Content: The Minimum Information for Publication of Quantitative Real-Time PCR Experiments (MIQE) guidelines target the reliability of results to help ensure the integrity of the scientific literature, promote consistency between laboratories, and increase experimental transparency. MIQE is a set of guidelines that describe the minimum information necessary for evaluating qPCR experiments. Included is a checklist to accompany the initial submission of a manuscript to the publisher. By providing all relevant experimental conditions and assay characteristics, reviewers can assess the validity of the protocols used. Full disclosure of all reagents, sequences, and analysis methods is necessary to enable other investigators to reproduce results. MIQE details should be published either in abbreviated form or as an online supplement. Summary: Following these guidelines will encourage better experimental practice, allowing more reliable and unequivocal interpretation of qPCR results.

...read moreread less

12,469 citations

Journal Article•DOI•

Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks

[...]

Cole Trapnell¹, Adam Roberts², Loyal A. Goff³, Loyal A. Goff⁴, Loyal A. Goff¹, Geo Pertea⁵, Daehwan Kim⁶, Daehwan Kim⁷, David R. Kelley³, David R. Kelley¹, Harold Pimentel², Steven L. Salzberg⁵, John L. Rinn³, John L. Rinn¹, Lior Pachter² - Show less +11 more•Institutions (7)

Broad Institute¹, University of California, Berkeley², Harvard University³, Massachusetts Institute of Technology⁴, Johns Hopkins University⁵, University of Maryland, College Park⁶, Johns Hopkins University School of Medicine⁷

01 Mar 2012-Nature Protocols

TL;DR: This protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results, which takes less than 1 d of computer time for typical experiments and ∼1 h of hands-on time.

...read moreread less

Abstract: Recent advances in high-throughput cDNA sequencing (RNA-seq) can reveal new genes and splice variants and quantify expression genome-wide in a single assay. The volume and complexity of data from RNA-seq experiments necessitate scalable, fast and mathematically principled analysis software. TopHat and Cufflinks are free, open-source software tools for gene discovery and comprehensive expression analysis of high-throughput mRNA sequencing (RNA-seq) data. Together, they allow biologists to identify new genes and new splice variants of known ones, as well as compare gene and transcript expression under two or more conditions. This protocol describes in detail how to use TopHat and Cufflinks to perform such analyses. It also covers several accessory tools and utilities that aid in managing data, including CummeRbund, a tool for visualizing RNA-seq analysis results. Although the procedure assumes basic informatics skills, these tools assume little to no background with RNA-seq analysis and are meant for novices and experts alike. The protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results. The protocol's execution time depends on the volume of transcriptome sequencing data and available computing resources but takes less than 1 d of computer time for typical experiments and ∼1 h of hands-on time.

...read moreread less

10,913 citations

Journal Article•DOI•

SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling.

[...]

Nicolas Guex, Manuel C. Peitsch

01 Jan 1997-Electrophoresis

TL;DR: An environment for comparative protein modeling is developed that consists of SWISS‐MODEL, a server for automated comparativeprotein modeling and of the SWiss‐PdbViewer, a sequence to structure workbench that provides a large selection of structure analysis and display tools.

...read moreread less

Abstract: Comparative protein modeling is increasingly gaining interest since it is of great assistance during the rational design of mutagenesis experiments. The availability of this method, and the resulting models, has however been restricted by the availability of expensive computer hardware and software. To overcome these limitations, we have developed an environment for comparative protein modeling that consists of SWISS-MODEL, a server for automated comparative protein modeling and of the SWISS-PdbViewer, a sequence to structure workbench. The Swiss-PdbViewer not only acts as a client for SWISS-MODEL, but also provides a large selection of structure analysis and display tools. In addition, we provide the SWISS-MODEL Repository, a database containing more than 3500 automatically generated protein models. By making such tools freely available to the scientific community, we hope to increase the use of protein structures and models in the process of experiment design.

...read moreread less

10,713 citations

SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)

[...]

Glenn Tesler

01 Jun 2012

TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).

...read moreread less

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

...read moreread less

10,124 citations

Journal Article•DOI•

Probability-based protein identification by searching sequence databases using mass spectrometry data.

[...]

David N. Perkins, Darryl J. Pappin, David M. Creasy, John S. Cottrell

01 Dec 1999-Electrophoresis

TL;DR: A new computer program, Mascot, is presented, which integrates all three types of search for protein identification by searching a sequence database using mass spectrometry data, and the scoring algorithm is probability based.

...read moreread less

Abstract: Several algorithms have been described in the literature for protein identification by searching a sequence database using mass spectrometry data. In some approaches, the experimental data are peptide molecular weights from the digestion of a protein by an enzyme. Other approaches use tandem mass spectrometry (MS/MS) data from one or more peptides. Still others combine mass data with amino acid sequence data. We present results from a new computer program, Mascot, which integrates all three types of search. The scoring algorithm is probability based, which has a number of advantages: (i) A simple rule can be used to judge whether a result is significant or not. This is particularly useful in guarding against false positives. (ii) Scores can be compared with those from other types of search, such as sequence homology. (iii) Search parameters can be readily optimised by iteration. The strengths and limitations of probability-based scoring are discussed, particularly in the context of high throughput, fully automated protein identification.

...read moreread less

8,195 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse