Home
/
Authors
/
Matthew Tepel

Author

Matthew Tepel

Bio: Matthew Tepel is an academic researcher from Harvard University. The author has contributed to research in topics: De novo peptide sequencing & Tandem mass spectrometry. The author has an hindex of 3, co-authored 3 publications receiving 472 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry

[...]

Ting Chen¹, Ming-Yang Kao², Matthew Tepel¹, John Rush¹, George M. Church¹ - Show less +1 more•Institutions (2)

Harvard University¹, Yale University²

01 Feb 2000

TL;DR: The de novo peptide sequencing problem is to reconstruct the peptide sequence from a given tandem mass spectral data of k ions by implicitly transforming the spectral data into an NC-spectrum graph G (V, E) where /V/ = 2k + 2, and this approach can be further used to discover a modified amino acid in O(/V//E/) time.

...read moreread less

242 citations

Journal Article•DOI•

A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry.

[...]

Ting Chen¹, Ming-Yang Kao, Matthew Tepel¹, John Rush¹, George M. Church¹ - Show less +1 more•Institutions (1)

Harvard University¹

01 Jan 2001-Journal of Computational Biology

TL;DR: In this paper, the authors proposed a dynamic programming-based method to reconstruct the peptide sequence from a given tandem mass spectral data of k ions by implicitly transforming the spectral data into an NC-spectrum graph G (V, E).

...read moreread less

Abstract: Tandem mass spectrometry fragments a large number of molecules of the same peptide sequence into charged molecules of prefix and suffix peptide subsequences and then measures mass/charge ratios of these ions. The de novo peptide sequencing problem is to reconstruct the peptide sequence from a given tandem mass spectral data of k ions. By implicitly transforming the spectral data into an NC-spectrum graph G (V, E) where /V/ = 2k + 2, we can solve this problem in O(/V//E/) time and O(/V/2) space using dynamic programming. For an ideal noise-free spectrum with only b- and y-ions, we improve the algorithm to O(/V/ + /E/) time and O(/V/) space. Our approach can be further used to discover a modified amino acid in O(/V//E/) time. The algorithms have been implemented and tested on experimental data.

...read moreread less

224 citations

Posted Content•

A Dynamic Programming Approach to De Novo Peptide Sequencing via Tandem Mass Spectrometry

[...]

Ting Chen¹, Ming-Yang Kao, Matthew Tepel¹, John Rush¹, George M. Church¹ - Show less +1 more•Institutions (1)

Harvard University¹

18 Jan 2001-arXiv: Computational Engineering, Finance, and Science

TL;DR: In this paper, the authors proposed to transform the spectral data into an NC-spectrum graph and solve the de novo peptide sequencing problem in O(|V|+|E|) time and space using dynamic programming.

...read moreread less

Abstract: The tandem mass spectrometry fragments a large number of molecules of the same peptide sequence into charged prefix and suffix subsequences, and then measures mass/charge ratios of these ions. The de novo peptide sequencing problem is to reconstruct the peptide sequence from a given tandem mass spectral data of k ions. By implicitly transforming the spectral data into an NC-spectrum graph G=(V,E) where |V|=2k+2, we can solve this problem in O(|V|+|E|) time and O(|V|) space using dynamic programming. Our approach can be further used to discover a modified amino acid in O(|V||E|) time and to analyze data with other types of noise in O(|V||E|) time. Our algorithms have been implemented and tested on actual experimental data.

...read moreread less

15 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

PEAKS: powerful software for peptide de novo sequencing by tandem mass spectrometry

[...]

Bin Ma¹, Kaizhong Zhang¹, Christopher Hendrie, Chengzhi Liang, Ming Li², Amanda Doherty-Kirby¹, Gilles A. Lajoie¹ - Show less +3 more•Institutions (2)

University of Western Ontario¹, University of Waterloo²

30 Oct 2003-Rapid Communications in Mass Spectrometry

TL;DR: A new de novo sequencing software package, PEAKS, is described, to extract amino acid sequence information without the use of databases, using a new model and a new algorithm to efficiently compute the best peptide sequences whose fragment ions can best interpret the peaks in the MS/MS spectrum.

...read moreread less

Abstract: A number of different approaches have been described to identify proteins from tandem mass spectrometry (MS/MS) data. The most common approaches rely on the available databases to match experimental MS/MS data. These methods suffer from several drawbacks and cannot be used for the identification of proteins from unknown genomes. In this communication, we describe a new de novo sequencing software package, PEAKS, to extract amino acid sequence information without the use of databases. PEAKS uses a new model and a new algorithm to efficiently compute the best peptide sequences whose fragment ions can best interpret the peaks in the MS/MS spectrum. The output of the software gives amino acid sequences with confidence scores for the entire sequences, as well as an additional novel positional scoring scheme for portions of the sequences. The performance of PEAKS is compared with Lutefisk, a well-known de novo sequencing software, using quadrupole-time-of-flight (Q-TOF) data obtained for several tryptic peptides from standard proteins.

...read moreread less

1,239 citations

Journal Article•DOI•

Machine learning in bioinformatics

[...]

Pedro Larrañaga¹, Borja Calvo¹, Roberto Santana², Concha Bielza¹, Josu Galdiano¹, Iñaki Inza¹, Jose A. Lozano¹, Rubén Armañanzas¹, Guzmán Santafé¹, Aritz Pérez², Víctor Robles¹ - Show less +7 more•Institutions (2)

University of the Basque Country¹, Technical University of Madrid²

01 Mar 2006-Briefings in Bioinformatics

TL;DR: Modelling methods, such as supervised classification, clustering and probabilistic graphical models for knowledge discovery, as well as deterministic and stochastic heuristics for optimization, are presented.

...read moreread less

Abstract: This article reviews machine learning methods for bioinformatics. It presents modelling methods, such as supervised classification, clustering and probabilistic graphical models for knowledge discovery, as well as deterministic and stochastic heuristics for optimization. Applications in genomics, proteomics, systems biology, evolution and text mining are also shown.

...read moreread less

805 citations

Journal Article•DOI•

De Novo Peptide Sequencing via Tandem Mass Spectrometry

[...]

Vlado Dančík¹, Theresa A. Addona², Karl R. Clauser², James E. Vath², Pavel A. Pevzner³ - Show less +1 more•Institutions (3)

Slovak Academy of Sciences¹, Millennium Pharmaceuticals², University of Southern California³

01 Jan 1999-Journal of Computational Biology

TL;DR: A new algorithm, SHERENGA, is developed for de novo interpretation of MS/MS spectral interpretation that automatically learns fragment ion types and intensity thresholds from a collection of test spectra generated from any type of mass spectrometer.

...read moreread less

Abstract: Peptide sequencing via tandem mass spectrometry (MS/MS) is one of the most powerful tools in proteomics for identifying proteins. Because complete genome sequences are accumulating rapidly, the recent trend in interpretation of MS/MS spectra has been database search. However, de novo MS/MS spectral interpretation remains an open problem typically involving manual interpretation by expert mass spectrometrists. We have developed a new algorithm, SHERENGA, for de novo interpretation that automatically learns fragment ion types and intensity thresholds from a collection of test spectra generated from any type of mass spectrometer. The test data are used to construct optimal path scoring in the graph representations of MS/MS spectra. A ranked list of high scoring paths corresponds to potential peptide sequences. SHERENGA is most useful for interpreting sequences of peptides resulting from unknown proteins and for validating the results of database search algorithms in fully automated, high-throughput peptide s...

...read moreread less

601 citations

Journal Article•DOI•

InsPecT: identification of posttranslationally modified peptides from tandem mass spectra.

[...]

Stephen Tanner¹, Hongjun Shu¹, Ari Frank¹, Ling-Chi Wang¹, Ebrahim Zandi¹, Marc C. Mumby¹, Pavel A. Pevzner¹, Vineet Bafna¹ - Show less +4 more•Institutions (1)

University of Texas Southwestern Medical Center¹

09 Jun 2005-Analytical Chemistry

TL;DR: A tool is described, InsPecT, to identify posttranslational modifications using tandem mass spectrometry data, which identifies modified peptides with better or equivalent accuracy than other database search tools while being 2 orders of magnitude faster than SEQUEST, and substantially faster than X!TANDEM on complex mixtures.

...read moreread less

Abstract: Reliable identification of posttranslational modifications is key to understanding various cellular regulatory processes. We describe a tool, InsPecT, to identify posttranslational modifications using tandem mass spectrometry data. InsPecT constructs database filters that proved to be very successful in genomics searches. Given an MS/MS spectrum S and a database D, a database filter selects a small fraction of database D that is guaranteed (with high probability) to contain a peptide that produced S. InsPecT uses peptide sequence tags as efficient filters that reduce the size of the database by a few orders of magnitude while retaining the correct peptide with very high probability. In addition to filtering, InsPecT also uses novel algorithms for scoring and validating in the presence of modifications, without explicit enumeration of all variants. InsPecT identifies modified peptides with better or equivalent accuracy than other database search tools while being 2 orders of magnitude faster than SEQUEST, ...

...read moreread less

588 citations

Journal Article•DOI•

ProLuCID: an improved SEQUEST-like algorithm with enhanced sensitivity and specificity

[...]

Tao Xu¹, Tao Xu², Sung Kyu Park², John D. Venable², James A. Wohlschlegel², Jolene K. Diedrich², Daniel Cociorva², Bingwen Lu², Lujian Liao², J. Hewel², Xuemei Han², Catherine C. L. Wong², Bryan R. Fonslow², Claire M. Delahunty², Yu Gao², H. Shah², John R. Yates² - Show less +13 more•Institutions (2)

Dow AgroSciences¹, Scripps Research Institute²

03 Nov 2015-Journal of Proteomics

TL;DR: ProLuCID was able to identify as many as 25% more proteins than SEQUEST and is able to take advantage of high resolution MS/MS spectra leading to further improvements in specificity when compared to low resolution tandem MS data.

...read moreread less

420 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54

Collapse