Home
/
Authors
/
Jimin Pei

Author

Jimin Pei

University of Texas Southwestern Medical Center

Other affiliations: Howard Hughes Medical Institute

Bio: Jimin Pei is an academic researcher from University of Texas Southwestern Medical Center. The author has contributed to research in topics: Multiple sequence alignment & Alignment-free sequence analysis. The author has an hindex of 34, co-authored 80 publications receiving 6999 citations. Previous affiliations of Jimin Pei include Howard Hughes Medical Institute.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2015
2014
2013
2012
2011
2009
2008
2007
2006
2005
2004
2003
2001

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Substrate and Functional Diversity of Lysine Acetylation Revealed by a Proteomics Survey

[...]

Sung Chan Kim¹, Robert Sprung¹, Yue Chen¹, Yingda Xu¹, Haydn L. Ball¹, Jimin Pei¹, Tzuling Cheng¹, Yoonjung Kho¹, Hao Xiao, Lin Xiao², Nick V. Grishin¹, Michael A. White¹, Xiang-Jiao Yang², Yingming Zhao¹ - Show less +10 more•Institutions (2)

University of Texas Southwestern Medical Center¹, McGill University²

18 Aug 2006-Molecular Cell

TL;DR: This study reveals previously unappreciated roles for lysine acetylation in the regulation of diverse cellular pathways outside of the nucleus, including many longevity regulators and metabolism enzymes.

...read moreread less

1,422 citations

Journal Article•DOI•

PROMALS3D: a tool for multiple protein sequence and structure alignments.

[...]

Jimin Pei¹, Bong Hyun Kim², Nick V. Grishin•Institutions (2)

Howard Hughes Medical Institute¹, University of Texas Southwestern Medical Center²

01 Apr 2008-Nucleic Acids Research

TL;DR: This work explores the use of 3D structural information to guide sequence alignments constructed by the MSA program PROMALS, and outperforms a number of existing methods for constructing multiple sequence or structural alignments using both reference-dependent and reference-independent evaluation methods.

...read moreread less

Abstract: Although multiple sequence alignments (MSAs) are essential for a wide range of applications from structure modeling to prediction of functional sites, construction of accurate MSAs for distantly related proteins remains a largely unsolved problem. The rapidly increasing database of spatial structures is a valuable source to improve alignment quality. We explore the use of 3D structural information to guide sequence alignments constructed by our MSA program PROMALS. The resulting tool, PROMALS3D, automatically identifies homologs with known 3D structures for the input sequences, derives structural constraints through structure-based alignments and combines them with sequence constraints to construct consistency-based multiple sequence alignments. The output is a consensus alignment that brings together sequence and structural information about input proteins and their homologs. PROMALS3D can also align sequences of multiple input structures, with the output representing a multiple structure-based alignment refined in combination with sequence constraints. The advantage of PROMALS3D is that it gives researchers an easy way to produce high-quality alignments consistent with both sequences and structures of proteins. PROMALS3D outperforms a number of existing methods for constructing multiple sequence or structural alignments using both reference-dependent and reference-independent evaluation methods.

...read moreread less

1,204 citations

Journal Article•DOI•

Lysine Acetylation Is a Highly Abundant and Evolutionarily Conserved Modification in Escherichia Coli

[...]

Junmei Zhang¹, Robert Sprung¹, Jimin Pei¹, Xiaohong Tan², Sung Chan Kim³, Heng Zhu⁴, Chuan-Fa Liu², Nick V. Grishin¹, Yingming Zhao¹ - Show less +5 more•Institutions (4)

University of Texas Southwestern Medical Center¹, Nanyang Technological University², Hallym University³, Johns Hopkins University School of Medicine⁴

01 Feb 2009-Molecular & Cellular Proteomics

TL;DR: The first global screening of lysine acetylation is reported, identifying 138 modification sites in 91 proteins from Escherichia coli, showing an intimate link of this modification to energy metabolism and implying that functions oflysineacetylation beyond regulation of gene expression are evolutionarily conserved from bacteria to mammals.

...read moreread less

448 citations

Journal Article•DOI•

AL2CO: calculation of positional conservation in a protein sequence alignment

[...]

Jimin Pei¹, Nick V. Grishin•Institutions (1)

University of Texas Southwestern Medical Center¹

01 Aug 2001-Bioinformatics

TL;DR: A program to calculate a conservation index at each position in a multiple sequence alignment using several methods suggests that conservation indices should be a valuable tool of alignment quality assessment and might be used as an objective function for refinement of multiple alignments.

...read moreread less

Abstract: MOTIVATION Amino acid sequence alignments are widely used in the analysis of protein structure, function and evolutionary relationships. Proteins within a superfamily usually share the same fold and possess related functions. These structural and functional constraints are reflected in the alignment conservation patterns. Positions of functional and/or structural importance tend to be more conserved. Conserved positions are usually clustered in distinct motifs surrounded by sequence segments of low conservation. Poorly conserved regions might also arise from the imperfections in multiple alignment algorithms and thus indicate possible alignment errors. Quantification of conservation by attributing a conservation index to each aligned position makes motif detection more convenient. Mapping these conservation indices onto a protein spatial structure helps to visualize spatial conservation features of the molecule and to predict functionally and/or structurally important sites. Analysis of conservation indices could be a useful tool in detection of potentially misaligned regions and will aid in improvement of multiple alignments. RESULTS We developed a program to calculate a conservation index at each position in a multiple sequence alignment using several methods. Namely, amino acid frequencies at each position are estimated and the conservation index is calculated from these frequencies. We utilize both unweighted frequencies and frequencies weighted using two different strategies. Three conceptually different approaches (entropy-based, variance-based and matrix score-based) are implemented in the algorithm to define the conservation index. Calculating conservation indices for 35522 positions in 284 alignments from SMART database we demonstrate that different methods result in highly correlated (correlation coefficient more than 0.85) conservation indices. Conservation indices show statistically significant correlation between sequentially adjacent positions i and i + j, where j < 13, and averaging of the indices over the window of three positions is optimal for motif detection. Positions with gaps display substantially lower conservation properties. We compare conservation properties of the SMART alignments or FSSP structural alignments to those of the ClustalW alignments. The results suggest that conservation indices should be a valuable tool of alignment quality assessment and might be used as an objective function for refinement of multiple alignments. AVAILABILITY The C code of the AL2CO program and its pre-compiled versions for several platforms as well as the details of the analysis are freely available at ftp://iole.swmed.edu/pub/al2co/.

...read moreread less

427 citations

Journal Article•DOI•

PROMALS: towards accurate multiple sequence alignments of distantly related proteins.

[...]

Jimin Pei¹, Nick V. Grishin•Institutions (1)

University of Texas Southwestern Medical Center¹

01 Apr 2007-Bioinformatics

TL;DR: This work developed PROMALS, a multiple alignment method that shows promising results for protein homologs with sequence identity below 10%, aligning close to half of the amino acid residues correctly on average, about three times more accurate than traditional pairwise sequence alignment methods.

...read moreread less

Abstract: Motivation: Accurate multiple sequence alignments are essential in protein structure modeling, functional prediction and efficient planning of experiments. Although the alignment problem has attracted considerable attention, preparation of high-quality alignments for distantly related sequences remains a difficult task. Results: We developed PROMALS, a multiple alignment method that shows promising results for protein homologs with sequence identity below 10%, aligning close to half of the amino acid residues correctly on average. This is about three times more accurate than traditional pairwise sequence alignment methods. PROMALS algorithm derives its strength from several sources: (i) sequence database searches to retrieve additional homologs; (ii) accurate secondary structure prediction; (iii) a hidden Markov model that uses a novel combined scoring of amino acids and secondary structures; (iv) probabilistic consistency-based scoring applied to progressive alignment of profiles. Compared to the best alignment methods that do not use secondary structure prediction and database searches (e.g. MUMMALS, ProbCons and MAFFT), PROMALS is up to 30% more accurate, with improvement being most prominent for highly divergent homologs. Compared to SPEM and HHalign, which also employ database searches and secondary structure prediction, PROMALS shows an accuracy improvement of several percent. Availability: The PROMALS web server is available at:

...read moreread less

346 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

다중혈관 관상동맥 환자에서 y-문합을 이용하여 양쪽 내흉동맥만을 사용한 우회술의 조기 성적

[...]

성기익, 이영탁, 박계현, 전태국, 박표원, 한일용, 장윤희 - Show less +3 more

01 Mar 2003-The Korean Journal of Thoracic and Cardiovascular Surgery

28,685 citations

Journal Article•DOI•

MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability

[...]

Kazutaka Katoh¹, Daron M. Standley¹•Institutions (1)

Osaka University¹

01 Apr 2013-Molecular Biology and Evolution

TL;DR: This version of MAFFT has several new features, including options for adding unaligned sequences into an existing alignment, adjustment of direction in nucleotide alignment, constrained alignment and parallel processing, which were implemented after the previous major update.

...read moreread less

Abstract: We report a major update of the MAFFT multiple sequence alignment program. This version has several new features, including options for adding unaligned sequences into an existing alignment, adjustment of direction in nucleotide alignment, constrained alignment and parallel processing, which were implemented after the previous major update. This report shows actual examples to explain how these features work, alone and in combination. Some examples incorrectly aligned by MAFFT are also shown to clarify its limitations. We discuss how to avoid misalignments, and our ongoing efforts to overcome such limitations.

...read moreread less

27,771 citations

“Bioinformatics” 특집을 내면서

[...]

장병탁, 김삼묘, 허철구

01 Aug 2000

TL;DR: Assessment of medical technology in the context of commercialization with Bioentrepreneur course, which addresses many issues unique to biomedical products.

...read moreread less

Abstract: BIOE 402. Medical Technology Assessment. 2 or 3 hours. Bioentrepreneur course. Assessment of medical technology in the context of commercialization. Objectives, competition, market share, funding, pricing, manufacturing, growth, and intellectual property; many issues unique to biomedical products. Course Information: 2 undergraduate hours. 3 graduate hours. Prerequisite(s): Junior standing or above and consent of the instructor.

...read moreread less

4,833 citations

Journal Article•DOI•

Deciphering key features in protein structures with the new ENDscript server

[...]

Xavier Robert¹, Patrice Gouet¹•Institutions (1)

University of Lyon¹

01 Jul 2014-Nucleic Acids Research

TL;DR: This major upgrade has been fully re-engineered to enhance speed, accuracy and usability with interactive 3D visualization of ENDscript 2 and ESPript 3 to handle a large number of data with reduced computation time.

...read moreread less

Abstract: ENDscript 2 is a friendly Web server for extracting and rendering a comprehensive analysis of primary to quaternary protein structure information in an automated way. This major upgrade has been fully re-engineered to enhance speed, accuracy and usability with interactive 3D visualization. It takes advantage of the new version 3 of ESPript, our well-known sequence alignment renderer, improved to handle a large number of data with reduced computation time. From a single PDB entry or file, ENDscript produces high quality figures displaying multiple sequence alignment of proteins homologous to the query, colored according to residue conservation. Furthermore, the experimental secondary structure elements and a detailed set of relevant biophysical and structural data are depicted. All this information and more are now mapped on interactive 3D PyMOL representations. Thanks to its adaptive and rigorous algorithm, beginner to expert users can modify settings to fine-tune ENDscript to their needs. ENDscript has also been upgraded as an open platform for the visualization of multiple biochemical and structural data coming from external biotool Web servers, with both 2D and 3D representations. ENDscript 2 and ESPript 3 are freely available at http://endscript.ibcp.fr and http://espript.ibcp.fr, respectively.

...read moreread less

4,722 citations

Journal Article•DOI•

Lysine Acetylation Targets Protein Complexes and Co-Regulates Major Cellular Functions

[...]

Chunaram Choudhary¹, Chanchal Kumar¹, Florian Gnad¹, Michael L. Nielsen¹, Michael Rehman¹, Tobias C. Walther¹, Jesper V. Olsen¹, Matthias Mann¹ - Show less +4 more•Institutions (1)

Max Planck Society¹

14 Aug 2009-Science

TL;DR: A proteomic-scale analysis of protein acetylation suggests that it is an important biological regulatory mechanism and the regulatory scope of lysine acetylations is broad and comparable with that of other major posttranslational modifications.

...read moreread less

Abstract: Lysine acetylation is a reversible posttranslational modification of proteins and plays a key role in regulating gene expression. Technological limitations have so far prevented a global analysis of lysine acetylation's cellular roles. We used high-resolution mass spectrometry to identify 3600 lysine acetylation sites on 1750 proteins and quantified acetylation changes in response to the deacetylase inhibitors suberoylanilide hydroxamic acid and MS-275. Lysine acetylation preferentially targets large macromolecular complexes involved in diverse cellular processes, such as chromatin remodeling, cell cycle, splicing, nuclear transport, and actin nucleation. Acetylation impaired phosphorylation-dependent interactions of 14-3-3 and regulated the yeast cyclin-dependent kinase Cdc28. Our data demonstrate that the regulatory scope of lysine acetylation is broad and comparable with that of other major posttranslational modifications.

...read moreread less

3,787 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse