Home
/
Authors
/
Minkyung Baek

Author

Minkyung Baek

Other affiliations: Seoul National University, UPRRP College of Natural Sciences

Bio: Minkyung Baek is an academic researcher from University of Washington. The author has contributed to research in topics: Biology & Medicine. The author has an hindex of 12, co-authored 34 publications receiving 770 citations. Previous affiliations of Minkyung Baek include Seoul National University & UPRRP College of Natural Sciences.

Topics: Biology, Medicine, CASP, Protein design, Protein structure prediction ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Accurate prediction of protein structures and interactions using a three-track neural network

[...]

Minkyung Baek¹, Frank DiMaio¹, Ivan Anishchenko¹, Justas Dauparas¹, Sergey Ovchinnikov², Gyu Rie Lee¹, Jue Wang¹, Qian Cong³, Lisa N. Kinch³, R. Dustin Schaeffer³, Claudia Millán⁴, Hahnbeom Park¹, Carson Adams¹, Caleb R. Glassman⁵, Andy DeGiovanni⁶, Jose Henrique Pereira⁶, Andria V. Rodrigues⁶, Alberdina A. van Dijk⁷, Ana C. Ebrecht⁷, Diederik J. Opperman⁸, Theo Sagmeister⁹, Christoph Buhlheller⁹, Christoph Buhlheller¹⁰, Tea Pavkov-Keller⁹, Manoj K. Rathinaswamy¹¹, Udit Dalwadi¹², Calvin K. Yip¹², John E. Burke¹¹, K. Christopher Garcia, Nick V. Grishin³, Paul D. Adams¹³, Paul D. Adams⁶, Randy J. Read⁴, David Baker¹ - Show less +30 more•Institutions (13)

University of Washington¹, Harvard University², University of Texas Southwestern Medical Center³, University of Cambridge⁴, Stanford University⁵, Lawrence Berkeley National Laboratory⁶, North-West University⁷, University of the Free State⁸, University of Graz⁹, Medical University of Graz¹⁰, University of Victoria¹¹, University of British Columbia¹², University of California, Berkeley¹³

20 Aug 2021-Science

TL;DR: In this article, a three-track network is proposed to combine information at the one-dimensional (1D) sequence level, the 2D distance map level, and the 3D coordinate level.

...read moreread less

Abstract: DeepMind presented notably accurate predictions at the recent 14th Critical Assessment of Structure Prediction (CASP14) conference. We explored network architectures that incorporate related ideas and obtained the best performance with a three-track network in which information at the one-dimensional (1D) sequence level, the 2D distance map level, and the 3D coordinate level is successively transformed and integrated. The three-track network produces structure predictions with accuracies approaching those of DeepMind in CASP14, enables the rapid solution of challenging x-ray crystallography and cryo-electron microscopy structure modeling problems, and provides insights into the functions of proteins of currently unknown structure. The network also enables rapid generation of accurate protein-protein complex models from sequence information alone, short-circuiting traditional approaches that require modeling of individual subunits followed by docking. We make the method available to the scientific community to speed biological research.

...read moreread less

1,907 citations

Journal Article•DOI•

Computed structures of core eukaryotic protein complexes.

[...]

Ian R. Humphreys¹, Jimin Pei², Minkyung Baek¹, Aditya Krishnakumar¹, Ivan Anishchenko¹, Sergey Ovchinnikov³, Jing Zhang², Travis J. Ness⁴, Sudeep Banjade⁵, Saket R. Bagde⁵, Viktoriya G. Stancheva⁶, Xiao-Han Li⁶, Kaixian Liu⁷, Zhi Zheng⁷, Zhi Zheng⁸, Daniel J. Barrero⁹, Upasana Roy¹⁰, Jochen Kuper¹¹, Israel S. Fernández¹², Barnabas Szakal, Dana Branzei, Josep Rizo², Caroline Kisker¹¹, Eric C. Greene¹⁰, Sue Biggins⁹, Scott Keeney⁸, Scott Keeney⁷, Elizabeth A. Miller⁶, J. Christopher Fromme⁵, Tamara L. Hendrickson⁴, Qian Cong², David Baker¹ - Show less +28 more•Institutions (12)

University of Washington¹, University of Texas Southwestern Medical Center², Harvard University³, Wayne State University⁴, Cornell University⁵, Laboratory of Molecular Biology⁶, Memorial Sloan Kettering Cancer Center⁷, Kettering University⁸, Fred Hutchinson Cancer Research Center⁹, Columbia University¹⁰, University of Würzburg¹¹, St. Jude Children's Research Hospital¹²

11 Nov 2021-Science

TL;DR: The structures of many eukaryotic protein complexes are unknown, and there are likely many protein-protein interactions not yet identified as mentioned in this paper, but these structures play critical roles in biology.

...read moreread less

Abstract: Protein-protein interactions play critical roles in biology, but the structures of many eukaryotic protein complexes are unknown, and there are likely many interactions not yet identified. We take ...

...read moreread less

215 citations

Journal Article•DOI•

Prediction of homoprotein and heteroprotein complexes by protein docking and template-based modeling: A CASP-CAPRI experiment.

[...]

Marc F. Lensink, Sameer Velankar¹, Andriy Kryshtafovych, Shen You Huang², Dina Schneidman-Duhovny, Andrej Sali³, Joan Segura⁴, Narcis Fernandez-Fuentes⁵, Shruthi Viswanath⁶, Ron Elber⁶, Sergei Grudinin⁷, Petr Popov⁷, Emilie Neveu⁷, Hasup Lee, Minkyung Baek, Sangwoo Park, Lim Heo, Gyu Rie Lee, Chaok Seok, Sanbo Qin⁸, Huan-Xiang Zhou⁸, David W. Ritchie⁹, Bernard Maigret¹⁰, Marie-Dominique Devignes¹⁰, Anisah W. Ghoorah¹¹, Mieczyslaw Torchala¹², Raphael A. G. Chaleil¹², Paul A. Bates¹², Efrat Ben-Zeev¹³, Miriam Eisenstein¹³, Surendra S. Negi¹⁴, Zhiping Weng¹⁵, Thom Vreven¹⁵, Brian G. Pierce¹⁵, Tyler M. Borrman¹⁵, Jinchao Yu¹⁶, Françoise Ochsenbein¹⁶, Raphael Guerois¹⁶, Anna Vangone, João P. G. L. M. Rodrigues, Gydo C. P. van Zundert, Mehdi Nellen, Li C. Xue, Ezgi Karaca, Adrien S. J. Melquiond, Koen M. Visscher, Panagiotis L. Kastritis, Alexandre M. J. J. Bonvin, Xianjin Xu, Liming Qiu, Chengfei Yan, Jilong Li, Zhiwei Ma, Jianlin Cheng, Xiaoqin Zou, Yang Shen¹⁷, Lenna X. Peterson¹⁸, Hyung Rae Kim¹⁸, Amit Roy¹⁸, Amit Roy¹⁹, Xusi Han¹⁸, Juan Esquivel-Rodríguez¹⁸, Daisuke Kihara¹⁸, Xiaofeng Yu²⁰, Neil J. Bruce²⁰, Jonathan C. Fuller²⁰, Rebecca C. Wade²¹, Ivan Anishchenko²², Petras J. Kundrotas²², Ilya A. Vakser²², Kenichiro Imai²³, Kazunori D. Yamada²³, Toshiyuki Oda²³, Tsukasa Nakamura²⁴, Kentaro Tomii²³, Chiara Pallara, Miguel Romero-Durana, Brian Jiménez-García, Iain H. Moal, Juan Fernández-Recio, Jong Young Joung²⁵, Jong Yun Kim²⁵, Keehyoung Joo²⁵, Jooyoung Lee²⁶, Jooyoung Lee²⁵, Dima Kozakov²⁷, Sandor Vajda²⁷, Scott E. Mottarella²⁷, David R. Hall²⁷, Dmitri Beglov²⁷, Artem B. Mamonov²⁷, Bing Xia²⁷, Tanggis Bohnuud²⁷, Carlos A. Del Carpio²⁸, Carlos A. Del Carpio²⁹, Eichiro Ichiishi³⁰, Nicholas A. Marze, Daisuke Kuroda, Shourya S. Roy Burman, Jeffrey J. Gray³¹, Edrisse Chermak³², Luigi Cavallo³², Romina Oliva³³, Andrey Tovchigrechko³⁴, Shoshana J. Wodak - Show less +101 more•Institutions (34)

Wellcome Trust¹, University of Missouri², California Institute for Quantitative Biosciences³, Spanish National Research Council⁴, Aberystwyth University⁵, University of Texas at Austin⁶, University of Grenoble⁷, Florida State University⁸, French Institute for Research in Computer Science and Automation⁹, Centre national de la recherche scientifique¹⁰, University of Mauritius¹¹, Francis Crick Institute¹², Weizmann Institute of Science¹³, University of Texas Medical Branch¹⁴, University of Massachusetts Amherst¹⁵, Université Paris-Saclay¹⁶, Toyota Technological Institute at Chicago¹⁷, Purdue University¹⁸, Rocky Mountain Laboratories¹⁹, Heidelberg Institute for Theoretical Studies²⁰, Interdisciplinary Center for Scientific Computing²¹, University of Kansas²², National Institute of Advanced Industrial Science and Technology²³, University of Tokyo²⁴, Protein Sciences²⁵, Korea Institute for Advanced Study²⁶, Boston University²⁷, Pacific Institute²⁸, Kyoto Institute of Technology²⁹, International University of Health and Welfare³⁰, Johns Hopkins University³¹, King Abdullah University of Science and Technology³², University of Naples Federico II³³, J. Craig Venter Institute³⁴

01 Jun 2016-Proteins

TL;DR: Results show that the prediction of homodimer assemblies by homology modeling techniques and docking calculations is quite successful for targets featuring large enough subunit interfaces to represent stable associations, and that docking procedures tend to perform better than standard homology modeled techniques.

...read moreread less

Abstract: We present the results for CAPRI Round 30, the first joint CASP-CAPRI experiment, which brought together experts from the protein structure prediction and protein-protein docking communities. The Round comprised 25 targets from amongst those submitted for the CASP11 prediction experiment of 2014. The targets included mostly homodimers, a few homotetramers, and two heterodimers, and comprised protein chains that could readily be modeled using templates from the Protein Data Bank. On average 24 CAPRI groups and 7 CASP groups submitted docking predictions for each target, and 12 CAPRI groups per target participated in the CAPRI scoring experiment. In total more than 9500 models were assessed against the 3D structures of the corresponding target complexes. Results show that the prediction of homodimer assemblies by homology modeling techniques and docking calculations is quite successful for targets featuring large enough subunit interfaces to represent stable associations. Targets with ambiguous or inaccurate oligomeric state assignments, often featuring crystal contact-sized interfaces, represented a confounding factor. For those, a much poorer prediction performance was achieved, while nonetheless often providing helpful clues on the correct oligomeric state of the protein. The prediction performance was very poor for genuine tetrameric targets, where the inaccuracy of the homology-built subunit models and the smaller pair-wise interfaces severely limited the ability to derive the correct assembly mode. Our analysis also shows that docking procedures tend to perform better than standard homology modeling techniques and that highly accurate models of the protein components are not always required to identify their association modes with acceptable accuracy. Proteins 2016; 84(Suppl 1):323-348. © 2016 Wiley Periodicals, Inc.

...read moreread less

139 citations

Journal Article•DOI•

Improved protein structure refinement guided by deep learning based accuracy estimation.

[...]

Naozumi Hiranuma¹, Hahnbeom Park¹, Minkyung Baek¹, Ivan Anishchenko¹, Justas Dauparas¹, David Baker¹, David Baker² - Show less +3 more•Institutions (2)

University of Washington¹, Howard Hughes Medical Institute²

26 Feb 2021-Nature Communications

TL;DR: DeepAccNet as discussed by the authors uses 3D convolutions to evaluate local atomic environments followed by 2D convolution to provide their global contexts and outperforms other methods that similarly predict the accuracy of protein structure models.

...read moreread less

Abstract: We develop a deep learning framework (DeepAccNet) that estimates per-residue accuracy and residue-residue distance signed error in protein models and uses these predictions to guide Rosetta protein structure refinement. The network uses 3D convolutions to evaluate local atomic environments followed by 2D convolutions to provide their global contexts and outperforms other methods that similarly predict the accuracy of protein structure models. Overall accuracy predictions for X-ray and cryoEM structures in the PDB correlate with their resolution, and the network should be broadly useful for assessing the accuracy of both predicted structure models and experimentally determined structures and identifying specific regions likely to be in error. Incorporation of the accuracy predictions at multiple stages in the Rosetta refinement protocol considerably increased the accuracy of the resulting protein structure models, illustrating how deep learning can improve search for global energy minima of biomolecules. Here the authors present DeepAccNet, a deep learning framework that estimates per-residue accuracy and residue-residue distance signed error in protein models, which are used to guide Rosetta protein structure refinement. Benchmarking suggests an improvement of accuracy prediction and refinement compared to other related state of the art methods.

...read moreread less

130 citations

Journal Article•DOI•

Scaffolding protein functional sites using deep learning

[...]

Jue Wang, Sidney Lisanza, David Juergens, Doug Tischer, Joseph L. Watson, Karla M Castro, Robert J. Ragotte, Amijai Saragovi, Lukas F. Milles, Minkyung Baek, Ivan Anishchenko, Wei Yang, Derrick R. Hicks, Marc Expòsit, Thomas Schlichthaerle, Jung Ho Chun, Justas Dauparas, N. Bennett, Basile I. M. Wicky, Andrew G. Muenks, Frank DiMaio, Bruno E. Correia, Sergey Ovchinnikov, David Baker - Show less +20 more

21 Jul 2022-Science

TL;DR: Wang et al. as mentioned in this paper proposed two deep learning methods to design proteins that contain prespecified functional sites, which can enable the scaffolding of desired functional residues within a well-folded designed protein.

...read moreread less

Abstract: The binding and catalytic functions of proteins are generally mediated by a small number of functional residues held in place by the overall protein structure. Here, we describe deep learning approaches for scaffolding such functional sites without needing to prespecify the fold or secondary structure of the scaffold. The first approach, “constrained hallucination,” optimizes sequences such that their predicted structures contain the desired functional site. The second approach, “inpainting,” starts from the functional site and fills in additional sequence and structure to create a viable protein scaffold in a single forward pass through a specifically trained RoseTTAFold network. We use these two methods to design candidate immunogens, receptor traps, metalloproteins, enzymes, and protein-binding proteins and validate the designs using a combination of in silico and experimental tests. Description Designing around function Protein design has had success in finding sequences that fold into a desired conformation, but designing functional proteins remains challenging. Wang et al. describe two deep-learning methods to design proteins that contain prespecified functional sites. In the first, they found sequences predicted to fold into stable structures that contain the functional site. In the second, they retrained a structure prediction network to recover the sequence and full structure of a protein given only the functional site. The authors demonstrate their methods by designing proteins containing a variety of functional motifs. —VV Deep-learning methods enable the scaffolding of desired functional residues within a well-folded designed protein.

...read moreread less

118 citations

1
2
3
4
…
5
6
7
8
9
10
11

Collapse

Cited by

PDF

Open Access

More filters

Integrative Genomics Viewer

[...]

James T. Robinson¹, Helga Thorvaldsdottir¹, Wendy Winckler¹, Mitchell Guttman¹, Eric S. Lander¹, Eric S. Lander², Gad Getz¹, Jill P. Mesirov¹ - Show less +4 more•Institutions (2)

Massachusetts Institute of Technology¹, Harvard University²

01 Jan 2011

TL;DR: The sheer volume and scope of data posed by this flood of data pose a significant challenge to the development of efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data.

...read moreread less

Abstract: Rapid improvements in sequencing and array-based platforms are resulting in a flood of diverse genome-wide data, including data from exome and whole-genome sequencing, epigenetic surveys, expression profiling of coding and noncoding RNAs, single nucleotide polymorphism (SNP) and copy number profiling, and functional assays. Analysis of these large, diverse data sets holds the promise of a more comprehensive understanding of the genome and its relation to human disease. Experienced and knowledgeable human review is an essential component of this process, complementing computational approaches. This calls for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data. However, the sheer volume and scope of data pose a significant challenge to the development of such tools.

...read moreread less

2,187 citations

Journal Article•DOI•

AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.

[...]

Mihaly Varadi¹, Stephen Anyango¹, Mandar Deshpande¹, Sreenath Nair¹, Cindy Natassia¹, Galabina Yordanova¹, David Yu Yuan¹, Oana Stroe¹, Gemma Wood¹, Agata Laydon, Augustin Žídek, Tim Green, Kathryn Tunyasuvunakool, Stig Petersen, John M. Jumper, Ellen Clancy, Richard E. Green, Ankur Vora, Mira Lutfi, Michael Figurnov, Andrew Cowie, Nicole Hobbs, Pushmeet Kohli, Gerard J. Kleywegt¹, Ewan Birney¹, Demis Hassabis, Sameer Velankar¹ - Show less +23 more•Institutions (1)

European Bioinformatics Institute¹

17 Nov 2021-Nucleic Acids Research

TL;DR: The AlphaFold Protein Structure Database (AlphaFold DB, https://alphafold.ebi.ac.uk) is an openly accessible, extensive database of high-accuracy protein-structure predictions.

...read moreread less

Abstract: The AlphaFold Protein Structure Database (AlphaFold DB, https://alphafold.ebi.ac.uk) is an openly accessible, extensive database of high-accuracy protein-structure predictions. Powered by AlphaFold v2.0 of DeepMind, it has enabled an unprecedented expansion of the structural coverage of the known protein-sequence space. AlphaFold DB provides programmatic access to and interactive visualization of predicted atomic coordinates, per-residue and pairwise model-confidence estimates and predicted aligned errors. The initial release of AlphaFold DB contains over 360,000 predicted structures across 21 model-organism proteomes, which will soon be expanded to cover most of the (over 100 million) representative sequences from the UniRef90 data set.

...read moreread less

2,008 citations

Journal Article•DOI•

Accurate prediction of protein structures and interactions using a three-track neural network

[...]

Minkyung Baek¹, Frank DiMaio¹, Ivan Anishchenko¹, Justas Dauparas¹, Sergey Ovchinnikov², Gyu Rie Lee¹, Jue Wang¹, Qian Cong³, Lisa N. Kinch³, R. Dustin Schaeffer³, Claudia Millán⁴, Hahnbeom Park¹, Carson Adams¹, Caleb R. Glassman⁵, Andy DeGiovanni⁶, Jose Henrique Pereira⁶, Andria V. Rodrigues⁶, Alberdina A. van Dijk⁷, Ana C. Ebrecht⁷, Diederik J. Opperman⁸, Theo Sagmeister⁹, Christoph Buhlheller¹⁰, Christoph Buhlheller⁹, Tea Pavkov-Keller⁹, Manoj K. Rathinaswamy¹¹, Udit Dalwadi¹², Calvin K. Yip¹², John E. Burke¹¹, K. Christopher Garcia, Nick V. Grishin³, Paul D. Adams¹³, Paul D. Adams⁶, Randy J. Read⁴, David Baker¹ - Show less +30 more•Institutions (13)

20 Aug 2021-Science

TL;DR: In this article, a three-track network is proposed to combine information at the one-dimensional (1D) sequence level, the 2D distance map level, and the 3D coordinate level.

...read moreread less

1,907 citations

Journal Article•DOI•

The ClusPro web server for protein-protein docking.

[...]

Dima Kozakov¹, David R. Hall, Bing Xia², Kathryn A. Porter², Dzmitry Padhorny¹, Christine Yueh², Dmitri Beglov², Sandor Vajda² - Show less +4 more•Institutions (2)

Stony Brook University¹, Boston University²

01 Feb 2017-Nature Protocols

TL;DR: This protocol describes the use of the various options, the construction of auxiliary restraints files, the selection of the energy parameters, and the analysis of the results of the ClusPro server.

...read moreread less

Abstract: The ClusPro server (https://cluspro.org) is a widely used tool for protein-protein docking. The server provides a simple home page for basic use, requiring only two files in Protein Data Bank (PDB) format. However, ClusPro also offers a number of advanced options to modify the search; these include the removal of unstructured protein regions, application of attraction or repulsion, accounting for pairwise distance restraints, construction of homo-multimers, consideration of small-angle X-ray scattering (SAXS) data, and location of heparin-binding sites. Six different energy functions can be used, depending on the type of protein. Docking with each energy parameter set results in ten models defined by centers of highly populated clusters of low-energy docked structures. This protocol describes the use of the various options, the construction of auxiliary restraints files, the selection of the energy parameters, and the analysis of the results. Although the server is heavily used, runs are generally completed in <4 h.

...read moreread less

1,699 citations

Journal Article•DOI•

ColabFold: making protein folding accessible to all

[...]

Milot Mirdita¹, Tatiana Valdez Bubnova², Oi Wah Liew³•Institutions (3)

Seoul National University¹, Harvard University², University of Göttingen³

30 May 2022-Nature Methods

TL;DR: ColabFold as discussed by the authors combines the fast homology search of MMseqs2 with AlphaFold2 or RoseTTAFold for protein folding and achieves 40-60fold faster search and optimized model utilization.

...read moreread less

Abstract: ColabFold offers accelerated prediction of protein structures and complexes by combining the fast homology search of MMseqs2 with AlphaFold2 or RoseTTAFold. ColabFold's 40-60-fold faster search and optimized model utilization enables prediction of close to 1,000 structures per day on a server with one graphics processing unit. Coupled with Google Colaboratory, ColabFold becomes a free and accessible platform for protein folding. ColabFold is open-source software available at https://github.com/sokrypton/ColabFold and its novel environmental databases are available at https://colabfold.mmseqs.com .

...read moreread less

1,553 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse