C
Christian Dallago
Researcher at Technische Universität München
Publications - 44
Citations - 2090
Christian Dallago is an academic researcher from Technische Universität München. The author has contributed to research in topics: Computer science & Biology. The author has an hindex of 11, co-authored 35 publications receiving 616 citations. Previous affiliations of Christian Dallago include Harvard University.
Papers
More filters
Posted ContentDOI
ProtTrans: Towards Cracking the Language of Life’s Code Through Self-Supervised Deep Learning and High Performance Computing
Ahmed Elnaggar,Michael Heinzinger,Christian Dallago,Ghalia Rihawi,Yu Wang,Llion Jones,Tom Gibbs,Tamas Feher,Christoph Angerer,Debsindhu Bhowmik,Burkhard Rost +10 more
TL;DR: In this paper, the authors trained two auto-regressive language models (Transformer-XL and XLNet) on 80 billion amino acids from 200 million protein sequences (UniRef100) and one auto-encoder model on 393 billion amino acid from 2.1 billion protein sequences taken from the Big Fat Database (BFD).
Journal ArticleDOI
ProtTrans: Towards Cracking the Language of Lifes Code Through Self-Supervised Deep Learning and High Performance Computing
Ahmed Elnaggar,Michael Heinzinger,Christian Dallago,Ghalia Rehawi,Wang Yu,Llion Jones,Tom Gibbs,Tamas Feher,Christoph Angerer,Martin Steinegger,Debsindhu Bhowmik,Burkhard Rost +11 more
TL;DR: In this paper, the authors trained two auto-regressive models (Transformer-XL, XLNet) and four auto-encoder models (BERT, Albert, Electra, T5) on data from UniRef and BFD containing up to 393 billion amino acids.
Journal ArticleDOI
Modeling aspects of the language of life through transfer-learning protein sequences
Michael Heinzinger,Ahmed Elnaggar,Yu Wang,Christian Dallago,Dmitrii Nechaev,Florian Matthes,Burkhard Rost +6 more
TL;DR: Transfer-learning succeeded to extract information from unlabeled sequence databases relevant for various protein prediction tasks and modeled the language of life, namely the principles underlying protein sequences better than any features suggested by textbooks and prediction methods.
Journal ArticleDOI
Pathway Commons 2019 Update: integration, analysis and exploration of pathway data.
Igor Rodchenkov,Özgün Babur,Augustin Luna,Bulent Arman Aksoy,Jeffrey V. Wong,Dylan Fong,Max Franz,Metin Can Siper,Manfred Cheung,Michael Wrana,Harsh Mistry,Logan Mosier,Jonah Dlin,Qizhi Wen,Caitlin O'Callaghan,Wanxin Li,Geoffrey Elder,Peter T. Smith,Christian Dallago,Christian Dallago,Ethan Cerami,Benjamin Gross,Ugur Dogrusoz,Emek Demir,Gary D. Bader,Chris Sander +25 more
TL;DR: Pathway Commons as mentioned in this paper is an integrated resource of publicly available information about biological pathways including biochemical reactions, assembly of biomolecular complexes, transport and catalysis events and physical interactions involving proteins, DNA, RNA, and small molecules (e.g. metabolites and drug compounds).
Journal ArticleDOI
The EVcouplings Python framework for coevolutionary sequence analysis
Thomas A. Hopf,Anna G. Green,Benjamin Schubert,Sophia Mersmann,Charlotta P I Schärfe,Charlotta P I Schärfe,John Ingraham,Agnes Toth-Petroczy,Kelly P Brock,Adam J. Riesselman,Perry Palmedo,Perry Palmedo,Chan Kang,Robert L. Sheridan,Eli J. Draizen,Christian Dallago,Christian Dallago,Chris Sander,Debora S. Marks +18 more
TL;DR: The EVcouplings framework is presented, a fully integrated open-source application and Python package for coevolutionary analysis that enables generation of sequence alignments, calculation and evaluation of evolutionary couplings, and de novo prediction of structure and mutation effects.