Characterizing disease-associated human proteins without available protein structures or homologues

doi:10.1101/2021.11.17.468998

Posted Content•DOI•

Characterizing disease-associated human proteins without available protein structures or homologues

Neeladri Sen, Ivan Anishchenko¹, Nicola Bordin², Ian Sillitoe², Sameer Velankar³, David Baker¹, Christine A. Orengo² - Show less +3 more•Institutions (3)

University of Washington¹, University College London², European Bioinformatics Institute³

19 Nov 2021-bioRxiv (Cold Spring Harbor Laboratory)-

TL;DR: In this article, the authors used deep learning techniques such as RoseTTAFold and AlphaFold to predict the structure of human proteins even in the absence of structural homologues.

read less

Abstract: Mutations in human proteins lead to diseases. The structure of these proteins can help understand the mechanism of such diseases and develop therapeutics against them. With improved deep learning techniques such as RoseTTAFold and AlphaFold, we can predict the structure of these proteins even in the absence of structural homologues. We modeled and extracted the domains from 553 disease-associated human proteins. We noticed that the model quality was higher and the RMSD lower between AlphaFold and RoseTTAFold models for domains that could be assigned to CATH families as compared to those which could be assigned to Pfam families of unknown structure or could not be assigned to either. We predicted ligand-binding sites, protein-protein interfaces, conserved residues and destabilising effects caused by residue mutations in these predicted structures. We then explored whether the disease-associated mutations were in the proximity of these predicted functional sites or if they destabilized the protein structure based on ddG calculations. We could explain 80% of these disease-associated mutations based on proximity to functional sites or structural destabilization. Usage of models from the two state-of-the-art techniques provide better confidence in our predictions, and we explain 93 additional mutations based on RoseTTAFold models which could not be explained based solely on AlphaFold models.

...read moreread less

Characterizing disease-associated human proteins without available protein structures or homologues

Citations

References