scispace - formally typeset
Search or ask a question

Showing papers by "Mikael Bodén published in 2013"


Journal ArticleDOI
TL;DR: The comprehensive HLA class II allele coverage along with the high specificity in identifying immunodominant CD4+ T-cell epitopes makes Predivac a valuable tool to aid epitope-based vaccine design in the context of a genetically heterogeneous human population.
Abstract: CD4+ T-cell epitopes play a crucial role in eliciting vigorous protective immune responses during peptide (epitope)-based vaccination The prediction of these epitopes focuses on the peptide binding process by MHC class II proteins The ability to account for MHC class II polymorphism is critical for epitope-based vaccine design tools, as different allelic variants can have different peptide repertoires In addition, the specificity of CD4+ T-cells is often directed to a very limited set of immunodominant peptides in pathogen proteins The ability to predict what epitopes are most likely to dominate an immune response remains a challenge We developed the computational tool Predivac to predict CD4+ T-cell epitopes Predivac can make predictions for 95% of all MHC class II protein variants (allotypes), a substantial advance over other available methods Predivac bases its prediction on the concept of specificity-determining residues The performance of the method was assessed both for high-affinity HLA class II peptide binding and CD4+ T-cell epitope prediction In terms of epitope prediction, Predivac outperformed three available pan-specific approaches (delivering the highest specificity) A central finding was the high accuracy delivered by the method in the identification of immunodominant and promiscuous CD4+ T-cell epitopes, which play an essential role in epitope-based vaccine design The comprehensive HLA class II allele coverage along with the high specificity in identifying immunodominant CD4+ T-cell epitopes makes Predivac a valuable tool to aid epitope-based vaccine design in the context of a genetically heterogeneous human populationThe tool is available at: http://predivacbiosciuqeduau/

59 citations


Journal ArticleDOI
TL;DR: The results reveal the molecular basis of a number of features of the classical nuclear transport pathway specific to plants, including two NLSs, previously described as plant specific, that bind to and are functional with plant, mammalian, and yeast importin-α proteins but interact with rice import in-α more strongly.
Abstract: In the classical nucleocytoplasmic import pathway, nuclear localization signals (NLSs) in cargo proteins are recognized by the import receptor importin-α Importin-α has two separate NLS binding sites (the major and the minor site), both of which recognize positively charged amino acid clusters in NLSs Little is known about the molecular basis of the unique features of the classical nuclear import pathway in plants We determined the crystal structure of rice (Oryza sativa) importin-α1a at 2-A resolution The structure reveals that the autoinhibitory mechanism mediated by the importin-β binding domain of importin-α operates in plants, with NLS-mimicking sequences binding to both minor and major NLS binding sites Consistent with yeast and mammalian proteins, rice importin-α binds the prototypical NLS from simian virus 40 large T-antigen preferentially at the major NLS binding site We show that two NLSs, previously described as plant specific, bind to and are functional with plant, mammalian, and yeast importin-α proteins but interact with rice importin-α more strongly The crystal structures of their complexes with rice importin-α show that they bind to the minor NLS binding site By contrast, the crystal structures of their complexes with mouse (Mus musculus) importin-α show preferential binding to the major NLS binding site Our results reveal the molecular basis of a number of features of the classical nuclear transport pathway specific to plants

58 citations


Journal ArticleDOI
01 Nov 2013-Traffic
TL;DR: crystallographic and biochemical methods are used to investigate the binding between importin‐α and predicted ‘minor site‐specific’ NLSs: four peptide library‐derived peptides, and the NLS from mouse RNA helicase II/Guα, to suggest the sequence RXXKR[K/X][F/Y/W]XXAF as the optimal minor NLS‐binding site‐ specific motif, which may help identify novel proteins with atypical NLS
Abstract: Nuclear localization signals (NLSs) contain one or two clusters of basic residues and are recognized by the import receptor importin-α. There are two NLS-binding sites (major and minor) on importin-α and the major NLS-binding site is considered to be the primary binding site. Here, we used crystallographic and biochemical methods to investigate the binding between importin-α and predicted ‘minor site-specific’ NLSs: four peptide library-derived peptides, and the NLS from mouse RNA helicase II/Guα. The crystal structures reveal that these atypical NLSs indeed preferentially bind to the minor NLS-binding site. Unlike previously characterized NLSs, the C-terminal residues of these NLSs form an α-helical turn, stabilized by internal H-bond and cation-π interactions between the aromatic residues from the NLSs and the positively charged residues from importin-α. This helical turn sterically hinders binding at the major NLS-binding site, explaining the minor-site preference. Our data suggest the sequence RXXKR[K/X][F/Y/W]XXAF as the optimal minor NLS-binding site-specific motif, which may help identify novel proteins with atypical NLSs.

44 citations


Journal ArticleDOI
TL;DR: DLocalMotif combines three scoring functions, measuring degrees of motif over-representation, entropy and spatial confinement, specifically designed to discriminatively exploit the availability of negative data, and is shown to outperform current methods that use only a subset of these motif characteristics.
Abstract: Motivation: Local motifs are patterns of DNA or protein sequences that occur within a sequence interval relative to a biologically defined anchor or landmark. Current protein motif discovery methods do not adequately consider such constraints to identify biologically significant motifs that are only weakly over-represented but spatially confined. Using negatives, i.e. sequences known to not contain a local motif, can further increase the specificity of their discovery. Results: This article introduces the method DLocalMotif that makes use of positional information and negative data for local motif discovery in protein sequences. DLocalMotif combines three scoring functions, measuring degrees of motif over-representation, entropy and spatial confinement, specifically designed to discriminatively exploit the availability of negative data. The method is shown to outperform current methods that use only a subset of these motif characteristics. We apply the method to several biological datasets. The analysis of peroxisomal targeting signals uncovers several novel motifs that occur immediately upstream of the dominant peroxisomal targeting signal-1 signal. The analysis of proline-tyrosine nuclear localization signals uncovers multiple novel motifs that overlap with C2H2 zinc finger domains. We also evaluate the method on classical nuclear localization signals and endoplasmic reticulum retention signals and find that DLocalMotif successfully recovers biologically relevant sequence properties. Availability: http://bioinf.scmb.uq.edu.au/dlocalmotif/

17 citations


Journal ArticleDOI
TL;DR: This study determined the structure of importin-α1a from rice, and made 2 observations that suggest an increased utilization of the minor NLS-binding site in this protein, and showed that NLSs of the “plant-specific” class preferentially bind to the minors of the rice protein.
Abstract: The specific recognition between the import receptor importin-α and the nuclear localization signals (NLSs) is crucial to ensure the selective transport of cargoes into the nucleus. NLSs contain 1 or 2 clusters of positively charged amino acids, which usually bind to the major (monopartite NLSs) or both minor and major NLS-binding sites (bipartite NLSs). In our recent study, we determined the structure of importin-α1a from rice (Oryza sativa), and made 2 observations that suggest an increased utilization of the minor NLS-binding site in this protein. First, unlike the mammalian protein, both the major and minor NLS-binding sites are auto-inhibited in the unliganded rice protein. Second, we showed that NLSs of the “plant-specific” class preferentially bind to the minor NLS-binding site of rice importin-α. Here, we show that a distinct group of “minor site-specific” NLSs also bind to the minor site of the rice protein. We further show a greater enrichment of proteins containing these “plant-specific” and “m...

15 citations


Journal ArticleDOI
TL;DR: These findings establish similarities in tri-nucleotide repeats across species at the level of protein functionality and protein sequence and suggest that unknown repeat diseases are yet to be uncovered in other species.
Abstract: Among repetitive genomic sequence, the class of tri-nucleotide repeats has received much attention due to their association with human diseases. Tri-nucleotide repeat diseases are caused by excessive sequence length variability; diseases such as Huntington’s disease and Fragile X syndrome are tied to an increase in the number of repeat units in a tract. Motivated by the recent discovery of a tri-nucleotide repeat associated genetic defect in Arabidopsis thaliana, this study takes a cross-species approach to investigating these repeat tracts, with the goal of using commonalities between species to identify potential disease-related properties. We find that statistical enrichment in regulatory function associations for coding region repeats – previously observed in human – is consistent across multiple organisms. By distinguishing between homo-amino acid tracts that are encoded by tri-nucleotide repeats, and those encoded by varying codons, we show that amino acid repeats – not tri-nucleotide repeats – fully explain these regulatory associations. Using this same separation between repeat- and non-repeat-encoded homo-amino acid tracts, we show that poly-glutamine tracts are disproportionately encoded by tri-nucleotide repeats, and those tracts that are encoded by tri-nucleotide repeats are also significantly longer; these results are consistent across multiple species. These findings establish similarities in tri-nucleotide repeats across species at the level of protein functionality and protein sequence. The tendency of tri-nucleotide repeats to encode longer poly-glutamine tracts indicates a link with the poly-glutamine repeat diseases. The cross-species nature of this tendency suggests that unknown repeat diseases are yet to be uncovered in other species. Future discoveries of new non-human repeat associated defects may provide the breadth of information needed to unravel the mechanisms that underpin this class of human disease.

8 citations