scispace - formally typeset
Search or ask a question

Showing papers by "Chantal Abergel published in 2004"


Journal ArticleDOI
19 Nov 2004-Science
TL;DR: The size and complexity of the Mimivirus genome challenge the established frontier between viruses and parasitic cellular organisms and this new sequence data might help shed a new light on the origin of DNA viruses and their role in the early evolution of eukaryotes.
Abstract: We recently reported the discovery and preliminary characterization of Mimivirus, the largest known virus, with a 400-nanometer particle size comparable to mycoplasma. Mimivirus is a double-stranded DNA virus growing in amoebae. We now present its 1,181,404–base pair genome sequence, consisting of 1262 putative open reading frames, 10% of which exhibit a similarity to proteins of known functions. In addition to exceptional genome size, Mimivirus exhibits many features that distinguish it from other nucleocytoplasmic large DNA viruses. The most unexpected is the presence of numerous genes encoding central protein-translation components, including four amino-acyl transfer RNA synthetases, peptide release factor 1, translation elongation factor EF-TU, and translation initiation factor 1. The genome also exhibits six tRNAs. Other notable features include the presence of both type I and type II topoisomerases, components of all DNA repair pathways, many polysaccharide synthesis enzymes, and one intein-containing gene. The size and complexity of the Mimivirus genome challenge the established frontier between viruses and parasitic cellular organisms. This new sequence data might help shed a new light on the origin of DNA viruses and their role in the early evolution of eukaryotes.

927 citations


Journal ArticleDOI
TL;DR: 3DCoffee, a novel method for combining protein sequences and structures in order to generate high-quality multiple sequence alignments, is developed and it is found that combining TCoffee with the threading program Fugue makes it possible to improve the accuracy of the HOMSTRAD dataset.

344 citations


Journal ArticleDOI
TL;DR: 3DCoffee@igs is a web-based tool dedicated to the computation of high-quality multiple sequence alignments (MSAs) and makes it possible to mix protein sequences and structures in order to increase the accuracy of the alignments.
Abstract: This paper presents 3DCoffee@igs, a web-based tool dedicated to the computation of high-quality multiple sequence alignments (MSAs). 3D-Coffee makes it possible to mix protein sequences and structures in order to increase the accuracy of the alignments. Structures can be either provided as PDB identifiers or directly uploaded into the server. Given a set of sequences and structures, pairs of structures are aligned with SAP while sequence-structure pairs are aligned with Fugue. The resulting collection of pairwise alignments is then combined into an MSA with the T-Coffee algorithm. The server and its documentation are available from http://igs-server.cnrs-mrs.fr/Tcoffee/.

168 citations


Journal ArticleDOI
TL;DR: Pre-refined homology models produced by CaspR significantly reduce the time-consuming refinement process in situations where standard MR yields a solution, and are expected to have a significant impact on the throughput of large-scale structural genomics projects.
Abstract: Molecular replacement (MR) is the method of choice for X-ray crystallography structure determination when structural homologues are available in the Protein Data Bank (PDB). Although the success rate of MR decreases sharply when the sequence similarity between template and target proteins drops below 35% identical residues, it has been found that screening for MR solutions with a large number of different homology models may still produce a suitable solution where the original template failed. Here we present the web tool CaspR, implementing such a strategy in an automated manner. On input of experimental diffraction data, of the corresponding target sequence and of one or several potential templates, CaspR executes an optimized molecular replacement procedure using a combination of well-established stand-alone software tools. The protocol of model building and screening begins with the generation of multiple structure-sequence alignments produced with T-COFFEE, followed by homology model building using MODELLER, molecular replacement with AMoRe and model refinement based on CNS. As a result, CaspR provides a progress report in the form of hierarchically organized summary sheets that describe the different stages of the computation with an increasing level of detail. For the 10 highest-scoring potential solutions, pre-refined structures are made available for download in PDB format. Results already obtained with CaspR and reported on the web server suggest that such a strategy significantly increases the fraction of protein structures which may be solved by MR. Moreover, even in situations where standard MR yields a solution, pre-refined homology models produced by CaspR significantly reduce the time-consuming refinement process. We expect this automated procedure to have a significant impact on the throughput of large-scale structural genomics projects. CaspR is freely available at http://igs-server.cnrs-mrs.fr/Caspr/.

85 citations


Journal ArticleDOI
TL;DR: In three independent cases (i.e. proteins and crystallization conditions) are reported of spectacular diffraction-pattern improvement through a simple crystal-handling procedure that was discovered serendipitously.
Abstract: Succeeding in getting a protein to crystallize is not always the final hurdle in the determination of its three-dimensional structure. A relatively frequent and particularly vexing situation is the production of macroscopically well formed crystals that exhibit no suitable diffraction pattern. In this paper, three independent cases (i.e. proteins and crystallization conditions) are reported of spectacular diffraction-pattern improvement through a simple crystal-handling procedure that was discovered serendipitously. The procedure basically consists of removing a non-diffracting frozen crystal from the X-ray beam, plunging it into a soaking solution made of the original crystallization solution supplemented with a traditional cryoprotectant and then letting it dry in the evaporating sitting drop for some time (15 min to several hours). The treated crystals are then remounted and exhibit a huge improvement in their diffraction intensity and resolution. In all three cases presented here, the crystal quality shifted from unusable to perfectly suitable for structure determination. In addition to being a 'last resort' procedure for experimentalists struggling with non-diffracting crystals, this puzzling effect constitutes one more challenging problem for theoretical protein crystallographers.

35 citations


Journal ArticleDOI
TL;DR: A major extension of Phydbac, that extends both the concept and the functionality of the original web-service, and integrates chromosomal proximity and gene fusion analyses as two additional non-similarity-based indicators for inferring pairwise gene functional relationships.
Abstract: Phydbac (phylogenomic display of bacterial genes) implemented a method of phylogenomic profiling using a distance measure based on normalized BLAST scores. This method was able to increase the predictive power of phylogenomic profiling by about 25% when compared to the classical approach based on Hamming distances. Here we present a major extension of Phydbac (named here Phydbac2), that extends both the concept and the functionality of the original web-service. While phylogenomic profiles remain the central focus of Phydbac2, it now integrates chromosomal proximity and gene fusion analyses as two additional non-similarity-based indicators for inferring pairwise gene functional relationships. Moreover, all presently available (January 2004) fully sequenced bacterial genomes and those of three lower eukaryotes are now included in the profiling process, thus increasing the initial number of reference genomes (71 in Phydbac) to 150 in Phydbac2. Using the KEGG metabolic pathway database as a benchmark, we show that the predictive power of Phydbac2 is improved by 27% over the previous version. This gain is accounted for on one hand, by the increased number of reference genomes (11%) and on the other hand, as a result of including chromosomal proximity into the distance measure (16%). The expanded functionality of Phydbac2 now allows the user to query more than 50 different genomes, including at least one member of each major bacterial group, most major pathogens and potential bio-terrorism agents. The search for co-evolving genes based on consensus profiles from multiple organisms, the display of Phydbac2 profiles side by side with COG information, the inclusion of KEGG metabolic pathway maps the production of chromosomal proximity maps, and the possibility of collecting and processing results from different Phydbac queries in a common shopping cart are the main new features of Phydbac2. The Phydbac2 web server is available at http://igs-server.cnrs-mrs.fr/phydbac/.

29 citations