scispace - formally typeset
Search or ask a question
Author

Chris Sander

Bio: Chris Sander is an academic researcher from Harvard University. The author has contributed to research in topics: Large Hadron Collider & Protein structure. The author has an hindex of 178, co-authored 713 publications receiving 233287 citations. Previous affiliations of Chris Sander include Purdue University & University of Leeds.


Papers
More filters
Journal ArticleDOI
TL;DR: The inferred relationships were consistent with sequence-based predictions and published data from miRNA perturbation experiments, and miRNAs with recurrent target relationships were frequently regulated by genetic and epigenetic alterations across the studied cancer types.
Abstract: Little is known about the extent to which individual microRNAs (miRNAs) regulate common processes of tumor biology across diverse cancer types. Using molecular profiles of >3,000 tumors from 11 human cancer types in The Cancer Genome Atlas, we systematically analyzed expression of miRNAs and mRNAs across cancer types to infer recurrent cancer-associated miRNA-target relationships. As we expected, the inferred relationships were consistent with sequence-based predictions and published data from miRNA perturbation experiments. Notably, miRNAs with recurrent target relationships were frequently regulated by genetic and epigenetic alterations across the studied cancer types. We also identify new examples of miRNAs that coordinately regulate cancer pathways, including the miR-29 family, which recurrently regulates active DNA demethylation pathway members TET1 and TDG. The online resource http://cancerminer.org allows exploration and prioritization of miRNA-target interactions that potentially regulate tumorigenesis.

195 citations

Journal ArticleDOI
TL;DR: Atomic solvation preference is recommended for use as a diagnostic tool in model building based on sequence similarity, in folding simulations and in protein design and is computationally fast compared to methods based on surface area calculations.

193 citations

Journal ArticleDOI
TL;DR: The overall disease spectrum mainly reflects the mutability of the genetic code and corroborate earlier results that the probability of a nonsynonymous mutation causing a genetic disease increases monotonically with an increase in the degree of evolutionary conservation of the mutation site and a decrease in the solvent-accessibility of the site.
Abstract: Background Nonsynonymous mutations in the coding regions of human genes are responsible for phenotypic differences between humans and for susceptibility to genetic disease. Computational methods were recently used to predict deleterious effects of nonsynonymous human mutations and polymorphisms. Here we focus on understanding the amino-acid mutation spectrum of human genetic disease. We compare the disease spectrum to the spectra of mutual amino-acid mutation frequencies, non-disease polymorphisms in human genes, and substitutions fixed between species.

190 citations

Journal ArticleDOI
01 Nov 1979-Nature
TL;DR: In this paper, a novel definition of β-sheet structure in terms of the tertiary structure juxtaposition of strands was proposed, and the secondary structure preferences for the 20 amino acids, separately for antiparallel and parallel arrangements of strands.
Abstract: A β-strand is a particular type of extended sequence of amino acid residues, an element of secondary structure of proteins. β-sheets are an assembly of strands, often bringing together parts of the protein which are separated along the backbone. As such, β-sheets are an element of tertiary structure. Parallel βP) and antiparallel (βA) arrangements of strands in a sheet differ in the hydrogen bond pattern between strands, as shown schematically in Fig. 1, and in the type of chain connectivity they allow: short reverse turn connections for βA and longer crossover connections for βP (refs 1–3). Most present secondary structure prediction methods (for reviews refs 4–6) use a four-state distinction of secondary structure: α-helix, β-strand or extended, reverse turn, and ‘random coil’ (everything else). With a data base of 30–40 different protein structures, the conformational preferences for all amino acid residues in these four states seem to have converged7. However, the steadily increasing data base of structurally known proteins makes a refinement of the four-state description feasible. Although more refined classifications of conformational states based on finer subdivisions of (φ,Ψ)-space have been made8,9, we prefer making distinctions based on structural environment. Using a novel definition of β-sheet structure in terms of the tertiary structure juxtaposition of strands, we have analysed residue contacts in known β-sheets and report here secondary structure preferences for the 20 amino acids, separately for antiparallel and parallel arrangements of strands. The distinction between the two arrangements results in strikingly different and sharpened sets of preference parameters, including some of the largest values reported so far for any substructure. These results point the way towards a basic improvement of secondary structure predictions by further distinction of secondary structure elements according to tertiary structure environment. Beyond secondary structure prediction, the different preferences for βA and βP may aid in predicting the tertiary interaction between strands.

189 citations

Journal ArticleDOI
TL;DR: Analysis of outlier cases can facilitate identification of potential biomarkers for targeted agents, and two genes are implicate as candidates for further study in this class of drugs.
Abstract: Purpose: Rapalogs are allosteric mTOR inhibitors and approved agents for advanced kidney cancer. Reports of clonal heterogeneity in this disease challenge the concept of targeted monotherapy, yet a small subset of patients derives extended benefit. Our aim was to analyze such outliers and explore the genomic background of extreme rapalog sensitivity in the context of intratumor heterogeneity. Experimental Design: We analyzed archived tumor tissue of 5 patients with renal cell carcinoma, who previously achieved durable disease control with rapalogs (median duration, 28 months). DNA was extracted from spatially separate areas of primary tumors and metastases. Custom target capture and ultradeep sequencing was used to identify alterations across 230 target genes. Whole-exome sequence analysis was added to investigate genes beyond this original target list. Results: Five long-term responders contributed 14 specimens to explore clonal heterogeneity. Genomic alterations with activating effect on mTOR signaling were detected in 11 of 14 specimens, offering plausible explanation for exceptional treatment response through alterations in two genes (TSC1 and MTOR). In two subjects, distinct yet functionally convergent alterations activated the mTOR pathway in spatially separate sites. In 1 patient, concurrent genomic events occurred in two separate pathway components across different tumor regions. Conclusions: Analysis of outlier cases can facilitate identification of potential biomarkers for targeted agents, and we implicate two genes as candidates for further study in this class of drugs. The previously reported phenomenon of clonal convergence can occur within a targetable pathway which might have implications for biomarker development beyond this disease and this class of agents. Clin Cancer Res; 20(7); 1955–64. ©2014 AACR .

187 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Abstract: The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. For protein comparisons, a variety of definitional, algorithmic and statistical refinements described here permits the execution time of the BLAST programs to be decreased substantially while enhancing their sensitivity to weak similarities. A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original. In addition, a method is introduced for automatically combining statistically significant alignments produced by BLAST into a position-specific score matrix, and searching the database using this matrix. The resulting Position-Specific Iterated BLAST (PSIBLAST) program runs at approximately the same speed per iteration as gapped BLAST, but in many cases is much more sensitive to weak but biologically relevant sequence similarities. PSI-BLAST is used to uncover several new and interesting members of the BRCT superfamily.

70,111 citations

Journal ArticleDOI
TL;DR: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved and modifications are incorporated into a new program, CLUSTAL W, which is freely available.
Abstract: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved for the alignment of divergent protein sequences. Firstly, individual weights are assigned to each sequence in a partial alignment in order to down-weight near-duplicate sequences and up-weight the most divergent ones. Secondly, amino acid substitution matrices are varied at different alignment stages according to the divergence of the sequences to be aligned. Thirdly, residue-specific gap penalties and locally reduced gap penalties in hydrophilic regions encourage new gaps in potential loop regions rather than regular secondary structure. Fourthly, positions in early alignments where gaps have been opened receive locally reduced gap penalties to encourage the opening up of new gaps at these positions. These modifications are incorporated into a new program, CLUSTAL W which is freely available.

63,427 citations

Journal ArticleDOI
TL;DR: ClUSTAL X is a new windows interface for the widely-used progressive multiple sequence alignment program CLUSTAL W, providing an integrated system for performing multiple sequence and profile alignments and analysing the results.
Abstract: CLUSTAL X is a new windows interface for the widely-used progressive multiple sequence alignment program CLUSTAL W. The new system is easy to use, providing an integrated system for performing multiple sequence and profile alignments and analysing the results. CLUSTAL X displays the sequence alignment in a window on the screen. A versatile sequence colouring scheme allows the user to highlight conserved features in the alignment. Pull-down menus provide all the options required for traditional multiple sequence and profile alignment. New features include: the ability to cut-and-paste sequences to change the order of the alignment, selection of a subset of the sequences to be realigned, and selection of a sub-range of the alignment to be realigned and inserted back into the original alignment. Alignment quality analysis can be performed and low-scoring segments or exceptional residues can be highlighted. Quality analysis and realignment of selected residue ranges provide the user with a powerful tool to improve and refine difficult alignments and to trap errors in input sequences. CLUSTAL X has been compiled on SUN Solaris, IRIX5.3 on Silicon Graphics, Digital UNIX on DECstations, Microsoft Windows (32 bit) for PCs, Linux ELF for x86 PCs, and Macintosh PowerMac.

38,522 citations

Journal ArticleDOI
TL;DR: MUSCLE is a new computer program for creating multiple alignments of protein sequences that includes fast distance estimation using kmer counting, progressive alignment using a new profile function the authors call the log-expectation score, and refinement using tree-dependent restricted partitioning.
Abstract: We describe MUSCLE, a new computer program for creating multiple alignments of protein sequences. Elements of the algorithm include fast distance estimation using kmer counting, progressive alignment using a new profile function we call the logexpectation score, and refinement using treedependent restricted partitioning. The speed and accuracy of MUSCLE are compared with T-Coffee, MAFFT and CLUSTALW on four test sets of reference alignments: BAliBASE, SABmark, SMART and a new benchmark, PREFAB. MUSCLE achieves the highest, or joint highest, rank in accuracy on each of these sets. Without refinement, MUSCLE achieves average accuracy statistically indistinguishable from T-Coffee and MAFFT, and is the fastest of the tested methods for large numbers of sequences, aligning 5000 sequences of average length 350 in 7 min on a current desktop computer. The MUSCLE program, source code and PREFAB test data are freely available at http://www.drive5. com/muscle.

37,524 citations

Journal ArticleDOI
TL;DR: The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing.
Abstract: Genomic sequencing has made it clear that a large fraction of the genes specifying the core biological functions are shared by all eukaryotes. Knowledge of the biological role of such shared proteins in one organism can often be transferred to other organisms. The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. To this end, three independent ontologies accessible on the World-Wide Web (http://www.geneontology.org) are being constructed: biological process, molecular function and cellular component.

35,225 citations