scispace - formally typeset
Search or ask a question
Author

Chris Sander

Bio: Chris Sander is an academic researcher from Harvard University. The author has contributed to research in topics: Large Hadron Collider & Protein structure. The author has an hindex of 178, co-authored 713 publications receiving 233287 citations. Previous affiliations of Chris Sander include Purdue University & University of Leeds.


Papers
More filters
Journal ArticleDOI
TL;DR: The current status and future outlook of macromolecular structure databases and information handling, with particular reference to European databases, are reviewed, issues concerning the efficiency with which data are represented, validated, archived and accessed as discussed by the authors.

16 citations

Journal ArticleDOI
TL;DR: This work has engineered the Kinase 1 and 2 motifs into a protein that has the CMBF and no nucleotide binding activity, the chemotactic protein from Escherichia coli, CheY, which demonstrates that the native structure of the P-loop requires external interactions with the rest of the protein.

16 citations

Proceedings ArticleDOI
TL;DR: The cBioPortal for cancer genomics software has the potential to facilitate the use of genomic data in clinical decision making, and as the sequencing of tumor samples has entered clinical practice, the features are expanding so that it can be used for precision medicine at cancer centers.
Abstract: The cBioPortal for Cancer Genomics provides intuitive visualization and analysis of complex cancer genomics data. The public site (http://cbioportal.org/) is accessed by more than 1,500 researchers per day, and there are now dozens of local instances of the software that host private data sets at cancer centers around the globe. We have recently released the software under an open source license, making it free to use and modify by anybody. The software and detailed documentation are available at https://github.com/cBioPortal/cbioportal. We are now establishing a multi-institutional software development network, which will coordinate and drive the future development of the software and associated data pipelines. This group will focus on four main areas: 1. New analysis and visualization features, including: a. Improved support for cross-cancer queries and cohort comparisons. b. Enhanced clinical decision support for precision oncology, including an improved patient view with knowledge base integration, patient timelines and improved tools for visualizing tumor evolution. 2. New data pipelines, including support for new genomic data types and streamlined pipelines for TCGA and the International Cancer Genome Consortium (ICGC). 3. Software architecture and performance improvements. 4. Community engagement: Documentation, user support, and training. This coordinated effort will help to further establish the cBioPortal as the software of choice in cancer genomics research, both in academia and the pharmaceutical industry. Furthermore, as the sequencing of tumor samples has entered clinical practice, we are expanding the features of the software so that it can be used for precision medicine at cancer centers. In particular, clean, web-accessible, interactive clinical reports integrating multiple sources of genome variation and clinical annotation over time has potential to improve clinical action beyond current text-based molecular reports. By making complex genomic data easily interpretable and linking it to information about drugs and clinical trials, the cBioPortal software has the potential to facilitate the use of genomic data in clinical decision making. Citation Format: Jianjiong Gao, James Lindsay, Stuart Watt, Istemi Bahceci, Pieter Lukasse, Adam Abeshouse, Hsiao-Wei Chen, Ino de Bruijn, Benjamin Gross, Dong Li, Ritika Kundra, Zachary Heins, Jorge Reis-Filho, Onur Sumer, Yichao Sun, Jiaojiao Wang, Qingguo Wang, Hongxin Zhang, Priti Kumari, M. Furkan Sahin, Sander de Ridder, Fedde Schaeffer, Kees van Bochove, Ugur Dogrusoz, Trevor Pugh, Chris Sander, Ethan Cerami, Nikolaus Schultz. The cBioPortal for cancer genomics and its application in precision oncology. [abstract]. In: Proceedings of the 107th Annual Meeting of the American Association for Cancer Research; 2016 Apr 16-20; New Orleans, LA. Philadelphia (PA): AACR; Cancer Res 2016;76(14 Suppl):Abstract nr 5277.

16 citations

Posted ContentDOI
01 Sep 2015-bioRxiv
TL;DR: A novel mRNA-based T cell infiltration score (TIS) and profile infiltration levels in 19 tumor types are defined and it is found that clear cell renal cell carcinoma (ccRCC) is the high for TIS and among the highest for the correlation between TISand APM expression, despite a modest mutation burden.
Abstract: Infiltrating T cells in the tumor microenvironment have crucial roles in the competing processes of pro-tumor and anti-tumor immune response. However, the infiltration level of distinct T cell subsets and the signals that draw them into a tumor, such as the expression of antigen presenting machinery (APM) genes, remain poorly characterized across human cancers. Here, we define a novel mRNA-based T cell infiltration score (TIS) and profile infiltration levels in 19 tumor types. We find that clear cell renal cell carcinoma (ccRCC) is the highest for TIS and among the highest for the correlation between TIS and APM expression, despite a modest mutation burden. This finding is contrary to the expectation that immune infiltration and mutation burden are linked. To further characterize the immune infiltration in ccRCC, we use RNA-seq data to computationally infer the infiltration levels of 24 immune cell types in a discovery cohort of 415 ccRCC patients and validate our findings in an independent cohort of 101 ccRCC patients. We find three clusters of tumors that are primarily separated by levels of T cell infiltration and APM gene expression. In ccRCC, the levels of Th17 cells and the ratio of CD8+ T/Treg levels are associated with improved survival whereas the levels of Th2 cells and Tregs are associated with negative clinical outcome. Our analysis illustrates the utility of computational immune cell decomposition for solid tumors, and the potential of this method to guide clinical decision-making.

16 citations

Patent
09 May 2013
TL;DR: In this paper, a method for predicting the three-dimensional structure of a protein from its sequence was proposed, which is based on a covariance matrix between all pairs of sequence positions in the multiple sequence alignment.
Abstract: The present invention relates to a method for predicting three-dimensional structure of a protein from its sequence. Three-dimensional structure may be determined by: (a) generating a multiple sequence alignment for a candidate protein having a known sequence; (b) identifying a covariance matrix between all pairs of sequence positions in the multiple sequence alignment; (c) inverting the covariance matrix and identifying predicted evolutionary constraints using a statistical model of the candidate protein; and (d) simulating folding of an extended chain structure of the candidate protein using the predicted constraints.

15 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Abstract: The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. For protein comparisons, a variety of definitional, algorithmic and statistical refinements described here permits the execution time of the BLAST programs to be decreased substantially while enhancing their sensitivity to weak similarities. A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original. In addition, a method is introduced for automatically combining statistically significant alignments produced by BLAST into a position-specific score matrix, and searching the database using this matrix. The resulting Position-Specific Iterated BLAST (PSIBLAST) program runs at approximately the same speed per iteration as gapped BLAST, but in many cases is much more sensitive to weak but biologically relevant sequence similarities. PSI-BLAST is used to uncover several new and interesting members of the BRCT superfamily.

70,111 citations

Journal ArticleDOI
TL;DR: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved and modifications are incorporated into a new program, CLUSTAL W, which is freely available.
Abstract: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved for the alignment of divergent protein sequences. Firstly, individual weights are assigned to each sequence in a partial alignment in order to down-weight near-duplicate sequences and up-weight the most divergent ones. Secondly, amino acid substitution matrices are varied at different alignment stages according to the divergence of the sequences to be aligned. Thirdly, residue-specific gap penalties and locally reduced gap penalties in hydrophilic regions encourage new gaps in potential loop regions rather than regular secondary structure. Fourthly, positions in early alignments where gaps have been opened receive locally reduced gap penalties to encourage the opening up of new gaps at these positions. These modifications are incorporated into a new program, CLUSTAL W which is freely available.

63,427 citations

Journal ArticleDOI
TL;DR: ClUSTAL X is a new windows interface for the widely-used progressive multiple sequence alignment program CLUSTAL W, providing an integrated system for performing multiple sequence and profile alignments and analysing the results.
Abstract: CLUSTAL X is a new windows interface for the widely-used progressive multiple sequence alignment program CLUSTAL W. The new system is easy to use, providing an integrated system for performing multiple sequence and profile alignments and analysing the results. CLUSTAL X displays the sequence alignment in a window on the screen. A versatile sequence colouring scheme allows the user to highlight conserved features in the alignment. Pull-down menus provide all the options required for traditional multiple sequence and profile alignment. New features include: the ability to cut-and-paste sequences to change the order of the alignment, selection of a subset of the sequences to be realigned, and selection of a sub-range of the alignment to be realigned and inserted back into the original alignment. Alignment quality analysis can be performed and low-scoring segments or exceptional residues can be highlighted. Quality analysis and realignment of selected residue ranges provide the user with a powerful tool to improve and refine difficult alignments and to trap errors in input sequences. CLUSTAL X has been compiled on SUN Solaris, IRIX5.3 on Silicon Graphics, Digital UNIX on DECstations, Microsoft Windows (32 bit) for PCs, Linux ELF for x86 PCs, and Macintosh PowerMac.

38,522 citations

Journal ArticleDOI
TL;DR: MUSCLE is a new computer program for creating multiple alignments of protein sequences that includes fast distance estimation using kmer counting, progressive alignment using a new profile function the authors call the log-expectation score, and refinement using tree-dependent restricted partitioning.
Abstract: We describe MUSCLE, a new computer program for creating multiple alignments of protein sequences. Elements of the algorithm include fast distance estimation using kmer counting, progressive alignment using a new profile function we call the logexpectation score, and refinement using treedependent restricted partitioning. The speed and accuracy of MUSCLE are compared with T-Coffee, MAFFT and CLUSTALW on four test sets of reference alignments: BAliBASE, SABmark, SMART and a new benchmark, PREFAB. MUSCLE achieves the highest, or joint highest, rank in accuracy on each of these sets. Without refinement, MUSCLE achieves average accuracy statistically indistinguishable from T-Coffee and MAFFT, and is the fastest of the tested methods for large numbers of sequences, aligning 5000 sequences of average length 350 in 7 min on a current desktop computer. The MUSCLE program, source code and PREFAB test data are freely available at http://www.drive5. com/muscle.

37,524 citations

Journal ArticleDOI
TL;DR: The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing.
Abstract: Genomic sequencing has made it clear that a large fraction of the genes specifying the core biological functions are shared by all eukaryotes. Knowledge of the biological role of such shared proteins in one organism can often be transferred to other organisms. The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. To this end, three independent ontologies accessible on the World-Wide Web (http://www.geneontology.org) are being constructed: biological process, molecular function and cellular component.

35,225 citations