scispace - formally typeset
Search or ask a question
Author

Carlos Bustamante

Bio: Carlos Bustamante is an academic researcher from Stanford University. The author has contributed to research in topics: Population & Optical tweezers. The author has an hindex of 161, co-authored 770 publications receiving 106053 citations. Previous affiliations of Carlos Bustamante include Lawrence Berkeley National Laboratory & University of California.


Papers
More filters
Journal ArticleDOI
TL;DR: This work developed REVEL (rare exome variant ensemble learner), an ensemble method for predicting the pathogenicity of missense variants on the basis of individual tools: MutPred, FATHMM, VEST, PolyPhen, SIFT, PROVEAN, MutationAssessor, LRT, GERP, SiPhy, phyloP, and phastCons.
Abstract: The vast majority of coding variants are rare, and assessment of the contribution of rare variants to complex traits is hampered by low statistical power and limited functional data. Improved methods for predicting the pathogenicity of rare coding variants are needed to facilitate the discovery of disease variants from exome sequencing studies. We developed REVEL (rare exome variant ensemble learner), an ensemble method for predicting the pathogenicity of missense variants on the basis of individual tools: MutPred, FATHMM, VEST, PolyPhen, SIFT, PROVEAN, MutationAssessor, MutationTaster, LRT, GERP, SiPhy, phyloP, and phastCons. REVEL was trained with recently discovered pathogenic and rare neutral missense variants, excluding those previously used to train its constituent tools. When applied to two independent test sets, REVEL had the best overall performance (p −12 ) as compared to any individual tool and seven ensemble methods: MetaSVM, MetaLR, KGGSeq, Condel, CADD, DANN, and Eigen. Importantly, REVEL also had the best performance for distinguishing pathogenic from rare neutral variants with allele frequencies

1,295 citations

Journal ArticleDOI
23 Jan 2003-Nature
TL;DR: The basic features of DNA were elucidated during the half-century following the discovery of the double helix, but it is only during the past decade that researchers have been able to manipulate single molecules of DNA to make direct measurements of its mechanical properties.
Abstract: The basic features of DNA were elucidated during the half-century following the discovery of the double helix. But it is only during the past decade that researchers have been able to manipulate single molecules of DNA to make direct measurements of its mechanical properties. These studies have illuminated the nature of interactions between DNA and proteins, the constraints within which the cellular machinery operates, and the forces created by DNA-dependent motors.

1,254 citations

Journal ArticleDOI
TL;DR: This work establishes an open-source translational research platform for genome-wide association studies in rice that directly links molecular variation in genes and metabolic pathways with the germplasm resources needed to accelerate varietal development and crop improvement.
Abstract: Asian rice, Oryza sativa is a cultivated, inbreeding species that feeds over half of the world ’ s population. Understanding the genetic basis of diverse physiological, developmental, and morphological traits provides the basis for improving yield, quality and sustainability of rice. Here we show the results of a genome-wide association study based on genotyping 44,100 SNP variants across 413 diverse accessions of O. sativa collected from 82 countries that were systematically phenotyped for 34 traits. Using cross-population-based mapping strategies, we identifi ed dozens of common variants infl uencing numerous complex traits. Signifi cant heterogeneity was observed in the genetic architecture associated with subpopulation structure and response to environment. This work establishes an open-source translational research platform for genome-wide association studies in rice that directly links molecular variation in genes and metabolic pathways with the germplasm resources needed to accelerate varietal development and crop improvement.

1,170 citations

Journal ArticleDOI
16 May 1997-Science
TL;DR: Scaling the molecular data up to sarcomeric dimensions reproduced many features of the passive force versus extension curve of muscle fibers, including force hysteresis arises from a difference between the unfolding and refolding kinetics of the molecule relative to the stretch and release rates in the experiments.
Abstract: Titin, a giant filamentous polypeptide, is believed to play a fundamental role in maintaining sarcomeric structural integrity and developing what is known as passive force in muscle. Measurements of the force required to stretch a single molecule revealed that titin behaves as a highly nonlinear entropic spring. The molecule unfolds in a high-force transition beginning at 20 to 30 piconewtons and refolds in a low-force transition at approximately 2.5 piconewtons. A fraction of the molecule (5 to 40 percent) remains permanently unfolded, behaving as a wormlike chain with a persistence length (a measure of the chain's bending rigidity) of 20 angstroms. Force hysteresis arises from a difference between the unfolding and refolding kinetics of the molecule relative to the stretch and release rates in the experiments, respectively. Scaling the molecular data up to sarcomeric dimensions reproduced many features of the passive force versus extension curve of muscle fibers.

1,159 citations

Journal ArticleDOI
07 Jun 2002-Science
TL;DR: The implementation and test of Jarzynski's equality provides the first example of its use as a bridge between the statistical mechanics of equilibrium and nonequilibrium systems, and extends the thermodynamic analysis of single molecule manipulation data beyond the context of equilibrium experiments.
Abstract: Recent advances in statistical mechanical theory can be used to solve a fundamental problem in experimental thermodynamics. In 1997, Jarzynski proved an equality relating the irreversible work to the equilibrium free energy difference, DeltaG. This remarkable theoretical result states that it is possible to obtain equilibrium thermodynamic parameters from processes carried out arbitrarily far from equilibrium. We test Jarzynski's equality by mechanically stretching a single molecule of RNA reversibly and irreversibly between two conformations. Application of this equality to the irreversible work trajectories recovers the DeltaG profile of the stretching process to within k(B)T/2 (half the thermal energy) of its best independent estimate, the mean work of reversible stretching. The implementation and test of Jarzynski's equality provides the first example of its use as a bridge between the statistical mechanics of equilibrium and nonequilibrium systems. This work also extends the thermodynamic analysis of single molecule manipulation data beyond the context of equilibrium experiments.

1,086 citations


Cited by
More filters
28 Jul 2005
TL;DR: PfPMP1)与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作�ly.
Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1(PfPMP1)与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员,通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

18,940 citations

Journal ArticleDOI
TL;DR: NAMD as discussed by the authors is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems that scales to hundreds of processors on high-end parallel platforms, as well as tens of processors in low-cost commodity clusters, and also runs on individual desktop and laptop computers.
Abstract: NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD scales to hundreds of processors on high-end parallel platforms, as well as tens of processors on low-cost commodity clusters, and also runs on individual desktop and laptop computers. NAMD works with AMBER and CHARMM potential functions, parameters, and file formats. This article, directed to novices as well as experts, first introduces concepts and methods used in the NAMD program, describing the classical molecular dynamics force field, equations of motion, and integration methods along with the efficient electrostatics evaluation algorithms employed and temperature and pressure controls used. Features for steering the simulation across barriers and for calculating both alchemical and conformational free energy differences are presented. The motivations for and a roadmap to the internal design of NAMD, implemented in C++ and based on Charm++ parallel objects, are outlined. The factors affecting the serial and parallel performance of a simulation are discussed. Finally, typical NAMD use is illustrated with representative applications to a small, a medium, and a large biomolecular system, highlighting particular features of NAMD, for example, the Tcl scripting language. The article also provides a list of the key features of NAMD and discusses the benefits of combining NAMD with the molecular graphics/sequence analysis software VMD and the grid computing/collaboratory software BioCoRE. NAMD is distributed free of charge with source code at www.ks.uiuc.edu.

14,558 citations

Journal ArticleDOI
Adam Auton1, Gonçalo R. Abecasis2, David Altshuler3, Richard Durbin4  +514 moreInstitutions (90)
01 Oct 2015-Nature
TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.
Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

12,661 citations

Journal Article
Fumio Tajima1
30 Oct 1989-Genomics
TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

11,521 citations