scispace - formally typeset
Search or ask a question
Author

Carlos Bustamante

Bio: Carlos Bustamante is an academic researcher from Stanford University. The author has contributed to research in topics: Population & Optical tweezers. The author has an hindex of 161, co-authored 770 publications receiving 106053 citations. Previous affiliations of Carlos Bustamante include Lawrence Berkeley National Laboratory & University of California.


Papers
More filters
01 Oct 2010
TL;DR: The pilot phase of the 1000 Genomes Project is presented, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms, and the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants are described.

599 citations

Journal ArticleDOI
Genevieve L. Wojcik1, Mariaelisa Graff2, Katherine K. Nishimura3, Ran Tao4, Jeffrey Haessler3, Christopher R. Gignoux1, Christopher R. Gignoux5, Heather M. Highland2, Yesha Patel6, Elena P. Sorokin1, Christy L. Avery2, Gillian M. Belbin7, Stephanie A. Bien3, Iona Cheng8, Sinead Cullina7, Chani J. Hodonsky2, Yao Hu3, Laura M. Huckins7, Janina M. Jeff7, Anne E. Justice2, Jonathan M. Kocarnik3, Unhee Lim9, Bridget M Lin2, Yingchang Lu7, Sarah C. Nelson10, Sungshim L. Park6, Hannah Poisner7, Michael Preuss7, Melissa A. Richard11, Claudia Schurmann12, Claudia Schurmann7, Veronica Wendy Setiawan6, Alexandra Sockell1, Karan Vahi6, Marie Verbanck7, Abhishek Vishnu7, Ryan W. Walker7, Kristin L. Young2, Niha Zubair3, Victor Acuña-Alonso, José Luis Ambite6, Kathleen C. Barnes5, Eric Boerwinkle11, Erwin P. Bottinger7, Erwin P. Bottinger12, Carlos Bustamante1, Christian Caberto9, Samuel Canizales-Quinteros, Matthew P. Conomos10, Ewa Deelman6, Ron Do7, Kimberly F. Doheny13, Lindsay Fernández-Rhodes2, Lindsay Fernández-Rhodes14, Myriam Fornage11, Benyam Hailu15, Gerardo Heiss2, Brenna M. Henn16, Lucia A. Hindorff15, Rebecca D. Jackson17, Cecelia A. Laurie10, Cathy C. Laurie10, Yuqing Li8, Yuqing Li18, Danyu Lin2, Andrés Moreno-Estrada, Girish N. Nadkarni7, Paul Norman5, Loreall Pooler6, Alexander P. Reiner10, Jane Romm13, Chiara Sabatti1, Karla Sandoval, Xin Sheng6, Eli A. Stahl7, Daniel O. Stram6, Timothy A. Thornton10, Christina L. Wassel19, Lynne R. Wilkens9, Cheryl A. Winkler, Sachi Yoneyama2, Steven Buyske20, Christopher A. Haiman6, Charles Kooperberg3, Loic Le Marchand9, Ruth J. F. Loos7, Tara C. Matise20, Kari E. North2, Ulrike Peters3, Eimear E. Kenny7, Christopher S. Carlson3 
27 Jun 2019-Nature
TL;DR: The value of diverse, multi-ethnic participants in large-scale genomic studies is demonstrated and evidence of effect-size heterogeneity across ancestries for published GWAS associations, substantial benefits for fine-mapping using diverse cohorts and insights into clinical implications are shown.
Abstract: Genome-wide association studies (GWAS) have laid the foundation for investigations into the biology of complex traits, drug development and clinical guidelines. However, the majority of discovery efforts are based on data from populations of European ancestry1-3. In light of the differential genetic architecture that is known to exist between populations, bias in representation can exacerbate existing disease and healthcare disparities. Critical variants may be missed if they have a low frequency or are completely absent in European populations, especially as the field shifts its attention towards rare variants, which are more likely to be population-specific4-10. Additionally, effect sizes and their derived risk prediction scores derived in one population may not accurately extrapolate to other populations11,12. Here we demonstrate the value of diverse, multi-ethnic participants in large-scale genomic studies. The Population Architecture using Genomics and Epidemiology (PAGE) study conducted a GWAS of 26 clinical and behavioural phenotypes in 49,839 non-European individuals. Using strategies tailored for analysis of multi-ethnic and admixed populations, we describe a framework for analysing diverse populations, identify 27 novel loci and 38 secondary signals at known loci, as well as replicate 1,444 GWAS catalogue associations across these traits. Our data show evidence of effect-size heterogeneity across ancestries for published GWAS associations, substantial benefits for fine-mapping using diverse cohorts and insights into clinical implications. In the United States-where minority populations have a disproportionately higher burden of chronic conditions13-the lack of representation of diverse populations in genetic research will result in inequitable access to precision medicine for those with the highest burden of disease. We strongly advocate for continued, large genome-wide efforts in diverse populations to maximize genetic discovery and reduce health disparities.

591 citations

Journal ArticleDOI
TL;DR: The most comprehensive examination of mammalian PSGs to date, using the six high-coverage genome assemblies now available for eutherian mammals, indicates that PSGs are expressed at significantly lower levels, and in a more tissue-specific manner, than non-PSGs.
Abstract: Genome-wide scans for positively selected genes (PSGs) in mammals have provided insight into the dynamics of genome evolution, the genetic basis of differences between species, and the functions of individual genes. However, previous scans have been limited in power and accuracy owing to small numbers of available genomes. Here we present the most comprehensive examination of mammalian PSGs to date, using the six high-coverage genome assemblies now available for eutherian mammals. The increased phylogenetic depth of this dataset results in substantially improved statistical power, and permits several new lineage- and clade-specific tests to be applied. Of approximately 16,500 human genes with high-confidence orthologs in at least two other species, 400 genes showed significant evidence of positive selection (FDR<0.05), according to a standard likelihood ratio test. An additional 144 genes showed evidence of positive selection on particular lineages or clades. As in previous studies, the identified PSGs were enriched for roles in defense/immunity, chemosensory perception, and reproduction, but enrichments were also evident for more specific functions, such as complement-mediated immunity and taste perception. Several pathways were strongly enriched for PSGs, suggesting possible co-evolution of interacting genes. A novel Bayesian analysis of the possible "selection histories" of each gene indicated that most PSGs have switched multiple times between positive selection and nonselection, suggesting that positive selection is often episodic. A detailed analysis of Affymetrix exon array data indicated that PSGs are expressed at significantly lower levels, and in a more tissue-specific manner, than non-PSGs. Genes that are specifically expressed in the spleen, testes, liver, and breast are significantly enriched for PSGs, but no evidence was found for an enrichment for PSGs among brain-specific genes. This study provides additional evidence for widespread positive selection in mammalian evolution and new genome-wide insights into the functional implications of positive selection.

563 citations

Journal ArticleDOI
Devin P. Locke1, LaDeana W. Hillier1, Wesley C. Warren1, Kim C. Worley2, Lynne V. Nazareth2, Donna M. Muzny2, Shiaw-Pyng Yang1, Zhengyuan Wang1, Asif T. Chinwalla1, Patrick Minx1, Makedonka Mitreva1, Lisa Cook1, Kim D. Delehaunty1, Catrina Fronick1, Heather Schmidt1, Lucinda Fulton1, Robert S. Fulton1, Joanne O. Nelson1, Vincent Magrini1, Craig Pohl1, Tina Graves1, Chris Markovic1, Andy Cree2, Huyen Dinh2, Jennifer Hume2, Christie Kovar2, Gerald R. Fowler2, Gerton Lunter3, Gerton Lunter4, Stephen Meader3, Andreas Heger3, Chris P. Ponting3, Tomas Marques-Bonet5, Tomas Marques-Bonet6, Can Alkan5, Lin Chen5, Ze Cheng5, Jeffrey M. Kidd5, Evan E. Eichler5, Evan E. Eichler7, Simon D. M. White8, Stephen M. J. Searle8, Albert J. Vilella9, Yuan Chen9, Paul Flicek9, Jian Ma10, Jian Ma11, Brian J. Raney11, Bernard B. Suh11, Richard Burhans12, Javier Herrero9, David Haussler11, Rui Faria6, Rui Faria13, Olga Fernando14, Olga Fernando6, Fleur Darré6, Domènec Farré6, Elodie Gazave6, Meritxell Oliva6, Arcadi Navarro6, Roberta Roberto15, Oronzo Capozzi15, Nicoletta Archidiacono15, Giuliano Della Valle16, Stefania Purgato16, Mariano Rocchi15, Miriam K. Konkel17, Jerilyn A. Walker17, Brygg Ullmer17, Mark A. Batzer17, Arian F.A. Smit18, Robert Hubley18, Claudio Casola19, Daniel R. Schrider19, Matthew W. Hahn19, Víctor Quesada20, Xose S. Puente20, Gonzalo R. Ordóñez20, Carlos López-Otín20, Tomas Vinar21, Brona Brejova21, Aakrosh Ratan12, Robert S. Harris12, Webb Miller12, Carolin Kosiol, Heather A. Lawson1, Vikas Taliwal22, André L. Martins22, Adam Siepel22, Arindam RoyChoudhury23, Xin Ma22, Jeremiah D. Degenhardt22, Carlos Bustamante24, Ryan N. Gutenkunst25, Thomas Mailund26, Julien Y. Dutheil26, Asger Hobolth26, Mikkel H. Schierup26, Oliver A. Ryder, Yuko Yoshinaga27, Pieter J. de Jong27, George M. Weinstock1, Jeffrey Rogers2, Elaine R. Mardis1, Richard A. Gibbs2, Richard K. Wilson1 
27 Jan 2011-Nature
TL;DR: The orang-utan species, Pongo abelii and Pongo pygmaeus, are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution and a primate polymorphic neocentromere, found in both Pongo species are described.
Abstract: 'Orang-utan' is derived from a Malay term meaning 'man of the forest' and aptly describes the southeast Asian great apes native to Sumatra and Borneo. The orang-utan species, Pongo abelii (Sumatran) and Pongo pygmaeus (Bornean), are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution. Here we present a Sumatran orang-utan draft genome assembly and short read sequence data from five Sumatran and five Bornean orang-utan genomes. Our analyses reveal that, compared to other primates, the orang-utan genome has many unique features. Structural evolution of the orang-utan genome has proceeded much more slowly than other great apes, evidenced by fewer rearrangements, less segmental duplication, a lower rate of gene family turnover and surprisingly quiescent Alu repeats, which have played a major role in restructuring other primate genomes. We also describe a primate polymorphic neocentromere, found in both Pongo species, emphasizing the gradual evolution of orang-utan genome structure. Orang-utans have extremely low energy usage for a eutherian mammal, far lower than their hominid relatives. Adding their genome to the repertoire of sequenced primates illuminates new signals of positive selection in several pathways including glycolipid metabolism. From the population perspective, both Pongo species are deeply diverse; however, Sumatran individuals possess greater diversity than their Bornean counterparts, and more species-specific variation. Our estimate of Bornean/Sumatran speciation time, 400,000 years ago, is more recent than most previous studies and underscores the complexity of the orang-utan speciation process. Despite a smaller modern census population size, the Sumatran effective population size (N(e)) expanded exponentially relative to the ancestral N(e) after the split, while Bornean N(e) declined over the same period. Overall, the resources and analyses presented here offer new opportunities in evolutionary genomics, insights into hominid biology, and an extensive database of variation for conservation efforts.

555 citations

Journal ArticleDOI
17 Jul 2003-Nature
TL;DR: Tests of the linearity of DNA's twist elasticity, direct measurements of the torsional modulus, characterization of torque-induced structural transitions, and the establishment of a framework for future assays of torque and twist generation by DNA-dependent enzymes are reported.
Abstract: Knowledge of the elastic properties of DNA is required to understand the structural dynamics of cellular processes such as replication and transcription. Measurements of force and extension on single molecules of DNA 1‐3 have allowed direct determination of the molecule’s mechanical properties, provided rigorous tests of theories of polymer elasticity 4 , revealed unforeseen structural transitions induced by mechanical stresses 3,5‐7 , and established an experimental and conceptual framework for mechanical assays of enzymes that act on DNA 8 . However, a complete description of DNA mechanics must also consider the effects of torque, a quantity that has hitherto not been directly measured in micromanipulation experiments. We have measured torque as a function of twist for stretched DNA—torsional strain in over- or underwound molecules was used to power the rotation of submicrometre beads serving as calibrated loads. Here we report tests of the linearity of DNA’s twist elasticity, direct measurements of the torsional modulus (finding a value ,40% higher than generally accepted), characterization of torque-induced structural transitions, and the establishment of a framework for future assays of torque and twist generation by DNA-dependent enzymes. We also show that cooperative structural transitions in DNA can be exploited to construct constanttorque wind-up motors and force‐torque converters. Previous investigations of the force‐extension behaviour of supercoiled DNA have found that, under low tension, DNA behaves like an isotropic flexible rod 2,9 : as turns are added to the molecule, its

545 citations


Cited by
More filters
28 Jul 2005
TL;DR: PfPMP1)与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作�ly.
Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1(PfPMP1)与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员,通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

18,940 citations

Journal ArticleDOI
TL;DR: NAMD as discussed by the authors is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems that scales to hundreds of processors on high-end parallel platforms, as well as tens of processors in low-cost commodity clusters, and also runs on individual desktop and laptop computers.
Abstract: NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD scales to hundreds of processors on high-end parallel platforms, as well as tens of processors on low-cost commodity clusters, and also runs on individual desktop and laptop computers. NAMD works with AMBER and CHARMM potential functions, parameters, and file formats. This article, directed to novices as well as experts, first introduces concepts and methods used in the NAMD program, describing the classical molecular dynamics force field, equations of motion, and integration methods along with the efficient electrostatics evaluation algorithms employed and temperature and pressure controls used. Features for steering the simulation across barriers and for calculating both alchemical and conformational free energy differences are presented. The motivations for and a roadmap to the internal design of NAMD, implemented in C++ and based on Charm++ parallel objects, are outlined. The factors affecting the serial and parallel performance of a simulation are discussed. Finally, typical NAMD use is illustrated with representative applications to a small, a medium, and a large biomolecular system, highlighting particular features of NAMD, for example, the Tcl scripting language. The article also provides a list of the key features of NAMD and discusses the benefits of combining NAMD with the molecular graphics/sequence analysis software VMD and the grid computing/collaboratory software BioCoRE. NAMD is distributed free of charge with source code at www.ks.uiuc.edu.

14,558 citations

Journal ArticleDOI
Adam Auton1, Gonçalo R. Abecasis2, David Altshuler3, Richard Durbin4  +514 moreInstitutions (90)
01 Oct 2015-Nature
TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.
Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

12,661 citations

Journal Article
Fumio Tajima1
30 Oct 1989-Genomics
TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

11,521 citations