scispace - formally typeset
Search or ask a question
Author

Carlos Bustamante

Bio: Carlos Bustamante is an academic researcher from Stanford University. The author has contributed to research in topics: Population & Optical tweezers. The author has an hindex of 161, co-authored 770 publications receiving 106053 citations. Previous affiliations of Carlos Bustamante include Lawrence Berkeley National Laboratory & University of California.


Papers
More filters
Posted ContentDOI
15 Oct 2015-bioRxiv
TL;DR: It is shown that relatedness patterns track north- and west-bound routes followed during the Great Migration, suggesting that admixture occurred predominantly in the South prior to the Civil War and that ancestry-biased migration is responsible for regional differences in ancestry.
Abstract: Genetic studies of African-Americans identify functional variants, elucidate historical and genealogical mysteries, and reveal basic biology. However, African-Americans have been under-represented in genetic studies, and little is known about nation-wide patterns of genomic diversity in the population. Here, we present a comprehensive assessment of African-American genomic diversity using genotype data from nationally and regionally representative cohorts. We find higher African ancestry in southern United States compared to the North and West. We show that relatedness patterns track north- and west-bound routes followed during the Great Migration, suggesting that admixture occurred predominantly in the South prior to the Civil War and that ancestry-biased migration is responsible for regional differences in ancestry. Rare genetic traits among African-Americans can therefore be shared over long geographic distances along the Great Migration routes, yet their distribution over short distances remains highly structured. This study clarifies the role of re- cent demography in shaping African-American genomic diversity.

36 citations

Journal ArticleDOI
TL;DR: This work eliminated the main source of noise of most high-resolution dual-trap optical tweezers and developed both a single-molecule assay and a self-learning algorithm to uncover the full trajectories of such a motor: RNA polymerase.
Abstract: In recent years, highly stable optical tweezers systems have enabled the characterization of the dynamics of molecular motors at very high resolution. However, the motion of many motors with angstrom-scale dynamics cannot be consistently resolved due to poor signal-to-noise ratio. Using an acousto-optic deflector to generate a "time-shared" dual-optical trap, we decreased low-frequency noise by more than one order of magnitude compared with conventional dual-trap optical tweezers. Using this instrument, we implemented a protocol that synthesizes single base-pair trajectories, which are used to test a Large State Space Hidden Markov Model algorithm to recover their individual steps. We then used this algorithm on real transcription data obtained in the same instrument to fully uncover the molecular trajectories of Escherichia coli RNA polymerase. We applied this procedure to reveal the effect of pyrophosphate on the distribution of dwell times between consecutive polymerase steps.

35 citations

Journal ArticleDOI
TL;DR: This work uses high-coverage whole-genome sequencing of a conditional mismatch repair mutant line of diploid yeast to identify mutations that accumulated after 160 generations of growth and suggests that specific mutation hot spots can contribute disproportionately to the genetic variation that is introduced into populations.

35 citations

Journal ArticleDOI
07 May 2015-eLife
TL;DR: The data suggest that SpoIIIE assembles a coaxially paired channel for each chromosome arm comprised of one hexamer in each cell to maintain membrane fission during DNA translocation.
Abstract: SpoIIIE is a membrane-anchored DNA translocase that localizes to the septal midpoint to mediate chromosome translocation and membrane fission during Bacillus subtilis sporulation. Here we use cell-specific protein degradation and quantitative photoactivated localization microscopy in strains with a thick sporulation septum to investigate the architecture and function of the SpoIIIE DNA translocation complex in vivo. We were able to visualize SpoIIIE complexes with approximately equal numbers of molecules in the mother cell and the forespore. Cell-specific protein degradation showed that only the mother cell complex is required to translocate DNA into the forespore, whereas degradation in either cell reverses membrane fission. Our data suggest that SpoIIIE assembles a coaxially paired channel for each chromosome arm comprised of one hexamer in each cell to maintain membrane fission during DNA translocation. We show that SpoIIIE can operate, in principle, as a bi-directional motor that exports DNA.

35 citations

Journal ArticleDOI
TL;DR: It is shown that exome capture of saliva-derived DNA yields sufficient non-human sequences to characterize oral microbial communities, including detection of bacteria linked to oral disease (e.g. Prevotella melaninogenica).
Abstract: Targeted capture of genomic regions reduces sequencing cost while generating higher coverage by allowing biomedical researchers to focus on specific loci of interest, such as exons. Targeted capture also has the potential to facilitate the generation of genomic data from DNA collected via saliva or buccal cells. DNA samples derived from these cell types tend to have a lower human DNA yield, may be degraded from age and/or have contamination from bacteria or other ambient oral microbiota. However, thousands of samples have been previously collected from these cell types, and saliva collection has the advantage that it is a non-invasive and appropriate for a wide variety of research. We demonstrate successful enrichment and sequencing of 15 South African KhoeSan exomes and 2 full genomes with samples initially derived from saliva. The expanded exome dataset enables us to characterize genetic diversity free from ascertainment bias for multiple KhoeSan populations, including new exome data from six HGDP Namibian San, revealing substantial population structure across the Kalahari Desert region. Additionally, we discover and independently verify thirty-one previously unknown KIR alleles using methods we developed to accurately map and call the highly polymorphic HLA and KIR loci from exome capture data. Finally, we show that exome capture of saliva-derived DNA yields sufficient non-human sequences to characterize oral microbial communities, including detection of bacteria linked to oral disease (e.g. Prevotella melaninogenica). For comparison, two samples were sequenced using standard full genome library preparation without exome capture and we found no systematic bias of metagenomic information between exome-captured and non-captured data. DNA from human saliva samples, collected and extracted using standard procedures, can be used to successfully sequence high quality human exomes, and metagenomic data can be derived from non-human reads. We find that individuals from the Kalahari carry a higher oral pathogenic microbial load than samples surveyed in the Human Microbiome Project. Additionally, rare variants present in the exomes suggest strong population structure across different KhoeSan populations.

35 citations


Cited by
More filters
28 Jul 2005
TL;DR: PfPMP1)与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作�ly.
Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1(PfPMP1)与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员,通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

18,940 citations

Journal ArticleDOI
TL;DR: NAMD as discussed by the authors is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems that scales to hundreds of processors on high-end parallel platforms, as well as tens of processors in low-cost commodity clusters, and also runs on individual desktop and laptop computers.
Abstract: NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD scales to hundreds of processors on high-end parallel platforms, as well as tens of processors on low-cost commodity clusters, and also runs on individual desktop and laptop computers. NAMD works with AMBER and CHARMM potential functions, parameters, and file formats. This article, directed to novices as well as experts, first introduces concepts and methods used in the NAMD program, describing the classical molecular dynamics force field, equations of motion, and integration methods along with the efficient electrostatics evaluation algorithms employed and temperature and pressure controls used. Features for steering the simulation across barriers and for calculating both alchemical and conformational free energy differences are presented. The motivations for and a roadmap to the internal design of NAMD, implemented in C++ and based on Charm++ parallel objects, are outlined. The factors affecting the serial and parallel performance of a simulation are discussed. Finally, typical NAMD use is illustrated with representative applications to a small, a medium, and a large biomolecular system, highlighting particular features of NAMD, for example, the Tcl scripting language. The article also provides a list of the key features of NAMD and discusses the benefits of combining NAMD with the molecular graphics/sequence analysis software VMD and the grid computing/collaboratory software BioCoRE. NAMD is distributed free of charge with source code at www.ks.uiuc.edu.

14,558 citations

Journal ArticleDOI
Adam Auton1, Gonçalo R. Abecasis2, David Altshuler3, Richard Durbin4  +514 moreInstitutions (90)
01 Oct 2015-Nature
TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.
Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

12,661 citations

Journal Article
Fumio Tajima1
30 Oct 1989-Genomics
TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

11,521 citations