Author
Nicholas F. Parrish
Other affiliations: Kyoto University, Emory University, Vanderbilt University ...read more
Bio: Nicholas F. Parrish is an academic researcher from University of Pennsylvania. The author has contributed to research in topics: Genome & Human genome. The author has an hindex of 21, co-authored 38 publications receiving 14681 citations. Previous affiliations of Nicholas F. Parrish include Kyoto University & Emory University.
Topics: Genome, Human genome, Virus, Viral replication, Medicine
Papers
More filters
••
TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.
Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
12,661 citations
01 Oct 2015
TL;DR: The 1000 Genomes Project as mentioned in this paper provided a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and reported the completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole genome sequencing, deep exome sequencing and dense microarray genotyping.
Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
3,247 citations
••
University of Washington1, University of Maryland, Baltimore2, Broad Institute3, Harvard University4, Mayo Clinic5, Yale University6, Washington University in St. Louis7, University of Michigan8, University of Texas Health Science Center at Houston9, Louisiana State University10, University of North Carolina at Charlotte11, Wellcome Trust12, University of Texas MD Anderson Cancer Center13, Boston College14, Yeshiva University15, Bilkent University16, University of California, San Diego17, National Institutes of Health18, Leiden University19, Baylor College of Medicine20, Cornell University21, Utrecht University22, University of Oxford23, Icahn School of Medicine at Mount Sinai24, Kyoto University25, Virginia Commonwealth University26, Heidelberg University27, Ewha Womans University28
TL;DR: In this paper, the authors describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which are constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations.
Abstract: Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association.
1,971 citations
••
TL;DR: Viral properties associated with mucosal HIV-1 transmission and a limited set of rapidly evolving adaptive mutations driven primarily, but not exclusively, by early cytotoxic T cell responses are revealed.
Abstract: Identification of full-length transmitted HIV-1 genomes could be instrumental in HIV-1 pathogenesis, microbicide, and vaccine research by enabling the direct analysis of those viruses actually responsible for productive clinical infection. We show in 12 acutely infected subjects (9 clade B and 3 clade C) that complete HIV-1 genomes of transmitted/founder viruses can be inferred by single genome amplification and sequencing of plasma virion RNA. This allowed for the molecular cloning and biological analysis of transmitted/founder viruses and a comprehensive genome-wide assessment of the genetic imprint left on the evolving virus quasispecies by a composite of host selection pressures. Transmitted viruses encoded intact canonical genes ( gag-pol-vif-vpr-tat-rev-vpu-env-nef ) and replicated efficiently in primary human CD4+ T lymphocytes but much less so in monocyte-derived macrophages. Transmitted viruses were CD4 and CCR5 tropic and demonstrated concealment of coreceptor binding surfaces of the envelope bridging sheet and variable loop 3. 2 mo after infection, transmitted/founder viruses in three subjects were nearly completely replaced by viruses differing at two to five highly selected genomic loci; by 12–20 mo, viruses exhibited concentrated mutations at 17–34 discrete locations. These findings reveal viral properties associated with mucosal HIV-1 transmission and a limited set of rapidly evolving adaptive mutations driven primarily, but not exclusively, by early cytotoxic T cell responses.
777 citations
••
TL;DR: TF viruses are enriched for higher Env content, enhanced cell-free infectivity, improved dendritic cell interaction, and relative IFN-α resistance, which should be considered in the development and testing of AIDS vaccines.
Abstract: Defining the virus–host interactions responsible for HIV-1 transmission, including the phenotypic requirements of viruses capable of establishing de novo infections, could be important for AIDS vaccine development. Previous analyses have failed to identify phenotypic properties other than chemokine receptor 5 (CCR5) and CD4+ T-cell tropism that are preferentially associated with viral transmission. However, most of these studies were limited to examining envelope (Env) function in the context of pseudoviruses. Here, we generated infectious molecular clones of transmitted founder (TF; n = 27) and chronic control (CC; n = 14) viruses of subtypes B (n = 18) and C (n = 23) and compared their phenotypic properties in assays specifically designed to probe the earliest stages of HIV-1 infection. We found that TF virions were 1.7-fold more infectious (P = 0.049) and contained 1.9-fold more Env per particle (P = 0.048) compared with CC viruses. TF viruses were also captured by monocyte-derived dendritic cells 1.7-fold more efficiently (P = 0.035) and more readily transferred to CD4+ T cells (P = 0.025). In primary CD4+ T cells, TF and CC viruses replicated with comparable kinetics; however, when propagated in the presence of IFN-α, TF viruses replicated to higher titers than CC viruses. This difference was significant for subtype B (P = 0.000013) but not subtype C (P = 0.53) viruses, possibly reflecting demographic differences of the respective patient cohorts. Together, these data indicate that TF viruses are enriched for higher Env content, enhanced cell-free infectivity, improved dendritic cell interaction, and relative IFN-α resistance. These viral properties, which likely act in concert, should be considered in the development and testing of AIDS vaccines.
384 citations
Cited by
More filters
28 Jul 2005
TL;DR: PfPMP1)与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作�ly.
Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1(PfPMP1)与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员,通过启动转录不同的var基因变异体为抗原变异提供了分子基础。
18,940 citations
••
TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.
Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
12,661 citations
01 Jun 2012
TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).
Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.
10,124 citations
••
Harvard University1, Broad Institute2, Boston Children's Hospital3, University of Washington4, University of Arizona5, Cardiff University6, Google7, Icahn School of Medicine at Mount Sinai8, Samsung Medical Center9, Vertex Pharmaceuticals10, University of Michigan11, University of Cambridge12, State University of New York Upstate Medical University13, Karolinska Institutet14, University of Eastern Finland15, Wellcome Trust Centre for Human Genetics16, University of Oxford17, Cedars-Sinai Medical Center18, University of Ottawa19, University of Pennsylvania20, University of North Carolina at Chapel Hill21, University of Helsinki22, University of California, San Diego23, University of Mississippi Medical Center24
TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.
Abstract: Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.
8,758 citations
01 Aug 2000
TL;DR: Assessment of medical technology in the context of commercialization with Bioentrepreneur course, which addresses many issues unique to biomedical products.
Abstract: BIOE 402. Medical Technology Assessment. 2 or 3 hours. Bioentrepreneur course. Assessment of medical technology in the context of commercialization. Objectives, competition, market share, funding, pricing, manufacturing, growth, and intellectual property; many issues unique to biomedical products. Course Information: 2 undergraduate hours. 3 graduate hours. Prerequisite(s): Junior standing or above and consent of the instructor.
4,833 citations