scispace - formally typeset
Search or ask a question

Showing papers by "Howard Hughes Medical Institute published in 2011"


Journal ArticleDOI
TL;DR: This work sequences a diverse array of 25 environmental samples and three known “mock communities” at a depth averaging 3.1 million reads per sample to demonstrate excellent consistency in taxonomic recovery and recapture diversity patterns that were previously reported on the basis of metaanalysis of many studies from the literature.
Abstract: The ongoing revolution in high-throughput sequencing continues to democratize the ability of small groups of investigators to map the microbial component of the biosphere. In particular, the coevolution of new sequencing platforms and new software tools allows data acquisition and analysis on an unprecedented scale. Here we report the next stage in this coevolutionary arms race, using the Illumina GAIIx platform to sequence a diverse array of 25 environmental samples and three known “mock communities” at a depth averaging 3.1 million reads per sample. We demonstrate excellent consistency in taxonomic recovery and recapture diversity patterns that were previously reported on the basis of metaanalysis of many studies from the literature (notably, the saline/nonsaline split in environmental samples and the split between host-associated and free-living communities). We also demonstrate that 2,000 Illumina single-end reads are sufficient to recapture the same relationships among samples that we observe with the full dataset. The results thus open up the possibility of conducting large-scale studies analyzing thousands of samples simultaneously to survey microbial communities at an unprecedented spatial and temporal resolution.

6,767 citations


Journal ArticleDOI
TL;DR: In this paper, the organization of networks in the human cerebrum was explored using resting-state functional connectivity MRI data from 1,000 subjects and a clustering approach was employed to identify and replicate networks of functionally coupled regions across the cerebral cortex.
Abstract: Information processing in the cerebral cortex involves interactions among distributed areas. Anatomical connectivity suggests that certain areas form local hierarchical relations such as within the visual system. Other connectivity patterns, particularly among association areas, suggest the presence of large-scale circuits without clear hierarchical relations. In this study the organization of networks in the human cerebrum was explored using resting-state functional connectivity MRI. Data from 1,000 subjects were registered using surface-based alignment. A clustering approach was employed to identify and replicate networks of functionally coupled regions across the cerebral cortex. The results revealed local networks confined to sensory and motor cortices as well as distributed networks of association regions. Within the sensory and motor cortices, functional connectivity followed topographic representations across adjacent areas. In association cortex, the connectivity patterns often showed abrupt transitions between network boundaries. Focused analyses were performed to better understand properties of network connectivity. A canonical sensory-motor pathway involving primary visual area, putative middle temporal area complex (MT+), lateral intraparietal area, and frontal eye field was analyzed to explore how interactions might arise within and between networks. Results showed that adjacent regions of the MT+ complex demonstrate differential connectivity consistent with a hierarchical pathway that spans networks. The functional connectivity of parietal and prefrontal association cortices was next explored. Distinct connectivity profiles of neighboring regions suggest they participate in distributed networks that, while showing evidence for interactions, are embedded within largely parallel, interdigitated circuits. We conclude by discussing the organization of these large-scale cerebral networks in relation to monkey anatomy and their potential evolutionary expansion in humans to support cognition.

6,284 citations


Journal ArticleDOI
05 May 2011-Nature
TL;DR: This study presents a general framework for deciphering cis-regulatory connections and their roles in disease, and maps nine chromatin marks across nine cell types to systematically characterize regulatory elements, their cell-type specificities and their functional interactions.
Abstract: Chromatin profiling has emerged as a powerful means of genome annotation and detection of regulatory activity. The approach is especially well suited to the characterization of non-coding portions of the genome, which critically contribute to cellular phenotypes yet remain largely uncharted. Here we map nine chromatin marks across nine cell types to systematically characterize regulatory elements, their cell-type specificities and their functional interactions. Focusing on cell-type-specific patterns of promoters and enhancers, we define multicell activity profiles for chromatin state, gene expression, regulatory motif enrichment and regulator expression. We use correlations between these profiles to link enhancers to putative target genes, and predict the cell-type-specific activators and repressors that modulate them. The resulting annotations and regulatory predictions have implications for the interpretation of genome-wide association studies. Top-scoring disease single nucleotide polymorphisms are frequently positioned within enhancer elements specifically active in relevant cell types, and in some cases affect a motif instance for a predicted regulator, thus suggesting a mechanism for the association. Our study presents a general framework for deciphering cis-regulatory connections and their roles in disease.

2,646 citations


Journal ArticleDOI
TL;DR: It is confirmed with actual sequence data that UniFrac values can be influenced by the number of sequences/sample, and sequence jackknifing is recommended to avoid this issue.
Abstract: UniFrac is a β-diversity measure that uses phylogenetic information to compare environmental samples. UniFrac, coupled with standard multivariate statistical techniques including principal coordinates analysis (PCoA), identifies factors explaining differences among microbial communities. A recent simulation study concluded that UniFrac is unsuitable as a distance metric and should not be used for multivariate analysis (Schloss, 2008). We counter this argument by reassessing the data that led to this conclusion and by providing a mathematical proof showing that UniFrac is a distance metric. However, we confirm with actual sequence data that UniFrac values can be influenced by the number of sequences/sample, and recommend sequence jackknifing (that is, determining how often the cluster results are recovered using random subsets of the data) to avoid this issue.

2,022 citations


Journal ArticleDOI
28 Apr 2011-Nature
TL;DR: It is shown that different viruses are targeted by unique sets of ISGs, and that each viral species is susceptible to multiple antiviral genes, which together encompass a range of inhibitory activities.
Abstract: The type I interferon response protects cells against invading viral pathogens. The cellular factors that mediate this defence are the products of interferon-stimulated genes (ISGs). Although hundreds of ISGs have been identified since their discovery more than 25 years ago, only a few have been characterized with respect to antiviral activity. For most ISG products, little is known about their antiviral potential, their target specificity and their mechanisms of action. Using an overexpression screening approach, here we show that different viruses are targeted by unique sets of ISGs. We find that each viral species is susceptible to multiple antiviral genes, which together encompass a range of inhibitory activities. To conduct the screen, more than 380 human ISGs were tested for their ability to inhibit the replication of several important human and animal viruses, including hepatitis C virus, yellow fever virus, West Nile virus, chikungunya virus, Venezuelan equine encephalitis virus and human immunodeficiency virus type-1. Broadly acting effectors included IRF1, C6orf150 (also known as MB21D1), HPSE, RIG-I (also known as DDX58), MDA5 (also known as IFIH1) and IFITM3, whereas more targeted antiviral specificity was observed with DDX60, IFI44L, IFI6, IFITM2, MAP3K14, MOV10, NAMPT (also known as PBEF1), OASL, RTP4, TREX1 and UNC84B (also known as SUN2). Combined expression of pairs of ISGs showed additive antiviral effects similar to those of moderate type I interferon doses. Mechanistic studies uncovered a common theme of translational inhibition for numerous effectors. Several ISGs, including ADAR, FAM46C, LY6E and MCOLN2, enhanced the replication of certain viruses, highlighting another layer of complexity in the highly pleiotropic type I interferon system.

1,926 citations


Journal ArticleDOI
22 Jul 2011-Cell
TL;DR: A brain polyribosome-programmed translation system is developed, revealing that FMRP reversibly stalls ribosomes specifically on its target mRNAs and suggests multiple targets for clinical intervention in FXS and ASD.

1,861 citations


Journal ArticleDOI
TL;DR: The evidence linking lncRNAs to diverse human diseases is reviewed and fundamental concepts in lncRNA biology that still need to be clarified are highlighted to provide a robust framework for lnc RNA genetics.

1,747 citations


Journal ArticleDOI
27 Oct 2011-Nature
TL;DR: The results establish small-molecule inhibition of Brd4 as a promising therapeutic strategy in AML and, potentially, other cancers, and highlight the utility of RNA interference screening for revealing epigenetic vulnerabilities that can be exploited for direct pharmacological intervention.
Abstract: Epigenetic pathways can regulate gene expression by controlling and interpreting chromatin modifications. Cancer cells are characterized by altered epigenetic landscapes, and commonly exploit the chromatin regulatory machinery to enforce oncogenic gene expression programs. Although chromatin alterations are, in principle, reversible and often amenable to drug intervention, the promise of targeting such pathways therapeutically has been limited by an incomplete understanding of cancer-specific dependencies on epigenetic regulators. Here we describe a non-biased approach to probe epigenetic vulnerabilities in acute myeloid leukaemia (AML), an aggressive haematopoietic malignancy that is often associated with aberrant chromatin states. By screening a custom library of small hairpin RNAs (shRNAs) targeting known chromatin regulators in a genetically defined AML mouse model, we identify the protein bromodomain-containing 4 (Brd4) as being critically required for disease maintenance. Suppression of Brd4 using shRNAs or the small-molecule inhibitor JQ1 led to robust antileukaemic effects in vitro and in vivo, accompanied by terminal myeloid differentiation and elimination of leukaemia stem cells. Similar sensitivities were observed in a variety of human AML cell lines and primary patient samples, revealing that JQ1 has broad activity in diverse AML subtypes. The effects of Brd4 suppression are, at least in part, due to its role in sustaining Myc expression to promote aberrant self-renewal, which implicates JQ1 as a pharmacological means to suppress MYC in cancer. Our results establish small-molecule inhibition of Brd4 as a promising therapeutic strategy in AML and, potentially, other cancers, and highlight the utility of RNA interference (RNAi) screening for revealing epigenetic vulnerabilities that can be exploited for direct pharmacological intervention.

1,737 citations


Journal ArticleDOI
27 May 2011-Cell
TL;DR: It is shown that deficiency of NLRP6 in mouse colonic epithelial cells results in reduced IL-18 levels and altered fecal microbiota characterized by expanded representation of the bacterial phyla Bacteroidetes (Prevotellaceae) and TM7.

1,704 citations


Journal ArticleDOI
TL;DR: The development of currently available single-component optogenetic tools is outlined and the application of various optogenetics tools in diverse model organisms is summarized.
Abstract: Genetically encoded, single-component optogenetic tools have made a significant impact on neuroscience, enabling specific modulation of selected cells within complex neural tissues. As the optogenetic toolbox contents grow and diversify, the opportunities for neuroscience continue to grow. In this review, we outline the development of currently available single-component optogenetic tools and summarize the application of various optogenetic tools in diverse model organisms.

1,658 citations


Journal Article
TL;DR: This study presents a general framework for deciphering cis-regulatory connections and their roles in disease, and defines multi-cell activity profiles for chromatin state, gene expression, regulatory motif enrichment, and regulator expression.
Abstract: Chromatin profiling has emerged as a powerful means of genome annotation and detection of regulatory activity. The approach is especially well suited to the characterization of non-coding portions of the genome, which critically contribute to cellular phenotypes yet remain largely uncharted. Here we map nine chromatin marks across nine cell types to systematically characterize regulatory elements, their cell-type specificities and their functional interactions. Focusing on cell-type-specific patterns of promoters and enhancers, we define multicell activity profiles for chromatin state, gene expression, regulatory motif enrichment and regulator expression. We use correlations between these profiles to link enhancers to putative target genes, and predict the cell-type-specific activators and repressors that modulate them. The resulting annotations and regulatory predictions have implications for the interpretation of genome-wide association studies. Top-scoring disease single nucleotide polymorphisms are frequently positioned within enhancer elements specifically active in relevant cell types, and in some cases affect a motif instance for a predicted regulator, thus suggesting a mechanism for the association. Our study presents a general framework for deciphering cis-regulatory connections and their roles in disease.

Journal ArticleDOI
20 May 2011-Science
TL;DR: The value of characterizing vertebrate gut microbiomes to understand host evolutionary histories at a supraorganismal level is illustrated by shotgun sequencing of microbial community DNA and targeted sequencing of bacterial 16S ribosomal RNA genes.
Abstract: Coevolution of mammals and their gut microbiota has profoundly affected their radiation into myriad habitats. We used shotgun sequencing of microbial community DNA and targeted sequencing of bacterial 16S ribosomal RNA genes to gain an understanding of how microbial communities adapt to extremes of diet. We sampled fecal DNA from 33 mammalian species and 18 humans who kept detailed diet records, and we found that the adaptation of the microbiota to diet is similar across different mammalian lineages. Functional repertoires of microbiome genes, such as those encoding carbohydrate-active enzymes and proteases, can be predicted from bacterial species assemblages. These results illustrate the value of characterizing vertebrate gut microbiomes to understand host evolutionary histories at a supraorganismal level.

Journal ArticleDOI
TL;DR: The most common building blocks and techniques used to implement these circuits, and an overview of a wide range of neuromorphic silicon neurons, which implement different computational models, ranging from biophysically realistic and conductance-based Hodgkin–Huxley models to bi-dimensional generalized adaptive integrate and fire models.
Abstract: Hardware implementations of spiking neurons can be extremely useful for a large variety of applications, ranging from high-speed modeling of large-scale neural systems to real-time behaving systems, to bidirectional brain-machine interfaces. The specific circuit solutions used to implement silicon neurons depend on the application requirements. In this paper we describe the most common building blocks and techniques used to implement these circuits, and present an overview of a wide range of neuromorphic silicon neurons, which implement different computational models, ranging from biophysically realistic and conductance-based Hodgkin-Huxley models to bi-dimensional generalized adaptive integrate and fire models. We compare the different design methodologies used for each silicon neuron design described, and demonstrate their features with experimental results, measured from a wide range of fabricated VLSI chips.

Journal ArticleDOI
TL;DR: An overview of the project and the resources it is generating and the application of ENCODE data to interpret the human genome are provided.
Abstract: The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome.

Journal ArticleDOI
24 Mar 2011-Nature
TL;DR: 111,195 new elements are identified, including thousands of genes, coding and non-coding transcripts, exons, splicing and editing events and inferred protein isoforms that previously eluded discovery using established experimental, prediction and conservation-based approaches.
Abstract: Drosophila melanogaster is one of the most well studied genetic model organisms; nonetheless, its genome still contains unannotated coding and non-coding genes, transcripts, exons and RNA editing sites. Full discovery and annotation are pre-requisites for understanding how the regulation of transcription, splicing and RNA editing directs the development of this complex organism. Here we used RNA-Seq, tiling microarrays and cDNA sequencing to explore the transcriptome in 30 distinct developmental stages. We identified 111,195 new elements, including thousands of genes, coding and non-coding transcripts, exons, splicing and editing events, and inferred protein isoforms that previously eluded discovery using established experimental, prediction and conservation-based approaches. These data substantially expand the number of known transcribed elements in the Drosophila genome and provide a high-resolution view of transcriptome dynamics throughout development.

Journal ArticleDOI
11 Nov 2011-Cell
TL;DR: A growing body of work about the connections between apoptosis, stem cells, and cancer is explored, focusing on how apoptotic cells release a variety of signals to communicate with their cellular environment, including factors that promote cell division, tissue regeneration, and wound healing.

Journal ArticleDOI
TL;DR: It is argued that the long-term goal should be routine, cost-effective and high quality de novo assembly of human genomes to comprehensively assess all classes of structural variation.
Abstract: Comparisons of human genomes show that more base pairs are altered as a result of structural variation — including copy number variation — than as a result of point mutations. Here we review advances and challenges in the discovery and genotyping of structural variation. The recent application of massively parallel sequencing methods has complemented microarray-based methods and has led to an exponential increase in the discovery of smaller structural-variation events. Some global discovery biases remain, but the integration of experimental and computational approaches is proving fruitful for accurate characterization of the copy, content and structure of variable regions. We argue that the long-term goal should be routine, cost-effective and high quality de novo assembly of human genomes to comprehensively assess all classes of structural variation.

Journal ArticleDOI
24 Mar 2011-Nature
TL;DR: The massively parallel sequencing of 38 tumour genomes and their comparison to matched normal DNAs indicates that cancer genome sequencing of large collections of samples will yield new insights into cancer not anticipated by existing knowledge.
Abstract: Multiple myeloma is an incurable malignancy of plasma cells, and its pathogenesis is poorly understood. Here we report the massively parallel sequencing of 38 tumour genomes and their comparison to matched normal DNAs. Several new and unexpected oncogenic mechanisms were suggested by the pattern of somatic mutation across the data set. These include the mutation of genes involved in protein translation (seen in nearly half of the patients), genes involved in histone methylation, and genes involved in blood coagulation. In addition, a broader than anticipated role of NF-κB signalling was indicated by mutations in 11 members of the NF-κB pathway. Of potential immediate clinical relevance, activating mutations of the kinase BRAF were observed in 4% of patients, suggesting the evaluation of BRAF inhibitors in multiple myeloma clinical trials. These results indicate that cancer genome sequencing of large collections of samples will yield new insights into cancer not anticipated by existing knowledge.

Journal ArticleDOI
29 Apr 2011-PLOS ONE
TL;DR: OSLOM (Order Statistics Local Optimization Method), the first method capable to detect clusters in networks accounting for edge directions, edge weights, overlapping communities, hierarchies and community dynamics, is presented.
Abstract: Community structure is one of the main structural features of networks, revealing both their internal organization and the similarity of their elementary units. Despite the large variety of methods proposed to detect communities in graphs, there is a big need for multi-purpose techniques, able to handle different types of datasets and the subtleties of community structure. In this paper we present OSLOM (Order Statistics Local Optimization Method), the first method capable to detect clusters in networks accounting for edge directions, edge weights, overlapping communities, hierarchies and community dynamics. It is based on the local optimization of a fitness function expressing the statistical significance of clusters with respect to random fluctuations, which is estimated with tools of Extreme and Order Statistics. OSLOM can be used alone or as a refinement procedure of partitions/covers delivered by other techniques. We have also implemented sequential algorithms combining OSLOM with other fast techniques, so that the community structure of very large networks can be uncovered. Our method has a comparable performance as the best existing algorithms on artificial benchmark graphs. Several applications on real networks are shown as well. OSLOM is implemented in a freely available software (http://www.oslom.org), and we believe it will be a valuable tool in the analysis of networks.

Journal ArticleDOI
03 Mar 2011-Nature
TL;DR: It is shown that 22 human induced pluripotent stem (hiPS) cell lines reprogrammed using five different methods each contained an average of five protein-coding point mutations in the regions sampled, and that hiPS cells acquire genetic modifications in addition to epigenetic modifications.
Abstract: Defined transcription factors can induce epigenetic reprogramming of adult mammalian cells into induced pluripotent stem cells. Although DNA factors are integrated during some reprogramming methods, it is unknown whether the genome remains unchanged at the single nucleotide level. Here we show that 22 human induced pluripotent stem (hiPS) cell lines reprogrammed using five different methods each contained an average of five protein-coding point mutations in the regions sampled (an estimated six protein-coding point mutations per exome). The majority of these mutations were non-synonymous, nonsense or splice variants, and were enriched in genes mutated or having causative effects in cancers. At least half of these reprogramming-associated mutations pre-existed in fibroblast progenitors at low frequencies, whereas the rest occurred during or after reprogramming. Thus, hiPS cells acquire genetic modifications in addition to epigenetic modifications. Extensive genetic screening should become a standard procedure to ensure hiPS cell safety before clinical use.

Journal ArticleDOI
09 Jun 2011-Neuron
TL;DR: A genome-wide analysis of rare copy-number variation in 1124 autism spectrum disorder families, each comprised of a single proband, unaffected parents, and, in most kindreds, an unaffected sibling, finds significant association of ASD with de novo duplications of 7q11.23, where the reciprocal deletion causes Williams-Beuren syndrome.

Journal ArticleDOI
10 Feb 2011-Nature
TL;DR: In this paper, the authors presented the complete sequence of seven primary human prostate cancers and their paired normal counterparts and revealed previously unknown balanced rearrangements, at which multiple intra-and inter-chromosomal loci exchange their breakpoint arms without any loss of genetic material.
Abstract: Prostate cancer is the second most common cause of male cancer deaths in the United States. However, the full range of prostate cancer genomic alterations is incompletely characterized. Here we present the complete sequence of seven primary human prostate cancers and their paired normal counterparts. Several tumours contained complex chains of balanced (that is, ‘copy-neutral’) rearrangements that occurred within or adjacent to known cancer genes. Rearrangement breakpoints were enriched near open chromatin, androgen receptor and ERG DNA binding sites in the setting of the ETS gene fusion TMPRSS2–ERG, but inversely correlated with these regions in tumours lacking ETS fusions. This observation suggests a link between chromatin or transcriptional regulation and the genesis of genomic aberrations. Three tumours contained rearrangements that disrupted CADM2, and four harboured events disrupting either PTEN (unbalanced events), a prostate tumour suppressor, or MAGI2 (balanced events), a PTEN interacting protein not previously implicated in prostate tumorigenesis. Thus, genomic rearrangements may arise from transcriptional or chromatin aberrancies and engage prostate tumorigenic mechanisms. Prostate cancer is a common cause of male cancer-related deaths. Complete genome sequencing of seven 'high-risk' primary prostate cancers and their paired normal counterparts now reveals previously unknown balanced rearrangements, at which multiple intra- and inter-chromosomal loci exchange their breakpoint arms without any loss of genetic material. The anomalies seem to arise through errors in transcription or abnormal chromatin structure, and genes affected include the known prostate tumour suppressor PTEN as well as MAG12, a gene not previously implicated in prostate tumorigenesis. Prostate cancer is a common cause of male cancer-related deaths. Complete sequencing of prostate cancer genomes now reveals previously unknown balanced rearrangements. Single-nucleotide resolution afforded by sequencing indicates that complex rearrangements may arise from transcriptional or chromatin aberrancies and engage prostate tumorigenic mechanisms.

Journal ArticleDOI
TL;DR: This analysis quantitatively characterized the switching properties of 26 organic dyes and directly related these properties to the quality of super-resolution images, providing guidelines for characterization ofsuper-resolution probes and a resource for selecting probes based on performance.
Abstract: One approach to super-resolution fluorescence imaging uses sequential activation and localization of individual fluorophores to achieve high spatial resolution. Essential to this technique is the choice of fluorescent probes; the properties of the probes, including photons per switching event, on-off duty cycle, photostability and number of switching cycles, largely dictate the quality of super-resolution images. Although many probes have been reported, a systematic characterization of the properties of these probes and their impact on super-resolution image quality has been described in only a few cases. Here we quantitatively characterized the switching properties of 26 organic dyes and directly related these properties to the quality of super-resolution images. This analysis provides guidelines for characterization of super-resolution probes and a resource for selecting probes based on performance. Our evaluation identified several photoswitchable dyes with good to excellent performance in four independent spectral ranges, with which we demonstrated low-cross-talk, four-color super-resolution imaging.

Journal ArticleDOI
TL;DR: An approach that can substantially increase the sensitivity of massively parallel sequencing instruments for the identification of rare variants and the utility of this approach for determining the fidelity of a polymerase, the accuracy of oligonucleotides synthesized in vitro, and the prevalence of mutations in the nuclear and mitochondrial genomes of normal cells is described.
Abstract: The identification of mutations that are present in a small fraction of DNA templates is essential for progress in several areas of biomedical research. Although massively parallel sequencing instruments are in principle well suited to this task, the error rates in such instruments are generally too high to allow confident identification of rare variants. We here describe an approach that can substantially increase the sensitivity of massively parallel sequencing instruments for this purpose. The keys to this approach, called the Safe-Sequencing System (“Safe-SeqS”), are (i) assignment of a unique identifier (UID) to each template molecule, (ii) amplification of each uniquely tagged template molecule to create UID families, and (iii) redundant sequencing of the amplification products. PCR fragments with the same UID are considered mutant (“supermutants”) only if ≥95% of them contain the identical mutation. We illustrate the utility of this approach for determining the fidelity of a polymerase, the accuracy of oligonucleotides synthesized in vitro, and the prevalence of mutations in the nuclear and mitochondrial genomes of normal cells.

01 Mar 2011
TL;DR: In this paper, a massively parallel sequencing of 38 tumour genomes and their comparison to matched normal DNAs was reported, and several new and unexpected oncogenic mechanisms were suggested by the pattern of somatic mutation across the data set.
Abstract: Multiple myeloma is an incurable malignancy of plasma cells, and its pathogenesis is poorly understood. Here we report the massively parallel sequencing of 38 tumour genomes and their comparison to matched normal DNAs. Several new and unexpected oncogenic mechanisms were suggested by the pattern of somatic mutation across the data set. These include the mutation of genes involved in protein translation (seen in nearly half of the patients), genes involved in histone methylation, and genes involved in blood coagulation. In addition, a broader than anticipated role of NF-κB signalling was indicated by mutations in 11 members of the NF-κB pathway. Of potential immediate clinical relevance, activating mutations of the kinase BRAF were observed in 4% of patients, suggesting the evaluation of BRAF inhibitors in multiple myeloma clinical trials. These results indicate that cancer genome sequencing of large collections of samples will yield new insights into cancer not anticipated by existing knowledge.

Journal ArticleDOI
TL;DR: SourceTracker, a Bayesian approach to estimate the proportion of contaminants in a given community that come from possible source environments, is presented, and microbial surveys from neonatal intensive care units, offices and molecular biology laboratories are applied.
Abstract: Contamination is a critical issue in high-throughput metagenomic studies, yet progress toward a comprehensive solution has been limited. We present SourceTracker, a Bayesian approach to estimate the proportion of contaminants in a given community that come from possible source environments. We applied SourceTracker to microbial surveys from neonatal intensive care units (NICUs), offices and molecular biology laboratories, and provide a database of known contaminants for future testing.

Journal ArticleDOI
TL;DR: The results show that trio-based exome sequencing is a powerful approach for identifying new candidate genes for ASDs and suggest that de novo mutations may contribute substantially to the genetic etiology of ASDs.
Abstract: Evidence for the etiology of autism spectrum disorders (ASDs) has consistently pointed to a strong genetic component complicated by substantial locus heterogeneity. We sequenced the exomes of 20 individuals with sporadic ASD (cases) and their parents, reasoning that these families would be enriched for de novo mutations of major effect. We identified 21 de novo mutations, 11 of which were protein altering. Protein-altering mutations were significantly enriched for changes at highly conserved residues. We identified potentially causative de novo events in 4 out of 20 probands, particularly among more severely affected individuals, in FOXP1, GRIN2B, SCN1A and LAMC3. In the FOXP1 mutation carrier, we also observed a rare inherited CNTNAP2 missense variant, and we provide functional support for a multi-hit model for disease risk. Our results show that trio-based exome sequencing is a powerful approach for identifying new candidate genes for ASDs and suggest that de novo mutations may contribute substantially to the genetic etiology of ASDs.

Journal ArticleDOI
TL;DR: A selection of recent studies that have probed histone modifications and successive layers of chromatin structure in mammalian genomes, the patterns that have been identified and future directions for research are reviewed.
Abstract: A succession of technological advances over the past decade have enabled researchers to chart maps of histone modifications and related chromatin structures with increasing accuracy, comprehensiveness and throughput. The resulting data sets highlight the interplay between chromatin and genome function, dynamic variations in chromatin structure across cellular conditions, and emerging roles for large-scale domains and higher-ordered chromatin organization. Here we review a selection of recent studies that have probed histone modifications and successive layers of chromatin structure in mammalian genomes, the patterns that have been identified and future directions for research.

Journal ArticleDOI
16 Sep 2011-Science
TL;DR: Anti-HIV broadly neutralizing antibodies with similar specificities and modes of binding were found in multiple HIV-infected individuals, and cloned 576 new HIV antibodies from four unrelated individuals to determine whether they are part of a larger group of related molecules.
Abstract: Passive transfer of broadly neutralizing HIV antibodies can prevent infection, which suggests that vaccines that elicit such antibodies would be protective. Thus far, however, few broadly neutralizing HIV antibodies that occur naturally have been characterized. To determine whether these antibodies are part of a larger group of related molecules, we cloned 576 new HIV antibodies from four unrelated individuals. All four individuals produced expanded clones of potent broadly neutralizing CD4-binding-site antibodies that mimic binding to CD4. Despite extensive hypermutation, the new antibodies shared a consensus sequence of 68 immunoglobulin H (IgH) chain amino acids and arise independently from two related IgH genes. Comparison of the crystal structure of one of the antibodies to the broadly neutralizing antibody VRC01 revealed conservation of the contacts to the HIV spike.

Journal ArticleDOI
23 Dec 2011-Cell
TL;DR: This study provides a roadmap for identification and analysis of lincRNAs in model organisms and shows that linc RNAs play crucial biological roles during embryonic development with functionality conserved despite limited sequence conservation.