scispace - formally typeset
Search or ask a question

Showing papers in "PLOS Genetics in 2019"


Journal ArticleDOI
TL;DR: This work confirms drugs with genetically supported targets were more likely to be successful in Phases II and III and finds the use of human genetic evidence increases approval by greater than two-fold, and, for Mendelian associations, the positive association holds prospectively.
Abstract: Despite strong vetting for disease activity, only 10% of candidate new molecular entities in early stage clinical trials are eventually approved. Analyzing historical pipeline data, Nelson et al. 2015 (Nat. Genet.) concluded pipeline drug targets with human genetic evidence of disease association are twice as likely to lead to approved drugs. Taking advantage of recent clinical development advances and rapid growth in GWAS datasets, we extend the original work using updated data, test whether genetic evidence predicts future successes and introduce statistical models adjusting for target and indication-level properties. Our work confirms drugs with genetically supported targets were more likely to be successful in Phases II and III. When causal genes are clear (Mendelian traits and GWAS associations linked to coding variants), we find the use of human genetic evidence increases approval by greater than two-fold, and, for Mendelian associations, the positive association holds prospectively. Our findings suggest investments into genomics and genetics are likely to be beneficial to companies deploying this strategy.

308 citations


Journal ArticleDOI
TL;DR: This work proposes an efficient statistical method (MultiXcan) that leverages the substantial sharing of eQTLs across tissues and contexts to improve the ability to identify potential target genes and develops a robust implementation of the summary-based extension.
Abstract: Integration of genome-wide association studies (GWAS) and expression quantitative trait loci (eQTL) studies is needed to improve our understanding of the biological mechanisms underlying GWAS hits, and our ability to identify therapeutic targets. Gene-level association methods such as PrediXcan can prioritize candidate targets. However, limited eQTL sample sizes and absence of relevant developmental and disease context restrict our ability to detect associations. Here we propose an efficient statistical method (MultiXcan) that leverages the substantial sharing of eQTLs across tissues and contexts to improve our ability to identify potential target genes. MultiXcan integrates evidence across multiple panels using multivariate regression, which naturally takes into account the correlation structure. We apply our method to simulated and real traits from the UK Biobank and show that, in realistic settings, we can detect a larger set of significantly associated genes than using each panel separately. To improve applicability, we developed a summary result-based extension called S-MultiXcan, which we show yields highly concordant results with the individual level version when LD is well matched. Our multivariate model-based approach allowed us to use the individual level results as a gold standard to calibrate and develop a robust implementation of the summary-based extension. Results from our analysis as well as software and necessary resources to apply our method are publicly available.

209 citations


Journal ArticleDOI
TL;DR: Most studies of de novo genes to date have thus focused on young genes, typically taxonomically-restricted genes that are present in a single species or lineage, including socalled orphan genes, defined as genes that lack any identifiable homolog.
Abstract: De novo gene birth is the process by which new genes evolve from DNA sequences that were ancestrally non-genic. De novo genes represent a subset of novel genes, and may be proteincoding or instead act as RNA genes [1]. The processes that govern de novo gene birth (Fig 1A) are not well understood, though several models exist that describe possible mechanisms by which de novo gene birth may occur. Although de novo gene birth may have occurred at any point in an organism’s evolutionary history, ancient de novo gene birth events are difficult to detect. Most studies of de novo genes to date have thus focused on young genes, typically taxonomically-restricted genes (TRGs) that are present in a single species or lineage, including socalled orphan genes, defined as genes that lack any identifiable homolog. It is important to note, however, that not all orphan genes arise de novo, and instead may emerge through fairly well-characterized mechanisms such as gene duplication (including retroposition) or horizontal gene transfer followed by sequence divergence, or by gene fission/fusion [2, 3] (Fig 2) Though de novo gene birth was once viewed as a highly unlikely occurrence [4], there are now several unequivocal examples of the phenomenon that have been described. It furthermore has been advanced that de novo gene birth plays a major role in the generation of evolutionary innovation [5, 6].

203 citations


Journal ArticleDOI
Madeline H. Kowalski1, Huijun Qian1, Ziyi Hou2, Jonathan D. Rosen1, Amanda L. Tapia1, Yue Shan1, Deepti Jain3, Maria Argos4, Donna K. Arnett5, Christy L. Avery1, Kathleen C. Barnes6, Lewis C. Becker7, Stephanie A. Bien8, Joshua C. Bis3, John Blangero9, Eric Boerwinkle10, Donald W. Bowden11, Steve Buyske12, Jianwen Cai1, Michael H. Cho2, Michael H. Cho13, Seung Hoan Choi14, Hélène Choquet15, L. Adrienne Cupples16, Mary Cushman17, Michelle Daya6, Paul S. de Vries10, Patrick T. Ellinor14, Patrick T. Ellinor2, Nauder Faraday7, Myriam Fornage10, Stacey Gabriel14, Santhi K. Ganesh18, Misa Graff1, Namrata Gupta14, Jiang He19, Susan R. Heckbert3, Susan R. Heckbert15, Bertha Hidalgo20, Chani J. Hodonsky1, Marguerite R. Irvin20, Andrew D. Johnson, Eric Jorgenson15, Robert C. Kaplan21, Sharon L.R. Kardia18, Tanika N. Kelly19, Charles Kooperberg8, Jessica Lasky-Su13, Jessica Lasky-Su2, Ruth J. F. Loos22, Steven A. Lubitz2, Steven A. Lubitz14, Rasika A. Mathias7, Caitlin P. McHugh3, Courtney G. Montgomery23, Jee-Young Moon21, Alanna C. Morrison10, Nicholette D. Palmer11, Nathan Pankratz24, George Papanicolaou, Juan M. Peralta9, Patricia A. Peyser18, Stephen S. Rich25, Jerome I. Rotter26, Edwin K. Silverman13, Edwin K. Silverman2, Jennifer A. Smith18, Nicholas L. Smith15, Nicholas L. Smith3, Nicholas L. Smith27, Kent D. Taylor26, Timothy A. Thornton3, Hemant K. Tiwari20, Russell P. Tracy17, Tao Wang21, Scott T. Weiss2, Scott T. Weiss13, Lu-Chen Weng14, Kerri L. Wiggins3, James G. Wilson28, Lisa R. Yanek7, Sebastian Zöllner18, Kari E. North1, Paul L. Auer29, Laura M. Raffield1, Alex P. Reiner3, Yun Li1 
TL;DR: It is demonstrated that using TOPMed sequencing data as the imputation reference panel improves genotypes into admixed African and Hispanic/Latino samples with genome-wide genotyping array data, which subsequently enhanced gene-mapping power for complex traits.
Abstract: Most genome-wide association and fine-mapping studies to date have been conducted in individuals of European descent, and genetic studies of populations of Hispanic/Latino and African ancestry are limited. In addition, these populations have more complex linkage disequilibrium structure. In order to better define the genetic architecture of these understudied populations, we leveraged >100,000 phased sequences available from deep-coverage whole genome sequencing through the multi-ethnic NHLBI Trans-Omics for Precision Medicine (TOPMed) program to impute genotypes into admixed African and Hispanic/Latino samples with genome-wide genotyping array data. We demonstrated that using TOPMed sequencing data as the imputation reference panel improves genotype imputation quality in these populations, which subsequently enhanced gene-mapping power for complex traits. For rare variants with minor allele frequency (MAF) 86%. Subsequent association analyses of TOPMed reference panel-imputed genotype data with hematological traits (hemoglobin (HGB), hematocrit (HCT), and white blood cell count (WBC)) in ~21,600 African-ancestry and ~21,700 Hispanic/Latino individuals identified associations with two rare variants in the HBB gene (rs33930165 with higher WBC [p = 8.8x10-15] in African populations, rs11549407 with lower HGB [p = 1.5x10-12] and HCT [p = 8.8x10-10] in Hispanics/Latinos). By comparison, neither variant would have been genome-wide significant if either 1000 Genomes Project Phase 3 or Haplotype Reference Consortium reference panels had been used for imputation. Our findings highlight the utility of the TOPMed imputation reference panel for identification of novel rare variant associations not previously detected in similarly sized genome-wide studies of under-represented African and Hispanic/Latino populations.

181 citations


Journal ArticleDOI
TL;DR: The data indicate that MDR clones pose the greatest risk, because they are more likely to acquire virulence genes than hypervirulent clones are to acquire resistance genes.
Abstract: Klebsiella pneumoniae has emerged as an important cause of two distinct public health threats: multi-drug resistant (MDR) healthcare-associated infections and drug susceptible community-acquired invasive infections. These pathotypes are generally associated with two distinct subsets of K. pneumoniae lineages or ‘clones’ that are distinguished by the presence of acquired resistance genes and several key virulence loci. Genomic evolutionary analyses of the most notorious MDR and invasive community-associated (‘hypervirulent’) clones indicate differences in terms of chromosomal recombination dynamics and capsule polysaccharide diversity, but it remains unclear if these differences represent generalised trends. Here we leverage a collection of >2200 K. pneumoniae genomes to identify 28 common clones (n ≥ 10 genomes each), and perform the first genomic evolutionary comparison. Eight MDR and 6 hypervirulent clones were identified on the basis of acquired resistance and virulence gene prevalence. Chromosomal recombination, surface polysaccharide locus diversity, pan-genome, plasmid and phage dynamics were characterised and compared. The data showed that MDR clones were highly diverse, with frequent chromosomal recombination generating extensive surface polysaccharide locus diversity. Additional pan-genome diversity was driven by frequent acquisition/loss of both plasmids and phage. In contrast, chromosomal recombination was rare in the hypervirulent clones, which also showed a significant reduction in pan-genome diversity, largely driven by a reduction in plasmid diversity. Hence the data indicate that hypervirulent clones may be subject to some sort of constraint for horizontal gene transfer that does not apply to the MDR clones. Our findings are relevant for understanding the risk of emergence of individual K. pneumoniae strains carrying both virulence and acquired resistance genes, which have been increasingly reported and cause highly virulent infections that are extremely difficult to treat. Specifically, our data indicate that MDR clones pose the greatest risk, because they are more likely to acquire virulence genes than hypervirulent clones are to acquire resistance genes.

175 citations


Journal ArticleDOI
TL;DR: Uniform manifold approximation and projection (UMAP), a non-linear dimension reduction tool, is applied to three well-studied genotype datasets and discover overlooked subpopulations within the American Hispanic population, fine-scale relationships between geography, genotypes, and phenotypes in the UK population, and cryptic structure in the Thousand Genomes Project data.
Abstract: Human populations feature both discrete and continuous patterns of variation. Current analysis approaches struggle to jointly identify these patterns because of modelling assumptions, mathematical constraints, or numerical challenges. Here we apply uniform manifold approximation and projection (UMAP), a non-linear dimension reduction tool, to three well-studied genotype datasets and discover overlooked subpopulations within the American Hispanic population, fine-scale relationships between geography, genotypes, and phenotypes in the UK population, and cryptic structure in the Thousand Genomes Project data. This approach is well-suited to the influx of large and diverse data and opens new lines of inquiry in population-scale datasets.

157 citations


Journal ArticleDOI
TL;DR: The data demonstrate that representations of plant mitochondrial genomes as simple, circular molecules are not accurate descriptions of their true nature and that in reality plant mitochondrial DNA is a complex, dynamic mixture of forms.
Abstract: Plant mitochondrial genomes are usually assembled and displayed as circular maps based on the widely-held view across the broad community of life scientists that circular genome-sized molecules are the primary form of plant mitochondrial DNA, despite the understanding by plant mitochondrial researchers that this is an inaccurate and outdated concept. Many plant mitochondrial genomes have one or more pairs of large repeats that can act as sites for inter- or intramolecular recombination, leading to multiple alternative arrangements (isoforms). Most mitochondrial genomes have been assembled using methods unable to capture the complete spectrum of isoforms within a species, leading to an incomplete inference of their structure and recombinational activity. To document and investigate underlying reasons for structural diversity in plant mitochondrial DNA, we used long-read (PacBio) and short-read (Illumina) sequencing data to assemble and compare mitochondrial genomes of domesticated (Lactuca sativa) and wild (L. saligna and L. serriola) lettuce species. We characterized a comprehensive, complex set of isoforms within each species and compared genome structures between species. Physical analysis of L. sativa mtDNA molecules by fluorescence microscopy revealed a variety of linear, branched, and circular structures. The mitochondrial genomes for L. sativa and L. serriola were identical in sequence and arrangement and differed substantially from L. saligna, indicating that the mitochondrial genome structure did not change during domestication. From the isoforms in our data, we infer that recombination occurs at repeats of all sizes at variable frequencies. The differences in genome structure between L. saligna and the two other Lactuca species can be largely explained by rare recombination events that rearranged the structure. Our data demonstrate that representations of plant mitochondrial genomes as simple, circular molecules are not accurate descriptions of their true nature and that in reality plant mitochondrial DNA is a complex, dynamic mixture of forms.

149 citations


Journal ArticleDOI
TL;DR: The advent of whole genome sequence data on tens of thousands of people is poised to resolve the problem of missing heritability, but here I want to sound a note of caution: more sequence data does not mean methodological problems go away.
Abstract: The problem of missing heritability, that is to say the gap between heritability estimates from genotype data and heritability estimates from twin data, has been a source of debate for about a decade [1]. It might appear that the advent of whole genome sequence data on tens of thousands of people is poised to resolve the issue, but here I want to sound a note of caution: more sequence data does not mean methodological problems go away. . . Heritability measures the overall importance of genetic inheritance in shaping differences between individuals and is defined as the fraction of trait variation in a population due to genetic inheritance [2]. The advent of twin studies[3] made it possible to estimate heritability by comparing the phenotypic similarity of identical (monozygotic) twins to non-identical (dizygotic) twins: since monozygotic twins are genetically identical, whereas non-identical twins are only half identical on average, greater similarity of identical over non-identical twins is evidence for a contribution of genetic variation to trait variation. However, the twin design makes several assumptions, most importantly that there is no greater environmental similarity of identical over non-identical twins. Whether twin studies have overestimated heritability for human traits, especially social and behavioural traits, remains controversial [4]. The dawn of the genome-wide association study (GWAS) era, around the year 2007, brought with it the question: can we identify specific genetic variations that explain the heritability estimated from twin studies? The small sample sizes of early GWAS meant they had power to identify only common genetic variants with relatively strong effects, and the amount of trait variation that these variants explained was typically only a small fraction of the heritability estimated by twin studies. For height, by 2010 around 40 variants had been identified that collectively explained around 5% of the variation in height, compared to a twin heritability of around 80% [5]. This gap became labelled ’the problem of missing heritability’ and has stimulated heated debate ever since [1]. Many different explanations for the ’missing heritability’ have been proposed [6]. I will focus on two: 1) that complex traits are highly polygenic and affected by many rare variants; 2) that twin studies have overestimated heritability. Note that both of these explanations could contribute to explaining the ‘missing heritability’. The idea behind 1) was that GWAS were not sufficiently powerful to detect the many genetic variants with weak effects on a trait like height, and the genotyping array technologies were not capturing the rare genetic variants that may explain a substantial fraction of the heritability [1,5,6]. The idea behind 2) was that twin studies were overestimating heritability, perhaps due to genetic interactions [7], gene-environment interactions[8], or violation of twin studies assumptions about the environment [4], so that less heritability was in fact missing. The deepest solution to the missing heritability problem would involve identifying all of the causal genetic variants and measuring how much trait variation they explain. An intermediate step towards this solution is to show how much variation we could hope to explain from all

147 citations


Journal ArticleDOI
TL;DR: This review focuses on recent studies characterizing the interaction partners of Xist RNA, and how they have affected the field by addressing long-standing controversies or by giving rise to new research perspectives that are currently being explored.
Abstract: In mammals, dosage compensation of sex chromosomal genes between females (XX) and males (XY) is achieved through X-chromosome inactivation (XCI). The X-linked X-inactive-specific transcript (Xist) long noncoding RNA is indispensable for XCI and initiates the process early during development by spreading in cis across the X chromosome from which it is transcribed. During XCI, Xist RNA triggers gene silencing, recruits a plethora of chromatin modifying factors, and drives a major structural reorganization of the X chromosome. Here, we review our knowledge of the multitude of epigenetic events orchestrated by Xist RNA to allow female mammals to survive through embryonic development by establishing and maintaining proper dosage compensation. In particular, we focus on recent studies characterizing the interaction partners of Xist RNA, and we discuss how they have affected the field by addressing long-standing controversies or by giving rise to new research perspectives that are currently being explored. This review is dedicated to the memory of Denise Barlow, pioneer of genomic imprinting and functional long noncoding RNAs (lncRNAs), whose work has revolutionized the epigenetics field and continues to inspire generations of scientists.

141 citations


Journal ArticleDOI
TL;DR: It is suggested that blocking or reversing mitochondrial damage may provide a potential therapeutic approach to these devastating diseases and uncovered a previously unknown role of LonP1 in regulating mitochondrial TDP-43 levels, but also advanced the understanding of the pathogenic mechanisms for T DP-43 proteinopathy.
Abstract: Mutations in or dys-regulation of the TDP-43 gene have been associated with TDP-43 proteinopathy, a spectrum of neurodegenerative diseases including Frontotemporal Lobar Degeneration (FTLD) and Amyotrophic Lateral Sclerosis (ALS). The underlying molecular and cellular defects, however, remain unclear. Here, we report a systematic study combining analyses of patient brain samples with cellular and animal models for TDP-43 proteinopathy. Electron microscopy (EM) analyses of patient samples revealed prominent mitochondrial impairment, including abnormal cristae and a loss of cristae; these ultrastructural changes were consistently observed in both cellular and animal models of TDP-43 proteinopathy. In these models, increased TDP-43 expression induced mitochondrial dysfunction, including decreased mitochondrial membrane potential and elevated production of reactive oxygen species (ROS). TDP-43 expression suppressed mitochondrial complex I activity and reduced mitochondrial ATP synthesis. Importantly, TDP-43 activated the mitochondrial unfolded protein response (UPRmt) in both cellular and animal models. Down-regulating mitochondrial protease LonP1 increased mitochondrial TDP-43 levels and exacerbated TDP-43-induced mitochondrial damage as well as neurodegeneration. Together, our results demonstrate that TDP-43 induced mitochondrial impairment is a critical aspect in TDP-43 proteinopathy. Our work has not only uncovered a previously unknown role of LonP1 in regulating mitochondrial TDP-43 levels, but also advanced our understanding of the pathogenic mechanisms for TDP-43 proteinopathy. Our study suggests that blocking or reversing mitochondrial damage may provide a potential therapeutic approach to these devastating diseases.

139 citations


Journal ArticleDOI
TL;DR: The proposition that chronic pain involves a strong nervous system component with implications for the understanding of the physiology of chronic pain is supported, as well as the future development of novel treatment approaches.
Abstract: Chronic pain is highly prevalent worldwide and represents a significant socioeconomic and public health burden. Several aspects of chronic pain, for example back pain and a severity-related phenotype ‘chronic pain grade’, have been shown previously to be complex heritable traits with a polygenic component. Additional pain-related phenotypes capturing aspects of an individual’s overall sensitivity to experiencing and reporting chronic pain have also been suggested as a focus for investigation. We made use of a measure of the number of sites of chronic pain in individuals within the UK general population. This measure, termed Multisite Chronic Pain (MCP), is a complex trait and its genetic architecture has not previously been investigated. To address this, we carried out a large-scale genome-wide association study (GWAS) of MCP in ~380,000 UK Biobank participants. Our findings were consistent with MCP having a significant polygenic component, with a Single Nucleotide Polymorphism (SNP) heritability of 10.2%. In total 76 independent lead SNPs at 39 risk loci were associated with MCP. Additional gene-level association analyses identified neurogenesis, synaptic plasticity, nervous system development, cell-cycle progression and apoptosis genes as enriched for genetic association with MCP. Genetic correlations were observed between MCP and a range of psychiatric, autoimmune and anthropometric traits, including major depressive disorder (MDD), asthma and Body Mass Index (BMI). Furthermore, in Mendelian randomisation (MR) analyses a causal effect of MCP on MDD was observed. Additionally, a polygenic risk score (PRS) for MCP was found to significantly predict chronic widespread pain (pain all over the body), indicating the existence of genetic variants contributing to both of these pain phenotypes. Overall, our findings support the proposition that chronic pain involves a strong nervous system component with implications for our understanding of the physiology of chronic pain. These discoveries may also inform the future development of novel treatment approaches.

Journal ArticleDOI
TL;DR: A novel multi-gene molecular toolkit is developed and applied to eDNA isolated from bulk plankton samples collected over a five-year period from a single site, which demonstrates the ability of the toolkit to audit community composition.
Abstract: Marine ecosystems are changing rapidly as the oceans warm and become more acidic. The physical factors and the changes to ocean chemistry that they drive can all be measured with great precision. Changes in the biological composition of communities in different ocean regions are far more challenging to measure because most biological monitoring methods focus on a limited taxonomic or size range. Environmental DNA (eDNA) analysis has the potential to solve this problem in biological oceanography, as it is capable of identifying a huge phylogenetic range of organisms to species level. Here we develop and apply a novel multi-gene molecular toolkit to eDNA isolated from bulk plankton samples collected over a five-year period from a single site. This temporal scale and level of detail is unprecedented in eDNA studies. We identified consistent seasonal assemblages of zooplankton species, which demonstrates the ability of our toolkit to audit community composition. We were also able to detect clear departures from the regular seasonal patterns that occurred during an extreme marine heatwave. The integration of eDNA analyses with existing biotic and abiotic surveys delivers a powerful new long-term approach to monitoring the health of our world's oceans in the context of a rapidly changing climate.

Journal ArticleDOI
TL;DR: A key analytical result is an expression for the joint distribution of mutant alleles at the end of the adaptive phase that characterizes the polygenic pattern of adaptation at the underlying genotype when phenotypic adaptation has been accomplished.
Abstract: Evolutionary theory has produced two conflicting paradigms for the adaptation of a polygenic trait. While population genetics views adaptation as a sequence of selective sweeps at single loci underlying the trait, quantitative genetics posits a collective response, where phenotypic adaptation results from subtle allele frequency shifts at many loci. Yet, a synthesis of these views is largely missing and the population genetic factors that favor each scenario are not well understood. Here, we study the architecture of adaptation of a binary polygenic trait (such as resistance) with negative epistasis among the loci of its basis. The genetic structure of this trait allows for a full range of potential architectures of adaptation, ranging from sweeps to small frequency shifts. By combining computer simulations and a newly devised analytical framework based on Yule branching processes, we gain a detailed understanding of the adaptation dynamics for this trait. Our key analytical result is an expression for the joint distribution of mutant alleles at the end of the adaptive phase. This distribution characterizes the polygenic pattern of adaptation at the underlying genotype when phenotypic adaptation has been accomplished. We find that a single compound parameter, the population-scaled background mutation rate Θbg, explains the main differences among these patterns. For a focal locus, Θbg measures the mutation rate at all redundant loci in its genetic background that offer alternative ways for adaptation. For adaptation starting from mutation-selection-drift balance, we observe different patterns in three parameter regions. Adaptation proceeds by sweeps for small Θbg ≲ 0.1, while small polygenic allele frequency shifts require large Θbg ≳ 100. In the large intermediate regime, we observe a heterogeneous pattern of partial sweeps at several interacting loci.

Journal ArticleDOI
TL;DR: The results demonstrate that the KAI2 signalling pathway is an important new regulator of root hair and root development in Arabidopsis and lay an important basis for research into a molecular understanding of how very similar and partially overlapping hormone signalling pathways regulate different phenotypic outputs.
Abstract: Karrikins are smoke-derived compounds presumed to mimic endogenous signalling molecules (KAI2-ligand, KL), whose signalling pathway is closely related to that of strigolactones (SLs), important regulators of plant development. Both karrikins/KLs and SLs are perceived by closely related α/β hydrolase receptors (KAI2 and D14 respectively), and signalling through both receptors requires the F-box protein MAX2. Furthermore, both pathways trigger proteasome-mediated degradation of related SMAX1-LIKE (SMXL) proteins, to influence development. It has previously been suggested in multiple studies that SLs are important regulators of root and root hair development in Arabidopsis, but these conclusions are based on phenotypes observed in the non-specific max2 mutants and by use of racemic-GR24, a mixture of stereoisomers that activates both D14 and KAI2 signalling pathways. Here, we demonstrate that the majority of the effects on Arabidopsis root development previously attributed to SL signalling are actually mediated by the KAI2 signalling pathway. Using mutants defective in SL or KL synthesis and/or perception, we show that KAI2-mediated signalling alone regulates root hair density and root hair length as well as root skewing, straightness and diameter, while both KAI2 and D14 pathways regulate lateral root density and epidermal cell length. We test the key hypothesis that KAI2 signals by a non-canonical receptor-target mechanism in the context of root development. Our results provide no evidence for this, and we instead show that all effects of KAI2 in the root can be explained by canonical SMAX1/SMXL2 activity. However, we do find evidence for non-canonical GR24 ligand-receptor interactions in D14/KAI2-mediated root hair development. Overall, our results demonstrate that the KAI2 signalling pathway is an important new regulator of root hair and root development in Arabidopsis and lay an important basis for research into a molecular understanding of how very similar and partially overlapping hormone signalling pathways regulate different phenotypic outputs.

Journal ArticleDOI
TL;DR: It is illustrated that the strength of reference bias is negatively correlated with fragment length, which has the potential to cause minor but significant differences in the results of downstream analyses such as population allele sharing, heterozygosity estimates and estimates of archaic ancestry.
Abstract: Haploid high quality reference genomes are an important resource in genomic research projects. A consequence is that DNA fragments carrying the reference allele will be more likely to map successfully, or receive higher quality scores. This reference bias can have effects on downstream population genomic analysis when heterozygous sites are falsely considered homozygous for the reference allele. In palaeogenomic studies of human populations, mapping against the human reference genome is used to identify endogenous human sequences. Ancient DNA studies usually operate with low sequencing coverages and fragmentation of DNA molecules causes a large proportion of the sequenced fragments to be shorter than 50 bp-reducing the amount of accepted mismatches, and increasing the probability of multiple matching sites in the genome. These ancient DNA specific properties are potentially exacerbating the impact of reference bias on downstream analyses, especially since most studies of ancient human populations use pseudo-haploid data, i.e. they randomly sample only one sequencing read per site. We show that reference bias is pervasive in published ancient DNA sequence data of prehistoric humans with some differences between individual genomic regions. We illustrate that the strength of reference bias is negatively correlated with fragment length. Most genomic regions we investigated show little to no mapping bias but even a small proportion of sites with bias can impact analyses of those particular loci or slightly skew genome-wide estimates. Therefore, reference bias has the potential to cause minor but significant differences in the results of downstream analyses such as population allele sharing, heterozygosity estimates and estimates of archaic ancestry. These spurious results highlight how important it is to be aware of these technical artifacts and that we need strategies to mitigate the effect. Therefore, we suggest some post-mapping filtering strategies to resolve reference bias which help to reduce its impact substantially.

Journal ArticleDOI
TL;DR: It is shown that mitochondrial fusion is necessary for proliferating MEFs to recover from mtDNA depletion and for the marked increase of mtDNA copy number during postnatal heart development, and OMM fusion to replication and distribution of mt DNA.
Abstract: Mitochondrial dynamics is an essential physiological process controlling mitochondrial content mixing and mobility to ensure proper function and localization of mitochondria at intracellular sites of high-energy demand. Intriguingly, for yet unknown reasons, severe impairment of mitochondrial fusion drastically affects mtDNA copy number. To decipher the link between mitochondrial dynamics and mtDNA maintenance, we studied mouse embryonic fibroblasts (MEFs) and mouse cardiomyocytes with disruption of mitochondrial fusion. Super-resolution microscopy revealed that loss of outer mitochondrial membrane (OMM) fusion, but not inner mitochondrial membrane (IMM) fusion, leads to nucleoid clustering. Remarkably, fluorescence in situ hybridization (FISH), bromouridine labeling in MEFs and assessment of mitochondrial transcription in tissue homogenates revealed that abolished OMM fusion does not affect transcription. Furthermore, the profound mtDNA depletion in mouse hearts lacking OMM fusion is not caused by defective integrity or increased mutagenesis of mtDNA, but instead we show that mitochondrial fusion is necessary to maintain the stoichiometry of the protein components of the mtDNA replisome. OMM fusion is necessary for proliferating MEFs to recover from mtDNA depletion and for the marked increase of mtDNA copy number during postnatal heart development. Our findings thus link OMM fusion to replication and distribution of mtDNA.

Journal ArticleDOI
TL;DR: An unexpectedly low level of differentiation is revealed between a pair of sex chromosomes harboring an old MSD gene in a wild teleost fish population, and this study highlights both the pivotal role of genes from the amh pathway in sex determination, as well as the importance of gene duplication as a mechanism driving the turnover of sex chromosome in this clade.
Abstract: Teleost fishes, thanks to their rapid evolution of sex determination mechanisms, provide remarkable opportunities to study the formation of sex chromosomes and the mechanisms driving the birth of new master sex determining (MSD) genes. However, the evolutionary interplay between the sex chromosomes and the MSD genes they harbor is rather unexplored. We characterized a male-specific duplicate of the anti-Mullerian hormone (amh) as the MSD gene in Northern Pike (Esox lucius), using genomic and expression evidence as well as by loss-of-function and gain-of-function experiments. Using RAD-Sequencing from a family panel, we identified Linkage Group (LG) 24 as the sex chromosome and positioned the sex locus in its sub-telomeric region. Furthermore, we demonstrated that this MSD originated from an ancient duplication of the autosomal amh gene, which was subsequently translocated to LG24. Using sex-specific pooled genome sequencing and a new male genome sequence assembled using Nanopore long reads, we also characterized the differentiation of the X and Y chromosomes, revealing a small male-specific insertion containing the MSD gene and a limited region with reduced recombination. Our study reveals an unexpectedly low level of differentiation between a pair of sex chromosomes harboring an old MSD gene in a wild teleost fish population, and highlights both the pivotal role of genes from the amh pathway in sex determination, as well as the importance of gene duplication as a mechanism driving the turnover of sex chromosomes in this clade.

Journal ArticleDOI
TL;DR: OsFIP directly mediates the m6A methylation of a set of threonine protease and NTPase mRNAs and is essential for their expression and/or splicing, which in turn regulates the progress of sporogenesis.
Abstract: N6-Methyladenosine (m6A) RNA methylation plays important roles during development in different species. However, knowledge of m6A RNA methylation in monocots remains limited. In this study, we reported that OsFIP and OsMTA2 are the components of m6A RNA methyltransferase complex in rice and uncovered a previously unknown function of m6A RNA methylation in regulation of plant sporogenesis. Importantly, OsFIP is essential for rice male gametogenesis. Knocking out of OsFIP results in early degeneration of microspores at the vacuolated pollen stage and simultaneously causes abnormal meiosis in prophase I. We further analyzed the profile of rice m6A modification during sporogenesis in both WT and OsFIP loss-of-function plants, and identified a rice panicle specific m6A modification motif "UGWAMH". Interestingly, we found that OsFIP directly mediates the m6A methylation of a set of threonine protease and NTPase mRNAs and is essential for their expression and/or splicing, which in turn regulates the progress of sporogenesis. Our findings revealed for the first time that OsFIP plays an indispensable role in plant early sporogenesis. This study also provides evidence for the different functions of the m6A RNA methyltransferase complex between rice and Arabidopsis.

Journal ArticleDOI
TL;DR: This study demonstrates that HCV infection induces genome-wide epigenetic changes by performing chromatin immunoprecipitation followed by next-generation sequencing (ChIP-seq) for histone post-translational modifications that are epigenetic markers for active and repressed chromatin.
Abstract: The increasing worldwide prevalence of Hepatocellular carcinoma (HCC), characterized by resistance to conventional chemotherapy, poor prognosis and eventually mortality, place it as a prime target for new modes of prevention and treatment. Hepatitis C Virus (HCV) is the predominant risk factor for HCC in the US and Europe. Multiple epidemiological studies showed that sustained virological responses (SVR) following treatment with the powerful direct acting antivirals (DAAs), which have replaced interferon-based regimes, do not eliminate tumor development. We aimed to identify an HCV-specific pathogenic mechanism that persists post SVR following DAAs treatment. We demonstrate that HCV infection induces genome-wide epigenetic changes by performing chromatin immunoprecipitation followed by next-generation sequencing (ChIP-seq) for histone post-translational modifications that are epigenetic markers for active and repressed chromatin. The changes in histone modifications correlate with reprogramed host gene expression and alter signaling pathways known to be associated with HCV life cycle and HCC. These epigenetic alterations require the presence of HCV RNA or/and expression of the viral proteins in the cells. Importantly, the epigenetic changes induced following infection persist as an "epigenetic signature" after virus eradication by DAAs treatment, as detected using in vitro HCV infection models. These observations led to the identification of an 8 gene signature that is associated with HCC development and demonstrate persistent epigenetic alterations in HCV infected and post SVR liver biopsy samples. The epigenetic signature was reverted in vitro by drugs that inhibit epigenetic modifying enzyme and by the EGFR inhibitor, Erlotinib. This epigenetic “scarring” of the genome, persisting following HCV eradication, suggest a novel mechanism for the persistent pathogenesis of HCV after its eradication by DAAs. Our study offers new avenues for prevention of the persistent oncogenic effects of chronic hepatitis infections using specific drugs to revert the epigenetic changes to the genome.

Journal ArticleDOI
TL;DR: Sex differences exist in the effects of obesity traits on risk of type 2 diabetes, chronic obstructive pulmonary disease, and renal failure, which may have downstream implications for public health.
Abstract: Obesity traits are causally implicated with risk of cardiometabolic diseases. It remains unclear whether there are similar causal effects of obesity traits on other non-communicable diseases. Also, it is largely unexplored whether there are any sex-specific differences in the causal effects of obesity traits on cardiometabolic diseases and other leading causes of death. We constructed sex-specific genetic risk scores (GRS) for three obesity traits; body mass index (BMI), waist-hip ratio (WHR), and WHR adjusted for BMI, including 565, 324, and 337 genetic variants, respectively. These GRSs were then used as instrumental variables to assess associations between the obesity traits and leading causes of mortality in the UK Biobank using Mendelian randomization. We also investigated associations with potential mediators, including smoking, glycemic and blood pressure traits. Sex-differences were subsequently assessed by Cochran's Q-test (Phet). A Mendelian randomization analysis of 228,466 women and 195,041 men showed that obesity causes coronary artery disease, stroke (particularly ischemic), chronic obstructive pulmonary disease, lung cancer, type 2 and 1 diabetes mellitus, non-alcoholic fatty liver disease, chronic liver disease, and acute and chronic renal failure. Higher BMI led to higher risk of type 2 diabetes in women than in men (Phet = 1.4×10-5). Waist-hip-ratio led to a higher risk of chronic obstructive pulmonary disease (Phet = 3.7×10-6) and higher risk of chronic renal failure (Phet = 1.0×10-4) in men than women. Obesity traits have an etiological role in the majority of the leading global causes of death. Sex differences exist in the effects of obesity traits on risk of type 2 diabetes, chronic obstructive pulmonary disease, and renal failure, which may have downstream implications for public health.

Journal ArticleDOI
TL;DR: The transcriptional programs that are essential for neural stem and progenitor cell (NSPC) function are investigated and a program of autophagy genes under the control of the transcription factor FOXO3 is uncovered.
Abstract: Maintenance of a healthy proteome is essential for cellular homeostasis and loss of proteostasis is associated with tissue dysfunction and neurodegenerative disease. The mechanisms that support proteostasis in healthy cells and how they become defective during aging or in disease states are not fully understood. Here, we investigate the transcriptional programs that are essential for neural stem and progenitor cell (NSPC) function and uncover a program of autophagy genes under the control of the transcription factor FOXO3. Using genomic approaches, we observe that FOXO3 directly binds a network of target genes in adult NSPCs that are involved in autophagy, and find that FOXO3 functionally regulates induction of autophagy in these cells. Interestingly, in the absence of FOXO activity, aggregates accumulate in NSPCs, and this effect is reversed by TOR (target of rapamycin) inhibition. Surprisingly, enhancing FOXO3 causes nucleation of protein aggregates, but does not increase their degradation. The work presented here identifies a genomic network under the direct control of a key transcriptional regulator of aging that is critical for maintaining a healthy mammalian stem cell pool to support lifelong neurogenesis.

Journal ArticleDOI
TL;DR: Current advents of high-resolution microscopy, single-molecule analysis and molecular structure determination of the components are expected to reveal the total structure and function of the bacterial nucleoid.
Abstract: How genomes are organized within cells and how the 3D architecture of a genome influences cellular functions are significant questions in biology. A bacterial genomic DNA resides inside cells in a highly condensed and functionally organized form called nucleoid (nucleus-like structure without a nuclear membrane). The Escherichia coli chromosome or nucleoid is composed of the genomic DNA, RNA, and protein. The nucleoid forms by condensation and functional arrangement of a single chromosomal DNA with the help of chromosomal architectural proteins and RNA molecules as well as DNA supercoiling. Although a high-resolution structure of a bacterial nucleoid is yet to come, five decades of research has established the following salient features of the E. coli nucleoid elaborated below: 1) The chromosomal DNA is on the average a negatively supercoiled molecule that is folded as plectonemic loops, which are confined into many independent topological domains due to supercoiling diffusion barriers; 2) The loops spatially organize into megabase size regions called macrodomains, which are defined by more frequent physical interactions among DNA sites within the same macrodomain than between different macrodomains; 3) The condensed and spatially organized DNA takes the form of a helical ellipsoid radially confined in the cell; and 4) The DNA in the chromosome appears to have a condition-dependent 3-D structure that is linked to gene expression so that the nucleoid architecture and gene transcription are tightly interdependent, influencing each other reciprocally. Current advents of high-resolution microscopy, single-molecule analysis and molecular structure determination of the components are expected to reveal the total structure and function of the bacterial nucleoid.

Journal ArticleDOI
TL;DR: New insights are provided into the structural impact of inserting foreign fragments into plant genomes and the utility of state-of-the-art long-range sequencing technologies to rapidly identify unanticipated genomic changes are demonstrated.
Abstract: The bacterium Agrobacterium tumefaciens has been the workhorse in plant genome engineering. Customized replacement of native tumor-inducing (Ti) plasmid elements enabled insertion of a sequence of interest called Transfer-DNA (T-DNA) into any plant genome. Although these transfer mechanisms are well understood, detailed understanding of structure and epigenomic status of insertion events was limited by current technologies. Here we applied two single-molecule technologies and analyzed Arabidopsis thaliana lines from three widely used T-DNA insertion collections (SALK, SAIL and WISC). Optical maps for four randomly selected T-DNA lines revealed between one and seven insertions/rearrangements, and the length of individual insertions from 27 to 236 kilobases. De novo nanopore sequencing-based assemblies for two segregating lines partially resolved T-DNA structures and revealed multiple translocations and exchange of chromosome arm ends. For the current TAIR10 reference genome, nanopore contigs corrected 83% of non-centromeric misassemblies. The unprecedented contiguous nucleotide-level resolution enabled an in-depth study of the epigenome at T-DNA insertion sites. SALK_059379 line T-DNA insertions were enriched for 24nt small interfering RNAs (siRNA) and dense cytosine DNA methylation, resulting in transgene silencing via the RNA-directed DNA methylation pathway. In contrast, SAIL_232 line T-DNA insertions are predominantly targeted by 21/22nt siRNAs, with DNA methylation and silencing limited to a reporter, but not the resistance gene. Additionally, we profiled the H3K4me3, H3K27me3 and H2A.Z chromatin environments around T-DNA insertions using ChIP-seq in SALK_059379, SAIL_232 and five additional T-DNA lines. We discovered various effect s ranging from complete loss of chromatin marks to the de novo incorporation of H2A.Z and trimethylation of H3K4 and H3K27 around the T-DNA integration sites. This study provides new insights into the structural impact of inserting foreign fragments into plant genomes and demonstrates the utility of state-of-the-art long-range sequencing technologies to rapidly identify unanticipated genomic changes.

Journal ArticleDOI
TL;DR: While repeated releases of high-ratio (10:1) non-drive constructs could achieve full introduction, results from the 1:1 release ratios across all experimental designs favor the use of gene drive, both for efficiency and anticipated cost of the control programs.
Abstract: Small laboratory cage trials of non-drive and gene-drive strains of the Asian malaria vector mosquito, Anopheles stephensi, were used to investigate release ratios and other strain properties for their impact on transgene spread during simulated population modification. We evaluated the effects of transgenes on survival, male contributions to next-generation populations, female reproductive success and the impact of accumulation of gene drive-resistant genomic target sites resulting from nonhomologous end-joining (NHEJ) mutagenesis during Cas9, guide RNA-mediated cleavage. Experiments with a non-drive, autosomally-linked malaria-resistance gene cassette showed 'full introduction' (100% of the insects have at least one copy of the transgene) within 8 weeks (≤ 3 generations) following weekly releases of 10:1 transgenic:wild-type males in an overlapping generation trial design. Male release ratios of 1:1 resulted in cages where mosquitoes with at least one copy of the transgene fluctuated around 50%. In comparison, two of three cages in which the malaria-resistance genes were linked to a gene-drive system in an overlapping generation, single 1:1 release reached full introduction in 6-8 generations with a third cage at ~80% within the same time. Release ratios of 0.1:1 failed to establish the transgenes. A non-overlapping generation, single-release trial of the same gene-drive strain resulted in two of three cages reaching 100% introduction within 6-12 generations following a 1:1 transgenic:wild-type male release. Two of three cages with 0.33:1 transgenic:wild-type male single releases achieved full introduction in 13-16 generations. All populations exhibiting full introduction went extinct within three generations due to a significant load on females having disruptions of both copies of the target gene, kynurenine hydroxylase. While repeated releases of high-ratio (10:1) non-drive constructs could achieve full introduction, results from the 1:1 release ratios across all experimental designs favor the use of gene drive, both for efficiency and anticipated cost of the control programs.

Journal ArticleDOI
TL;DR: D-PLA is an anti-bacterial metabolite found in high concentrations in LAB-fermented food such as Sauerkraut and it is demonstrated that D-PLA from such alimentary sources is well absorbed from the human gut leading to high plasma and urine levels and triggers pertussis toxin-sensitive migration of primary human monocytes in an HCA3-dependent manner.
Abstract: The interplay of microbiota and the human host is physiologically crucial in health and diseases. The beneficial effects of lactic acid bacteria (LAB), permanently colonizing the human intestine or transiently obtained from food, have been extensively reported. However, the molecular understanding of how LAB modulate human physiology is still limited. G protein-coupled receptors for hydroxycarboxylic acids (HCAR) are regulators of immune functions and energy homeostasis under changing metabolic and dietary conditions. Most mammals have two HCAR (HCA1, HCA2) but humans and other hominids contain a third member (HCA3) in their genomes. A plausible hypothesis why HCA3 function was advantageous in hominid evolution was lacking. Here, we used a combination of evolutionary, analytical and functional methods to unravel the role of HCA3 in vitro and in vivo. The functional studies included different pharmacological assays, analyses of human monocytes and pharmacokinetic measurements in human. We report the discovery of the interaction of D-phenyllactic acid (D-PLA) and the human host through highly potent activation of HCA3. D-PLA is an anti-bacterial metabolite found in high concentrations in LAB-fermented food such as Sauerkraut. We demonstrate that D-PLA from such alimentary sources is well absorbed from the human gut leading to high plasma and urine levels and triggers pertussis toxin-sensitive migration of primary human monocytes in an HCA3-dependent manner. We provide evolutionary, analytical and functional evidence supporting the hypothesis that HCA3 was consolidated in hominids as a new signaling system for LAB-derived metabolites.

Journal ArticleDOI
TL;DR: The heritability of thinness is characterised which is comparable to that of severe obesity, although with incomplete genetic overlap, and the ability to replicate loci found with much larger sample sizes demonstrates the value of clinical extremes and suggests that characterisation of the genetics ofthinness may provide a more nuanced understanding of the genetic architecture of body weight regulation.
Abstract: The variation in weight within a shared environment is largely attributable to genetic factors. Whilst many genes/loci confer susceptibility to obesity, little is known about the genetic architecture of healthy thinness. Here, we characterise the heritability of thinness which we found was comparable to that of severe obesity (h2 = 28.07 vs 32.33% respectively), although with incomplete genetic overlap (r = -0.49, 95% CI [-0.17, -0.82], p = 0.003). In a genome-wide association analysis of thinness (n = 1,471) vs severe obesity (n = 1,456), we identified 10 loci previously associated with obesity, and demonstrate enrichment for established BMI-associated loci (pbinomial = 3.05x10-5). Simulation analyses showed that different association results between the extremes were likely in agreement with additive effects across the BMI distribution, suggesting different effects on thinness and obesity could be due to their different degrees of extremeness. In further analyses, we detected a novel obesity and BMI-associated locus at PKHD1 (rs2784243, obese vs. thin p = 5.99x10-6, obese vs. controls p = 2.13x10-6 pBMI = 2.3x10-13), associations at loci recently discovered with much larger sample sizes (e.g. FAM150B and PRDM6-CEP120), and novel variants driving associations at previously established signals (e.g. rs205262 at the SNRPC/C6orf106 locus and rs112446794 at the PRDM6-CEP120 locus). Our ability to replicate loci found with much larger sample sizes demonstrates the value of clinical extremes and suggest that characterisation of the genetics of thinness may provide a more nuanced understanding of the genetic architecture of body weight regulation and may inform the identification of potential anti-obesity targets.

Journal ArticleDOI
TL;DR: A novel mechanism by which OBP 4 antagonistically controls XTH9 expression and the OBP4-XTH9 module elaborately sustains LR development in response to nitrate treatment is proposed.
Abstract: Plant root system architecture in response to nitrate availability represents a notable example to study developmental plasticity, but the underlying mechanism remains largely unknown. Xyloglucan endotransglucosylases (XTHs) play a critical role in cell wall biosynthesis. Here we assessed the gene expression of XTH1-11 belonging to group I of XTHs in lateral root (LR) primordia and found that XTH9 was highly expressed. Correspondingly, an xth9 mutant displayed less LR, while overexpressing XTH9 presented more LR, suggesting the potential function of XTH9 in controlling LR development. XTH9 gene mutation obviously alters the properties of the cell wall. Furthermore, nitrogen signals stimulated the expression of XTH9 to promote LRs. Genetic analysis revealed that the function of XTH9 was dependent on auxin-mediated ARF7/19 and downstream AFB3 in response to nitrogen signals. In addition, we identified another transcription factor, OBP4, that was also induced by nitrogen treatment, but the induction was much slower than that of XTH9. In contrast to XTH9, overexpressing OBP4 caused fewer LRs while OBP4 knockdown with OBP4-RNAi or an artificial miRNA silenced amiOBP4 line produced more LR. We further found OBP4 bound to the promoter of XTH9 to suppress XTH9 expression. In agreement with this, both OBP4-RNAi and crossed OBP4-RNAi & 35S::XTH9 lines led to more LR, but OBP4-RNAi & xth9 produced less LR, similar to xth9. Based on these findings we propose a novel mechanism by which OBP4 antagonistically controls XTH9 expression and the OBP4-XTH9 module elaborately sustains LR development in response to nitrate treatment.

Journal ArticleDOI
TL;DR: A mechanism whereby an lncRNA works at different cellular compartments to regulate the pro-apoptotic gene PYCard at both the epigenetic and translational levels, contributing to the PYCARD-regulated apoptosis is revealed and new light is shed on the role of discretely distributed lncRNAs in diverse biological processes.
Abstract: Long noncoding RNAs (lncRNAs) participate in various biological processes such as apoptosis. The function of lncRNAs is closely correlated with their localization within the cell. While regulatory potential of many lncRNAs has been revealed at specific subcellular location, the biological significance of discrete distribution of an lncRNA in different cellular compartments remains largely unexplored. Here, we identified an lncRNA antisense to the pro-apoptotic gene PYCARD, named PYCARD-AS1, which exhibits a dual nuclear and cytoplasmic distribution and is required for the PYCARD silencing in breast cancer cells. The PYCARD-regulated apoptosis is controlled by PYCARD-AS1; moreover, PYCARD-AS1 regulates apoptosis in a PYCARD-dependent manner, indicating that PYCARD is a critical downstream target of PYCARD-AS1. Mechanistically, PYCARD-AS1 can localize to the PYCARD promoter, where it facilitates DNA methylation and H3K9me2 modification by recruiting the chromatin-suppressor proteins DNMT1 and G9a. Moreover, PYCARD-AS1 and PYCARD mRNA can interact with each other via their 5′ overlapping region, leading to inhibition of ribosome assembly in the cytoplasm for PYCARD translation. This study reveals a mechanism whereby an lncRNA works at different cellular compartments to regulate the pro-apoptotic gene PYCARD at both the epigenetic and translational levels, contributing to the PYCARD-regulated apoptosis, and also sheds new light on the role of discretely distributed lncRNAs in diverse biological processes.

Journal ArticleDOI
TL;DR: It is indicated that dispensable mini-chromosomes and core chromosomes undergo divergent evolutionary trajectories, and mini- Chromosome and core chromosome ends are coupled as a mobile, fast-evolving effector compartment in the wheat pathogen genome.
Abstract: Newly emerged wheat blast disease is a serious threat to global wheat production. Wheat blast is caused by a distinct, exceptionally diverse lineage of the fungus causing rice blast disease. Through sequencing a recent field isolate, we report a reference genome that includes seven core chromosomes and mini-chromosome sequences that harbor effector genes normally found on ends of core chromosomes in other strains. No mini-chromosomes were observed in an early field strain, and at least two from another isolate each contain different effector genes and core chromosome end sequences. The mini-chromosome is enriched in transposons occurring most frequently at core chromosome ends. Additionally, transposons in mini-chromosomes lack the characteristic signature for inactivation by repeat-induced point (RIP) mutation genome defenses. Our results, collectively, indicate that dispensable mini-chromosomes and core chromosomes undergo divergent evolutionary trajectories, and mini-chromosomes and core chromosome ends are coupled as a mobile, fast-evolving effector compartment in the wheat pathogen genome.

Journal ArticleDOI
TL;DR: Novel SNPs and genes associated with epigenetic age acceleration are identified, differences in the genetic architecture of Horvath-based and Hannum-based epigenetic ageing measures are highlighted, and different trajectories of age-related decline are highlighted.
Abstract: 'Epigenetic age acceleration' is a valuable biomarker of ageing, predictive of morbidity and mortality, but for which the underlying biological mechanisms are not well established. Two commonly used measures, derived from DNA methylation, are Horvath-based (Horvath-EAA) and Hannum-based (Hannum-EAA) epigenetic age acceleration. We conducted genome-wide association studies of Horvath-EAA and Hannum-EAA in 13,493 unrelated individuals of European ancestry, to elucidate genetic determinants of differential epigenetic ageing. We identified ten independent SNPs associated with Horvath-EAA, five of which are novel. We also report 21 Horvath-EAA-associated genes including several involved in metabolism (NHLRC, TPMT) and immune system pathways (TRIM59, EDARADD). GWAS of Hannum-EAA identified one associated variant (rs1005277), and implicated 12 genes including several involved in innate immune system pathways (UBE2D3, MANBA, TRIM46), with metabolic functions (UBE2D3, MANBA), or linked to lifespan regulation (CISD2). Both measures had nominal inverse genetic correlations with father’s age at death, a rough proxy for lifespan. Nominally significant genetic correlations between Hannum-EAA and lifestyle factors including smoking behaviours and education support the hypothesis that Hannum-based epigenetic ageing is sensitive to variations in environment, whereas Horvath-EAA is a more stable cellular ageing process. We identified novel SNPs and genes associated with epigenetic age acceleration, and highlighted differences in the genetic architecture of Horvath-based and Hannum-based epigenetic ageing measures. Understanding the biological mechanisms underlying individual differences in the rate of epigenetic ageing could help explain different trajectories of age-related decline.