scispace - formally typeset
Search or ask a question

Showing papers in "PLOS Genetics in 2021"


Journal ArticleDOI
TL;DR: In this paper, the authors evaluated the predictive utility of polygenic scoring methods within a reference-standardized framework, which uses a common set of variants and reference-based estimates of linkage disequilibrium and allele frequencies to construct scores.
Abstract: The predictive utility of polygenic scores is increasing, and many polygenic scoring methods are available, but it is unclear which method performs best. This study evaluates the predictive utility of polygenic scoring methods within a reference-standardized framework, which uses a common set of variants and reference-based estimates of linkage disequilibrium and allele frequencies to construct scores. Eight polygenic score methods were tested: p-value thresholding and clumping (pT+clump), SBLUP, lassosum, LDpred1, LDpred2, PRScs, DBSLMM and SBayesR, evaluating their performance to predict outcomes in UK Biobank and the Twins Early Development Study (TEDS). Strategies to identify optimal p-value thresholds and shrinkage parameters were compared, including 10-fold cross validation, pseudovalidation and infinitesimal models (with no validation sample), and multi-polygenic score elastic net models. LDpred2, lassosum and PRScs performed strongly using 10-fold cross-validation to identify the most predictive p-value threshold or shrinkage parameter, giving a relative improvement of 16-18% over pT+clump in the correlation between observed and predicted outcome values. Using pseudovalidation, the best methods were PRScs, DBSLMM and SBayesR. PRScs pseudovalidation was only 3% worse than the best polygenic score identified by 10-fold cross validation. Elastic net models containing polygenic scores based on a range of parameters consistently improved prediction over any single polygenic score. Within a reference-standardized framework, the best polygenic prediction was achieved using LDpred2, lassosum and PRScs, modeling multiple polygenic scores derived using multiple parameters. This study will help researchers performing polygenic score studies to select the most powerful and predictive analysis methods.

73 citations


Journal ArticleDOI
TL;DR: In this article, the Sum of Single Effects (SuSiE) regression framework was used for fine-mapping genetic signals for use with coloc, which can be used to optimise accuracy of colocalization analyses when multiple causal variants exist.
Abstract: In genome-wide association studies (GWAS) it is now common to search for, and find, multiple causal variants located in close proximity. It has also become standard to ask whether different traits share the same causal variants, but one of the popular methods to answer this question, coloc, makes the simplifying assumption that only a single causal variant exists for any given trait in any genomic region. Here, we examine the potential of the recently proposed Sum of Single Effects (SuSiE) regression framework, which can be used for fine-mapping genetic signals, for use with coloc. SuSiE is a novel approach that allows evidence for association at multiple causal variants to be evaluated simultaneously, whilst separating the statistical support for each variant conditional on the causal signal being considered. We show this results in more accurate coloc inference than other proposals to adapt coloc for multiple causal variants based on conditioning. We therefore recommend that coloc be used in combination with SuSiE to optimise accuracy of colocalisation analyses when multiple causal variants exist.

63 citations


Journal ArticleDOI
TL;DR: In this paper, the authors show that regulation of nuclease expression is as important as the choice of target site when developing a robust homing-based gene drive for population suppression.
Abstract: Homing-based gene drives use a germline source of nuclease to copy themselves at specific target sites in a genome and bias their inheritance. Such gene drives can be designed to spread and deliberately suppress populations of malaria mosquitoes by impairing female fertility. However, strong unintended fitness costs of the drive and a propensity to generate resistant mutations can limit a gene drive's potential to spread. Alternative germline regulatory sequences in the drive element confer improved fecundity of carrier individuals and reduced propensity for target site resistance. This is explained by reduced rates of end-joining repair of DNA breaks from parentally deposited nuclease in the embryo, which can produce heritable mutations that reduce gene drive penetrance. We tracked the generation and selection of resistant mutations over the course of a gene drive invasion of a population. Improved gene drives show faster invasion dynamics, increased suppressive effect and later onset of target site resistance. Our results show that regulation of nuclease expression is as important as the choice of target site when developing a robust homing-based gene drive for population suppression.

53 citations


Journal ArticleDOI
Xingxing Li1, Bo Yu1, Qi Wu1, Qian Min1, Rongfeng Zeng1, Zizhao Xie1, Junli Huang1 
TL;DR: In this article, a stress-responsive MADS-box TF OsMADS23 from rice conferring the osmotic stress tolerance in plants was reported, and it was found that the OsNCED2-knockout mutants had lower ABA levels and exhibited higher sensitivity to drought and oxidative stress than wild type.
Abstract: Some of MADS-box transcription factors (TFs) have been shown to play essential roles in the adaptation of plant to abiotic stress. Still, the mechanisms that MADS-box proteins regulate plant stress response are not fully understood. Here, a stress-responsive MADS-box TF OsMADS23 from rice conferring the osmotic stress tolerance in plants is reported. Overexpression of OsMADS23 remarkably enhanced, but knockout of the gene greatly reduced the drought and salt tolerance in rice plants. Further, OsMADS23 was shown to promote the biosynthesis of endogenous ABA and proline by activating the transcription of target genes OsNCED2, OsNCED3, OsNCED4 and OsP5CR that are key components for ABA and proline biosynthesis, respectively. Then, the convincing evidence showed that the OsNCED2-knockout mutants had lower ABA levels and exhibited higher sensitivity to drought and oxidative stress than wild type, which is similar to osmads23 mutant. Interestingly, the SnRK2-type protein kinase SAPK9 was found to physically interact with and phosphorylate OsMADS23, and thus increase its stability and transcriptional activity. Furthermore, the activation of OsMADS23 by SAPK9-mediated phosphorylation is dependent on ABA in plants. Collectively, these findings establish a mechanism that OsMADS23 functions as a positive regulator in response to osmotic stress by regulating ABA biosynthesis, and provide a new strategy for improving drought and salt tolerance in rice.

48 citations


Journal ArticleDOI
TL;DR: In this article, the authors found that the long day-induced drop in the concentration of the steroid hormone ecdysone is essential for the preparation of photoperiodic reproductive diapause in Colaphellus bowringi, an economically important cabbage beetle.
Abstract: Diapause, a programmed developmental arrest primarily induced by seasonal environmental changes, is very common in the animal kingdom, and found in vertebrates and invertebrates alike. Diapause provides an adaptive advantage to animals, as it increases the odds of surviving adverse conditions. In insects, individuals perceive photoperiodic cues and modify endocrine signaling to direct reproductive diapause traits, such as ovary arrest and increased fat accumulation. However, it remains unclear as to which endocrine factors are involved in this process and how they regulate the onset of reproductive diapause. Here, we found that the long day-mediated drop in the concentration of the steroid hormone ecdysone is essential for the preparation of photoperiodic reproductive diapause in Colaphellus bowringi, an economically important cabbage beetle. The diapause-inducing long-day condition reduced the expression of ecdysone biosynthetic genes, explaining the drop in the titer of 20-hydroxyecdysone (20E, the active form of ecdysone) in female adults. Application of exogenous 20E induced vitellogenesis and ovarian development but reduced fat accumulation in the diapause-destined females. Knocking down the ecdysone receptor (EcR) in females destined for reproduction blocked reproductive development and induced diapause traits. RNA-seq and hormone measurements indicated that 20E stimulates the production of juvenile hormone (JH), a key endocrine factor in reproductive diapause. To verify this, we depleted three ecdysone biosynthetic enzymes via RNAi, which confirmed that 20E is critical for JH biosynthesis and reproductive diapause. Importantly, impairing Met function, a component of the JH intracellular receptor, partially blocked the 20E-regulated reproductive diapause preparation, indicating that 20E regulates reproductive diapause in both JH-dependent and -independent manners. Finally, we found that 20E deficiency decreased ecdysis-triggering hormone signaling and reduced JH production, thereby inducing diapause. Together, these results suggest that 20E signaling is a pivotal regulator that coordinates reproductive plasticity in response to environmental inputs.

46 citations


Journal ArticleDOI
TL;DR: In this paper, the role of the allelic content in determining the long-term fate of the inversion was quantified and the authors highlighted the dynamic features of inversions by showing how the non-adaptive evolution of allele content can play a major role in the fate of inversion.
Abstract: Chromosomal inversions contribute widely to adaptation and speciation, yet they present a unique evolutionary puzzle as both their allelic content and frequency evolve in a feedback loop. In this simulation study, we quantified the role of the allelic content in determining the long-term fate of the inversion. Recessive deleterious mutations accumulated on both arrangements with most of them being private to a given arrangement. This led to increasing overdominance, allowing for the maintenance of the inversion polymorphism and generating strong non-adaptive divergence between arrangements. The accumulation of mutations was mitigated by gene conversion but nevertheless led to the fitness decline of at least one homokaryotype under all considered conditions. Surprisingly, this fitness degradation could be permanently halted by the branching of an arrangement into multiple highly divergent haplotypes. Our results highlight the dynamic features of inversions by showing how the non-adaptive evolution of allelic content can play a major role in the fate of the inversion.

45 citations


Journal ArticleDOI
TL;DR: In this paper, deep generative adversarial networks (GANs) and restricted Boltzmann machines (RBMs) are used to learn the complex distributions of real genomic datasets and generate novel high-quality artificial genomes (AGs) with none to little privacy loss.
Abstract: Generative models have shown breakthroughs in a wide spectrum of domains due to recent advancements in machine learning algorithms and increased computational power. Despite these impressive achievements, the ability of generative models to create realistic synthetic data is still under-exploited in genetics and absent from population genetics. Yet a known limitation in the field is the reduced access to many genetic databases due to concerns about violations of individual privacy, although they would provide a rich resource for data mining and integration towards advancing genetic studies. In this study, we demonstrated that deep generative adversarial networks (GANs) and restricted Boltzmann machines (RBMs) can be trained to learn the complex distributions of real genomic datasets and generate novel high-quality artificial genomes (AGs) with none to little privacy loss. We show that our generated AGs replicate characteristics of the source dataset such as allele frequencies, linkage disequilibrium, pairwise haplotype distances and population structure. Moreover, they can also inherit complex features such as signals of selection. To illustrate the promising outcomes of our method, we showed that imputation quality for low frequency alleles can be improved by data augmentation to reference panels with AGs and that the RBM latent space provides a relevant encoding of the data, hence allowing further exploration of the reference dataset and features for solving supervised tasks. Generative models and AGs have the potential to become valuable assets in genetic studies by providing a rich yet compact representation of existing genomes and high-quality, easy-access and anonymous alternatives for private databases.

44 citations


Journal ArticleDOI
TL;DR: Sigminer as mentioned in this paper is a user-friendly open-source bioinformatics tool for copy number signature extraction, analysis, and visualization, which has been applied in prostate cancer (PC), which is particularly driven by complex genome alterations.
Abstract: Genome alteration signatures reflect recurring patterns caused by distinct endogenous or exogenous mutational events during the evolution of cancer. Signatures of single base substitution (SBS) have been extensively studied in different types of cancer. Copy number alterations are important drivers for the progression of multiple cancer. However, practical tools for studying the signatures of copy number alterations are still lacking. Here, a user-friendly open source bioinformatics tool “sigminer” has been constructed for copy number signature extraction, analysis and visualization. This tool has been applied in prostate cancer (PC), which is particularly driven by complex genome alterations. Five copy number signatures are identified from human PC genome with this tool. The underlying mutational processes for each copy number signature have been illustrated. Sample clustering based on copy number signature exposure reveals considerable heterogeneity of PC, and copy number signatures show improved PC clinical outcome association when compared with SBS signatures. This copy number signature analysis in PC provides distinct insight into the etiology of PC, and potential biomarkers for PC stratification and prognosis.

40 citations


Journal ArticleDOI
TL;DR: In this paper, the authors used genome-wide capture of nuclear gene sequences, plus skimming of organellar sequences, to investigate the phylogenomics of monkeyflowers in Mimulus section Erythranthe (27 accessions from seven species).
Abstract: Inferences about past processes of adaptation and speciation require a gene-scale and genome-wide understanding of the evolutionary history of diverging taxa. In this study, we use genome-wide capture of nuclear gene sequences, plus skimming of organellar sequences, to investigate the phylogenomics of monkeyflowers in Mimulus section Erythranthe (27 accessions from seven species). Taxa within Erythranthe, particularly the parapatric and putatively sister species M. lewisii (bee-pollinated) and M. cardinalis (hummingbird-pollinated), have been a model system for investigating the ecological genetics of speciation and adaptation for over five decades. Across >8000 nuclear loci, multiple methods resolve a predominant species tree in which M. cardinalis groups with other hummingbird-pollinated taxa (37% of gene trees), rather than being sister to M. lewisii (32% of gene trees). We independently corroborate a single evolution of hummingbird pollination syndrome in Erythranthe by demonstrating functional redundancy in genetic complementation tests of floral traits in hybrids; together, these analyses overturn a textbook case of pollination-syndrome convergence. Strong asymmetries in allele sharing (Patterson’s D-statistic and related tests) indicate that gene tree discordance reflects ancient and recent introgression rather than incomplete lineage sorting. Consistent with abundant introgression blurring the history of divergence, low-recombination and adaptation-associated regions support the new species tree, while high-recombination regions generate phylogenetic evidence for sister status for M. lewisii and M. cardinalis. Population-level sampling of core taxa also revealed two instances of chloroplast capture, with Sierran M. lewisii and Southern Californian M. parishii each carrying organelle genomes nested within respective sympatric M. cardinalis clades. A recent organellar transfer from M. cardinalis, an outcrosser where selfish cytonuclear dynamics are more likely, may account for the unexpected cytoplasmic male sterility effects of selfer M. parishii organelles in hybrids with M. lewisii. Overall, our phylogenomic results reveal extensive reticulation throughout the evolutionary history of a classic monkeyflower radiation, suggesting that natural selection (re-)assembles and maintains species-diagnostic traits and barriers in the face of gene flow. Our findings further underline the challenges, even in reproductively isolated species, in distinguishing re-use of adaptive alleles from true convergence and emphasize the value of a phylogenomic framework for reconstructing the evolutionary genetics of adaptation and speciation.

40 citations


Journal ArticleDOI
TL;DR: This article analyzed the behavior of these estimators in the presence of arbitrarily-complex population structures, which results in an improved estimation framework specifically designed for arbitrary population structures and established a framework for assessing bias and consistency of genome-wide estimators, characterizing biases and estimation challenges unobserved under their originally-assumed models of structure.
Abstract: FST and kinship are key parameters often estimated in modern population genetics studies in order to quantitatively characterize structure and relatedness. Kinship matrices have also become a fundamental quantity used in genome-wide association studies and heritability estimation. The most frequently-used estimators of FST and kinship are method-of-moments estimators whose accuracies depend strongly on the existence of simple underlying forms of structure, such as the independent subpopulations model of non-overlapping, independently evolving subpopulations. However, modern data sets have revealed that these simple models of structure likely do not hold in many populations, including humans. In this work, we analyze the behavior of these estimators in the presence of arbitrarily-complex population structures, which results in an improved estimation framework specifically designed for arbitrary population structures. After generalizing the definition of FST to arbitrary population structures and establishing a framework for assessing bias and consistency of genome-wide estimators, we calculate the accuracy of existing FST and kinship estimators under arbitrary population structures, characterizing biases and estimation challenges unobserved under their originally-assumed models of structure. We then present our new approach, which consistently estimates kinship and FST when the minimum kinship value in the dataset is estimated consistently. We illustrate our results using simulated genotypes from an admixture model, constructing a one-dimensional geographic scenario that departs nontrivially from the independent subpopulations model. Our simulations reveal the potential for severe biases in estimates of existing approaches that are overcome by our new framework. This work may significantly improve future analyses that rely on accurate kinship and FST estimates.

39 citations


Journal ArticleDOI
TL;DR: The first large-scale genome-wide association study of inner retinal morphology using phenotypes derived from OCT images of 31,434 UK Biobank participants was presented in this paper.
Abstract: Optical Coherence Tomography (OCT) enables non-invasive imaging of the retina and is used to diagnose and manage ophthalmic diseases including glaucoma. We present the first large-scale genome-wide association study of inner retinal morphology using phenotypes derived from OCT images of 31,434 UK Biobank participants. We identify 46 loci associated with thickness of the retinal nerve fibre layer or ganglion cell inner plexiform layer. Only one of these loci has been associated with glaucoma, and despite its clear role as a biomarker for the disease, Mendelian randomisation does not support inner retinal thickness being on the same genetic causal pathway as glaucoma. We extracted overall retinal thickness at the fovea, representative of foveal hypoplasia, with which three of the 46 SNPs were associated. We additionally associate these three loci with visual acuity. In contrast to the Mendelian causes of severe foveal hypoplasia, our results suggest a spectrum of foveal hypoplasia, in part genetically determined, with consequences on visual function.

Journal ArticleDOI
TL;DR: In this paper, the role of Runx1 in different cell populations with regards to BMP and Wnt signaling pathways and in the interacting network underlying bone homeostasis as well as adipogenesis was investigated.
Abstract: Runx1 is highly expressed in osteoblasts, however, its function in osteogenesis is unclear. We generated mesenchymal progenitor-specific (Runx1f/fTwist2-Cre) and osteoblast-specific (Runx1f/fCol1α1-Cre) conditional knockout (Runx1 CKO) mice. The mutant CKO mice with normal skeletal development displayed a severe osteoporosis phenotype at postnatal and adult stages. Runx1 CKO resulted in decreased osteogenesis and increased adipogenesis. RNA-sequencing analysis, Western blot, and qPCR validation of Runx1 CKO samples showed that Runx1 regulates BMP signaling pathway and Wnt/β-catenin signaling pathway. ChIP assay revealed direct binding of Runx1 to the promoter regions of Bmp7, Alk3, and Atf4, and promoter mapping demonstrated that Runx1 upregulates their promoter activity through the binding regions. Bmp7 overexpression rescued Alk3, Runx2, and Atf4 expression in Runx1-deficient BMSCs. Runx2 expression was decreased while Runx1 was not changed in Alk3 deficient osteoblasts. Atf4 overexpression in Runx1-deficient BMSCs did not rescue expression of Runx1, Bmp7, and Alk3. Smad1/5/8 activity was vitally reduced in Runx1 CKO cells, indicating Runx1 positively regulates the Bmp7/Alk3/Smad1/5/8/Runx2/ATF4 signaling pathway. Notably, Runx1 overexpression in Runx2-/- osteoblasts rescued expression of Atf4, OCN, and ALP to compensate Runx2 function. Runx1 CKO mice at various osteoblast differentiation stages reduced Wnt signaling and caused high expression of C/ebpα and Pparγ and largely increased adipogenesis. Co-culture of Runx1-deficient and wild-type cells demonstrated that Runx1 regulates osteoblast−adipocyte lineage commitment both cell-autonomously and non-autonomously. Notably, Runx1 overexpression rescued bone loss in OVX-induced osteoporosis. This study focused on the role of Runx1 in different cell populations with regards to BMP and Wnt signaling pathways and in the interacting network underlying bone homeostasis as well as adipogenesis, and has provided new insight and advancement of knowledge in skeletal development. Collectively, Runx1 maintains adult bone homeostasis from bone loss though up-regulating Bmp7/Alk3/Smad1/5/8/Runx2/ATF4 and WNT/β-Catenin signaling pathways, and targeting Runx1 potentially leads to novel therapeutics for osteoporosis.

Journal ArticleDOI
TL;DR: In this article, the authors used a proportional hazards model within an interval-based censoring methodology to estimate age-varying individual variant contributions to genetic relative risk for 24 common diseases within the British ancestry subset of UK Biobank, applying a Bayesian clustering approach to group variants by their relative risk profile over age and permutation tests for age dependency and multiplicity of profiles.
Abstract: Inherited genetic variation contributes to individual risk for many complex diseases and is increasingly being used for predictive patient stratification. Previous work has shown that genetic factors are not equally relevant to human traits across age and other contexts, though the reasons for such variation are not clear. Here, we introduce methods to infer the form of the longitudinal relationship between genetic relative risk for disease and age and to test whether all genetic risk factors behave similarly. We use a proportional hazards model within an interval-based censoring methodology to estimate age-varying individual variant contributions to genetic relative risk for 24 common diseases within the British ancestry subset of UK Biobank, applying a Bayesian clustering approach to group variants by their relative risk profile over age and permutation tests for age dependency and multiplicity of profiles. We find evidence for age-varying relative risk profiles in nine diseases, including hypertension, skin cancer, atherosclerotic heart disease, hypothyroidism and calculus of gallbladder, several of which show evidence, albeit weak, for multiple distinct profiles of genetic relative risk. The predominant pattern shows genetic risk factors having the greatest relative impact on risk of early disease, with a monotonic decrease over time, at least for the majority of variants, although the magnitude and form of the decrease varies among diseases. As a consequence, for diseases where genetic relative risk decreases over age, genetic risk factors have stronger explanatory power among younger populations, compared to older ones. We show that these patterns cannot be explained by a simple model involving the presence of unobserved covariates such as environmental factors. We discuss possible models that can explain our observations and the implications for genetic risk prediction.

Journal ArticleDOI
TL;DR: In this paper, the authors used histone modification at H3-Lys27 underlies transcriptional silencing ex planta, and that exchange for an active chemical modification contributes to transcription of in planta induced genes.
Abstract: Transcriptional dynamic in response to environmental and developmental cues are fundamental to biology, yet many mechanistic aspects are poorly understood. One such example is fungal plant pathogens, which use secreted proteins and small molecules, termed effectors, to suppress host immunity and promote colonization. Effectors are highly expressed in planta but remain transcriptionally repressed ex planta, but our mechanistic understanding of these transcriptional dynamics remains limited. We tested the hypothesis that repressive histone modification at H3-Lys27 underlies transcriptional silencing ex planta, and that exchange for an active chemical modification contributes to transcription of in planta induced genes. Using genetics, chromatin immunoprecipitation and sequencing and RNA-sequencing, we determined that H3K27me3 provides significant local transcriptional repression. We detail how regions that lose H3K27me3 gain H3K27ac, and these changes are associated with increased transcription. Importantly, we observed that many in planta induced genes were marked by H3K27me3 during axenic growth, and detail how altered H3K27 modification influences transcription. ChIP-qPCR during in planta growth suggests that H3K27 modifications are generally stable, but can undergo dynamics at specific genomic locations. Our results support the hypothesis that dynamic histone modifications at H3K27 contributes to fungal genome regulation and specifically contributes to regulation of genes important during host infection.

Journal ArticleDOI
TL;DR: In this paper, the authors used a Drosophila model of α-synuclein neurotoxicity with widespread and robust pathology, and found that human α-nuclein expression impairs autophagolysosomal flux in aging adult neurons Genetic destabilization of the actin cytoskeleton rescues F-actin accumulation, promotes autophagosome clearance, normalizes the autophagsome system, and rescues neurotoxicity.
Abstract: Vesicular trafficking defects, particularly those in the autophagolysosomal system, have been strongly implicated in the pathogenesis of Parkinson's disease and related α-synucleinopathies However, mechanisms mediating dysfunction of membrane trafficking remain incompletely understood Using a Drosophila model of α-synuclein neurotoxicity with widespread and robust pathology, we find that human α-synuclein expression impairs autophagic flux in aging adult neurons Genetic destabilization of the actin cytoskeleton rescues F-actin accumulation, promotes autophagosome clearance, normalizes the autophagolysosomal system, and rescues neurotoxicity in α-synuclein transgenic animals through an Arp2/3 dependent mechanism Similarly, mitophagosomes accumulate in human α-synuclein-expressing neurons, and reversal of excessive actin stabilization promotes both clearance of these abnormal mitochondria-containing organelles and rescue of mitochondrial dysfunction These results suggest that Arp2/3 dependent actin cytoskeleton stabilization mediates autophagic and mitophagic dysfunction and implicate failure of autophagosome maturation as a pathological mechanism in Parkinson's disease and related α-synucleinopathies

Journal ArticleDOI
TL;DR: In this paper, the authors uncovered the mechanisms of overexpression of the P450 gene, CYP321A8 in a major pest insect, Spodoptera exigua that is resistant to multiple insecticides.
Abstract: The evolution of insect resistance to insecticides is frequently associated with overexpression of one or more cytochrome P450 enzyme genes. Although overexpression of CYP450 genes is a well-known mechanism of insecticide resistance, the underlying regulatory mechanisms are poorly understood. Here we uncovered the mechanisms of overexpression of the P450 gene, CYP321A8 in a major pest insect, Spodoptera exigua that is resistant to multiple insecticides. CYP321A8 confers resistance to organophosphate (chlorpyrifos) and pyrethroid (cypermethrin and deltamethrin) insecticides in this insect. Constitutive upregulation of transcription factors CncC/Maf are partially responsible for upregulated expression of CYP321A8 in the resistant strain. Reporter gene assays and site-directed mutagenesis analyses demonstrated that CncC/Maf enhanced the expression of CYP321A8 by binding to specific sites in the promoter. Additional cis-regulatory elements resulting from a mutation in the CYP321A8 promoter in the resistant strain facilitates the binding of the orphan nuclear receptor, Knirps, and enhances the promoter activity. These results demonstrate that two independent mechanisms; overexpression of transcription factors and mutations in the promoter region resulting in a new cis-regulatory element that facilitates binding of the orphan nuclear receptor are involved in overexpression of CYP321A8 in insecticide-resistant S. exigua.

Journal ArticleDOI
TL;DR: In this article, the effect of distal-SNPs or other molecular effects underlying the SNP-gene association is considered and an integrative approach to transcriptome-wide imputation and association studies is proposed to detect gene-trait associations.
Abstract: Traditional predictive models for transcriptome-wide association studies (TWAS) consider only single nucleotide polymorphisms (SNPs) local to genes of interest and perform parameter shrinkage with a regularization process. These approaches ignore the effect of distal-SNPs or other molecular effects underlying the SNP-gene association. Here, we outline multi-omics strategies for transcriptome imputation from germline genetics to allow more powerful testing of gene-trait associations by prioritizing distal-SNPs to the gene of interest. In one extension, we identify mediating biomarkers (CpG sites, microRNAs, and transcription factors) highly associated with gene expression and train predictive models for these mediators using their local SNPs. Imputed values for mediators are then incorporated into the final predictive model of gene expression, along with local SNPs. In the second extension, we assess distal-eQTLs (SNPs associated with genes not in a local window around it) for their mediation effect through mediating biomarkers local to these distal-eSNPs. Distal-eSNPs with large indirect mediation effects are then included in the transcriptomic prediction model with the local SNPs around the gene of interest. Using simulations and real data from ROS/MAP brain tissue and TCGA breast tumors, we show considerable gains of percent variance explained (1-2% additive increase) of gene expression and TWAS power to detect gene-trait associations. This integrative approach to transcriptome-wide imputation and association studies aids in identifying the complex interactions underlying genetic regulation within a tissue and important risk genes for various traits and disorders.

Journal ArticleDOI
TL;DR: In this paper, a high-quality chromosome-level assembly of the beet armyworm genome and use bulked segregant analysis (BSA) to identify the locus of avermectin resistance, which mapped on 15-16 Mbp of chromosome 17.
Abstract: The evolution of insecticide resistance represents a global constraint to agricultural production. Because of the extreme genetic diversity found in insects and the large numbers of genes involved in insecticide detoxification, better tools are needed to quickly identify and validate the involvement of putative resistance genes for improved monitoring, management, and countering of field-evolved insecticide resistance. The avermectins, emamectin benzoate (EB) and abamectin are relatively new pesticides with reduced environmental risk that target a wide number of insect pests, including the beet armyworm, Spodoptera exigua, an important global pest of many crops. Unfortunately, field resistance to avermectins recently evolved in the beet armyworm, threatening the sustainable use of this class of insecticides. Here, we report a high-quality chromosome-level assembly of the beet armyworm genome and use bulked segregant analysis (BSA) to identify the locus of avermectin resistance, which mapped on 15-16 Mbp of chromosome 17. Knockout of the CYP9A186 gene that maps within this region by CRISPR/Cas9 gene editing fully restored EB susceptibility, implicating this gene in avermectin resistance. Heterologous expression and in vitro functional assays further confirm that a natural substitution (F116V) found in the substrate recognition site 1 (SRS1) of the CYP9A186 protein results in enhanced metabolism of EB and abamectin. Hence, the combined approach of coupling gene editing with BSA allows for the rapid identification of metabolic resistance genes responsible for insecticide resistance, which is critical for effective monitoring and adaptive management of insecticide resistance.

Journal ArticleDOI
TL;DR: In this paper, the fertility status in different axonemal dynein preassembly mutant males (DNAAF2/ KTU, DNAAF4/ DYX1C1,DNAAF6/ PIH1D3, DLRC6/ZMYND10, CFAP300/C11orf70 and LRRC6) was reported.
Abstract: Axonemal protein complexes, such as outer (ODA) and inner (IDA) dynein arms, are responsible for the generation and regulation of flagellar and ciliary beating. Studies in various ciliated model organisms have shown that axonemal dynein arms are first assembled in the cell cytoplasm and then delivered into axonemes during ciliogenesis. In humans, mutations in genes encoding for factors involved in this process cause structural and functional defects of motile cilia in various organs such as the airways and result in the hereditary disorder primary ciliary dyskinesia (PCD). Despite extensive knowledge about the cytoplasmic assembly of axonemal dynein arms in respiratory cilia, this process is still poorly understood in sperm flagella. To better define its clinical relevance on sperm structure and function, and thus male fertility, further investigations are required. Here we report the fertility status in different axonemal dynein preassembly mutant males (DNAAF2/ KTU, DNAAF4/ DYX1C1, DNAAF6/ PIH1D3, DNAAF7/ZMYND10, CFAP300/C11orf70 and LRRC6). Besides andrological examinations, we functionally and structurally analyzed sperm flagella of affected individuals by high-speed video- and transmission electron microscopy as well as systematically compared the composition of dynein arms in sperm flagella and respiratory cilia by immunofluorescence microscopy. Furthermore, we analyzed the flagellar length in dynein preassembly mutant sperm. We found that the process of axonemal dynein preassembly is also critical in sperm, by identifying defects of ODAs and IDAs in dysmotile sperm of these individuals. Interestingly, these mutant sperm consistently show a complete loss of ODAs, while some respiratory cilia from the same individual can retain ODAs in the proximal ciliary compartment. This agrees with reports of solely one distinct ODA type in sperm, compared to two different ODA types in proximal and distal respiratory ciliary axonemes. Consistent with observations in model organisms, we also determined a significant reduction of sperm flagellar length in these individuals. These findings are relevant to subsequent studies on the function and composition of sperm flagella in PCD patients and non-syndromic infertile males. Our study contributes to a better understanding of the fertility status in PCD-affected males and should help guide genetic and andrological counselling for affected males and their families.

Journal ArticleDOI
TL;DR: The authors found that the majority of mexicana ancestry tracts introgressed into maize over 1000 generations ago and that high introgression peaks are most commonly shared among high-elevation maize populations, consistent with adaptation to the highland environment.
Abstract: While often deleterious, hybridization can also be a key source of genetic variation and pre-adapted haplotypes, enabling rapid evolution and niche expansion. Here we evaluate these opposing selection forces on introgressed ancestry between maize (Zea mays ssp. mays) and its wild teosinte relative, mexicana (Zea mays ssp. mexicana). Introgression from ecologically diverse teosinte may have facilitated maize's global range expansion, in particular to challenging high elevation regions (> 1500 m). We generated low-coverage genome sequencing data for 348 maize and mexicana individuals to evaluate patterns of introgression in 14 sympatric population pairs, spanning the elevational range of mexicana, a teosinte endemic to the mountains of Mexico. While recent hybrids are commonly observed in sympatric populations and mexicana demonstrates fine-scale local adaptation, we find that the majority of mexicana ancestry tracts introgressed into maize over 1000 generations ago. This mexicana ancestry seems to have maintained much of its diversity and likely came from a common ancestral source, rather than contemporary sympatric populations, resulting in relatively low FST between mexicana ancestry tracts sampled from geographically distant maize populations. Introgressed mexicana ancestry in maize is reduced in lower-recombination rate quintiles of the genome and around domestication genes, consistent with pervasive selection against introgression. However, we also find mexicana ancestry increases across the sampled elevational gradient and that high introgression peaks are most commonly shared among high-elevation maize populations, consistent with introgression from mexicana facilitating adaptation to the highland environment. In the other direction, we find patterns consistent with adaptive and clinal introgression of maize ancestry into sympatric mexicana at many loci across the genome, suggesting that maize also contributes to adaptation in mexicana, especially at the lower end of its elevational range. In sympatric maize, in addition to high introgression regions we find many genomic regions where selection for local adaptation maintains steep gradients in introgressed mexicana ancestry across elevation, including at least two inversions: the well-characterized 14 Mb Inv4m on chromosome 4 and a novel 3 Mb inversion Inv9f surrounding the macrohairless1 locus on chromosome 9. Most outlier loci with high mexicana introgression show no signals of sweeps or local sourcing from sympatric populations and so likely represent ancestral introgression sorted by selection, resulting in correlated but distinct outcomes of introgression in different contemporary maize populations.

Journal ArticleDOI
TL;DR: In this article, the authors investigate the family-level dynamics of transposable elements (TEs) in maize and conclude that while the impact of transposition is highly family- and context-dependent, a familylevel understanding of the ecology of TEs in the genome can refine our ability to predict the role of TE copies in generating genetic and phenotypic diversity.
Abstract: Transposable elements (TEs) constitute the majority of flowering plant DNA, reflecting their tremendous success in subverting, avoiding, and surviving the defenses of their host genomes to ensure their selfish replication. More than 85% of the sequence of the maize genome can be ascribed to past transposition, providing a major contribution to the structure of the genome. Evidence from individual loci has informed our understanding of how transposition has shaped the genome, and a number of individual TE insertions have been causally linked to dramatic phenotypic changes. Genome-wide analyses in maize and other taxa have frequently represented TEs as a relatively homogeneous class of fragmentary relics of past transposition, obscuring their evolutionary history and interaction with their host genome. Using an updated annotation of structurally intact TEs in the maize reference genome, we investigate the family-level dynamics of TEs in maize. Integrating a variety of data, from descriptors of individual TEs like coding capacity, expression, and methylation, as well as similar features of the sequence they inserted into, we model the relationship between attributes of the genomic environment and the survival of TE copies and families. In contrast to the wholesale relegation of all TEs to a single category of junk DNA, these differences reveal a diversity of survival strategies of TE families. Together these generate a rich ecology of the genome, with each TE family representing the evolution of a distinct ecological niche. We conclude that while the impact of transposition is highly family- and context-dependent, a family-level understanding of the ecology of TEs in the genome can refine our ability to predict the role of TEs in generating genetic and phenotypic diversity.

Journal ArticleDOI
TL;DR: This article proposed to integrate multiple tissues in the TWAS using sparse canonical correlation analysis (sCCA) and showed that sCCA-TWAS combined with single-tissue TWAS with an aggregate Cauchy association test (ACAT) outperforms traditional single-to-single TWAS.
Abstract: Transcriptome-wide association studies (TWAS) test the association between traits and genetically predicted gene expression levels. The power of a TWAS depends in part on the strength of the correlation between a genetic predictor of gene expression and the causally relevant gene expression values. Consequently, TWAS power can be low when expression quantitative trait locus (eQTL) data used to train the genetic predictors have small sample sizes, or when data from causally relevant tissues are not available. Here, we propose to address these issues by integrating multiple tissues in the TWAS using sparse canonical correlation analysis (sCCA). We show that sCCA-TWAS combined with single-tissue TWAS using an aggregate Cauchy association test (ACAT) outperforms traditional single-tissue TWAS. In empirically motivated simulations, the sCCA+ACAT approach yielded the highest power to detect a gene associated with phenotype, even when expression in the causal tissue was not directly measured, while controlling the Type I error when there is no association between gene expression and phenotype. For example, when gene expression explains 2% of the variability in outcome, and the GWAS sample size is 20,000, the average power difference between the ACAT combined test of sCCA features and single-tissue, versus single-tissue combined with Generalized Berk-Jones (GBJ) method, single-tissue combined with S-MultiXcan, UTMOST, or summarizing cross-tissue expression patterns using Principal Component Analysis (PCA) approaches was 5%, 8%, 5% and 38%, respectively. The gain in power is likely due to sCCA cross-tissue features being more likely to be detectably heritable. When applied to publicly available summary statistics from 10 complex traits, the sCCA+ACAT test was able to increase the number of testable genes and identify on average an additional 400 additional gene-trait associations that single-trait TWAS missed. Our results suggest that aggregating eQTL data across multiple tissues using sCCA can improve the sensitivity of TWAS while controlling for the false positive rate.

Journal ArticleDOI
TL;DR: In this paper, the authors showed that RNA editing at very limited sites, which is mediated by a subtle amount of ADAR1 p150, is sufficient to prevent melanoma differentiation-associated protein 5 (MDA5) activation in the mouse brain.
Abstract: Adenosine deaminase acting on RNA 1 (ADAR1), an enzyme responsible for adenosine-to-inosine RNA editing, is composed of two isoforms: nuclear p110 and cytoplasmic p150. Deletion of Adar1 or Adar1 p150 genes in mice results in embryonic lethality with overexpression of interferon-stimulating genes (ISGs), caused by the aberrant recognition of unedited endogenous transcripts by melanoma differentiation-associated protein 5 (MDA5). However, among numerous RNA editing sites, how many RNA sites require editing, especially by ADAR1 p150, to avoid MDA5 activation and whether ADAR1 p110 contributes to this function remains elusive. In particular, ADAR1 p110 is abundant in the mouse brain where a subtle amount of ADAR1 p150 is expressed, whereas ADAR1 mutations cause Aicardi-Goutieres syndrome, in which the brain is one of the most affected organs accompanied by the elevated expression of ISGs. Therefore, understanding RNA editing-mediated prevention of MDA5 activation in the brain is especially important. Here, we established Adar1 p110-specific knockout mice, in which the upregulated expression of ISGs was not observed. This result suggests that ADAR1 p150-mediated RNA editing is enough to suppress MDA5 activation. Therefore, we further created Adar1 p110/Adar2 double knockout mice to identify ADAR1 p150-mediated editing sites. This analysis demonstrated that although the elevated expression of ISGs was not observed, only less than 2% of editing sites were preserved in the brains of Adar1 p110/Adar2 double knockout mice. Of note, we found that some sites were highly edited, which was comparable to those found in wild-type mice, indicating the presence of ADAR1 p150-specific sites. These data suggest that RNA editing at a very limited sites, which is mediated by a subtle amount of ADAR1 p150, is sufficient to prevents MDA5 activation, at least in the mouse brain.

Journal ArticleDOI
TL;DR: In this article, a two-sample MR using cis-acting brain-derived expression quantitative trait loci (eQTLs) from the Accelerating Medicines Partnership for Alzheimer's Disease consortium (AMP-AD) and the CommonMind Consortium (CMC) meta-analysis study (n = 1,286) was used to predict the effects of 7,137 genes on 12 neurological and psychiatric disorders.
Abstract: Discovering drugs that efficiently treat brain diseases has been challenging. Genetic variants that modulate the expression of potential drug targets can be utilized to assess the efficacy of therapeutic interventions. We therefore employed Mendelian Randomization (MR) on gene expression measured in brain tissue to identify drug targets involved in neurological and psychiatric diseases. We conducted a two-sample MR using cis-acting brain-derived expression quantitative trait loci (eQTLs) from the Accelerating Medicines Partnership for Alzheimer's Disease consortium (AMP-AD) and the CommonMind Consortium (CMC) meta-analysis study (n = 1,286) as genetic instruments to predict the effects of 7,137 genes on 12 neurological and psychiatric disorders. We conducted Bayesian colocalization analysis on the top MR findings (using P<6x10-7 as evidence threshold, Bonferroni-corrected for 80,557 MR tests) to confirm sharing of the same causal variants between gene expression and trait in each genomic region. We then intersected the colocalized genes with known monogenic disease genes recorded in Online Mendelian Inheritance in Man (OMIM) and with genes annotated as drug targets in the Open Targets platform to identify promising drug targets. 80 eQTLs showed MR evidence of a causal effect, from which we prioritised 47 genes based on colocalization with the trait. We causally linked the expression of 23 genes with schizophrenia and a single gene each with anorexia, bipolar disorder and major depressive disorder within the psychiatric diseases and 9 genes with Alzheimer's disease, 6 genes with Parkinson's disease, 4 genes with multiple sclerosis and two genes with amyotrophic lateral sclerosis within the neurological diseases we tested. From these we identified five genes (ACE, GPNMB, KCNQ5, RERE and SUOX) as attractive drug targets that may warrant follow-up in functional studies and clinical trials, demonstrating the value of this study design for discovering drug targets in neuropsychiatric diseases.

Journal ArticleDOI
TL;DR: In this article, a rapid in-vivo phenotypic screen for macrophage-related genes that promote regeneration after spinal injury was conducted, using acute injection of synthetic RNA Oligo CRISPR guide RNAs that were pre-screened for high activity in vivo.
Abstract: Zebrafish exhibit robust regeneration following spinal cord injury, promoted by macrophages that control post-injury inflammation. However, the mechanistic basis of how macrophages regulate regeneration is poorly understood. To address this gap in understanding, we conducted a rapid in vivo phenotypic screen for macrophage-related genes that promote regeneration after spinal injury. We used acute injection of synthetic RNA Oligo CRISPR guide RNAs (sCrRNAs) that were pre-screened for high activity in vivo. Pre-screening of over 350 sCrRNAs allowed us to rapidly identify highly active sCrRNAs (up to half, abbreviated as haCRs) and to effectively target 30 potentially macrophage-related genes. Disruption of 10 of these genes impaired axonal regeneration following spinal cord injury. We selected 5 genes for further analysis and generated stable mutants using haCRs. Four of these mutants (tgfb1a, tgfb3, tnfa, sparc) retained the acute haCR phenotype, validating the approach. Mechanistically, tgfb1a haCR-injected and stable mutant zebrafish fail to resolve post-injury inflammation, indicated by prolonged presence of neutrophils and increased levels of il1b expression. Inhibition of Il-1β rescues the impaired axon regeneration in the tgfb1a mutant. Hence, our rapid and scalable screening approach has identified functional regulators of spinal cord regeneration, but can be applied to any biological function of interest.

Journal ArticleDOI
TL;DR: In this paper, the authors investigated structural variation in four isolates of the blast fungus M. oryzae from different grass hosts and analyzed the sequences of mini-chromosomes in the rice, foxtail millet and goosegrass isolates.
Abstract: Supernumerary mini-chromosomes-a unique type of genomic structural variation-have been implicated in the emergence of virulence traits in plant pathogenic fungi. However, the mechanisms that facilitate the emergence and maintenance of mini-chromosomes across fungi remain poorly understood. In the blast fungus Magnaporthe oryzae (Syn. Pyricularia oryzae), mini-chromosomes have been first described in the early 1990s but, until very recently, have been overlooked in genomic studies. Here we investigated structural variation in four isolates of the blast fungus M. oryzae from different grass hosts and analyzed the sequences of mini-chromosomes in the rice, foxtail millet and goosegrass isolates. The mini-chromosomes of these isolates turned out to be highly diverse with distinct sequence composition. They are enriched in repetitive elements and have lower gene density than core-chromosomes. We identified several virulence-related genes in the mini-chromosome of the rice isolate, including the virulence-related polyketide synthase Ace1 and two variants of the effector gene AVR-Pik. Macrosynteny analyses around these loci revealed structural rearrangements, including inter-chromosomal translocations between core- and mini-chromosomes. Our findings provide evidence that mini-chromosomes emerge from structural rearrangements and segmental duplication of core-chromosomes and might contribute to adaptive evolution of the blast fungus.

Journal ArticleDOI
TL;DR: The Aurora protein kinases are well-established regulators of spindle building and chromosome segregation in mitotic and meiotic cells, and they have been shown to have multiple functions essential to completing meiosis I as discussed by the authors.
Abstract: The Aurora protein kinases are well-established regulators of spindle building and chromosome segregation in mitotic and meiotic cells. In mouse oocytes, there is significant Aurora kinase A (AURKA) compensatory abilities when the other Aurora kinase homologs are deleted. Whether the other homologs, AURKB or AURKC can compensate for loss of AURKA is not known. Using a conditional mouse oocyte knockout model, we demonstrate that this compensation is not reciprocal because female oocyte-specific knockout mice are sterile, and their oocytes fail to complete meiosis I. In determining AURKA-specific functions, we demonstrate that its first meiotic requirement is to activate Polo-like kinase 1 at acentriolar microtubule organizing centers (aMTOCs; meiotic spindle poles). This activation induces fragmentation of the aMTOCs, a step essential for building a bipolar spindle. We also show that AURKA is required for regulating localization of TACC3, another protein required for spindle building. We conclude that AURKA has multiple functions essential to completing MI that are distinct from AURKB and AURKC.

Journal ArticleDOI
TL;DR: In this article, the role of the proinflammatory transcription factor NF-κB in regulating clock function was investigated using a combination of genetic and pharmacological approaches, and it was shown that perturbation of the canonical NFκB subunit RELA in the human U2OS cellular model altered core clock gene expression.
Abstract: In mammals, the circadian clock coordinates cell physiological processes including inflammation. Recent studies suggested a crosstalk between these two pathways. However, the mechanism of how inflammation affects the clock is not well understood. Here, we investigated the role of the proinflammatory transcription factor NF-κB in regulating clock function. Using a combination of genetic and pharmacological approaches, we show that perturbation of the canonical NF-κB subunit RELA in the human U2OS cellular model altered core clock gene expression. While RELA activation shortened period length and dampened amplitude, its inhibition lengthened period length and caused amplitude phenotypes. NF-κB perturbation also altered circadian rhythms in the master suprachiasmatic nucleus (SCN) clock and locomotor activity behavior under different light/dark conditions. We show that RELA, like the clock repressor CRY1, repressed the transcriptional activity of BMAL1/CLOCK at the circadian E-box cis-element. Biochemical and biophysical analysis showed that RELA binds to the transactivation domain of BMAL1. These data support a model in which NF-kB competes with CRY1 and coactivator CBP/p300 for BMAL1 binding to affect circadian transcription. This is further supported by chromatin immunoprecipitation analysis showing that binding of RELA, BMAL1 and CLOCK converges on the E-boxes of clock genes. Taken together, these data support a significant role for NF-κB in directly regulating the circadian clock and highlight mutual regulation between the circadian and inflammatory pathways.

Journal ArticleDOI
TL;DR: In this paper, a comprehensive framework called GRAPPLE is proposed to analyze the causal effect of target risk factors with heterogeneous genetic instruments and identify possible pleiotropic patterns from data.
Abstract: Over a decade of genome-wide association studies (GWAS) have led to the finding of extreme polygenicity of complex traits. The phenomenon that "all genes affect every complex trait" complicates Mendelian Randomization (MR) studies, where natural genetic variations are used as instruments to infer the causal effect of heritable risk factors. We reexamine the assumptions of existing MR methods and show how they need to be clarified to allow for pervasive horizontal pleiotropy and heterogeneous effect sizes. We propose a comprehensive framework GRAPPLE to analyze the causal effect of target risk factors with heterogeneous genetic instruments and identify possible pleiotropic patterns from data. By using GWAS summary statistics, GRAPPLE can efficiently use both strong and weak genetic instruments, detect the existence of multiple pleiotropic pathways, determine the causal direction and perform multivariable MR to adjust for confounding risk factors. With GRAPPLE, we analyze the effect of blood lipids, body mass index, and systolic blood pressure on 25 disease outcomes, gaining new information on their causal relationships and potential pleiotropic pathways involved.

Journal ArticleDOI
TL;DR: In this paper, the authors show that ARID1A loss leads to DNA replication stress associated with R-loops and transcription-replication conflicts in human cells, which correlate with altered transcription and replication dynamics in ARID 1A knockout cells and reduced TOP2A binding at R-loop sites.
Abstract: ARID1A is a core DNA-binding subunit of the BAF chromatin remodeling complex, and is lost in up to 7% of all cancers The frequency of ARID1A loss increases in certain cancer types, such as clear cell ovarian carcinoma where ARID1A protein is lost in about 50% of cases While the impact of ARID1A loss on the function of the BAF chromatin remodeling complexes is likely to drive oncogenic gene expression programs in specific contexts, ARID1A also binds genome stability regulators such as ATR and TOP2 Here we show that ARID1A loss leads to DNA replication stress associated with R-loops and transcription-replication conflicts in human cells These effects correlate with altered transcription and replication dynamics in ARID1A knockout cells and to reduced TOP2A binding at R-loop sites Together this work extends mechanisms of replication stress in ARID1A deficient cells with implications for targeting ARID1A deficient cancers