Home
/
Authors
/
Bjarni J. Vilhjálmsson

Author

Bjarni J. Vilhjálmsson

Other affiliations: Harvard University, Austrian Academy of Sciences, Gregor Mendel Institute ...read more

Bio: Bjarni J. Vilhjálmsson is an academic researcher from Aarhus University. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 32, co-authored 95 publications receiving 9401 citations. Previous affiliations of Bjarni J. Vilhjálmsson include Harvard University & Austrian Academy of Sciences.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines

[...]

Susanna Atwell¹, Yu S. Huang, Bjarni J. Vilhjálmsson, Glenda Willems, Matthew W. Horton², Yan Li², Dazhe Meng, Alexander Platt, Aaron M. Tarone, Tina T. Hu, Rong Jiang, N. Wayan Muliyati², Xu Zhang², Muhammad Ali Amer, Ivan Baxter, Benjamin Brachi³, Joanne Chory⁴, Joanne Chory⁵, Caroline Dean⁶, Marilyne Debieu⁷, Juliette de Meaux⁷, Joseph R. Ecker⁵, Nathalie Faure³, Joel M. Kniskern², Jonathan D. G. Jones⁸, Todd P. Michael⁵, Adnane Nemri⁸, Fabrice Roux², Fabrice Roux³, David E. Salt⁹, Chunlao Tang, Marco Todesco⁷, M. Brian Traw², Detlef Weigel⁷, Paul Marjoram¹, Justin O. Borevitz², Joy Bergelson², Magnus Nordborg¹⁰ - Show less +34 more•Institutions (10)

University of Southern California¹, University of Chicago², Centre national de la recherche scientifique³, Howard Hughes Medical Institute⁴, Salk Institute for Biological Studies⁵, John Innes Centre⁶, Max Planck Society⁷, Sainsbury Laboratory⁸, Purdue University⁹, Gregor Mendel Institute¹⁰

03 Jun 2010-Nature

TL;DR: This study demonstrates the feasibility of GWA studies in A. thaliana and suggests that the approach will be appropriate for many other organisms, particularly when inbred lines are available.

...read moreread less

Abstract: Although pioneered by human geneticists as a potential solution to the challenging problem of finding the genetic basis of common human diseases, genome-wide association (GWA) studies have, owing to advances in genotyping and sequencing technology, become an obvious general approach for studying the genetics of natural variation and traits of agricultural importance. They are particularly useful when inbred lines are available, because once these lines have been genotyped they can be phenotyped multiple times, making it possible (as well as extremely cost effective) to study many different traits in many different environments, while replicating the phenotypic measurements to reduce environmental noise. Here we demonstrate the power of this approach by carrying out a GWA study of 107 phenotypes in Arabidopsis thaliana, a widely distributed, predominantly self-fertilizing model plant known to harbour considerable genetic variation for many adaptively important traits. Our results are dramatically different from those of human GWA studies, in that we identify many common alleles of major effect, but they are also, in many cases, harder to interpret because confounding by complex genetics and population structure make it difficult to distinguish true associations from false. However, a-priori candidates are significantly over-represented among these associations as well, making many of them excellent candidates for follow-up experiments. Our study demonstrates the feasibility of GWA studies in A. thaliana and suggests that the approach will be appropriate for many other organisms.

...read moreread less

1,525 citations

Journal Article•DOI•

Efficient Bayesian mixed-model analysis increases association power in large cohorts

[...]

Po-Ru Loh¹, George Tucker¹, Brendan Bulik-Sullivan¹, Bjarni J. Vilhjálmsson², Bjarni J. Vilhjálmsson¹, Hilary K. Finucane³, Rany M. Salem⁴, Daniel I. Chasman⁵, Paul M. Ridker⁵, Benjamin M. Neale¹, Benjamin M. Neale², Bonnie Berger³, Nick Patterson², Alkes L. Price¹ - Show less +10 more•Institutions (5)

Harvard University¹, Broad Institute², Massachusetts Institute of Technology³, Boston Children's Hospital⁴, Brigham and Women's Hospital⁵

01 Mar 2015-Nature Genetics

TL;DR: BOLT-LMM is presented, which requires only a small number of O(MN) time iterations and increases power by modeling more realistic, non-infinitesimal genetic architectures via a Bayesian mixture prior on marker effect sizes.

...read moreread less

Abstract: Linear mixed models are a powerful statistical tool for identifying genetic associations and avoiding confounding. However, existing methods are computationally intractable in large cohorts and may not optimize power. All existing methods require time cost O(MN(2)) (where N is the number of samples and M is the number of SNPs) and implicitly assume an infinitesimal genetic architecture in which effect sizes are normally distributed, which can limit power. Here we present a far more efficient mixed-model association method, BOLT-LMM, which requires only a small number of O(MN) time iterations and increases power by modeling more realistic, non-infinitesimal genetic architectures via a Bayesian mixture prior on marker effect sizes. We applied BOLT-LMM to 9 quantitative traits in 23,294 samples from the Women's Genome Health Study (WGHS) and observed significant increases in power, consistent with simulations. Theory and simulations show that the boost in power increases with cohort size, making BOLT-LMM appealing for genome-wide association studies in large cohorts.

...read moreread less

1,232 citations

Journal Article•DOI•

Modeling Linkage Disequilibrium Increases Accuracy of Polygenic Risk Scores

[...]

Bjarni J. Vilhjálmsson¹, Jian Yang², Hilary K. Finucane³, Alexander Gusev⁴ +391 more•Institutions (14)

01 Oct 2015-American Journal of Human Genetics

TL;DR: LDpred is introduced, a method that infers the posterior mean effect size of each marker by using a prior on effect sizes and LD information from an external reference panel, and outperforms the approach of pruning followed by thresholding, particularly at large sample sizes.

...read moreread less

Abstract: Polygenic risk scores have shown great promise in predicting complex disease risk and will become more accurate as training sample sizes increase. The standard approach for calculating risk scores involves linkage disequilibrium (LD)-based marker pruning and applying a p value threshold to association statistics, but this discards information and can reduce predictive accuracy. We introduce LDpred, a method that infers the posterior mean effect size of each marker by using a prior on effect sizes and LD information from an external reference panel. Theory and simulations show that LDpred outperforms the approach of pruning followed by thresholding, particularly at large sample sizes. Accordingly, predicted R(2) increased from 20.1% to 25.3% in a large schizophrenia dataset and from 9.8% to 12.0% in a large multiple sclerosis dataset. A similar relative improvement in accuracy was observed for three additional large disease datasets and for non-European schizophrenia samples. The advantage of LDpred over existing methods will grow as sample sizes increase.

...read moreread less

1,088 citations

Journal Article•DOI•

An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations.

[...]

Vincent Segura¹, Vincent Segura², Bjarni J. Vilhjálmsson¹, Bjarni J. Vilhjálmsson³, Alexander Platt¹, Alexander Platt³, Arthur Korte¹, Ümit Seren¹, Quan Long¹, Magnus Nordborg¹, Magnus Nordborg³ - Show less +7 more•Institutions (3)

Austrian Academy of Sciences¹, Institut national de la recherche agronomique², University of Southern California³

01 Jul 2012-Nature Genetics

TL;DR: Simulations suggest that the proposed multi-locus mixed model as a general method for mapping complex traits in structured populations outperforms existing methods in terms of power as well as false discovery rate.

...read moreread less

Abstract: Population structure causes genome-wide linkage disequilibrium between unlinked loci, leading to statistical confounding in genome-wide association studies. Mixed models have been shown to handle the confounding effects of a diffuse background of large numbers of loci of small effect well, but they do not always account for loci of larger effect. Here we propose a multi-locus mixed model as a general method for mapping complex traits in structured populations. Simulations suggest that our method outperforms existing methods in terms of power as well as false discovery rate. We apply our method to human and Arabidopsis thaliana data, identifying new associations and evidence for allelic heterogeneity. We also show how a priori knowledge from an A. thaliana linkage mapping study can be integrated into our method using a Bayesian approach. Our implementation is computationally efficient, making the analysis of large data sets (n > 10,000) practicable.

...read moreread less

723 citations

Journal Article•DOI•

The nature of nurture: Effects of parental genotypes.

[...]

Augustine Kong¹, Augustine Kong², Augustine Kong³, Gudmar Thorleifsson¹, Michael L. Frigge¹, Bjarni J. Vilhjálmsson⁴, Bjarni J. Vilhjálmsson⁵, Alexander I. Young³, Alexander I. Young¹, Alexander I. Young⁶, Thorgeir E. Thorgeirsson¹, Stefania Benonisdottir¹, Asmundur Oddsson¹, Bjarni V. Halldorsson¹, Gisli Masson¹, Daniel F. Gudbjartsson², Daniel F. Gudbjartsson¹, Agnar Helgason¹, Agnar Helgason², Gyda Bjornsdottir¹, Unnur Thorsteinsdottir², Unnur Thorsteinsdottir¹, Kari Stefansson¹, Kari Stefansson² - Show less +20 more•Institutions (6)

deCODE genetics¹, University of Iceland², University of Oxford³, Aarhus University⁴, Harvard University⁵, Wellcome Trust Centre for Human Genetics⁶

26 Jan 2018-Science

TL;DR: The findings suggest that genetic nurture is ultimately due to genetic variation in the population and is mediated by the environment that parents create for their children.

...read moreread less

Abstract: Sequence variants in the parental genomes that are not transmitted to a child (the proband) are often ignored in genetic studies. Here we show that nontransmitted alleles can affect a child through their impacts on the parents and other relatives, a phenomenon we call "genetic nurture." Using results from a meta-analysis of educational attainment, we find that the polygenic score computed for the nontransmitted alleles of 21,637 probands with at least one parent genotyped has an estimated effect on the educational attainment of the proband that is 29.9% (P = 1.6 × 10-14) of that of the transmitted polygenic score. Genetic nurturing effects of this polygenic score extend to other traits. Paternal and maternal polygenic scores have similar effects on educational attainment, but mothers contribute more than fathers to nutrition- and heath-related traits.

...read moreread less

643 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

[...]

Fumio Tajima¹•Institutions (1)

Kyushu University¹

30 Oct 1989-Genomics

TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

...read moreread less

11,521 citations

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)

[...]

Glenn Tesler

01 Jun 2012

TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).

...read moreread less

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

...read moreread less

10,124 citations

Book Chapter•DOI•

Prospective Cohort Study

[...]

Victor R. Preedy, Ronald R. Watson

01 Jan 2010

5,842 citations

Journal Article•DOI•

The UK Biobank resource with deep phenotyping and genomic data

[...]

Clare Bycroft¹, Colin Freeman¹, Desislava Petkova¹, Desislava Petkova², Gavin Band¹, Lloyd T. Elliott¹, Kevin Sharp¹, Allan Motyer³, Damjan Vukcevic³, Olivier Delaneau⁴, Olivier Delaneau⁵, Jared O'Connell⁶, Adrian Cortes¹, Adrian Cortes⁷, Samantha Welsh, Alan Young¹, Mark Effingham, Gil McVean¹, Stephen Leslie³, Naomi E. Allen¹, Peter Donnelly¹, Jonathan Marchini¹ - Show less +18 more•Institutions (7)

University of Oxford¹, Procter & Gamble², University of Melbourne³, Swiss Institute of Bioinformatics⁴, University of Geneva⁵, Illumina⁶, John Radcliffe Hospital⁷

11 Oct 2018-Nature

TL;DR: Deep phenotype and genome-wide genetic data from 500,000 individuals from the UK Biobank is described, describing population structure and relatedness in the cohort, and imputation to increase the number of testable variants to 96 million.

...read moreread less

Abstract: The UK Biobank project is a prospective cohort study with deep genetic and phenotypic data collected on approximately 500,000 individuals from across the United Kingdom, aged between 40 and 69 at recruitment. The open resource is unique in its size and scope. A rich variety of phenotypic and health-related information is available on each participant, including biological measurements, lifestyle indicators, biomarkers in blood and urine, and imaging of the body and brain. Follow-up information is provided by linking health and medical records. Genome-wide genotype data have been collected on all participants, providing many opportunities for the discovery of new genetic associations and the genetic bases of complex traits. Here we describe the centralized analysis of the genetic data, including genotype quality, properties of population structure and relatedness of the genetic data, and efficient phasing and genotype imputation that increases the number of testable variants to around 96 million. Classical allelic variation at 11 human leukocyte antigen genes was imputed, resulting in the recovery of signals with known associations between human leukocyte antigen alleles and many diseases.

...read moreread less

4,489 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse