scispace - formally typeset
Search or ask a question

Showing papers by "Adam Auton published in 2021"


Journal ArticleDOI
04 Mar 2021-Nature
TL;DR: The GenOMICC (Genetics Of Mortality In Critical Care) genome-wide association study in 2244 critically ill Covid-19 patients from 208 UK intensive care units is reported, finding evidence in support of a causal link from low expression of IFNAR2, and high expression of TYK2, to life-threatening disease.
Abstract: Host-mediated lung inflammation is present1, and drives mortality2, in the critical illness caused by coronavirus disease 2019 (COVID-19). Host genetic variants associated with critical illness may identify mechanistic targets for therapeutic development3. Here we report the results of the GenOMICC (Genetics Of Mortality In Critical Care) genome-wide association study in 2,244 critically ill patients with COVID-19 from 208 UK intensive care units. We have identified and replicated the following new genome-wide significant associations: on chromosome 12q24.13 (rs10735079, P = 1.65 × 10−8) in a gene cluster that encodes antiviral restriction enzyme activators (OAS1, OAS2 and OAS3); on chromosome 19p13.2 (rs74956615, P = 2.3 × 10−8) near the gene that encodes tyrosine kinase 2 (TYK2); on chromosome 19p13.3 (rs2109069, P = 3.98 × 10−12) within the gene that encodes dipeptidyl peptidase 9 (DPP9); and on chromosome 21q22.1 (rs2236757, P = 4.99 × 10−8) in the interferon receptor gene IFNAR2. We identified potential targets for repurposing of licensed medications: using Mendelian randomization, we found evidence that low expression of IFNAR2, or high expression of TYK2, are associated with life-threatening disease; and transcriptome-wide association in lung tissue revealed that high expression of the monocyte–macrophage chemotactic receptor CCR2 is associated with severe COVID-19. Our results identify robust genetic signals relating to key host antiviral defence mechanisms and mediators of inflammatory organ damage in COVID-19. Both mechanisms may be amenable to targeted treatment with existing drugs. However, large-scale randomized clinical trials will be essential before any change to clinical practice. A genome-wide association study of critically ill patients with COVID-19 identifies genetic signals that relate to important host antiviral defence mechanisms and mediators of inflammatory organ damage that may be targeted by repurposing drug treatments.

941 citations


Journal ArticleDOI
Douglas P Wightman1, Iris E. Jansen1, Jeanne E. Savage1, Alexey A. Shadrin2, Shahram Bahrami3, Shahram Bahrami2, Dominic Holland4, Arvid Rongve5, Sigrid Børte3, Sigrid Børte6, Sigrid Børte2, Bendik S. Winsvold6, Bendik S. Winsvold3, Ole Kristian Drange6, Amy E Martinsen6, Amy E Martinsen3, Amy E Martinsen2, Anne Heidi Skogholt6, Cristen J. Willer7, Geir Bråthen6, Ingunn Bosnes8, Ingunn Bosnes6, Jonas B. Nielsen9, Jonas B. Nielsen7, Jonas B. Nielsen6, Lars G. Fritsche7, Laurent F. Thomas6, Linda M. Pedersen3, Maiken Elvestad Gabrielsen6, Marianne Bakke Johnsen3, Marianne Bakke Johnsen2, Marianne Bakke Johnsen6, Tore Wergeland Meisingset6, Wei Zhou7, Wei Zhou10, Petroula Proitsi11, Angela Hodges11, Richard Dobson, Latha Velayudhan11, Karl Heilbron, Adam Auton, Julia M. Sealock12, Lea K. Davis12, Nancy L. Pedersen13, Chandra A. Reynolds14, Ida K. Karlsson15, Ida K. Karlsson13, Sigurdur H. Magnusson16, Hreinn Stefansson16, Steinunn Thordardottir, Palmi V. Jonsson17, Jon Snaedal, Anna Zettergren18, Ingmar Skoog18, Ingmar Skoog19, Silke Kern18, Silke Kern19, Margda Waern18, Margda Waern19, Henrik Zetterberg, Kaj Blennow18, Kaj Blennow19, Eystein Stordal6, Eystein Stordal8, Kristian Hveem6, John-Anker Zwart2, John-Anker Zwart3, John-Anker Zwart6, Lavinia Athanasiu3, Lavinia Athanasiu2, Per Selnes20, Ingvild Saltvedt6, Sigrid Botne Sando6, Ingun Ulstein3, Srdjan Djurovic3, Srdjan Djurovic5, Tormod Fladby20, Tormod Fladby2, Dag Aarsland21, Dag Aarsland11, Geir Selbæk3, Geir Selbæk2, Stephan Ripke10, Stephan Ripke22, Stephan Ripke23, Kari Stefansson16, Ole A. Andreassen2, Ole A. Andreassen3, Danielle Posthuma24, Danielle Posthuma1 
TL;DR: This paper identified microglia, immune cells and protein catabolism as relevant genes for late-onset Alzheimer's disease, while identifying and prioritizing previously unidentified genes of potential interest.
Abstract: Late-onset Alzheimer's disease is a prevalent age-related polygenic disease that accounts for 50-70% of dementia cases. Currently, only a fraction of the genetic variants underlying Alzheimer's disease have been identified. Here we show that increased sample sizes allowed identification of seven previously unidentified genetic loci contributing to Alzheimer's disease. This study highlights microglia, immune cells and protein catabolism as relevant to late-onset Alzheimer's disease, while identifying and prioritizing previously unidentified genes of potential interest. We anticipate that these results can be included in larger meta-analyses of Alzheimer's disease to identify further genetic variants that contribute to Alzheimer's pathology.

269 citations


Journal ArticleDOI
TL;DR: In this paper, a study of 1,051,032 23andMe research participants was conducted to identify genetic and nongenetic associations with testing positive for SARS-CoV-2, respiratory symptoms and hospitalization.
Abstract: COVID-19 presents with a wide range of severity, from asymptomatic in some individuals to fatal in others. Based on a study of 1,051,032 23andMe research participants, we report genetic and nongenetic associations with testing positive for SARS-CoV-2, respiratory symptoms and hospitalization. Using trans-ancestry genome-wide association studies, we identified a strong association between blood type and COVID-19 diagnosis, as well as a gene-rich locus on chromosome 3p21.31 that is more strongly associated with outcome severity. Hospitalization risk factors include advancing age, male sex, obesity, lower socioeconomic status, non-European ancestry and preexisting cardiometabolic conditions. While non-European ancestry was a significant risk factor for hospitalization after adjusting for sociodemographics and preexisting health conditions, we did not find evidence that these two primary genetic associations explain risk differences between populations for severe COVID-19 outcomes.

141 citations


Journal ArticleDOI
TL;DR: In this paper, the authors demonstrate that it is possible to estimate comparative biases by performing a genome-wide association study contrasting one subgroup versus another, showing that sex exhibits artifactual heritability in the presence of sex-differential participation bias.
Abstract: Genetic association results are often interpreted with the assumption that study participation does not affect downstream analyses. Understanding the genetic basis of participation bias is challenging since it requires the genotypes of unseen individuals. Here we demonstrate that it is possible to estimate comparative biases by performing a genome-wide association study contrasting one subgroup versus another. For example, we showed that sex exhibits artifactual autosomal heritability in the presence of sex-differential participation bias. By performing a genome-wide association study of sex in approximately 3.3 million males and females, we identified over 158 autosomal loci spuriously associated with sex and highlighted complex traits underpinning differences in study participation between the sexes. For example, the body mass index–increasing allele at FTO was observed at higher frequency in males compared to females (odds ratio = 1.02, P = 4.4 × 10−36). Finally, we demonstrated how these biases can potentially lead to incorrect inferences in downstream analyses and propose a conceptual framework for addressing such biases. Our findings highlight a new challenge that genetic studies may face as sample sizes continue to grow. Genetic analyses identify widespread sex-differential participation bias in population-based studies and show how this bias can lead to incorrect inferences. These findings highlight new challenges for association studies as sample sizes continue to grow.

107 citations


Journal ArticleDOI
15 Apr 2021-Cell
TL;DR: A framework for repurposing data from EHRs in concert with genomic data to explore the demographic ties that can impact disease burdens and demonstrates that fine-scale population structure can impact the prediction of complex disease risk within groups.

55 citations


Journal ArticleDOI
TL;DR: In this paper, a method for polygenic prediction, LDpred-funct, that leverages trait-specific functional priors to increase prediction accuracy was proposed. But the method was not applied to predict 21 highly heritable traits in the UK Biobank and 23andMe cohorts.
Abstract: Polygenic risk prediction is a widely investigated topic because of its promising clinical applications. Genetic variants in functional regions of the genome are enriched for complex trait heritability. Here, we introduce a method for polygenic prediction, LDpred-funct, that leverages trait-specific functional priors to increase prediction accuracy. We fit priors using the recently developed baseline-LD model, including coding, conserved, regulatory, and LD-related annotations. We analytically estimate posterior mean causal effect sizes and then use cross-validation to regularize these estimates, improving prediction accuracy for sparse architectures. We applied LDpred-funct to predict 21 highly heritable traits in the UK Biobank (avg N = 373 K as training data). LDpred-funct attained a +4.6% relative improvement in average prediction accuracy (avg prediction R2 = 0.144; highest R2 = 0.413 for height) compared to SBayesR (the best method that does not incorporate functional information). For height, meta-analyzing training data from UK Biobank and 23andMe cohorts (N = 1107 K) increased prediction R2 to 0.431. Our results show that incorporating functional priors improves polygenic prediction accuracy, consistent with the functional architecture of complex traits.

43 citations


Journal ArticleDOI
TL;DR: In this article, the authors developed a primary disease risk score (DRS) that combined all 32 identified genetic and non-genetic risk factors to derive lifetime risk trajectories for the three major types of skin cancers.
Abstract: We trained and validated risk prediction models for the three major types of skin cancer- basal cell carcinoma (BCC), squamous cell carcinoma (SCC), and melanoma-on a cross-sectional and longitudinal dataset of 210,000 consented research participants who responded to an online survey covering personal and family history of skin cancer, skin susceptibility, and UV exposure. We developed a primary disease risk score (DRS) that combined all 32 identified genetic and non-genetic risk factors. Top percentile DRS was associated with an up to 13-fold increase (odds ratio per standard deviation increase >2.5) in the risk of developing skin cancer relative to the middle DRS percentile. To derive lifetime risk trajectories for the three skin cancers, we developed a second and age independent disease score, called DRSA. Using incident cases, we demonstrated that DRSA could be used in early detection programs for identifying high risk asymptotic individuals, and predicting when they are likely to develop skin cancer. High DRSA scores were not only associated with earlier disease diagnosis (by up to 14 years), but also with more severe and recurrent forms of skin cancer.

35 citations


Journal ArticleDOI
TL;DR: In this article, a covariate-adjusted linkage disequilibrium score regression (cov-LDSC) was proposed to estimate SNP-heritability and its enrichment in homogenous and admixed populations with summary statistics and in-sample LD estimates.
Abstract: It is important to study the genetics of complex traits in diverse populations. Here, we introduce covariate-adjusted linkage disequilibrium (LD) score regression (cov-LDSC), a method to estimate SNP-heritability (${\boldsymbol{h}}_{\boldsymbol{g}}^{\mathbf{2}})$ and its enrichment in homogenous and admixed populations with summary statistics and in-sample LD estimates. In-sample LD can be estimated from a subset of the genome-wide association studies samples, allowing our method to be applied efficiently to very large cohorts. In simulations, we show that unadjusted LDSC underestimates ${\boldsymbol{h}}_{\boldsymbol{g}}^{\mathbf{2}}$ by 10-60% in admixed populations; in contrast, cov-LDSC is robustly accurate. We apply cov-LDSC to genotyping data from 8124 individuals, mostly of admixed ancestry, from the Slim Initiative in Genomic Medicine for the Americas study, and to approximately 161 000 Latino-ancestry individuals, 47 000 African American-ancestry individuals and 135 000 European-ancestry individuals, as classified by 23andMe. We estimate ${\boldsymbol{h}}_{\boldsymbol{g}}^{\mathbf{2}}$ and detect heritability enrichment in three quantitative and five dichotomous phenotypes, making this, to our knowledge, the most comprehensive heritability-based analysis of admixed individuals to date. Most traits have high concordance of ${\boldsymbol{h}}_{\boldsymbol{g}}^{\mathbf{2}}$ and consistent tissue-specific heritability enrichment among different populations. However, for age at menarche, we observe population-specific heritability estimates of ${\boldsymbol{h}}_{\boldsymbol{g}}^{\mathbf{2}}$. We observe consistent patterns of tissue-specific heritability enrichment across populations; for example, in the limbic system for BMI, the per-standardized-annotation effect size $ \tau $* is 0.16 ± 0.04, 0.28 ± 0.11 and 0.18 ± 0.03 in the Latino-, African American- and European-ancestry populations, respectively. Our approach is a powerful way to analyze genetic data for complex traits from admixed populations.

26 citations


Journal ArticleDOI
TL;DR: In this paper, the Templated positional Burrows-Wheeler transform (TPBWT) is proposed to make fast IBD estimates robust to genotype and phasing errors. But the authors highlight the fragility of most phase-aware IBD inference methods; the accuracy of IBD estimate can be highly sensitive to the quality of haplotype phasing.
Abstract: Estimating the genomic location and length of identical-by-descent (IBD) segments among individuals is a crucial step in many genetic analyses. However, the exponential growth in the size of biobank and direct-to-consumer genetic data sets makes accurate IBD inference a significant computational challenge. Here we present the templated positional Burrows-Wheeler transform (TPBWT) to make fast IBD estimates robust to genotype and phasing errors. Using haplotype data simulated over pedigrees with realistic genotyping and phasing errors, we show that the TPBWT outperforms other state-of-the-art IBD inference algorithms in terms of speed and accuracy. For each phase-aware method, we explore the false positive and false negative rates of inferring IBD by segment length and characterize the types of error commonly found. Our results highlight the fragility of most phased IBD inference methods; the accuracy of IBD estimates can be highly sensitive to the quality of haplotype phasing. Additionally, we compare the performance of the TPBWT against a widely used phase-free IBD inference approach that is robust to phasing errors. We introduce both in-sample and out-of-sample TPBWT-based IBD inference algorithms and demonstrate their computational efficiency on massive-scale data sets with millions of samples. Furthermore, we describe the binary file format for TPBWT-compressed haplotypes that results in fast and efficient out-of-sample IBD computes against very large cohort panels. Finally, we demonstrate the utility of the TPBWT in a brief empirical analysis, exploring geographic patterns of haplotype sharing within Mexico. Hierarchical clustering of IBD shared across regions within Mexico reveals geographically structured haplotype sharing and a strong signal of isolation by distance. Our software implementation of the TPBWT is freely available for noncommercial use in the code repository (https://github.com/23andMe/phasedibd, last accessed January 11, 2021).

16 citations


Posted ContentDOI
22 Aug 2021-medRxiv
TL;DR: This article found 42 independent genome-wide significant loci: 17 are in genes linked to or pleiotropic with cognitive ability/educational attainment; 25 are novel and may be more specifically associated with dyslexia.
Abstract: Reading and writing are crucial for many aspects of modern life but up to 1 in 10 children are affected by dyslexia [1, 2], which can persist into adulthood. Family studies of dyslexia suggest heritability up to 70% [3, 4], yet no convincing genetic markers have been found due to limited study power [5]. Here, we present a genome-wide association study representing a 20-fold increase in sample size from prior work, with 51,800 adults self-reporting a dyslexia diagnosis and 1,087,070 controls. We identified 42 independent genome-wide significant loci: 17 are in genes linked to or pleiotropic with cognitive ability/educational attainment; 25 are novel and may be more specifically associated with dyslexia. Twenty-three loci (12 novel) were validated in independent cohorts of Chinese and European ancestry. We confirmed a similar genetic aetiology of dyslexia between sexes, and found genetic covariance with many traits, including ambidexterity, but not neuroanatomical measures of language-related circuitry. Causal analyses revealed a directional effect of dyslexia on attention deficit hyperactivity disorder and bidirectional effects on socio-educational traits but these relationships require further investigation. Dyslexia polygenic scores explained up to 6% of variance in reading traits in independent cohorts, and might in future enable earlier identification and remediation of dyslexia.

9 citations


Posted ContentDOI
31 May 2021-medRxiv
TL;DR: In this paper, the authors describe the demographic patterns associated with COVID-19 related anosmia, and find the symptom is more often reported in women and younger respondents, and less often by those of East Asian and African American ancestry compared to those of European ancestry.
Abstract: Loss of sense of smell is a characteristic symptom of infection with SARS-CoV-2. However, specific mechanisms linking infection with loss of smell are poorly understood. Using self-reported symptom data from the 23andMe COVID-19 study, we describe the demographic patterns associated with COVID-19 related anosmia, and find the symptom is more often reported in women and younger respondents, and less often by those of East Asian and African American ancestry compared to those of European ancestry. We ran a trans-ethnic genome-wide association study (GWAS) comparing loss of smell or taste (n=47,298) with no loss of smell or taste (n=22,543) among those with a positive SARS-CoV-2 test result. We identified an association (rs7688383) in the vicinity of the UGT2A1 and UGT2A2 genes (OR=1.115, p-value=4x10-15), which have been linked to olfactory function. These results may shed light on the biological mechanisms underlying COVID-19 related anosmia.

DOI
05 Nov 2021
TL;DR: O'Donnell et al. as discussed by the authors constructed a new genome-wide imputation reference panel comprising 2,269 individuals of Sub-Saharan African ancestry, predominantly Atlantic African admixed with varying amounts of European and American ancestry.
Abstract: There is currently a dearth of accessible whole genome sequencing (WGS) data for individuals residing in the Americas with Sub-Saharan African ancestry. We generated whole genome sequencing data at intermediate (15×) coverage for 2,294 individuals with large amounts of Sub-Saharan African ancestry, predominantly Atlantic African admixed with varying amounts of European and American ancestry. We performed extensive comparisons of variant callers, phasing algorithms, and variant filtration on these data to construct a high quality imputation panel containing data from 2,269 unrelated individuals. With the exception of the TOPMed imputation server (which notably cannot be downloaded), our panel substantially outperformed other available panels when imputing African American individuals. The raw sequencing data, variant calls and imputation panel for this cohort are all freely available via dbGaP and should prove an invaluable resource for further study of admixed African genetics. O’Connell et al. construct a new genome-wide imputation reference panel comprising 2,269 individuals of Sub-Saharan African ancestries. They adapt DeepVariant to create best practices for reference panel development and generate a high quality, publicly available resource that will further empower high resolution genome-wide imputation efforts in individuals of African ancestries.

Journal ArticleDOI
TL;DR: In this paper, the authors present a method that infers small pedigrees of close relatives and then assembles them into larger ones, using a branch-and-bound-like approach.
Abstract: Summary Pedigree inference from genotype data is a challenging problem, particularly when pedigrees are sparsely sampled and individuals may be distantly related to their closest genotyped relatives. We present a method that infers small pedigrees of close relatives and then assembles them into larger pedigrees. To assemble large pedigrees, we introduce several formulas and tools including a likelihood for the degree separating two small pedigrees, a generalization of the fast DRUID point estimate of the degree separating two pedigrees, a method for detecting individuals who share background identity-by-descent (IBD) that does not reflect recent common ancestry, and a method for identifying the ancestral branches through which distant relatives are connected. Our method also takes several approaches that help to improve the accuracy and efficiency of pedigree inference. In particular, we incorporate age information directly into the likelihood rather than using ages only for consistency checks and we employ a heuristic branch-and-bound-like approach to more efficiently explore the space of possible pedigrees. Together, these approaches make it possible to construct large pedigrees that are challenging or intractable for current inference methods.

Posted ContentDOI
20 Jan 2021-bioRxiv
TL;DR: Ancestry deconvolution is the task of identifying the ancestral origins of chromosomal segments of admixed individuals as discussed by the authors, which has important applications, from mapping disease genes to identifying loci potentially under natural selection.
Abstract: Ancestry deconvolution is the task of identifying the ancestral origins of chromosomal segments of admixed individuals. It has important applications, from mapping disease genes to identifying loci potentially under natural selection. However, most existing methods are limited to a small number of ancestral populations and are unsuitable for large-scale applications. In this article, we describe Ancestry Composition, a modular pipeline for accurate and efficient ancestry deconvolution. In the first stage, a string-kernel support-vector-machines classifier assigns provisional ancestry labels to short statistically phased genomic segments. In the second stage, an autoregressive pair hidden Markov model corrects phasing errors, smooths local ancestry estimates, and computes confidence scores. Using publicly available datasets and more than 12,000 individuals from the customer database of the personal genetics company, 23andMe, Inc., we have constructed a reference panel containing more than 14,000 unrelated individuals of unadmixed ancestry. We used principal components analysis (PCA) and uniform manifold approximation and projection (UMAP) to identify genetic clusters and define 45 distinct reference populations upon which to train our method. In cross-validation experiments, Ancestry Composition achieves high precision and recall.

Posted ContentDOI
16 Jun 2021-medRxiv
TL;DR: In this paper, a large-scale online collection of self-reported diagnosis data is used for discovery and replication of genetic associations for rare diseases, including Duane retraction syndrome, vestibular schwannoma, and spontaneous pneumothorax.
Abstract: A key challenge in the study of rare disease genetics is assembling large case cohorts for well-powered studies. We demonstrate the use of self-reported diagnosis data to study rare diseases at scale. We performed genome-wide association studies (GWAS) for 33 rare diseases using self-reported diagnosis phenotypes and re-discovered 29 known associations to validate our approach. In addition, we performed the first GWAS for Duane retraction syndrome, vestibular schwannoma and spontaneous pneumothorax, and report novel genome-wide significant associations for these diseases. We replicated these novel associations in non-European populations within the 23andMe, Inc. cohort as well as in the UK Biobank cohort. We also show that mixed model analyses including all ethnicities and related samples increase the power for finding associations in rare diseases. Our results, based on analysis of 19,084 rare disease cases for 33 diseases from 7 populations, show that large-scale online collection of self-reported data is a viable method for discovery and replication of genetic associations for rare diseases. This approach, which is complementary to sequencing-based approaches, will enable the discovery of more novel genetic associations for increasingly rare diseases across multiple ancestries and shed more light on the genetic architecture of rare diseases.

Journal ArticleDOI
26 Mar 2021-Science
TL;DR: Hamer et al. as discussed by the authors argued that the variable "ever versus never had a same-sex partner" does not capture the complexity of human sexuality and reported follow-up analyses showing substantial overlap of the genetic influences on the main variable and on more nuanced measures of sexual behavior, attraction, and identity.
Abstract: Hamer et al argue that the variable "ever versus never had a same-sex partner" does not capture the complexity of human sexuality. We agree and said so in our paper. But Hamer et al neglect to mention that we also reported follow-up analyses showing substantial overlap of the genetic influences on our main variable and on more nuanced measures of sexual behavior, attraction, and identity.

Posted ContentDOI
08 Apr 2021-bioRxiv
TL;DR: Bonsai as discussed by the authors infers small pedigrees of close relatives and then assembles them into a larger pedigree, using a branch-and-bound-like approach to more efficiently explore the space of possible families.
Abstract: 1.AO_SCPLOWBSTRACTC_SCPLOWPedigree inference from genotype data is a challenging problem, particularly when pedigrees are sparsely sampled and individuals may be distantly related to their closest genotyped relatives. We present a new method that infers small pedigrees of close relatives and then assembles them into larger pedigrees. To assemble large pedigrees, we introduce several new formulas and tools including a new likelihood for the degree separating two small pedigrees, a method for detecting individuals who share background identity-by-descent (IBD) that does not reflect recent common ancestry, and a method for identifying the ancestral branches through which distant relatives are connected. Our method also takes several new approaches that help to improve the accuracy and efficiency of pedigree inference. In particular, we incorporate age information directly into the likelihood rather than using ages only for consistency checks and we employ a heuristic branch-and-bound-like approach to more efficiently explore the space of possible pedigrees. Together, these approaches make it possible to construct large pedigrees that are challenging or intractable for current inference methods. The new method, Bonsai, is available at https://github.com/23andMe/bonsaitree.

Patent
21 Jan 2021
TL;DR: In this article, a probabilistic hidden Markov model (HMM) is used to correct genotyping errors and phase switch errors to make fast and accurate phase aware IBD estimates.
Abstract: The disclosed embodiments concern methods, apparatus, systems and computer program products for estimating IBD segments. Some implementations use a templated positional Burrows-Wheeler transform (PBWT) technique and a phase switch error heuristic to correct genotyping errors and phase switch errors to make fast and accurate phase aware IBD estimates. In some implementations a templated PBWT technique and a probabilistic hidden Markov model (HMM) are used to correct genotyping errors and phase switch errors.

Patent
18 Mar 2021
TL;DR: In this paper, a probabilistic relationship model is used to obtain various likelihoods of various potential relationships based on pairwise IBD data and pairwise age data and display pedigree graphs with various features that are informative and easy to understand.
Abstract: The disclosed embodiments concern methods, apparatus, systems and computer program products for determining and displaying pedigrees based on IBD data. Some implementations use a probabilistic relationship model to obtain various likelihoods of various potential relationships based on pairwise IBD data and pairwise age data. Some implementations build large pedigrees by combining smaller pedigrees. Some implementations display pedigree graphs with various features that are informative and easy to understand.