Home
/
Authors
/
T. Daniel Andrews

Author

T. Daniel Andrews

Other affiliations: National Institute of Genetics, Wellcome Trust Sanger Institute, European Bioinformatics Institute

Bio: T. Daniel Andrews is an academic researcher from Australian National University. The author has contributed to research in topics: Population & Exome. The author has an hindex of 23, co-authored 34 publications receiving 10994 citations. Previous affiliations of T. Daniel Andrews include National Institute of Genetics & Wellcome Trust Sanger Institute.

Topics: Population, Exome, Copy-number variation, Exome sequencing, Genome ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Global variation in copy number in the human genome

[...]

Richard Redon¹, Shumpei Ishikawa², Karen R. Fitch³, Lars Feuk⁴, George H. Perry⁵, T. Daniel Andrews¹, Heike Fiegler¹, Michael H. Shapero³, Andrew R. Carson⁴, Wenwei Chen³, Eun Kyung Cho⁶, Stephanie Dallaire⁶, Jennifer L. Freeman⁶, Juan R. González⁷, Mònica Gratacòs⁷, Jing Huang³, Dimitrios Kalaitzopoulos¹, Daisuke Komura², Jeffrey R. MacDonald⁴, Christian R. Marshall⁴, Rui Mei³, Lyndal Montgomery¹, Keunihiro Nishimura², Kohji Okamura⁴, Fan Shen³, Martin J. Somerville⁸, Joelle Tchinda⁶, Armand Valsesia¹, Cara Woodwark¹, Fengtang Yang¹, Junjun Zhang⁴, Tatiana Zerjal¹, Jane Zhang³, Lluís Armengol⁷, Donald F. Conrad⁹, Xavier Estivill⁷, Chris Tyler-Smith¹, Nigel P. Carter¹, Hiroyuki Aburatani², Charles Lee⁶, Keith W. Jones³, Stephen W. Scherer⁴, Matthew E. Hurles¹ - Show less +39 more•Institutions (9)

Wellcome Trust Sanger Institute¹, University of Tokyo², Thermo Fisher Scientific³, University of Toronto⁴, Brigham and Women's Hospital⁵, Harvard University⁶, Pompeu Fabra University⁷, University of Alberta⁸, University of Chicago⁹

23 Nov 2006-Nature

TL;DR: A first-generation CNV map of the human genome is constructed through the study of 270 individuals from four populations with ancestry in Europe, Africa or Asia, underscoring the importance of CNV in genetic diversity and evolution and the utility of this resource for genetic disease studies.

...read moreread less

Abstract: Copy number variation (CNV) of DNA sequences is functionally significant but has yet to be fully ascertained. We have constructed a first-generation CNV map of the human genome through the study of 270 individuals from four populations with ancestry in Europe, Africa or Asia (the HapMap collection). DNA from these individuals was screened for CNV using two complementary technologies: single-nucleotide polymorphism (SNP) genotyping arrays, and clone-based comparative genomic hybridization. A total of 1,447 copy number variable regions (CNVRs), which can encompass overlapping or adjacent gains or losses, covering 360 megabases (12% of the genome) were identified in these populations. These CNVRs contained hundreds of genes, disease loci, functional elements and segmental duplications. Notably, the CNVRs encompassed more nucleotide content per genome than SNPs, underscoring the importance of CNV in genetic diversity and evolution. The data obtained delineate linkage disequilibrium patterns for many CNVs, and reveal marked variation in copy number among populations. We also demonstrate the utility of this resource for genetic disease studies.

...read moreread less

4,275 citations

Journal Article•DOI•

Origins and functional impact of copy number variation in the human genome

[...]

Donald F. Conrad¹, Dalila Pinto², Richard Redon³, Richard Redon¹, Lars Feuk², Lars Feuk⁴, Omer Gokcumen⁵, Yujun Zhang¹, Jan Aerts¹, T. Daniel Andrews¹, Chris P. Barnes¹, Peter J. Campbell¹, Tomas W Fitzgerald¹, Min Hu¹, Chun Hwa Ihm⁵, K. Kristiansson¹, Daniel G. MacArthur¹, Jeffrey R. MacDonald², Ifejinelo Onyiah¹, Andy Wing Chun Pang², Samuel Robson¹, Kathy Stirrups¹, Armand Valsesia¹, Klaudia Walter¹, John Wei², Chris Tyler-Smith¹, Nigel P. Carter¹, Charles Lee⁵, Stephen W. Scherer⁶, Stephen W. Scherer², Matthew E. Hurles¹ - Show less +27 more•Institutions (6)

Wellcome Trust Sanger Institute¹, The Centre for Applied Genomics², French Institute of Health and Medical Research³, Uppsala University⁴, Brigham and Women's Hospital⁵, University of Toronto⁶

01 Apr 2010-Nature

TL;DR: It is concluded that the heritability void left by genome-wide association studies will not be accounted for by common CNVs, and 30 loci with CNVs that are candidates for influencing disease susceptibility are identified.

...read moreread less

Abstract: Structural variations of DNA greater than 1 kilobase in size account for most bases that vary among human genomes, but are still relatively under-ascertained. Here we use tiling oligonucleotide microarrays, comprising 42 million probes, to generate a comprehensive map of 11,700 copy number variations (CNVs) greater than 443 base pairs, of which most (8,599) have been validated independently. For 4,978 of these CNVs, we generated reference genotypes from 450 individuals of European, African or East Asian ancestry. The predominant mutational mechanisms differ among CNV size classes. Retrotransposition has duplicated and inserted some coding and non-coding DNA segments randomly around the genome. Furthermore, by correlation with known trait-associated single nucleotide polymorphisms (SNPs), we identified 30 loci with CNVs that are candidates for influencing disease susceptibility. Despite this, having assessed the completeness of our map and the patterns of linkage disequilibrium between CNVs and SNPs, we conclude that, for complex traits, the heritability void left by genome-wide association studies will not be accounted for by common CNVs.

...read moreread less

1,892 citations

Journal Article•DOI•

The DNA sequence of the human X chromosome

[...]

Mark T. Ross¹, Darren Grafham¹, Alison J. Coffey¹, Steven E. Scherer² +279 more•Institutions (15)

17 Mar 2005-Nature

TL;DR: This analysis illustrates the autosomal origin of the mammalian sex chromosomes, the stepwise process that led to the progressive loss of recombination between X and Y, and the extent of subsequent degradation of the Y chromosome.

...read moreread less

Abstract: The human X chromosome has a unique biology that was shaped by its evolution as the sex chromosome shared by males and females. We have determined 99.3% of the euchromatic sequence of the X chromosome. Our analysis illustrates the autosomal origin of the mammalian sex chromosomes, the stepwise process that led to the progressive loss of recombination between X and Y, and the extent of subsequent degradation of the Y chromosome. LINE1 repeat elements cover one-third of the X chromosome, with a distribution that is consistent with their proposed role as way stations in the process of X-chromosome inactivation. We found 1,098 genes in the sequence, of which 99 encode proteins expressed in testis and in various tumour types. A disproportionately high number of mendelian diseases are documented for the X chromosome. Of this number, 168 have been explained by mutations in 113 X-linked genes, which in many cases were characterized with the aid of the DNA sequence.

...read moreread less

1,102 citations

Journal Article•DOI•

Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls

[...]

Nicholas John Craddock¹, Matthew E. Hurles², Niall Cardin³, Richard D. Pearson³ +232 more•Institutions (34)

01 Apr 2010-Nature

TL;DR: A large, direct genome-wide study of association between CNVs and eight common human diseases concludes that common CNVs that can be typed on existing platforms are unlikely to contribute greatly to the genetic basis ofcommon human diseases.

...read moreread less

Abstract: Copy number variants (CNVs) account for a major proportion of human genetic polymorphism and have been predicted to have an important role in genetic susceptibility to common disease. To address this we undertook a large, direct genome-wide study of association between CNVs and eight common human diseases. Using a purpose-designed array we typed approximately 19,000 individuals into distinct copy-number classes at 3,432 polymorphic CNVs, including an estimated approximately 50% of all common CNVs larger than 500 base pairs. We identified several biological artefacts that lead to false-positive associations, including systematic CNV differences between DNAs derived from blood and cell lines. Association testing and follow-up replication analyses confirmed three loci where CNVs were associated with disease-IRGM for Crohn's disease, HLA for Crohn's disease, rheumatoid arthritis and type 1 diabetes, and TSPAN8 for type 2 diabetes-although in each case the locus had previously been identified in single nucleotide polymorphism (SNP)-based studies, reflecting our observation that most common CNVs that are well-typed on our array are well tagged by SNPs and so have been indirectly explored through SNP studies. We conclude that common CNVs that can be typed on existing platforms are unlikely to contribute greatly to the genetic basis of common human diseases.

...read moreread less

765 citations

Journal Article•DOI•

A high-resolution survey of deletion polymorphism in the human genome

[...]

Donald F. Conrad¹, T. Daniel Andrews², Nigel P. Carter², Matthew E. Hurles², Jonathan K. Pritchard¹ - Show less +1 more•Institutions (2)

University of Chicago¹, Wellcome Trust Sanger Institute²

01 Jan 2006-Nature Genetics

TL;DR: A new method that uses SNP genotype data from parent-offspring trios to identify polymorphic deletions is reported, which will permit the identification of deletion polymorphisms in high-density SNP surveys of trio or other family data.

...read moreread less

Abstract: Recent work has shown that copy number polymorphism is an important class of genetic variation in human genomes. Here we report a new method that uses SNP genotype data from parent-offspring trios to identify polymorphic deletions. We applied this method to data from the International HapMap Project to produce the first high-resolution population surveys of deletion polymorphism. Approximately 100 of these deletions have been experimentally validated using comparative genome hybridization on tiling-resolution oligonucleotide microarrays. Our analysis identifies a total of 586 distinct regions that harbor deletion polymorphisms in one or more of the families. Notably, we estimate that typical individuals are hemizygous for roughly 30-50 deletions larger than 5 kb, totaling around 550-750 kb of euchromatic sequence across their genomes. The detected deletions span a total of 267 known and predicted genes. Overall, however, the deleted regions are relatively gene-poor, consistent with the action of purifying selection against deletions. Deletion polymorphisms may well have an important role in the genetics of complex traits; however, they are not directly observed in most current gene mapping studies. Our new method will permit the identification of deletion polymorphisms in high-density SNP surveys of trio or other family data.

...read moreread less

702 citations

1
2
3
4
…
5
6
7
8

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Model-based Analysis of ChIP-Seq (MACS)

[...]

Yong Zhang¹, Tao Liu¹, Clifford A. Meyer¹, Jérôme Eeckhoute², David S. Johnson, Bradley E. Bernstein¹, Bradley E. Bernstein³, Chad Nusbaum³, Richard M. Myers⁴, Myles Brown², Wei Li⁵, X. Shirley Liu¹ - Show less +8 more•Institutions (5)

Harvard University¹, Brigham and Women's Hospital², Broad Institute³, Stanford University⁴, Baylor College of Medicine⁵

17 Sep 2008-Genome Biology

TL;DR: This work presents Model-based Analysis of ChIP-Seq data, MACS, which analyzes data generated by short read sequencers such as Solexa's Genome Analyzer, and uses a dynamic Poisson distribution to effectively capture local biases in the genome, allowing for more robust predictions.

...read moreread less

Abstract: We present Model-based Analysis of ChIP-Seq data, MACS, which analyzes data generated by short read sequencers such as Solexa's Genome Analyzer. MACS empirically models the shift size of ChIP-Seq tags, and uses it to improve the spatial resolution of predicted binding sites. MACS also uses a dynamic Poisson distribution to effectively capture local biases in the genome, allowing for more robust predictions. MACS compares favorably to existing ChIP-Seq peak-finding algorithms, and is freely available.

...read moreread less

13,008 citations

Journal Article•DOI•

ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data

[...]

Kai Wang¹, Mingyao Li¹, Hakon Hakonarson¹•Institutions (1)

Children's Hospital of Philadelphia¹

01 Sep 2010-Nucleic Acids Research

TL;DR: The ANNOVAR tool to annotate single nucleotide variants and insertions/deletions, such as examining their functional consequence on genes, inferring cytogenetic bands, reporting functional importance scores, finding variants in conserved regions, or identifying variants reported in the 1000 Genomes Project and dbSNP is developed.

...read moreread less

Abstract: High-throughput sequencing platforms are generating massive amounts of genetic variation data for diverse genomes, but it remains a challenge to pinpoint a small subset of functionally important variants. To fill these unmet needs, we developed the ANNOVAR tool to annotate single nucleotide variants (SNVs) and insertions/deletions, such as examining their functional consequence on genes, inferring cytogenetic bands, reporting functional importance scores, finding variants in conserved regions, or identifying variants reported in the 1000 Genomes Project and dbSNP. ANNOVAR can utilize annotation databases from the UCSC Genome Browser or any annotation data set conforming to Generic Feature Format version 3 (GFF3). We also illustrate a 'variants reduction' protocol on 4.7 million SNVs and indels from a human genome, including two causal mutations for Miller syndrome, a rare recessive disease. Through a stepwise procedure, we excluded variants that are unlikely to be causal, and identified 20 candidate genes including the causal gene. Using a desktop computer, ANNOVAR requires ∼4 min to perform gene-based annotation and ∼15 min to perform variants reduction on 4.7 million variants, making it practical to handle hundreds of human genomes in a day. ANNOVAR is freely available at http://www.openbioinformatics.org/annovar/.

...read moreread less

10,461 citations

Journal Article•DOI•

An integrated map of genetic variation from 1,092 human genomes

[...]

Gonçalo R. Abecasis¹, Adam Auton², Lisa D. Brooks³, Mark A. DePristo⁴, Richard Durbin⁵, Robert E. Handsaker⁶, Robert E. Handsaker⁴, Hyun Min Kang¹, Gabor T. Marth⁷, Gil McVean⁸ - Show less +6 more•Institutions (8)

University of Michigan¹, Yeshiva University², National Institutes of Health³, Broad Institute⁴, Wellcome Trust Sanger Institute⁵, Harvard University⁶, Boston College⁷, University of Oxford⁸

01 Nov 2012-Nature

TL;DR: It is shown that evolutionary conservation and coding consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways, and that each individual contains hundreds of rare non-coding variants at conserved sites, such as motif-disrupting changes in transcription-factor-binding sites.

...read moreread less

Abstract: By characterizing the geographic and functional spectrum of human genetic variation, the 1000 Genomes Project aims to build a resource to help to understand the genetic contribution to disease. Here we describe the genomes of 1,092 individuals from 14 populations, constructed using a combination of low-coverage whole-genome and exome sequencing. By developing methods to integrate information across several algorithms and diverse data sources, we provide a validated haplotype map of 38 million single nucleotide polymorphisms, 1.4 million short insertions and deletions, and more than 14,000 larger deletions. We show that individuals from different populations carry different profiles of rare and common variants, and that low-frequency variants show substantial geographic differentiation, which is further increased by the action of purifying selection. We show that evolutionary conservation and coding consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways, and that each individual contains hundreds of rare non-coding variants at conserved sites, such as motif-disrupting changes in transcription-factor-binding sites. This resource, which captures up to 98% of accessible single nucleotide polymorphisms at a frequency of 1% in related populations, enables analysis of common and low-frequency variants in individuals from diverse, including admixed, populations.

...read moreread less

7,710 citations

Journal Article•DOI•

A Map of Human Genome Variation From Population-Scale Sequencing

[...]

Gonçalo R. Abecasis¹, David Altshuler², David Altshuler³, Adam Auton⁴, Lisa D Brooks⁵, Richard Durbin⁶, Richard A. Gibbs⁷, Matthew E. Hurles⁶, Gil McVean⁴ - Show less +5 more•Institutions (7)

University of Michigan¹, Harvard University², Broad Institute³, University of Oxford⁴, Johns Hopkins University⁵, Wellcome Trust Sanger Institute⁶, Baylor College of Medicine⁷

28 Oct 2010-Nature

TL;DR: The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype as mentioned in this paper, and the results of the pilot phase of the project, designed to develop and compare different strategies for genomewide sequencing with high-throughput platforms.

...read moreread less

Abstract: The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype. Here we present results of the pilot phase of the project, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms. We undertook three projects: low-coverage whole-genome sequencing of 179 individuals from four populations; high-coverage sequencing of two mother-father-child trios; and exon-targeted sequencing of 697 individuals from seven populations. We describe the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants, most of which were previously undescribed. We show that, because we have catalogued the vast majority of common variation, over 95% of the currently accessible variants found in any individual are present in this data set. On average, each person is found to carry approximately 250 to 300 loss-of-function variants in annotated genes and 50 to 100 variants previously implicated in inherited disorders. We demonstrate how these results can be used to inform association and functional studies. From the two trios, we directly estimate the rate of de novo germline base substitution mutations to be approximately 10(-8) per base pair per generation. We explore the data with regard to signatures of natural selection, and identify a marked reduction of genetic variation in the neighbourhood of genes, due to selection at linked sites. These methods and public data will support the next phase of human genetic research.

...read moreread less

7,538 citations

Journal Article•DOI•

The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups

[...]

Christina Curtis¹, Christina Curtis², Sohrab P. Shah³, Suet-Feung Chin¹, Gulisa Turashvili³, Oscar M. Rueda¹, Mark J Dunning, Doug Speed¹, Doug Speed², Andy G. Lynch¹, Shamith A. Samarajiwa¹, Yinyin Yuan¹, Stefan Gräf¹, Gavin Ha³, Gholamreza Haffari³, Ali Bashashati³, Roslin Russell, Steven McKinney³, Anita Langerød⁴, Andrew R. Green⁵, Elena Provenzano¹, Gordon C. Wishart¹, Sarah E Pinder⁶, Peter H. Watson⁷, Peter H. Watson³, Florian Markowetz¹, Leigh C. Murphy⁷, Ian O. Ellis⁵, Arnie Purushotham⁶, Arnie Purushotham⁸, Anne Lise Børresen-Dale⁹, Anne Lise Børresen-Dale⁴, James D. Brenton, Simon Tavaré, Carlos Caldas, Samuel Aparicio³ - Show less +32 more•Institutions (9)

University of Cambridge¹, University of Southern California², University of British Columbia³, Oslo University Hospital⁴, University of Nottingham⁵, King's College London⁶, University of Manitoba⁷, Guy's and St Thomas' NHS Foundation Trust⁸, University of Oslo⁹

21 Jun 2012-Nature

TL;DR: The results provide a novel molecular stratification of the breast cancer population, derived from the impact of somatic CNAs on the transcriptome, and identify novel subgroups with distinct clinical outcomes, which reproduced in the validation cohort.

...read moreread less

Abstract: The elucidation of breast cancer subgroups and their molecular drivers requires integrated views of the genome and transcriptome from representative numbers of patients. We present an integrated analysis of copy number and gene expression in a discovery and validation set of 997 and 995 primary breast tumours, respectively, with long-term clinical follow-up. Inherited variants (copy number variants and single nucleotide polymorphisms) and acquired somatic copy number aberrations (CNAs) were associated with expression in 40% of genes, with the landscape dominated by cisand trans-acting CNAs. By delineating expression outlier genes driven in cis by CNAs, we identified putative cancer genes, including deletions in PPP2R2A, MTAP and MAP2K4. Unsupervised analysis of paired DNA–RNA profiles revealed novel subgroups with distinct clinical outcomes, which reproduced in the validation cohort. These include a high-risk, oestrogen-receptor-positive 11q13/14 cis-acting subgroup and a favourable prognosis subgroup devoid of CNAs. Trans-acting aberration hotspots were found to modulate subgroup-specific gene networks, including a TCR deletion-mediated adaptive immune response in the ‘CNA-devoid’ subgroup and a basal-specific chromosome 5 deletion-associated mitotic network. Our results provide a novel molecular stratification of the breast cancer population, derived from the impact of somatic CNAs on the transcriptome.

...read moreread less

4,722 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse