Home
/
Authors
/
Richard K. Wilson

Author

Richard K. Wilson

Other affiliations: University of Washington, St. Jude Children's Research Hospital, Memorial Sloan Kettering Cancer Center ...read more

Bio: Richard K. Wilson is an academic researcher from Nationwide Children's Hospital. The author has contributed to research in topics: Genome & Gene. The author has an hindex of 173, co-authored 463 publications receiving 260000 citations. Previous affiliations of Richard K. Wilson include University of Washington & St. Jude Children's Research Hospital.

Topics: Genome, Gene, Exome sequencing, Genomics, Human genome ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1990
1987
1986
1985
1984
1983
1980

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Low pressure DNA shearing: a method for random DNA sequence analysis.

[...]

Lawrence A. Schriefer¹, Beth K. Gebauer¹, Lisa Q.Q. Qiu¹, Robert H. Waterston¹, Richard K. Wilson¹ - Show less +1 more•Institutions (1)

University of Washington¹

01 Jan 1990-Nucleic Acids Research

TL;DR: The results of these initial experiments suggested that low-pressure shearing offered a useful alternative to sonic and enzymatic DNA fragmentation methods, and additional DNA sequencing experiments using subclones produced by the low pressure-shearing method are in progress.

...read moreread less

Abstract: Several methods have been described for random fragmentation of DNA. These methods, often used for library preparation and subcloning prior to DNA sequence analysis, include sonic treatment (1, 2), partial digestion by restriction endonucleases (3) and treatment with DNase I in the presence of manganese ions (4). While all of these methods have been used successfully to prepare random DNA fragments for further manipulation and analysis, each has difficulties and limitations. In an effort to minimize template DNA preparation tasks and simplify primerand PCR-directed DNA closure methods after an initial shotgun sequencing approach, we wished to prepare random subclones containing inserts with an average size of 4 to 6 kilobase pairs (kb). As an alternative approach, several different DNA samples were passed through a small French pressure cell at a variety of low to intermediate pressures (Figure lb). A lever device was constructed to allow controlled application of low to intermediate pressures to the cell (Figure la). The results of these initial experiments suggested that low-pressure shearing offered a useful alternative to sonic and enzymatic DNA fragmentation methods. Subsequently, regions of the Caenorhabditis elegans genome cloned in cosmid vectors were sheared using an application of 250 psi. Shearing experiments with three different C. elegans cosmid clones (insert sizes ca. 35 —42 kb) all produced essentially the same results (data not shown). The sheared cosmid DNA fragments were made flush with T4 DNA polymerase in the presence of 100 /tM dNTPs (2), and DNA fragments of the desired size range were purified by preparative agarose gel electrophoresis and subcloned in the Hindi (Sail) site of the phagemid vector pUC118. To check the efficiency of this subcloning method, 109 of these subclones were examined by standard plasmid mini-prep and agarose gel electrophoresis procedures. 101 (92%) subclones contained an insert of the expected 4 to 6 kb size range. 72 subclone DNAs were sequenced using a linear amplification method with fluorescent dye-labeled primers. Identical subclones were not observed in this analysis, and no sequence-specific shearing hot spots were detected. Additional DNA sequencing experiments using subclones produced by the low pressure-shearing method are in progress in order to determine the complete nucleotide sequence of a 100 kb region in the large cluster of C. elegans chromosome HI.

...read moreread less

64 citations

Journal Article•DOI•

BreakFusion: targeted assembly-based identification of gene fusions in whole transcriptome paired-end sequencing data.

[...]

Ken Chen¹, John W. Wallis², Cyriac Kandoth², Joelle Kalicki-Veizer², Karen Mungall³, Andrew J. Mungall³, Steven J.M. Jones³, Marco A. Marra³, Timothy J. Ley², Elaine R. Mardis², Richard K. Wilson², John N. Weinstein, Li Ding² - Show less +9 more•Institutions (3)

University of Texas MD Anderson Cancer Center¹, Washington University in St. Louis², University of British Columbia³

15 Jul 2012-Bioinformatics

TL;DR: A software package, BreakFusion that combines the strength of reference alignment followed by read-pair analysis and de novo assembly to achieve a good balance in sensitivity, specificity and computational efficiency is presented.

...read moreread less

Abstract: Summary: Despite recent progress, computational tools that identify gene fusions from next-generation whole transcriptome sequencing data are often limited in accuracy and scalability. Here, we present a software package, BreakFusion that combines the strength of reference alignment followed by read-pair analysis and de novo assembly to achieve a good balance in sensitivity, specificity and computational efficiency. Availability: http://bioinformatics.mdanderson.org/main/BreakFusion Contact: gro.nosrednadm@3nehck; ude.ltsuw.emoneg@gnidl Supplementary information: Supplementary data are available at Bioinformatics online

...read moreread less

64 citations

Journal Article•DOI•

Novel venom gene discovery in the platypus

[...]

Camilla M. Whittington¹, Camilla M. Whittington², Anthony T. Papenfuss³, Devin P. Locke¹, Elaine R. Mardis¹, Richard K. Wilson¹, Sahar Abubucker¹, Makedonka Mitreva¹, Emily S. W. Wong², Arthur Hsu³, Philip W. Kuchel², Katherine Belov², Wesley C. Warren¹ - Show less +9 more•Institutions (3)

Washington University in St. Louis¹, University of Sydney², Walter and Eliza Hall Institute of Medical Research³

29 Sep 2010-Genome Biology

TL;DR: 83 novel putative platypus venom genes from 13 toxin families, which are homologous to known toxins from a wide range of vertebrates and invertebrates are identified, providing insight into the evolution of mammalian venom.

...read moreread less

Abstract: To date, few peptides in the complex mixture of platypus venom have been identified and sequenced, in part due to the limited amounts of platypus venom available to study. We have constructed and sequenced a cDNA library from an active platypus venom gland to identify the remaining components. We identified 83 novel putative platypus venom genes from 13 toxin families, which are homologous to known toxins from a wide range of vertebrates (fish, reptiles, insectivores) and invertebrates (spiders, sea anemones, starfish). A number of these are expressed in tissues other than the venom gland, and at least three of these families (those with homology to toxins from distant invertebrates) may play non-toxin roles. Thus, further functional testing is required to confirm venom activity. However, the presence of similar putative toxins in such widely divergent species provides further evidence for the hypothesis that there are certain protein families that are selected preferentially during evolution to become venom peptides. We have also used homology with known proteins to speculate on the contributions of each venom component to the symptoms of platypus envenomation. This study represents a step towards fully characterizing the first mammal venom transcriptome. We have found similarities between putative platypus toxins and those of a number of unrelated species, providing insight into the evolution of mammalian venom.

...read moreread less

63 citations

Journal Article•DOI•

CMDS: A population-based method for identifying recurrent DNA copy number aberrations in cancer from high-resolution data

[...]

Qunyuan Zhang¹, Li Ding¹, David E. Larson¹, Daniel C. Koboldt¹, Michael D. McLellan¹, Ken Chen, Xiaoqi Shi¹, Aldi T. Kraja¹, Elaine R. Mardis¹, Richard K. Wilson¹, Ingrid B. Borecki¹, Michael A. Province¹ - Show less +8 more•Institutions (1)

Washington University in St. Louis¹

15 Feb 2010-Bioinformatics

TL;DR: CMDS provides a fast, powerful and easily implemented tool for the RCNA analysis of large-scale data from cancer genomes and is statistically powerful, computationally efficient and particularly suitable for high-resolution and large-population studies.

...read moreread less

Abstract: Motivation: DNA copy number aberration (CNA) is a hallmark of genomic abnormality in tumor cells. Recurrent CNA (RCNA) occurs in multiple cancer samples across the same chromosomal region and has greater implication in tumorigenesis. Current commonly used methods for RCNA identification require CNA calling for individual samples before cross-sample analysis. This two-step strategy may result in a heavy computational burden, as well as a loss of the overall statistical power due to segmentation and discretization of individual sample's data. We propose a population-based approach for RCNA detection with no need of single-sample analysis, which is statistically powerful, computationally efficient and particularly suitable for high-resolution and large-population studies. Results: Our approach, correlation matrix diagonal segmentation (CMDS), identifies RCNAs based on a between-chromosomal-site correlation analysis. Directly using the raw intensity ratio data from all samples and adopting a diagonal transformation strategy, CMDS substantially reduces computational burden and can obtain results very quickly from large datasets. Our simulation indicates that the statistical power of CMDS is higher than that of single-sample CNA calling based two-step approaches. We applied CMDS to two real datasets of lung cancer and brain cancer from Affymetrix and Illumina array platforms, respectively, and successfully identified known regions of CNA associated with EGFR, KRAS and other important oncogenes. CMDS provides a fast, powerful and easily implemented tool for the RCNA analysis of large-scale data from cancer genomes. Availability: The R and C programs implementing our method are available at https://dsgweb.wustl.edu/qunyuan/software/cmds. Contact: ude.ltsuw@nauynuq Supplementary information: Supplementary data are available at Bioinformatics online.

...read moreread less

62 citations

Journal Article•DOI•

Comparative genomic analysis of six Glossina genomes, vectors of African trypanosomes

[...]

Geoffrey M. Attardo¹, Adly M. M. Abd-Alla², Alvaro Acosta-Serrano³, James E. Allen⁴, Rosemary Bateta, Joshua B. Benoit⁵, Kostas Bourtzis², Jelle Caers⁶, Guy Caljon⁷, Mikkel B. Christensen⁴, David W. Farrow⁵, Markus Friedrich⁸, Aurélie Hua-Van⁹, Emily C. Jennings⁵, Denis M. Larkin¹⁰, Daniel Lawson¹¹, Michael J. Lehane³, Vasileios Panagiotis Lenis¹², Ernesto Lowy-Gallego⁴, Rosaline W. Macharia¹³, Anna R. Malacrida¹⁴, Heather G. Marco¹⁵, Daniel K. Masiga, Gareth Maslen⁴, Irina Matetovici¹⁶, Richard P. Meisel¹⁷, Irene K. Meki², Veronika Michalkova¹⁸, Veronika Michalkova¹⁹, Wolfgang J. Miller²⁰, Patrick Minx²¹, Paul O. Mireji²², Lino Ometto¹⁴, Andrew G. Parker², Rita V. M. Rio²³, Clair Rose³, Andrew J. Rosendale²⁴, Andrew J. Rosendale⁵, Omar Rota-Stabelli, Grazia Savini¹⁴, Liliane Schoofs⁶, Francesca Scolari¹⁴, Martin T. Swain²⁵, Peter Takac, Chad Tomlinson²¹, George Tsiamis²⁶, Jan Van Den Abbeele¹⁶, Aurélien Vigneron²⁷, Jingwen Wang²⁸, Wesley C. Warren²¹, Wesley C. Warren²⁹, Robert M. Waterhouse³⁰, Matthew T. Weirauch³¹, Brian L. Weiss²⁷, Richard K. Wilson²¹, Xin Zhao³², Serap Aksoy²⁷ - Show less +53 more•Institutions (32)

University of California, Davis¹, International Atomic Energy Agency², Liverpool School of Tropical Medicine³, European Bioinformatics Institute⁴, University of Cincinnati⁵, Katholieke Universiteit Leuven⁶, University of Antwerp⁷, Wayne State University⁸, Université Paris-Saclay⁹, Royal Veterinary College¹⁰, Imperial College London¹¹, University of Plymouth¹², University of Nairobi¹³, University of Pavia¹⁴, University of Cape Town¹⁵, Institute of Tropical Medicine Antwerp¹⁶, University of Houston¹⁷, Florida International University¹⁸, Slovak Academy of Sciences¹⁹, Medical University of Vienna²⁰, Washington University in St. Louis²¹, Kenya Medical Research Institute²², West Virginia University²³, Saint Joseph's University²⁴, Aberystwyth University²⁵, University of Patras²⁶, Yale University²⁷, Fudan University²⁸, University of Missouri²⁹, Swiss Institute of Bioinformatics³⁰, Cincinnati Children's Hospital Medical Center³¹, Chinese Academy of Sciences³²

02 Sep 2019-Genome Biology

TL;DR: Compared genomic analyses validate established evolutionary relationships and sub-genera and provide insight into the evolutionary biology underlying novel adaptations and are relevant to applied aspects of vector control such as trap design and discovery of novel pest and disease control strategies.

...read moreread less

Abstract: Tsetse flies (Glossina sp.) are the vectors of human and animal trypanosomiasis throughout sub-Saharan Africa. Tsetse flies are distinguished from other Diptera by unique adaptations, including lactation and the birthing of live young (obligate viviparity), a vertebrate blood-specific diet by both sexes, and obligate bacterial symbiosis. This work describes the comparative analysis of six Glossina genomes representing three sub-genera: Morsitans (G. morsitans morsitans, G. pallidipes, G. austeni), Palpalis (G. palpalis, G. fuscipes), and Fusca (G. brevipalpis) which represent different habitats, host preferences, and vectorial capacity. Genomic analyses validate established evolutionary relationships and sub-genera. Syntenic analysis of Glossina relative to Drosophila melanogaster shows reduced structural conservation across the sex-linked X chromosome. Sex-linked scaffolds show increased rates of female-specific gene expression and lower evolutionary rates relative to autosome associated genes. Tsetse-specific genes are enriched in protease, odorant-binding, and helicase activities. Lactation-associated genes are conserved across all Glossina species while male seminal proteins are rapidly evolving. Olfactory and gustatory genes are reduced across the genus relative to other insects. Vision-associated Rhodopsin genes show conservation of motion detection/tracking functions and variance in the Rhodopsin detecting colors in the blue wavelength ranges. Expanded genomic discoveries reveal the genetics underlying Glossina biology and provide a rich body of knowledge for basic science and disease control. They also provide insight into the evolutionary biology underlying novel adaptations and are relevant to applied aspects of vector control such as trap design and discovery of novel pest and disease control strategies.

...read moreread less

60 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
…
49
50
51
52
53
54
55
…
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

[...]

Stephen F. Altschul¹, Thomas L. Madden, Alejandro A. Schäffer¹, Jinghui Zhang, Zheng Zhang², Webb Miller², David J. Lipman - Show less +3 more•Institutions (2)

National Institutes of Health¹, Pennsylvania State University²

01 Sep 1997-Nucleic Acids Research

TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.

...read moreread less

Abstract: The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. For protein comparisons, a variety of definitional, algorithmic and statistical refinements described here permits the execution time of the BLAST programs to be decreased substantially while enhancing their sensitivity to weak similarities. A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original. In addition, a method is introduced for automatically combining statistically significant alignments produced by BLAST into a position-specific score matrix, and searching the database using this matrix. The resulting Position-Specific Iterated BLAST (PSIBLAST) program runs at approximately the same speed per iteration as gapped BLAST, but in many cases is much more sensitive to weak but biologically relevant sequence similarities. PSI-BLAST is used to uncover several new and interesting members of the BRCT superfamily.

...read moreread less

70,111 citations

Journal Article•DOI•

Initial sequencing and analysis of the human genome.

[...]

Eric S. Lander¹, Lauren Linton¹, Bruce W. Birren¹, Chad Nusbaum¹ +245 more•Institutions (29)

15 Feb 2001-Nature

TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.

...read moreread less

Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

...read moreread less

22,269 citations

Journal Article•DOI•

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

[...]

Aaron McKenna¹, Matthew Hanna, Eric Banks, Andrey Sivachenko, Kristian Cibulskis, Andrew Kernytsky, Kiran V. Garimella, David Altshuler, Stacey Gabriel, Mark J. Daly, Mark A. DePristo - Show less +7 more•Institutions (1)

Broad Institute¹

01 Sep 2010-Genome Research

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

20,557 citations

Journal Article•DOI•

Ultrafast and memory-efficient alignment of short DNA sequences to the human genome

[...]

Ben Langmead¹, Cole Trapnell¹, Mihai Pop¹, Steven L. Salzberg¹•Institutions (1)

University of Maryland, College Park¹

04 Mar 2009-Genome Biology

TL;DR: Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches and can be used simultaneously to achieve even greater alignment speeds.

...read moreread less

Abstract: Bowtie is an ultrafast, memory-efficient alignment program for aligning short DNA sequence reads to large genomes. For the human genome, Burrows-Wheeler indexing allows Bowtie to align more than 25 million reads per CPU hour with a memory footprint of approximately 1.3 gigabytes. Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches. Multiple processor cores can be used simultaneously to achieve even greater alignment speeds. Bowtie is open source http://bowtie.cbcb.umd.edu.

...read moreread less

20,335 citations

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse