Home
/
Authors
/
Wing-Kin Sung

Author

Wing-Kin Sung

Other affiliations: University of Hong Kong, Yale University, Huazhong Agricultural University ...read more

Bio: Wing-Kin Sung is an academic researcher from National University of Singapore. The author has contributed to research in topics: Gene & Chromatin immunoprecipitation. The author has an hindex of 64, co-authored 327 publications receiving 26116 citations. Previous affiliations of Wing-Kin Sung include University of Hong Kong & Yale University.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1994

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Genome-wide survey of recurrent HBV integration in hepatocellular carcinoma

[...]

Wing-Kin Sung¹, Hancheng Zheng², Shuyu Li³, Ronghua Chen⁴, Xiao Liu², Yingrui Li², Nikki P. Lee¹, Wah Heng Lee⁵, Pramila N. Ariyaratne⁵, Chandana Tennakoon⁶, Fabianus Hendriyan Mulawadi⁵, Kwong F. Wong, Angela M. Liu, Ronnie T.P. Poon¹, Sheung Tat Fan¹, KL Chan¹, Zhuolin Gong², Yujie Hu², Zhao Lin², Guan Wang², Qinghui Zhang², Thomas D. Barber³, Wen-Chi Chou³, Amit Aggarwal³, Ke Hao⁴, Wei Zhou⁴, Chunsheng Zhang⁴, James S. Hardwick⁷, James S. Hardwick⁴, Carolyn A. Buser⁴, Jiangchun Xu⁸, Zhengyan Kan⁸, Hongyue Dai⁴, Mao Mao⁸, Mao Mao⁷, Christoph Reinhard³, Jun Wang², Jun Wang⁹, John M. Luk - Show less +35 more•Institutions (9)

University of Hong Kong¹, Beijing Genomics Institute², Eli Lilly and Company³, Merck & Co.⁴, Genome Institute of Singapore⁵, National University of Singapore⁶, Wilmington University⁷, Pfizer⁸, University of Copenhagen⁹

01 Jul 2012-Nature Genetics

TL;DR: Evidence is reported that suggests that the number of HBV integrations is associated with patient survival and copy-number variations were significantly increased at HBV breakpoint locations where chromosomal instability was likely induced.

...read moreread less

Abstract: To survey hepatitis B virus (HBV) integration in liver cancer genomes, we conducted massively parallel sequencing of 81 HBV-positive and 7 HBV-negative hepatocellular carcinomas (HCCs) and adjacent normal tissues. We found that HBV integration is observed more frequently in the tumors (86.4%) than in adjacent liver tissues (30.7%). Copy-number variations (CNVs) were significantly increased at HBV breakpoint locations where chromosomal instability was likely induced. Approximately 40% of HBV breakpoints within the HBV genome were located within a 1,800-bp region where the viral enhancer, X gene and core gene are located. We also identified recurrent HBV integration events (in ≥4 HCCs) that were validated by RNA sequencing (RNA-seq) and Sanger sequencing at the known and putative cancer-related TERT, MLL4 and CCNE1 genes, which showed upregulated gene expression in tumor versus normal tissue. We also report evidence that suggests that the number of HBV integrations is associated with patient survival.

...read moreread less

772 citations

Journal Article•DOI•

CTCF-mediated functional chromatin interactome in pluripotent cells

[...]

Lusy Handoko¹, Han Xu¹, Guoliang Li¹, Chew Yee Ngan¹, Elaine G.Y. Chew¹, Marie Schnapp¹, Charlie Wah Heng Lee¹, Chaopeng Ye¹, Joanne Lim Hui Ping¹, Fabianus Hendriyan Mulawadi¹, Eleanor Wong¹, Eleanor Wong², Jianpeng Sheng³, Yubo Zhang¹, Thompson Poh¹, Chee Seng Chan¹, Galih Kunarso², Atif Shahab¹, Guillaume Bourque¹, Valere Cacheux-Rataboul¹, Wing-Kin Sung¹, Wing-Kin Sung², Yijun Ruan¹, Chia-Lin Wei⁴, Chia-Lin Wei², Chia-Lin Wei¹ - Show less +22 more•Institutions (4)

Genome Institute of Singapore¹, National University of Singapore², Nanyang Technological University³, Joint Genome Institute⁴

01 Jul 2011-Nature Genetics

TL;DR: Five distinct chromatin domains are uncovered that suggest potential new models of CTCF function in chromatin organization and transcriptional control, and demarcate chromatin-nuclear membrane attachments and influence proper gene expression through extensive cross-talk between promoters and regulatory elements.

...read moreread less

Abstract: Mammalian genomes are viewed as functional organizations that orchestrate spatial and temporal gene regulation. CTCF, the most characterized insulator-binding protein, has been implicated as a key genome organizer. However, little is known about CTCF-associated higher-order chromatin structures at a global scale. Here we applied chromatin interaction analysis by paired-end tag (ChIA-PET) sequencing to elucidate the CTCF-chromatin interactome in pluripotent cells. From this analysis, we identified 1,480 cis- and 336 trans-interacting loci with high reproducibility and precision. Associating these chromatin interaction loci with their underlying epigenetic states, promoter activities, enhancer binding and nuclear lamina occupancy, we uncovered five distinct chromatin domains that suggest potential new models of CTCF function in chromatin organization and transcriptional control. Specifically, CTCF interactions demarcate chromatin-nuclear membrane attachments and influence proper gene expression through extensive cross-talk between promoters and regulatory elements. This highly complex nuclear organization offers insights toward the unifying principles that govern genome plasticity and function.

...read moreread less

642 citations

Journal Article•DOI•

Whole-genome mapping of histone H3 Lys4 and 27 trimethylations reveals distinct genomic compartments in human embryonic stem cells.

[...]

Xiao Dong Zhao¹, Xu Han¹, Joon Lin Chew², Joon Lin Chew¹, Jun Liu¹, Kuo Ping Chiu¹, Andre Choo², Yuriy L. Orlov¹, Wing-Kin Sung², Wing-Kin Sung¹, Atif Shahab¹, Vladimir A. Kuznetsov¹, Guillaume Bourque¹, Steve Oh², Yijun Ruan¹, Huck-Hui Ng¹, Huck-Hui Ng², Chia-Lin Wei¹ - Show less +14 more•Institutions (2)

Genome Institute of Singapore¹, National University of Singapore²

13 Sep 2007-Cell Stem Cell

TL;DR: These global histone methylation maps provide an epigenetic framework that enables the discovery of novel transcriptional networks and delineation of different genetic compartments of the pluripotent cell genome.

...read moreread less

597 citations

Journal Article•DOI•

Assemblathon 1: A competitive assessment of de novo short read assembly methods

[...]

Dent Earl¹, Keith Bradnam², John St. John, Aaron E. Darling², Dawei Lin², Joseph Fass², Hung On Ken Yu², Vince Buffalo², Daniel R. Zerbino¹, Mark Diekhans, Ngan Nguyen, Pramila N. Ariyaratne³, Wing-Kin Sung⁴, Wing-Kin Sung³, Zemin Ning⁵, Matthias Haimel⁶, Jared T. Simpson⁵, Nuno A. Fonseca⁷, Inanc Birol, T. Roderick Docking, Isaac Ho⁸, Daniel S. Rokhsar⁸, Rayan Chikhi, Dominique Lavenier⁹, Dominique Lavenier¹⁰, Guillaume Chapuis, Delphine Naquin¹⁰, Delphine Naquin⁹, Nicolas Maillet⁹, Nicolas Maillet¹⁰, Michael C. Schatz¹¹, David R. Kelley¹², Adam M. Phillippy, Sergey Koren, Shiaw-Pyng Yang¹³, Wei Wu¹³, Wen-Chi Chou, Anuj Srivastava, Timothy I. Shaw, J. Graham Ruby¹⁴, J. Graham Ruby¹⁵, Peter Skewes-Cox¹⁴, Peter Skewes-Cox¹⁵, Miguel Betegon¹⁴, Miguel Betegon¹⁵, Michelle Dimon¹⁵, Michelle Dimon¹⁴, Victor V. Solovyev¹⁶, Igor Seledtsov, Petr Kosarev, Denis Vorobyev, Ricardo H. Ramirez-Gonzalez, Richard M. Leggett¹⁷, Dan MacLean¹⁷, Fangfang Xia, Ruibang Luo¹⁸, Zhenyu Li¹⁸, Yinlong Xie¹⁸, Binghang Liu¹⁸, Sante Gnerre¹⁹, Iain MacCallum¹⁹, Dariusz Przybylski¹⁹, Filipe J. Ribeiro¹⁹, Shuangye Yin¹⁹, Ted Sharpe¹⁹, Giles Hall¹⁹, Paul J. Kersey⁶, Richard Durbin⁵, Shaun D. Jackman, Jarrod Chapman⁸, Xiaoqiu Huang, Joseph L. DeRisi¹⁴, Mario Caccamo, Yingrui Li¹⁸, David B. Jaffe¹⁹, Richard E. Green¹, David Haussler¹⁴, Ian F Korf, Benedict Paten¹⁴ - Show less +75 more•Institutions (19)

University of California, Santa Cruz¹, University of California, Davis², Agency for Science, Technology and Research³, National University of Singapore⁴, Wellcome Trust Sanger Institute⁵, European Bioinformatics Institute⁶, University of Porto⁷, Joint Genome Institute⁸, Centre national de la recherche scientifique⁹, French Institute for Research in Computer Science and Automation¹⁰, Cold Spring Harbor Laboratory¹¹, University of Maryland, College Park¹², Monsanto¹³, Howard Hughes Medical Institute¹⁴, University of California, San Francisco¹⁵, Royal Holloway, University of London¹⁶, Sainsbury Laboratory¹⁷, Beijing Genomics Institute¹⁸, Broad Institute¹⁹

16 Sep 2011-Genome Research

TL;DR: The Assemblathon 1 competition is described, which aimed to comprehensively assess the state of the art in de novo assembly methods when applied to current sequencing technologies, and it is established that it is possible to assemble the genome to a high level of coverage and accuracy.

...read moreread less

Abstract: Low-cost short read sequencing technology has revolutionized genomics, though it is only just becoming practical for the high-quality de novo assembly of a novel large genome We describe the Assemblathon 1 competition, which aimed to comprehensively assess the state of the art in de novo assembly methods when applied to current sequencing technologies In a collaborative effort, teams were asked to assemble a simulated Illumina HiSeq data set of an unknown, simulated diploid genome A total of 41 assemblies from 17 different groups were received Novel haplotype aware assessments of coverage, contiguity, structure, base calling, and copy number were made We establish that within this benchmark: (1) It is possible to assemble the genome to a high level of coverage and accuracy, and that (2) large differences exist between the assemblies, suggesting room for further improvements in current methods The simulated benchmark, including the correct answer, the assemblies, and the code that was used to evaluate the assemblies is now public and freely available from http://wwwassemblathonorg/

...read moreread less

548 citations

Journal Article•DOI•

Exploiting indirect neighbours and topological weight to predict protein function from protein--protein interactions

[...]

Hon Nian Chua¹, Wing-Kin Sung¹, Limsoon Wong¹•Institutions (1)

National University of Singapore¹

01 Jul 2006-Bioinformatics

TL;DR: An algorithm is developed that predicts the functions of a protein in two steps by estimating its functional similarity with the protein using the local topology of the interaction network as well as the reliability of experimental sources and scoring each function based on its weighted frequency in these neighbours.

...read moreread less

Abstract: Motivation: Most approaches in predicting protein function from protein--protein interaction data utilize the observation that a protein often share functions with proteins that interacts with it (its level-1 neighbours). However, proteins that interact with the same proteins (i.e. level-2 neighbours) may also have a greater likelihood of sharing similar physical or biochemical characteristics. We speculate that functional similarity between a protein and its neighbours from the two different levels arise from two distinct forms of functional association, and a protein is likely to share functions with its level-1 and/or level-2 neighbours. We are interested in finding out how significant is functional association between level-2 neighbours and how they can be exploited for protein function prediction. Results: We made a statistical study on recent interaction data and observed that functional association between level-2 neighbours is clearly observable. A substantial number of proteins are observed to share functions with level-2 neighbours but not with level-1 neighbours. We develop an algorithm that predicts the functions of a protein in two steps: (1) assign a weight to each of its level-1 and level-2 neighbours by estimating its functional similarity with the protein using the local topology of the interaction network as well as the reliability of experimental sources and (2) scoring each function based on its weighted frequency in these neighbours. Using leave-one-out cross validation, we compare the performance of our method against that of several other existing approaches and show that our method performs relatively well. Contact: g0306417@nus.edu.sg

...read moreread less

539 citations

1
2
3
4
5
…
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Fast and accurate short read alignment with Burrows–Wheeler transform

[...]

Heng Li¹, Richard Durbin¹•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Jul 2009-Bioinformatics

TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.

...read moreread less

Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

...read moreread less

43,862 citations

Journal Article•DOI•

Ultrafast and memory-efficient alignment of short DNA sequences to the human genome

[...]

Ben Langmead¹, Cole Trapnell¹, Mihai Pop¹, Steven L. Salzberg¹•Institutions (1)

University of Maryland, College Park¹

04 Mar 2009-Genome Biology

TL;DR: Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches and can be used simultaneously to achieve even greater alignment speeds.

...read moreread less

Abstract: Bowtie is an ultrafast, memory-efficient alignment program for aligning short DNA sequence reads to large genomes. For the human genome, Burrows-Wheeler indexing allows Bowtie to align more than 25 million reads per CPU hour with a memory footprint of approximately 1.3 gigabytes. Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches. Multiple processor cores can be used simultaneously to achieve even greater alignment speeds. Bowtie is open source http://bowtie.cbcb.umd.edu.

...read moreread less

20,335 citations

Journal Article•DOI•

An integrated encyclopedia of DNA elements in the human genome

[...]

Principal investigators¹, Nhgri groups², Data production leads³, Lead analysts³•Institutions (3)

Wellcome Trust¹, University of Washington², Pennsylvania State University³

06 Sep 2012-Nature

TL;DR: The Encyclopedia of DNA Elements project provides new insights into the organization and regulation of the authors' genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

Abstract: The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

13,548 citations

Journal Article•DOI•

Machine learning

[...]

Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

01 Dec 1996-ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

...read moreread less

13,246 citations

Journal Article•DOI•

Model-based Analysis of ChIP-Seq (MACS)

[...]

Yong Zhang¹, Tao Liu¹, Clifford A. Meyer¹, Jérôme Eeckhoute², David S. Johnson, Bradley E. Bernstein¹, Bradley E. Bernstein³, Chad Nusbaum³, Richard M. Myers⁴, Myles Brown², Wei Li⁵, X. Shirley Liu¹ - Show less +8 more•Institutions (5)

Harvard University¹, Brigham and Women's Hospital², Broad Institute³, Stanford University⁴, Baylor College of Medicine⁵

17 Sep 2008-Genome Biology

TL;DR: This work presents Model-based Analysis of ChIP-Seq data, MACS, which analyzes data generated by short read sequencers such as Solexa's Genome Analyzer, and uses a dynamic Poisson distribution to effectively capture local biases in the genome, allowing for more robust predictions.

...read moreread less

Abstract: We present Model-based Analysis of ChIP-Seq data, MACS, which analyzes data generated by short read sequencers such as Solexa's Genome Analyzer. MACS empirically models the shift size of ChIP-Seq tags, and uses it to improve the spatial resolution of predicted binding sites. MACS also uses a dynamic Poisson distribution to effectively capture local biases in the genome, allowing for more robust predictions. MACS compares favorably to existing ChIP-Seq peak-finding algorithms, and is freely available.

...read moreread less

13,008 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse