Showing papers by "Wing-Kin Sung published in 2017"

PDF

Open Access

Journal Article•DOI•

Erratum: An intrinsic mechanism controls reactivation of neural stem cells by spindle matrix proteins.

[...]

Song Li¹, Chwee Tat Koe¹, Su Ting Tay¹, Angie Lay Keng Tan¹, Shenli Zhang¹, Yingjie Zhang¹, Patrick Tan, Wing-Kin Sung¹, Wing-Kin Sung², Hongyan Wang¹ - Show less +6 more•Institutions (2)

National University of Singapore¹, Genome Institute of Singapore²

25 Jul 2017-Nature Communications

TL;DR: In Drosophila larval brain, Chro promotes neural stem cell reactivation and prevents activated NSCs from entering quiescence, and that Chro carries out such a role by regulating the expression of key transcription factors in the nucleus.

...read moreread less

Abstract: The switch between quiescence and proliferation is central for neurogenesis and its alteration is linked to neurodevelopmental disorders such as microcephaly. However, intrinsic mechanisms that reactivate Drosophila larval neural stem cells (NSCs) to exit from quiescence are not well established. Here we show that the spindle matrix complex containing Chromator (Chro) functions as a key intrinsic regulator of NSC reactivation downstream of extrinsic insulin/insulin-like growth factor signalling. Chro also prevents NSCs from re-entering quiescence at later stages. NSC-specific in vivo profiling has identified many downstream targets of Chro, including a temporal transcription factor Grainy head (Grh) and a neural stem cell quiescence-inducing factor Prospero (Pros). We show that spindle matrix proteins promote the expression of Grh and repress that of Pros in NSCs to govern their reactivation. Our data demonstrate that nuclear Chro critically regulates gene expression in NSCs at the transition from quiescence to proliferation.The spindle matrix proteins, including Chro, are known to regulate mitotic spindle assembly in the cytoplasm. Here the authors show that in Drosophila larval brain, Chro promotes neural stem cell (NSC) reactivation and prevents activated NSCs from entering quiescence, and that Chro carries out such a role by regulating the expression of key transcription factors in the nucleus.

...read moreread less

24 citations

Journal Article•DOI•

BATVI: Fast, sensitive and accurate detection of virus integrations.

[...]

Chandana Tennakoon¹, Chandana Tennakoon², Wing-Kin Sung³, Wing-Kin Sung²•Institutions (3)

United Arab Emirates University¹, Genome Institute of Singapore², National University of Singapore³

14 Mar 2017-BMC Bioinformatics

TL;DR: The performance of BatVI was compared with existing methods VirusFinder and VirusSeq using both simulated and real-life datasets of liver cancer patients and it was able to predict almost twice the number of true positives compared to other methods while maintaining a false positive rate less than 1%.

...read moreread less

Abstract: The study of virus integrations in human genome is important since virus integrations were shown to be associated with diseases. In the literature, few methods have been proposed that predict virus integrations using next generation sequencing datasets. Although they work, they are slow and are not very sensitive. This paper introduces a new method BatVI to predict viral integrations. Our method uses a fast screening method to filter out chimeric reads containing possible viral integrations. Next, sensitive alignments of these candidate chimeric reads are called by BLAST. Chimeric reads that are co-localized in the human genome are clustered. Finally, by assembling the chimeric reads in each cluster, high confident virus integration sites are extracted. We compared the performance of BatVI with existing methods VirusFinder and VirusSeq using both simulated and real-life datasets of liver cancer patients. BatVI ran an order of magnitude faster and was able to predict almost twice the number of true positives compared to other methods while maintaining a false positive rate less than 1%. For the liver cancer datasets, BatVI uncovered novel integrations to two important genes TERT and MLL4, which were missed by previous studies. Through gene expression data, we verified the correctness of these additional integrations. BatVI can be downloaded from http://biogpu.ddns.comp.nus.edu.sg/~ksung/batvi/index.html .

...read moreread less

18 citations

Journal Article•DOI•

Serine peptidase inhibitor Kazal type 1 (SPINK1) as novel downstream effector of the cadherin-17/β-catenin axis in hepatocellular carcinoma

[...]

Felix H. Shek¹, Felix H. Shek², Ruibang Luo², Brian Y.H. Lam³, Wing-Kin Sung⁴, Wing-Kin Sung⁵, Tak-Wah Lam², John M. Luk², Ming Sum Leung², Kin Tak Chan², Hector K. Wang², Chung Man Chan², Chung Man Chan¹, Ronnie T.P. Poon², Nikki P. Lee², Nikki P. Lee¹ - Show less +12 more•Institutions (5)

Zhejiang University¹, University of Hong Kong², University of Cambridge³, Genome Institute of Singapore⁴, National University of Singapore⁵

19 Jun 2017-Cellular Oncology

TL;DR: The current data substantiate knowledge on the role of CDH17 in the biology of HCC and suggest that components of the CDh17/β-catenin axis may serve as therapeutic targets in CDH 17 over-expressing HCC patients.

...read moreread less

Abstract: Hepatocellular carcinoma (HCC) is the most common type of liver cancer worldwide. Previously, we reported that cadherin-17 (CDH17) and its related CDH17/β-catenin axis may be responsible for inducing HCC in a subset of patients exhibiting CDH17 over-expression. Here we aimed at obtaining a better understanding of the CDH17-related HCC biology and to obtain further indications for the design of targeted therapies in CDH17 over-expressing HCC patients. We found that SPINK1 acts as a downstream effector of the CDH17/β-catenin axis in HCC. In addition, we found that SPINK1 expression exhibited a positive correlation with CDH17 expression in human HCCs and was over-expressed in up to 70% of the tumors. We identified SPINK1 as a downstream effector of the CDH17/β-catenin axis using a spectrum of in vitro assays, including gene expression modulation and inhibitor assays, bioinformatics analyses and luciferase reporter assays. These in vitro results were validated in primary human HCCs, including the observation that alteration in β-catenin expression (a core component of the CDH17/β-catenin axis) in tumors affects SPINK1 serum levels in HCC patients. Similar to CDH17, SPINK1 expression in HCC cells was found to be associated with specific tumor-related properties via activating the c-Raf/MEK/ERK pathway. Our current data substantiate our knowledge on the role of CDH17 in the biology of HCC and suggest that components of the CDH17/β-catenin axis may serve as therapeutic targets in CDH17 over-expressing HCC patients.

...read moreread less

12 citations

Book Chapter•DOI•

Faster Algorithms for 1-Mappability of a Sequence

[...]

Mai Alzamel¹, Panagiotis Charalampopoulos¹, Costas S. Iliopoulos¹, Solon P. Pissis¹, Jakub Radoszewski², Jakub Radoszewski¹, Wing-Kin Sung³ - Show less +3 more•Institutions (3)

King's College London¹, University of Warsaw², National University of Singapore³

16 Dec 2017

TL;DR: The fastest known algorithm for the k-mappability problem with k = 1 requires time complexity of at most O(n) and space complexity of O(log n) as mentioned in this paper.

...read moreread less

Abstract: In the k-mappability problem, we are given a string x of length n and integers m and k, and we are asked to count, for each length-m factor y of x, the number of other factors of length m of x that are at Hamming distance at most k from y. We focus here on the version of the problem where $k=1$. The fastest known algorithm for $k=1$ requires time $\mathcal {O}(mn \log n/\log \log n)$ and space $\mathcal {O}(n)$. We present two new algorithms that require worst-case time $\mathcal {O}(mn)$ and $\mathcal {O}(n \log n \log \log n)$, respectively, and space $\mathcal {O}(n)$, thus greatly improving the state of the art. Moreover, we present another algorithm that requires average-case time and space $\mathcal {O}(n)$ for integer alphabets of size $\sigma $ if $m=\varOmega (\log _\sigma n)$. Notably, we show that this algorithm is generalizable for arbitrary k, requiring average-case time $\mathcal {O}(kn)$ and space $\mathcal {O}(n)$ if $m=\varOmega (k\log _\sigma n)$.

...read moreread less

6 citations

Journal Article•DOI•

On finding the Adams consensus tree

[...]

Jesper Jansson¹, Zhaoxian Li², Wing-Kin Sung², Wing-Kin Sung³•Institutions (3)

Hong Kong Polytechnic University¹, National University of Singapore², Genome Institute of Singapore³

01 Oct 2017-Information & Computation

TL;DR: A fast algorithm for finding the Adams consensus tree of a set of conflicting phylogenetic trees with identical leaf labels is presented, which relies on an extension of the wavelet tree-based technique of Bose et al. for orthogonal range counting on a grid.

...read moreread less

Abstract: This article presents a fast algorithm for finding the Adams consensus tree of a set of conflicting phylogenetic trees with identical leaf labels. Its worst-case running time is O ( k n log ⁡ n ) , where k is the number of input trees and n is the size of the leaf label set; in comparison, the original algorithm of Adams has a worst-case running time of O ( k n 2 ) . To achieve subquadratic running time, the centroid path decomposition technique is applied in a novel way that traverses the input trees by following a centroid path in each of them in unison. For k = 2 , an even faster algorithm running in O ( n ⋅ log ⁡ n log ⁡ log ⁡ n ) time is provided, which relies on an extension of the wavelet tree-based technique of Bose et al. for orthogonal range counting on a grid. Our extended wavelet tree data structure also supports truncated range maximum/minimum queries efficiently.

...read moreread less

6 citations

Book•

Algorithms for Next-Generation Sequencing

[...]

Wing-Kin Sung¹•Institutions (1)

National University of Singapore¹

24 May 2017

TL;DR: Algorithms for Next-Generation Sequencing (ALGS) as discussed by the authors is a tool for students and researchers in bioinformatics and computational biology, biologists seeking to process and manage the data generated by next-generation sequencing, and as a textbook or a self-study resource.

...read moreread less

Abstract: Advances in sequencing technology have allowed scientists to study the human genome in greater depth and on a larger scale than ever before – as many as hundreds of millions of short reads in the course of a few days. But what are the best ways to deal with this flood of data? Algorithms for Next-Generation Sequencing is an invaluable tool for students and researchers in bioinformatics and computational biology, biologists seeking to process and manage the data generated by next-generation sequencing, and as a textbook or a self-study resource. In addition to offering an in-depth description of the algorithms for processing sequencing data, it also presents useful case studies describing the applications of this technology.

...read moreread less

5 citations

Book Chapter•DOI•

Determining the consistency of resolved triplets and fan triplets

[...]

Jesper Jansson¹, Jesper Jansson², Andrzej Lingas³, Ramesh Rajaby⁴, Wing-Kin Sung⁴, Wing-Kin Sung⁵ - Show less +2 more•Institutions (5)

Kyoto University¹, Hong Kong Polytechnic University², Lund University³, National University of Singapore⁴, Genome Institute of Singapore⁵

03 May 2017

TL;DR: In this paper, a detailed characterization of how the computational complexity of the consistency problem changes under various restrictions is presented, and the main result is an efficient algorithm for dense inputs satisfying ''R^{-} = \emptyset'' whose running time is linear in the size of the input and therefore optimal.

...read moreread less

Abstract: The $\mathcal {R}^{+-} \mathcal {F}^{+-}$ Consistency problem takes as input two sets $R^{+}$ and $R^{-}$ of resolved triplets and two sets $F^{+}$ and $F^{-}$ of fan triplets, and asks for a distinctly leaf-labeled tree that contains all elements in $R^{+} \cup F^{+}$ and no elements in $R^{-} \cup F^{-}$ as embedded subtrees, if such a tree exists. This paper presents a detailed characterization of how the computational complexity of the problem changes under various restrictions. Our main result is an efficient algorithm for dense inputs satisfying $R^{-} = \emptyset $ whose running time is linear in the size of the input and therefore optimal.

...read moreread less

4 citations

Book Chapter•DOI•

An Efficient Algorithm for the Rooted Triplet Distance Between Galled Trees

[...]

Jesper Jansson¹, Jesper Jansson², Ramesh Rajaby³, Wing-Kin Sung³, Wing-Kin Sung⁴ - Show less +1 more•Institutions (4)

Kyoto University¹, Hong Kong Polytechnic University², National University of Singapore³, Genome Institute of Singapore⁴

05 Jun 2017

TL;DR: The fastest known algorithm for computing the rooted triplet distance between two input galled trees runs in O(n 2.687 ) time, where n is the cardinality of the leaf label set as discussed by the authors.

...read moreread less

Abstract: The previously fastest algorithm for computing the rooted triplet distance between two input galled trees (i.e., phylogenetic networks whose cycles are vertex-disjoint) runs in $O(n^{2.687})$ time, where n is the cardinality of the leaf label set. Here, we present an $O(n \log n)$-time solution. Our strategy is to transform the input so that the answer can be obtained by applying an existing $O(n \log n)$-time algorithm for the simpler case of two phylogenetic trees a constant number of times.

...read moreread less

2 citations

Book Chapter•DOI•

Computing Asymmetric Median Tree of Two Trees via Better Bipartite Matching Algorithm

[...]

Ramesh Rajaby¹, Wing-Kin Sung¹, Wing-Kin Sung²•Institutions (2)

National University of Singapore¹, Genome Institute of Singapore²

17 Jul 2017

TL;DR: This work has shown that the Hopcroft–Karp algorithm can find a maximum bipartite matching of a bipartites graph G in \(O(\sqrt{n} m) time where n and m are the number of nodes and edges, respectively, in the bipartITE graph G.

...read moreread less

Abstract: Maximum bipartite matching is a fundamental problem in computer science with many applications. The HopcroftKarp algorithm can find a maximum bipartite matching of a bipartite graph G in $O(\sqrt{n} m)$ time where n and m are the number of nodes and edges, respectively, in the bipartite graph G. However, when G is dense (i.e., $m=O(n^2)$), the Hopcroft–Karp algorithm runs in $O(n^{2.5})$ time.

...read moreread less

2 citations

Posted Content•

A Faster Construction of Phylogenetic Consensus Trees.

[...]

Paweł Gawrychowski, Gad M. Landau, Wing-Kin Sung, Oren Weimann

30 May 2017

TL;DR: This paper focuses on two of the most well-known and widely used oconsensus tree methods: the greedy consensus tree and the frequency difference consensus tree, and improves these running times to Õpknq and Õ pknq respectively.

...read moreread less

Abstract: A consensus tree is a phylogenetic tree that captures the similarity between a set of conflicting phylogenetic trees. The problem of computing a consensus tree is a major step in phylogenetic tree reconstruction. It also finds applications in predicting a species tree from a set of gene trees. This paper focuses on two of the most well-known and widely used oconsensus tree methods: the greedy consensus tree and the frequency difference consensus tree. Given k conflicting trees each with n leaves, the previous fastest algorithms for these problems were Opknq for the greedy consensus tree [J. ACM 2016] and Õpmintkn, knuq for the frequency difference consensus tree [ACM TCBB 2016]. We improve these running times to Õpknq and Õpknq respectively.

...read moreread less

2 citations

Posted Content•

A Faster Construction of Greedy Consensus Trees

[...]

Paweł Gawrychowski¹, Gad M. Landau², Wing-Kin Sung³, Oren Weimann⁴•Institutions (4)

University of Warsaw¹, Tel Aviv University², National University of Singapore³, University of Haifa⁴

30 May 2017-arXiv: Data Structures and Algorithms

TL;DR: In this paper, the authors improved the running time of the greedy consensus tree and the frequency difference consensus tree to O(k n−1.5) and O((k n −2, k^2n) ), respectively, by computing a consensus tree from a set of conflicting phylogenetic trees.

...read moreread less

Abstract: A consensus tree is a phylogenetic tree that captures the similarity between a set of conflicting phylogenetic trees. The problem of computing a consensus tree is a major step in phylogenetic tree reconstruction. It also finds applications in predicting a species tree from a set of gene trees. This paper focuses on two of the most well-known and widely used oconsensus tree methods: the greedy consensus tree and the frequency difference consensus tree. Given $k$ conflicting trees each with $n$ leaves, the previous fastest algorithms for these problems were $O(k n^2)$ for the greedy consensus tree [J. ACM 2016] and $\tilde O(\min \{ k n^2, k^2n\})$ for the frequency difference consensus tree [ACM TCBB 2016]. We improve these running times to $\tilde O(k n^{1.5})$ and $\tilde O(k n)$ respectively.

...read moreread less

Posted Content•

Faster algorithms for 1-mappability of a sequence

[...]

Mai Alzamel¹, Panagiotis Charalampopoulos¹, Costas S. Iliopoulos¹, Solon P. Pissis¹, Jakub Radoszewski¹, Wing-Kin Sung² - Show less +2 more•Institutions (2)

King's College London¹, National University of Singapore²

11 May 2017-arXiv: Data Structures and Algorithms

TL;DR: Two new algorithms that require worst-case time and space for integer alphabets of size $m=\varOmega (\log _\sigma n)$ are presented, thus greatly improving the state of the art.

...read moreread less

Abstract: In the k-mappability problem, we are given a string x of length n and integers m and k, and we are asked to count, for each length-m factor y of x, the number of other factors of length m of x that are at Hamming distance at most k from y. We focus here on the version of the problem where k = 1. The fastest known algorithm for k = 1 requires time O(mn log n/ log log n) and space O(n). We present two algorithms that require worst-case time O(mn) and O(n log^2 n), respectively, and space O(n), thus greatly improving the state of the art. Moreover, we present an algorithm that requires average-case time and space O(n) for integer alphabets if m = {\Omega}(log n/ log {\sigma}), where {\sigma} is the alphabet size.

...read moreread less