scispace - formally typeset
Search or ask a question
Institution

The Chinese University of Hong Kong

EducationHong Kong, China
About: The Chinese University of Hong Kong is a education organization based out in Hong Kong, China. It is known for research contribution in the topics: Population & Computer science. The organization has 43411 authors who have published 93672 publications receiving 3066651 citations.
Topics: Population, Computer science, Cancer, Medicine, China


Papers
More filters
Journal ArticleDOI
TL;DR: The United Nations declared 2016 as the International Year of Pulses (grain legumes) under the banner ‘nutritious seeds for a sustainable future’, but the current lack of coordinated focus on grain legumes has compromised human health, nutritional security and sustainable food production.
Abstract: The United Nations declared 2016 as the International Year of Pulses (grain legumes) under the banner ‘nutritious seeds for a sustainable future’. A second green revolution is required to ensure food and nutritional security in the face of global climate change. Grain legumes provide an unparalleled solution to this problem because of their inherent capacity for symbiotic atmospheric nitrogen fixation, which provides economically sustainable advantages for farming. In addition, a legume-rich diet has health benefits for humans and livestock alike. However, grain legumes form only a minor part of most current human diets, and legume crops are greatly under-used. Food security and soil fertility could be significantly improved by greater grain legume usage and increased improvement of a range of grain legumes. The current lack of coordinated focus on grain legumes has compromised human health, nutritional security and sustainable food production.

547 citations

Journal ArticleDOI
TL;DR: This article proposes new filtering techniques by exploiting the token ordering information and drastically reduce the candidate sizes and hence improve the efficiency of existing algorithms to find a pair of records such that their similarities are no less than a given threshold.
Abstract: With the increasing amount of data and the need to integrate data from multiple data sources, one of the challenging issues is to identify near-duplicate records efficiently. In this article, we focus on efficient algorithms to find a pair of records such that their similarities are no less than a given threshold. Several existing algorithms rely on the prefix filtering principle to avoid computing similarity values for all possible pairs of records. We propose new filtering techniques by exploiting the token ordering information; they are integrated into the existing methods and drastically reduce the candidate sizes and hence improve the efficiency. We have also studied the implementation of our proposed algorithm in stand-alone and RDBMS-based settings. Experimental results show our proposed algorithms can outperform previous algorithms on several real datasets.

546 citations

Journal ArticleDOI
TL;DR: A detailed analysis of the size profiles of plasma DNA in 90 patients with hepatocellular carcinoma, 67 with chronic hepatitis B, 36 with hepatitis B-associated cirrhosis, and 32 healthy controls using massively parallel sequencing to achieve plasma DNA size measurement at single-base resolution and in a genome-wide manner improved understanding of thesize profile of tumor-derived circulating cell-free DNA.
Abstract: The analysis of tumor-derived circulating cell-free DNA opens up new possibilities for performing liquid biopsies for the assessment of solid tumors. Although its clinical potential has been increasingly recognized, many aspects of the biological characteristics of tumor-derived cell-free DNA remain unclear. With respect to the size profile of such plasma DNA molecules, a number of studies reported the finding of increased integrity of tumor-derived plasma DNA, whereas others found evidence to suggest that plasma DNA molecules released by tumors might be shorter. Here, we performed a detailed analysis of the size profiles of plasma DNA in 90 patients with hepatocellular carcinoma, 67 with chronic hepatitis B, 36 with hepatitis B-associated cirrhosis, and 32 healthy controls. We used massively parallel sequencing to achieve plasma DNA size measurement at single-base resolution and in a genome-wide manner. Tumor-derived plasma DNA molecules were further identified with the use of chromosome arm-level z-score analysis (CAZA), which facilitated the studying of their specific size profiles. We showed that populations of aberrantly short and long DNA molecules existed in the plasma of patients with hepatocellular carcinoma. The short ones preferentially carried the tumor-associated copy number aberrations. We further showed that there were elevated amounts of plasma mitochondrial DNA in the plasma of hepatocellular carcinoma patients. Such molecules were much shorter than the nuclear DNA in plasma. These results have improved our understanding of the size profile of tumor-derived circulating cell-free DNA and might further enhance our ability to use plasma DNA as a molecular diagnostic tool.

546 citations

Journal ArticleDOI
17 Dec 2015
TL;DR: No effective medical interventions exist that completely reverse the disease other than lifestyle changes, dietary alterations and, possibly, bariatric surgery, however, several strategies that target pathophysiological processes such as an oversupply of fatty acids to the liver, cell injury and inflammation are currently under investigation.
Abstract: Nonalcoholic fatty liver disease (NAFLD) is a disorder characterized by excess accumulation of fat in hepatocytes (nonalcoholic fatty liver (NAFL)); in up to 40% of individuals, there are additional findings of portal and lobular inflammation and hepatocyte injury (which characterize nonalcoholic steatohepatitis (NASH)). A subset of patients will develop progressive fibrosis, which can progress to cirrhosis. Hepatocellular carcinoma and cardiovascular complications are life-threatening co-morbidities of both NAFL and NASH. NAFLD is closely associated with insulin resistance; obesity and metabolic syndrome are common underlying factors. As a consequence, the prevalence of NAFLD is estimated to be 10-40% in adults worldwide, and it is the most common liver disease in children and adolescents in developed countries. Mechanistic insights into fat accumulation, subsequent hepatocyte injury, the role of the immune system and fibrosis as well as the role of the gut microbiota are unfolding. Furthermore, genetic and epigenetic factors might explain the considerable interindividual variation in disease phenotype, severity and progression. To date, no effective medical interventions exist that completely reverse the disease other than lifestyle changes, dietary alterations and, possibly, bariatric surgery. However, several strategies that target pathophysiological processes such as an oversupply of fatty acids to the liver, cell injury and inflammation are currently under investigation. Diagnosis of NAFLD can be established by imaging, but detection of the lesions of NASH still depend on the gold-standard but invasive liver biopsy. Several non-invasive strategies are being evaluated to replace or complement biopsies, especially for follow-up monitoring.

546 citations

Journal ArticleDOI
TL;DR: Various conditions connecting the communication data rate with the rate of change of the underlying dynamics are established for the existence of stable and asymptotically convergent coder-estimator schemes.
Abstract: In this paper, we investigate a state estimation problem involving finite communication capacity constraints. Unlike classical estimation problems where the observation is a continuous process corrupted by additive noises, there is a constraint that the observations must be coded and transmitted over a digital communication channel with finite capacity. This problem is formulated mathematically, and some convergence properties are defined. Moreover, the concept of a finitely recursive coder-estimator sequence is introduced. A new upper bound for the average estimation error is derived for a large class of random variables. Convergence properties of some coder-estimator algorithms are analyzed. Various conditions connecting the communication data rate with the rate of change of the underlying dynamics are established for the existence of stable and asymptotically convergent coder-estimator schemes.

545 citations


Authors

Showing all 43993 results

NameH-indexPapersCitations
Michael Marmot1931147170338
Jing Wang1844046202769
Jiaguo Yu178730113300
Yang Yang1712644153049
Mark Gerstein168751149578
Gang Chen1673372149819
Jun Wang1661093141621
Jean Louis Vincent1611667163721
Wei Zheng1511929120209
Rui Zhang1512625107917
Ben Zhong Tang1492007116294
Kypros H. Nicolaides147130287091
Thomas S. Huang1461299101564
Galen D. Stucky144958101796
Joseph J.Y. Sung142124092035
Network Information
Related Institutions (5)
University of Toronto
294.9K papers, 13.5M citations

92% related

University of California, San Diego
204.5K papers, 12.3M citations

92% related

University of Pittsburgh
201K papers, 9.6M citations

92% related

University of Michigan
342.3K papers, 17.6M citations

92% related

University of Minnesota
257.9K papers, 11.9M citations

91% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
2023212
2022904
20217,888
20207,245
20195,968
20185,372