scispace - formally typeset
Search or ask a question
Institution

Peking University

EducationBeijing, Beijing, China
About: Peking University is a education organization based out in Beijing, Beijing, China. It is known for research contribution in the topics: Population & Cancer. The organization has 143902 authors who have published 181007 publications receiving 4103666 citations. The organization is also known as: PKU & Beida.
Topics: Population, Cancer, Medicine, Catalysis, China


Papers
More filters
Journal ArticleDOI
04 Oct 2012-Nature
TL;DR: MGWAS analysis showed that patients with type 2 diabetes were characterized by a moderate degree of gut microbial dysbiosis, a decrease in the abundance of some universal butyrate-producing bacteria and an increase in various opportunistic pathogens, as well as an enrichment of other microbial functions conferring sulphate reduction and oxidative stress resistance.
Abstract: Assessment and characterization of gut microbiota has become a major research area in human disease, including type 2 diabetes, the most prevalent endocrine disease worldwide. To carry out analysis on gut microbial content in patients with type 2 diabetes, we developed a protocol for a metagenome-wide association study (MGWAS) and undertook a two-stage MGWAS based on deep shotgun sequencing of the gut microbial DNA from 345 Chinese individuals. We identified and validated approximately 60,000 type-2-diabetes-associated markers and established the concept of a metagenomic linkage group, enabling taxonomic species-level analyses. MGWAS analysis showed that patients with type 2 diabetes were characterized by a moderate degree of gut microbial dysbiosis, a decrease in the abundance of some universal butyrate-producing bacteria and an increase in various opportunistic pathogens, as well as an enrichment of other microbial functions conferring sulphate reduction and oxidative stress resistance. An analysis of 23 additional individuals demonstrated that these gut microbial markers might be useful for classifying type 2 diabetes.

4,981 citations

Proceedings Article
04 Dec 2017
TL;DR: It is proved that, since the data instances with larger gradients play a more important role in the computation of information gain, GOSS can obtain quite accurate estimation of the information gain with a much smaller data size, and is called LightGBM.
Abstract: Gradient Boosting Decision Tree (GBDT) is a popular machine learning algorithm, and has quite a few effective implementations such as XGBoost and pGBRT. Although many engineering optimizations have been adopted in these implementations, the efficiency and scalability are still unsatisfactory when the feature dimension is high and data size is large. A major reason is that for each feature, they need to scan all the data instances to estimate the information gain of all possible split points, which is very time consuming. To tackle this problem, we propose two novel techniques: Gradient-based One-Side Sampling (GOSS) and Exclusive Feature Bundling (EFB). With GOSS, we exclude a significant proportion of data instances with small gradients, and only use the rest to estimate the information gain. We prove that, since the data instances with larger gradients play a more important role in the computation of information gain, GOSS can obtain quite accurate estimation of the information gain with a much smaller data size. With EFB, we bundle mutually exclusive features (i.e., they rarely take nonzero values simultaneously), to reduce the number of features. We prove that finding the optimal bundling of exclusive features is NP-hard, but a greedy algorithm can achieve quite good approximation ratio (and thus can effectively reduce the number of features without hurting the accuracy of split point determination by much). We call our new GBDT implementation with GOSS and EFB LightGBM. Our experiments on multiple public datasets show that, LightGBM speeds up the training process of conventional GBDT by up to over 20 times while achieving almost the same accuracy.

4,977 citations

Journal ArticleDOI
19 Aug 2011-Science
TL;DR: The total forest sink estimate is equivalent in magnitude to the terrestrial sink deduced from fossil fuel emissions and land-use change sources minus ocean and atmospheric sinks, with tropical estimates having the largest uncertainties.
Abstract: The terrestrial carbon sink has been large in recent decades, but its size and location remain uncertain. Using forest inventory data and long-term ecosystem carbon studies, we estimate a total forest sink of 2.4 ± 0.4 petagrams of carbon per year (Pg C year–1) globally for 1990 to 2007. We also estimate a source of 1.3 ± 0.7 Pg C year–1 from tropical land-use change, consisting of a gross tropical deforestation emission of 2.9 ± 0.5 Pg C year–1 partially compensated by a carbon sink in tropical forest regrowth of 1.6 ± 0.5 Pg C year–1. Together, the fluxes comprise a net global forest sink of 1.1 ± 0.8 Pg C year–1, with tropical estimates having the largest uncertainties. Our total forest sink estimate is equivalent in magnitude to the terrestrial sink deduced from fossil fuel emissions and land-use change sources minus ocean and atmospheric sinks.

4,948 citations

Journal ArticleDOI
TL;DR: Detailed estimates of dementia prevalence for each world region are believed to constitute the best currently available basis for policymaking, planning, and allocation of health and welfare resources.

4,891 citations

Journal ArticleDOI
TL;DR: In this paper, the authors provided an assessment of black-carbon climate forcing that is comprehensive in its inclusion of all known and relevant processes and that is quantitative in providing best estimates and uncertainties of the main forcing terms: direct solar absorption; influence on liquid, mixed phase, and ice clouds; and deposition on snow and ice.
Abstract: Black carbon aerosol plays a unique and important role in Earth's climate system. Black carbon is a type of carbonaceous material with a unique combination of physical properties. This assessment provides an evaluation of black-carbon climate forcing that is comprehensive in its inclusion of all known and relevant processes and that is quantitative in providing best estimates and uncertainties of the main forcing terms: direct solar absorption; influence on liquid, mixed phase, and ice clouds; and deposition on snow and ice. These effects are calculated with climate models, but when possible, they are evaluated with both microphysical measurements and field observations. Predominant sources are combustion related, namely, fossil fuels for transportation, solid fuels for industrial and residential uses, and open burning of biomass. Total global emissions of black carbon using bottom-up inventory methods are 7500 Gg yr−1 in the year 2000 with an uncertainty range of 2000 to 29000. However, global atmospheric absorption attributable to black carbon is too low in many models and should be increased by a factor of almost 3. After this scaling, the best estimate for the industrial-era (1750 to 2005) direct radiative forcing of atmospheric black carbon is +0.71 W m−2 with 90% uncertainty bounds of (+0.08, +1.27) W m−2. Total direct forcing by all black carbon sources, without subtracting the preindustrial background, is estimated as +0.88 (+0.17, +1.48) W m−2. Direct radiative forcing alone does not capture important rapid adjustment mechanisms. A framework is described and used for quantifying climate forcings, including rapid adjustments. The best estimate of industrial-era climate forcing of black carbon through all forcing mechanisms, including clouds and cryosphere forcing, is +1.1 W m−2 with 90% uncertainty bounds of +0.17 to +2.1 W m−2. Thus, there is a very high probability that black carbon emissions, independent of co-emitted species, have a positive forcing and warm the climate. We estimate that black carbon, with a total climate forcing of +1.1 W m−2, is the second most important human emission in terms of its climate forcing in the present-day atmosphere; only carbon dioxide is estimated to have a greater forcing. Sources that emit black carbon also emit other short-lived species that may either cool or warm climate. Climate forcings from co-emitted species are estimated and used in the framework described herein. When the principal effects of short-lived co-emissions, including cooling agents such as sulfur dioxide, are included in net forcing, energy-related sources (fossil fuel and biofuel) have an industrial-era climate forcing of +0.22 (−0.50 to +1.08) W m−2 during the first year after emission. For a few of these sources, such as diesel engines and possibly residential biofuels, warming is strong enough that eliminating all short-lived emissions from these sources would reduce net climate forcing (i.e., produce cooling). When open burning emissions, which emit high levels of organic matter, are included in the total, the best estimate of net industrial-era climate forcing by all short-lived species from black-carbon-rich sources becomes slightly negative (−0.06 W m−2 with 90% uncertainty bounds of −1.45 to +1.29 W m−2). The uncertainties in net climate forcing from black-carbon-rich sources are substantial, largely due to lack of knowledge about cloud interactions with both black carbon and co-emitted organic carbon. In prioritizing potential black-carbon mitigation actions, non-science factors, such as technical feasibility, costs, policy design, and implementation feasibility play important roles. The major sources of black carbon are presently in different stages with regard to the feasibility for near-term mitigation. This assessment, by evaluating the large number and complexity of the associated physical and radiative processes in black-carbon climate forcing, sets a baseline from which to improve future climate forcing estimates.

4,591 citations


Authors

Showing all 144711 results

NameH-indexPapersCitations
Zhong Lin Wang2452529259003
Yoshua Bengio2021033420313
Jing Wang1844046202769
Richard Peto183683231434
Xiaohui Fan183878168522
H. S. Chen1792401178529
Yang Gao1682047146301
Yang Yang1642704144071
Hua Zhang1631503116769
Rory Collins162489193407
Dongyuan Zhao160872106451
Wei Li1581855124748
Stephen J. O'Brien153106293025
Rui Zhang1512625107917
Kevin J. Gaston15075085635
Network Information
Related Institutions (5)
Shanghai Jiao Tong University
184.6K papers, 3.4M citations

95% related

Chinese Academy of Sciences
634.8K papers, 14.8M citations

93% related

Tsinghua University
200.5K papers, 4.5M citations

93% related

National University of Singapore
165.4K papers, 5.4M citations

92% related

University of Southern California
169.9K papers, 7.8M citations

91% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
20241
2023367
20221,554
202117,269
202016,526
201914,464