scispace - formally typeset
Search or ask a question
Author

Xiao Li

Other affiliations: Chinese Academy of Sciences
Bio: Xiao Li is an academic researcher from East China University of Science and Technology. The author has contributed to research in topics: Irradiation & Tobacco smoke. The author has an hindex of 7, co-authored 9 publications receiving 283 citations. Previous affiliations of Xiao Li include Chinese Academy of Sciences.

Papers
More filters
Journal ArticleDOI
TL;DR: Some representative substructures responsible for acute oral toxicity were identified using information gain and substructure frequency analysis methods, which might be very helpful for further study to avoid the toxicity.
Abstract: Chemical acute oral toxicity is an important end point in drug design and environmental risk assessment. However, it is difficult to determine by experiments, and in silico methods are hence developed as an alternative. In this study, a comprehensive data set containing 12 204 diverse compounds with median lethal dose (LD50) was compiled. These chemicals were classified into four categories, namely categories I, II, III and IV, based on the criterion of the U.S. Environmental Protection Agency (EPA). Then several multiclassification models were developed using five machine learning methods, including support vector machine (SVM), C4.5 decision tree (C4.5), random forest (RF), κ-nearest neighbor (kNN), and naive Bayes (NB) algorithms, along with MACCS and FP4 fingerprints. One-against-one (OAO) and binary tree (BT) strategies were employed for SVM multiclassification. Performances were measured by two external validation sets containing 1678 and 375 chemicals, separately. The overall accuracy of the MACCS-...

137 citations

Journal ArticleDOI
TL;DR: The results demonstrated the inverse association between cigarette smoking and the risk of Parkinson's disease and suggested that effective drugs for PD might be developed using chemical substances derived from tobacco or tobacco smoke.

111 citations

Journal ArticleDOI
TL;DR: The results indicated that the best binary and ternary classification models were used to estimate carcinogenicity of tobacco smoke components containing 2251 compounds and would be helpful for prediction of chemical carcinogenicicity.
Abstract: Carcinogenicity is one of the most concerned properties of chemicals to human health, thus it is important to identify chemical carcinogenicity as early as possible. In this study, 829 diverse compounds with rat carcinogenicity were collected from Carcinogenic Potency Database (CPDB). Using six types of fingerprints to represent the molecules, 30 binary and ternary classification models were generated to predict chemical carcinogenicity by five machine learning methods. The models were evaluated by an external validation set containing 87 chemicals from ISSCAN database. The best binary model was developed by MACCS keys and kNN algorithm with predictive accuracy at 83.91 %, while the best ternary model was also generated by MACCS keys and kNN algorithm with overall accuracy at 80.46 %. Furthermore, the best binary and ternary classification models were used to estimate carcinogenicity of tobacco smoke components containing 2251 compounds. 981 ones were predicted as carcinogens by binary classification model, while 110 compounds were predicted as strong carcinogens and 807 ones as weak carcinogens by ternary classification model. The results indicated that our models would be helpful for prediction of chemical carcinogenicity.

37 citations

Journal ArticleDOI
TL;DR: Several structural alerts for characterizing serious EI/EC were identified using the combination of information gain and substructure frequency analysis and indicated that the models were reliable and robust, which can be used to predict the potential seriousness of Ei/EC of compounds.
Abstract: Rapidly and correctly identifying eye irritants or corrosive chemicals is an important issue in health hazard assessment. The purpose of this study is to describe the development of in silico methods for the classification of chemicals into irritants/corrosives or non-irritants/non-corrosives. A total of 5220 chemicals for a serious eye irritation (EI) dataset and 2299 chemicals as an eye corrosion (EC) dataset were collected from available databases and literature. Structure–activity relationship (SAR) models were developed to separately predict serious EI or EC via machine learning methods. According to the overall prediction accuracy, the Pub-SVM model gave the best results for both serious EI (overall classification accuracy CA = 0.946) and EC (CA = 0.959). The sensitivity and specificity of serious EI were 97.3% and 86.7% for the training set, and 96.9% and 82.7% for the external validation set, respectively. Similarly, the sensitivity and specificity of EC were 95.5% and 96.2% for the training set, and 94.9% and 96.2% for the external validation set, respectively. The high specificity and sensitivity indicated that our models were reliable and robust, which can be used to predict the potential seriousness of EI/EC of compounds. Moreover, several structural alerts for characterizing serious EI/EC were identified using the combination of information gain and substructure frequency analysis.

31 citations

Journal ArticleDOI
TL;DR: Two consensus models based on the top‐performing individual models for hERG blockage performed much better than the individual models both on 5‐fold cross validation and external validation, indicating that the predictive power of consensus model II should be stronger than most of the previously reported models.
Abstract: Human ether-a-go-go related gene (hERG) K+ channel plays an important role in cardiac action potential. Blockage of hERG channel may result in long QT syndrome (LQTS), even cause sudden cardiac death. Many drugs have been withdrawn from the market because of the serious hERG-related cardiotoxicity. Therefore, it is quite essential to estimate the chemical blockage of hERG in the early stage of drug discovery. In this study, a diverse set of 3721 compounds with hERG inhibition data was assembled from literature. Then, we make full use of the Online Chemical Modeling Environment (OCHEM), which supplies rich machine learning methods and descriptor sets, to build a series of classification models for hERG blockage. We also generated two consensus models based on the top-performing individual models. The consensus models performed much better than the individual models both on 5-fold cross validation and external validation. Especially, consensus model II yielded the prediction accuracy of 89.5 % and MCC of 0.670 on external validation. This result indicated that the predictive power of consensus model II should be stronger than most of the previously reported models. The 17 top-performing individual models and the consensus models and the data sets used for model development are available at https://ochem.eu/article/103592.

29 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Over the past generation, the global burden of Parkinson's disease has more than doubled as a result of increasing numbers of older people, with potential contributions from longer disease duration and environmental factors.
Abstract: Summary Background Neurological disorders are now the leading source of disability globally, and ageing is increasing the burden of neurodegenerative disorders, including Parkinson's disease. We aimed to determine the global burden of Parkinson's disease between 1990 and 2016 to identify trends and to enable appropriate public health, medical, and scientific responses. Methods Through a systematic analysis of epidemiological studies, we estimated global, regional, and country-specific prevalence and years of life lived with disability for Parkinson's disease from 1990 to 2016. We estimated the proportion of mild, moderate, and severe Parkinson's disease on the basis of studies that used the Hoehn and Yahr scale and assigned disability weights to each level. We jointly modelled prevalence and excess mortality risk in a natural history model to derive estimates of deaths due to Parkinson's disease. Death counts were multiplied by values from the Global Burden of Disease study's standard life expectancy to compute years of life lost. Disability-adjusted life-years (DALYs) were computed as the sum of years lived with disability and years of life lost. We also analysed results based on the Socio-demographic Index, a compound measure of income per capita, education, and fertility. Findings In 2016, 6·1 million (95% uncertainty interval [UI] 5·0–7·3) individuals had Parkinson's disease globally, compared with 2·5 million (2·0–3·0) in 1990. This increase was not solely due to increasing numbers of older people, because age-standardised prevalence rates increased by 21·7% (95% UI 18·1–25·3) over the same period (compared with an increase of 74·3%, 95% UI 69·2–79·6, for crude prevalence rates). Parkinson's disease caused 3·2 million (95% UI 2·6–4·0) DALYs and 211 296 deaths (95% UI 167 771–265 160) in 2016. The male-to-female ratios of age-standardised prevalence rates were similar in 2016 (1·40, 95% UI 1·36–1·43) and 1990 (1·37, 1·34–1·40). From 1990 to 2016, age-standardised prevalence, DALY rates, and death rates increased for all global burden of disease regions except for southern Latin America, eastern Europe, and Oceania. In addition, age-standardised DALY rates generally increased across the Socio-demographic Index. Interpretation Over the past generation, the global burden of Parkinson's disease has more than doubled as a result of increasing numbers of older people, with potential contributions from longer disease duration and environmental factors. Demographic and potentially other factors are poised to increase the future burden of Parkinson's disease substantially. Funding Bill & Melinda Gates Foundation.

1,388 citations

Journal ArticleDOI
TL;DR: The results suggested that the ADMET-score would be a comprehensive index to evaluate chemical drug-likeness, and might be helpful for users to select appropriate drug candidates for further development.
Abstract: Chemical absorption, distribution, metabolism, excretion, and toxicity (ADMET), play key roles in drug discovery and development. A high-quality drug candidate should not only have sufficient efficacy against the therapeutic target, but also show appropriate ADMET properties at a therapeutic dose. A lot of in silico models are hence developed for prediction of chemical ADMET properties. However, it is still not easy to evaluate the drug-likeness of compounds in terms of so many ADMET properties. In this study, we proposed a scoring function named the ADMET-score to evaluate drug-likeness of a compound. The scoring function was defined on the basis of 18 ADMET properties predicted via our web server admetSAR. The weight of each property in the ADMET-score was determined by three parameters: the accuracy rate of the model, the importance of the endpoint in the process of pharmacokinetics, and the usefulness index. The FDA-approved drugs from DrugBank, the small molecules from ChEMBL and the old drugs withdrawn from the market due to safety concerns were used to evaluate the performance of the ADMET-score. The indices of the arithmetic mean and p-value showed that the ADMET-score among the three data sets differed significantly. Furthermore, we learned that there was no obvious linear correlation between the ADMET-score and QED (quantitative estimate of drug-likeness). These results suggested that the ADMET-score would be a comprehensive index to evaluate chemical drug-likeness, and might be helpful for users to select appropriate drug candidates for further development.

230 citations

Journal ArticleDOI
TL;DR: The journey thus far of PD genetics is outlined, highlighting how significant advances have improved knowledge of the genetic basis of PD risk, onset and progression and foresee that genetic discoveries in PD will directly influence the ability to predict disease and aid in defining etiological subtypes, critical steps for the implementation of precision medicine for PD.

213 citations

Journal ArticleDOI
TL;DR: The development of in silico models for some physicochemical parameters, ADME properties and toxicity evaluation, with an emphasis on the modelling approaches thereof, their application in drug discovery, and the potential merits or deficiencies of these models are introduced.
Abstract: In recent decades, in silico absorption, distribution, metabolism, excretion (ADME), and toxicity (T) modelling as a tool for rational drug design has received considerable attention from pharmaceutical scientists, and various ADME/T-related prediction models have been reported. The high-throughput and low-cost nature of these models permits a more streamlined drug development process in which the identification of hits or their structural optimization can be guided based on a parallel investigation of bioavailability and safety, along with activity. However, the effectiveness of these tools is highly dependent on their capacity to cope with needs at different stages, e.g. their use in candidate selection has been limited due to their lack of the required predictability. For some events or endpoints involving more complex mechanisms, the current in silico approaches still need further improvement. In this review, we will briefly introduce the development of in silico models for some physicochemical parameters, ADME properties and toxicity evaluation, with an emphasis on the modelling approaches thereof, their application in drug discovery, and the potential merits or deficiencies of these models. Finally, the outlook for future ADME/T modelling based on big data analysis and systems sciences will be discussed.

198 citations

Journal ArticleDOI
01 Jan 2019-Gut
TL;DR: This nationwide, unselected, cohort study shows a significant association between IBD and later occurrence of PD, which is consistent with recent basic scientific findings of a potential role of GI inflammation in development of parkinsonian disorders.
Abstract: Objective Intestinal inflammation has been suggested to play a role in development of Parkinson’s disease (PD) and multiple system atrophy (MSA). To test the hypothesis that IBD is associated with risk of PD and MSA, we performed a nationwide population-based cohort study. Design The cohort consisted of all individuals diagnosed with IBD in Denmark during 1977–2014 (n=76 477) and non-IBD individuals from the general population, who were comparable in terms of gender, age and vital status (n=7 548 259). All cohort members were followed from IBD diagnosis/index date to occurrence of PD and MSA (according to the Danish National Patient Register). Results Patients with IBD had a 22% increased risk of PD as compared with non-IBD individuals (HR=1.22; 95% CI 1.09 to 1.35). The increased risk was present independently of age at IBD diagnosis, gender or length of follow-up. The overall incidence of MSA was low in our study, and the regression analysis suggested a tendency towards higher risk of developing MSA in patients with IBD as compared with non-IBD individuals (HR=1.41; 95% CI 0.82 to 2.44). Estimates were similar for women and men. The increased risk of parkinsonism was significantly higher among patients with UC (HR=1.35; 95% CI 1.20 to 1.52) and not significantly different among patients with Crohn’s disease (HR=1.12; 95% CI 0.89 to 1.40). Conclusions This nationwide, unselected, cohort study shows a significant association between IBD and later occurrence of PD, which is consistent with recent basic scientific findings of a potential role of GI inflammation in development of parkinsonian disorders.

197 citations