scispace - formally typeset
Search or ask a question
Author

Kaixian Chen

Bio: Kaixian Chen is an academic researcher from Chinese Academy of Sciences. The author has contributed to research in topics: Virtual screening & Docking (molecular). The author has an hindex of 47, co-authored 380 publications receiving 9209 citations. Previous affiliations of Kaixian Chen include Shanghai University & East China University of Science and Technology.


Papers
More filters
Journal ArticleDOI
TL;DR: Different types of PPI networks have been effectively mapped with the proposed method, suggesting that, even with only sequence information, this method could be applied to the exploration of networks for any newly discovered protein with unknown biological relativity.
Abstract: Protein–protein interactions (PPIs) are central to most biological processes. Although efforts have been devoted to the development of methodology for predicting PPIs and protein interaction networks, the application of most existing methods is limited because they need information about protein homology or the interaction marks of the protein partners. In the present work, we propose a method for PPI prediction using only the information of protein sequences. This method was developed based on a learning algorithm-support vector machine combined with a kernel function and a conjoint triad feature for describing amino acids. More than 16,000 diverse PPI pairs were used to construct the universal model. The prediction ability of our approach is better than that of other sequence-based PPI prediction methods because it is able to predict PPI networks. Different types of PPI networks have been effectively mapped with our method, suggesting that, even with only sequence information, this method could be applied to the exploration of networks for any newly discovered protein with unknown biological relativity. In addition, such supplementary experimental information can enhance the prediction ability of the method.

867 citations

Journal ArticleDOI
TL;DR: TarFisDock may be a useful tool for target identification, mechanism study of old drugs and probes discovered from natural products, and a reverse ligand–protein docking program for seeking potential protein targets by screening an appropriate protein database.
Abstract: TarFisDock is a web-based tool for automating the procedure of searching for small molecule-protein interactions over a large repertoire of protein structures. It offers PDTD (potential drug target database), a target database containing 698 protein structures covering 15 therapeutic areas and a reverse ligand-protein docking program. In contrast to conventional ligand-protein docking, reverse ligand-protein docking aims to seek potential protein targets by screening an appropriate protein database. The input file of this web server is the small molecule to be tested, in standard mol2 format; TarFisDock then searches for possible binding proteins for the given small molecule by use of a docking approach. The ligand-protein interaction energy terms of the program DOCK are adopted for ranking the proteins. To test the reliability of the TarFisDock server, we searched the PDTD for putative binding proteins for vitamin E and 4H-tamoxifen. The top 2 and 10% candidates of vitamin E binding proteins identified by TarFisDock respectively cover 30 and 50% of reported targets verified or implicated by experiments; and 30 and 50% of experimentally confirmed targets for 4H-tamoxifen appear amongst the top 2 and 5% of the TarFisDock predicted candidates, respectively. Therefore, TarFisDock may be a useful tool for target identification, mechanism study of old drugs and probes discovered from natural products. TarFisDock and PDTD are available at http://www.dddc.ac.cn/tarfisdock/.

364 citations

Journal ArticleDOI
TL;DR: A new graph neural network architecture called Attentive FP for molecular representation that uses a graph attention mechanism to learn from relevant drug discovery datasets and achieves state-of-the-art predictive performances on a variety of datasets and that what it learns is interpretable.
Abstract: Hunting for chemicals with favorable pharmacological, toxicological, and pharmacokinetic properties remains a formidable challenge for drug discovery. Deep learning provides us with powerful tools to build predictive models that are appropriate for the rising amounts of data, but the gap between what these neural networks learn and what human beings can comprehend is growing. Moreover, this gap may induce distrust and restrict deep learning applications in practice. Here, we introduce a new graph neural network architecture called Attentive FP for molecular representation that uses a graph attention mechanism to learn from relevant drug discovery data sets. We demonstrate that Attentive FP achieves state-of-the-art predictive performances on a variety of data sets and that what it learns is interpretable. The feature visualization for Attentive FP suggests that it automatically learns nonlocal intramolecular interactions from specified tasks, which can help us gain chemical insights directly from data beyond human perception.

359 citations

Journal ArticleDOI
TL;DR: It is suggested that albeit halogenation is a valuable approach for improving ligand bioactivity, more attention should be paid in the future to the application of the halogen bond for ligand ADME/T property optimization.
Abstract: Halogen bond has attracted a great deal of attention in the past years for hit-to-lead-to-candidate optimization aiming at improving drug-target binding affinity. In general, heavy organohalogens (i.e., organochlorines, organobromines, and organoiodines) are capable of forming halogen bonds while organofluorines are not. In order to explore the possible roles that halogen bonds could play beyond improving binding affinity, we performed a detailed database survey and quantum chemistry calculation with close attention paid to (1) the change of the ratio of heavy organohalogens to organofluorines along the drug discovery and development process and (2) the halogen bonds between organohalogens and nonbiopolymers or nontarget biopolymers. Our database survey revealed that (1) an obviously increasing trend of the ratio of heavy organohalogens to organofluorines was observed along the drug discovery and development process, illustrating that more organofluorines are worn and eliminated than heavy organohalogens ...

261 citations

Journal ArticleDOI
TL;DR: PDTD serves as a comprehensive and unique repository of drug targets and in conjunction with TarFisDock, PDTD can be used to identify binding proteins for small molecules and may be a valuable platform for the pharmaceutical researchers.
Abstract: Target identification is important for modern drug discovery. With the advances in the development of molecular docking, potential binding proteins may be discovered by docking a small molecule to a repository of proteins with three-dimensional (3D) structures. To complete this task, a reverse docking program and a drug target database with 3D structures are necessary. To this end, we have developed a web server tool, TarFisDock (Tar get Fis hing Dock ing) http://www.dddc.ac.cn/tarfisdock , which has been used widely by others. Recently, we have constructed a protein target database, P otential D rug T arget D atabase (PDTD), and have integrated PDTD with TarFisDock. This combination aims to assist target identification and validation. PDTD is a web-accessible protein database for in silico target identification. It currently contains >1100 protein entries with 3D structures presented in the Protein Data Bank. The data are extracted from the literatures and several online databases such as TTD, DrugBank and Thomson Pharma. The database covers diverse information of >830 known or potential drug targets, including protein and active sites structures in both PDB and mol2 formats, related diseases, biological functions as well as associated regulating (signaling) pathways. Each target is categorized by both nosology and biochemical function. PDTD supports keyword search function, such as PDB ID, target name, and disease name. Data set generated by PDTD can be viewed with the plug-in of molecular visualization tools and also can be downloaded freely. Remarkably, PDTD is specially designed for target identification. In conjunction with TarFisDock, PDTD can be used to identify binding proteins for small molecules. The results can be downloaded in the form of mol2 file with the binding pose of the probe compound and a list of potential binding targets according to their ranking scores. PDTD serves as a comprehensive and unique repository of drug targets. Integrated with TarFisDock, PDTD is a useful resource to identify binding proteins for active compounds or existing drugs. Its potential applications include in silico drug target identification, virtual screening, and the discovery of the secondary effects of an old drug (i.e. new pharmacological usage) or an existing target (i.e. new pharmacological or toxic relevance), thus it may be a valuable platform for the pharmaceutical researchers. PDTD is available online at http://www.dddc.ac.cn/pdtd/ .

259 citations


Cited by
More filters
Christopher M. Bishop1
01 Jan 2006
TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.
Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

10,141 citations

01 Aug 2000
TL;DR: Assessment of medical technology in the context of commercialization with Bioentrepreneur course, which addresses many issues unique to biomedical products.
Abstract: BIOE 402. Medical Technology Assessment. 2 or 3 hours. Bioentrepreneur course. Assessment of medical technology in the context of commercialization. Objectives, competition, market share, funding, pricing, manufacturing, growth, and intellectual property; many issues unique to biomedical products. Course Information: 2 undergraduate hours. 3 graduate hours. Prerequisite(s): Junior standing or above and consent of the instructor.

4,833 citations

Journal ArticleDOI
TL;DR: This review covers the literature published in 2014 for marine natural products, with 1116 citations referring to compounds isolated from marine microorganisms and phytoplankton, green, brown and red algae, sponges, cnidarians, bryozoans, molluscs, tunicates, echinoderms, mangroves and other intertidal plants and microorganisms.

4,649 citations

Journal ArticleDOI
11 Jun 2020-Nature
TL;DR: A programme of structure-assisted drug design and high-throughput screening identifies six compounds that inhibit the main protease of SARS-CoV-2, demonstrating the ability of this strategy to isolate drug leads with clinical potential.
Abstract: A new coronavirus, known as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), is the aetiological agent responsible for the 2019–2020 viral pneumonia outbreak of coronavirus disease 2019 (COVID-19)1–4. Currently, there are no targeted therapeutic agents for the treatment of this disease, and effective treatment options remain very limited. Here we describe the results of a programme that aimed to rapidly discover lead compounds for clinical use, by combining structure-assisted drug design, virtual drug screening and high-throughput screening. This programme focused on identifying drug leads that target main protease (Mpro) of SARS-CoV-2: Mpro is a key enzyme of coronaviruses and has a pivotal role in mediating viral replication and transcription, making it an attractive drug target for SARS-CoV-25,6. We identified a mechanism-based inhibitor (N3) by computer-aided drug design, and then determined the crystal structure of Mpro of SARS-CoV-2 in complex with this compound. Through a combination of structure-based virtual and high-throughput screening, we assayed more than 10,000 compounds—including approved drugs, drug candidates in clinical trials and other pharmacologically active compounds—as inhibitors of Mpro. Six of these compounds inhibited Mpro, showing half-maximal inhibitory concentration values that ranged from 0.67 to 21.4 μM. One of these compounds (ebselen) also exhibited promising antiviral activity in cell-based assays. Our results demonstrate the efficacy of our screening strategy, which can lead to the rapid discovery of drug leads with clinical potential in response to new infectious diseases for which no specific drugs or vaccines are available. A programme of structure-assisted drug design and high-throughput screening identifies six compounds that inhibit the main protease of SARS-CoV-2, demonstrating the ability of this strategy to isolate drug leads with clinical potential.

2,845 citations

Journal ArticleDOI
TL;DR: A number of substructural features which can help to identify compounds that appear as frequent hitters (promiscuous compounds) in many biochemical high throughput screens are described.
Abstract: This report describes a number of substructural features which can help to identify compounds that appear as frequent hitters (promiscuous compounds) in many biochemical high throughput screens. The compounds identified by such substructural features are not recognized by filters commonly used to identify reactive compounds. Even though these substructural features were identified using only one assay detection technology, such compounds have been reported to be active from many different assays. In fact, these compounds are increasingly prevalent in the literature as potential starting points for further exploration, whereas they may not be.

2,791 citations