scispace - formally typeset
Open AccessJournal ArticleDOI

Applications of Deep-Learning in Exploiting Large-Scale and Heterogeneous Compound Data in Industrial Pharmaceutical Research.

Reads0
Chats0
TLDR
The current state of analyzing large-scale compound data in industrial pharmaceutical research is summarized and the impact it has had on the drug discovery process over the last two decades is described, with a specific focus on deep-learning technologies.
Abstract
In recent years, the development of high-throughput screening (HTS) technologies and their establishment in an industrialized environment have given scientists the possibility to test millions of molecules and profile them against a multitude of biological targets in a short period of time, generating data in a much faster pace and with a higher quality than before. Besides the structure activity data from traditional bioassays, more complex assays such as transcriptomics profiling or imaging have also been established as routine profiling experiments thanks to the advancement of Next Generation Sequencing or automated microscopy technologies. In industrial pharmaceutical research, these technologies are typically established in conjunction with automated platforms in order to enable efficient handling of screening collections of thousands to millions of compounds. To exploit the ever-growing amount of data that are generated by these approaches, computational techniques are constantly evolving. In this regard, artificial intelligence technologies such as deep learning and machine learning methods play a key role in cheminformatics and bio-image analytics fields to address activity prediction, scaffold hopping, de novo molecule design, reaction/retrosynthesis predictions, or high content screening analysis. Herein we summarize the current state of analyzing large-scale compound data in industrial pharmaceutical research and describe the impact it has had on the drug discovery process over the last two decades, with a specific focus on deep-learning technologies.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal Article

The human genome.

Sermonti G
- 01 Jan 1988 - 
Journal ArticleDOI

Molecular representations in AI-driven drug discovery: a review and practical guide

TL;DR: This review presents some of the most popular electronic molecular and macromolecular representations used in drug discovery, many of which are based on graph representations, and describes applications of these representations in AI-driven drug discovery.
Journal ArticleDOI

Image-based profiling for drug discovery: due for a machine-learning upgrade?

TL;DR: How the application of machine learning is renewing interest in image-based profiling for all aspects of the drug discovery process, from understanding disease mechanisms to predicting a drug’s activity or mechanism of action is discussed.
Journal ArticleDOI

Assessing microscope image focus quality with deep learning.

TL;DR: In this paper, a deep neural network model was proposed to predict an absolute measure of image focus on a single image in isolation, without any user-specified parameters, and also outputs a measure of prediction certainty, enabling interpretable predictions.
Journal ArticleDOI

SMILES-based deep generative scaffold decorator for de-novo drug design

TL;DR: A new SMILES-based molecular generative architecture that generates molecules from scaffolds and can be trained from any arbitrary molecular set and serves as a data augmentation technique and is readily coupled with randomized SMilES to obtain even better results with small sets.
References
More filters
Journal ArticleDOI

Random Forests

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
Journal ArticleDOI

Long short-term memory

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.
Journal ArticleDOI

Support-Vector Networks

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.
Journal ArticleDOI

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.
Posted Content

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

TL;DR: Faster R-CNN as discussed by the authors proposes a Region Proposal Network (RPN) to generate high-quality region proposals, which are used by Fast R-NN for detection.
Related Papers (5)
Trending Questions (1)
In what ways does AI excel in managing large-scale screening datasets efficiently specially in drug delivery system?

AI, particularly deep learning, excels in drug delivery by predicting compound activity, scaffold hopping, de novo design, and high content screening analysis, enhancing efficiency in managing large-scale screening datasets.