Velodrome: Out-of-Distribution Generalization from Labeled and Unlabeled Gene Expression Data for Drug Response Prediction
read more
Citations
A context-aware deconfounding autoencoder for robust prediction of personalized clinical drug response from cell-line compound screening
Out-of-distribution generalization from labelled and unlabelled gene expression data for drug response prediction
Deep learning methods for drug response prediction in cancer: Predominant and emerging trends
Multi-Dimensional Ability Diagnosis for Machine Learning Algorithms
References
A Survey on Transfer Learning
The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity
The cancer genome atlas pan-cancer analysis project
The Cancer Genome Atlas Pan-Cancer analysis project
Adversarial Discriminative Domain Adaptation
Related Papers (5)
Frequently Asked Questions (13)
Q2. What is the promising aspect of the study?
Especially promising are proteomics data (Ali et al. 2018) and germline variants (Menden et al. 2018), due to their predictive power.
Q3. What methods were designed to take both labeled and unlabeled samples?
Velodrome, PRECISE, and Mean Teacher were designed to take both labeled and unlabeled samples and, therefore, were expected to achieve better performance on patients than DeepAll-ERM and Ridge-ERM.
Q4. What is the description of Velodrome?
To the best of their knowledge, Velodrome is the first method for semi-supervised out-of-distribution generalization from labeled cell lines and unlabeled patients to different preclinical and clinical datasets.
Q5. What is the purpose of transfer learning?
Transfer learning has emerged as a machine learning paradigm for such scenarios (Pan and Yang 2010; Neyshabur, Sedghi, and Zhang 2020), where the authors have access to different datasets from multiple resources (known as source domains) and want to make predictions for a dataset of interest (known as target domain) and it has been employed in different problems (Taroni et al.
Q6. What is the recent method to adjust for the output space discrepancy?
A recent method adjusts for this output space discrepancy and improves the prediction performance (Sharifi-Noghabi et al. 2020), but this method requires access to the target domain during training which violates the assumption of out-of-distribution generalization.
Q7. What was the significance of Velodrome on non-solid tissues?
For that, the authors tested the trained Velodrome models for the studied drugs on samples originated from non-solid tissues in the gCSI cell line dataset and evaluated the performance in terms of Pearson correlation between the predictions and the actual AAC values and reported two-tailed p-value as well.
Q8. What is the relationship between CMPK1 and EGFR?
Prostate cancer progression and lethal outcome have been associated with metabolic signaling pathways and CMPK1 (it mediates the mechanism of action for Gemcitabine) was shown to be highly expressed in prostate cancer patients (Kelly et al. 2016).
Q9. What is the average performance of Velodrome over the studied drugs?
Although the authors observed that the average performance (over the studied drugs) of all methods decreased, Velodrome still achieved the best performance on patients in terms of both AUROC and AUPR, and also the best performance in terms of both Pearson and Spearman correlation on cell lines.
Q10. What is the significance of Velodrome on non-solid tissues?
These(𝑃 > 0. 05) (𝑃 = 4 × 10−3) results suggest that Velodrome is as accurate (and even more accurate in the case of Erlotinib) as a non-solid predictor on these tissues even though it did not utilize them during training.
Q11. What is the version of Velodrome?
Their results on patients demonstrate that on average std (over all drugs for 10 independent runs), the± complete version of Velodrome outperforms its variants which indicates the added value of both alignment and consistency losses (Figure 3-B).
Q12. What is the role of BCL2 in kidney cancer?
BCL2 can also act as an oncoprotein in kidney cancer (Paraf et al. 1995) and therapeutics roles (Adams and Cory 2007; Delbridge et al. 2016).
Q13. What was the correlation between Velodrome and the predicted drugs?
Similar to the Velodrome results, this predictor also achieved significant correlations of 0.34 and 0.39(𝑃 = 10−2) for Erlotinib and Gemcitabine and negative correlations of -0.11(𝑃 = 5 × 10−3) and -0.4 for Docetaxel and Paclitaxel, respectively.