Prediction of cancer cell line-specific synergistic drug combinations based on multi-omics data

View article
Bioinformatics and Genomics

Main article text

 

Introduction

Method

Framework of XDDC

Drug and pathway data screening

Drug features

Multi-omics data processing for cell lines

Cell line-pathway features

NCI-ALMANAC data preprocessing

Drug combination modeling with machine learning and deep learning algorithms

Model adjustment and validation assessment

Drug validation experiments

Result

Exploration of NCI-ALMANAC dataset

Model selection and evaluation

Feature selection

Correction of unbalanced data

Feature importance evaluation

Prediction and validation in unlabeled data

Drug validation experiments

Discussion

Supplemental Information

Exploration and filtering of the NCI-ALMANAC dataset.

(A) The stacked bar chart illustrates the number of drug pairs in different cell lines, taking the threshold of 10 as an example. Green represents the number of drug pairs without synergistic effects according to NCI, yellow represents the number of drug pairs with synergistic effects according to NCI, and red represents the number of untested drug pairs in the validation set by NCI. (B) Positive drug pair counts in cell lines corresponding to the 9 types of cancer when the threshold is set to 10.

DOI: 10.7717/peerj.19078/supp-1

Weight allocation and undersampling method evaluation.

A-B represents the ROC curves of the models after weight adjustment and undersampling, indicating the AUC values. C-D represents the PR curves of the models after processing, indicating the AUPR values. E-F represent the accuracy and precision of each model after the three different methods, where R1 represents weight adjustment, R2 represents undersampling, and R3 represents oversampling.

DOI: 10.7717/peerj.19078/supp-2

Prediction of drug combination networks.

A-E represent the predicted drug combination networks that can act on melanoma, central nervous system cancer(CNS cancer), renal cancer, colon cancer, and prostate cancer-related cell lines.

DOI: 10.7717/peerj.19078/supp-3

Overview of NCI-ALMANAC Training and Testing Sets.

DOI: 10.7717/peerj.19078/supp-4

Evaluation of Combined Input Feature Sets.

DOI: 10.7717/peerj.19078/supp-5

Feature Importance.

DOI: 10.7717/peerj.19078/supp-6

The top ten featured pathways and their annotations in each feature set.

DOI: 10.7717/peerj.19078/supp-7

All positive predicted results.

DOI: 10.7717/peerj.19078/supp-8

Additional Information and Declarations

Competing Interests

Author Contributions

Data Availability

Funding

This work was supported by the Natural Science Foundation of Hainan Province [No. 821QN0894, 621MS041, 824RC514]; National Natural Science Foundation of China [No.32260155]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

1 Citation 538 Views 53 Downloads