Genomic data integration by WON-PARAFAC identifies interpretable factors for predicting drug-sensitivity in vivo

Yongsoo Kim, Tycho Bismeijer, Wilbert Zwart*, Lodewyk F.A. Wessels, Daniel J. Vis

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

8 Citations (Scopus)
45 Downloads (Pure)

Abstract

Integrative analyses that summarize and link molecular data to treatment sensitivity are crucial to capture the biological complexity which is essential to further precision medicine. We introduce Weighted Orthogonal Nonnegative parallel factor analysis (WON-PARAFAC), a data integration method that identifies sparse and interpretable factors. WON-PARAFAC summarizes the GDSC1000 cell line compendium in 130 factors. We interpret the factors based on their association with recurrent molecular alterations, pathway enrichment, cancer type, and drug-response. Crucially, the cell line derived factors capture the majority of the relevant biological variation in Patient-Derived Xenograft (PDX) models, strongly suggesting our factors capture invariant and generalizable aspects of cancer biology. Furthermore, drug response in cell lines is better and more consistently translated to PDXs using factor-based predictors as compared to raw feature-based predictors. WON-PARAFAC efficiently summarizes and integrates multiway high-dimensional genomic data and enhances translatability of drug response prediction from cell lines to patient-derived xenografts.

Original languageEnglish
Article number5034
Number of pages12
JournalNature Communications
Volume10
Issue number1
DOIs
Publication statusPublished - 1 Dec 2019

Fingerprint

Dive into the research topics of 'Genomic data integration by WON-PARAFAC identifies interpretable factors for predicting drug-sensitivity in vivo'. Together they form a unique fingerprint.

Cite this