TY - JOUR
T1 - Metabolomic predictors of phenotypic traits can replace and complement measured clinical variables in population-scale expression profiling studies
AU - BBMRI-NL BIOS consortium
AU - BBMRI-NL Metabolomics consortium
AU - Niehues, Anna
AU - Bizzarri, Daniele
AU - Reinders, Marcel J.T.
AU - Slagboom, P. Eline
AU - van Gool, Alain J.
AU - van den Akker, Erik B.
AU - 't Hoen, Peter A.C.
PY - 2022
Y1 - 2022
N2 - Population-scale expression profiling studies can provide valuable insights into biological and disease-underlying mechanisms. The availability of phenotypic traits is essential for studying clinical effects. Therefore, missing, incomplete, or inaccurate phenotypic information can make analyses challenging and prevent RNA-seq or other omics data to be reused. A possible solution are predictors that infer clinical or behavioral phenotypic traits from molecular data. While such predictors have been developed based on different omics data types and are being applied in various studies, metabolomics-based surrogates are less commonly used than predictors based on DNA methylation profiles.In this study, we inferred 17 traits, including diabetes status and exposure to lipid medication, using previously trained metabolomic predictors. We evaluated whether these metabolomic surrogates can be used as an alternative to reported information for studying the respective phenotypes using expression profiling data of four population cohorts. For the majority of the 17 traits, the metabolomic surrogates performed similarly to the reported phenotypes in terms of effect sizes, number of significant associations, replication rates, and significantly enriched pathways.The application of metabolomics-derived surrogate outcomes opens new possibilities for reuse of multi-omics data sets. In studies where availability of clinical metadata is limited, missing or incomplete information can be complemented by these surrogates, thereby increasing the size of available data sets. Additionally, the availability of such surrogates could be used to correct for potential biological confounding. In the future, it would be interesting to further investigate the use of molecular predictors across different omics types and cohorts.
AB - Population-scale expression profiling studies can provide valuable insights into biological and disease-underlying mechanisms. The availability of phenotypic traits is essential for studying clinical effects. Therefore, missing, incomplete, or inaccurate phenotypic information can make analyses challenging and prevent RNA-seq or other omics data to be reused. A possible solution are predictors that infer clinical or behavioral phenotypic traits from molecular data. While such predictors have been developed based on different omics data types and are being applied in various studies, metabolomics-based surrogates are less commonly used than predictors based on DNA methylation profiles.In this study, we inferred 17 traits, including diabetes status and exposure to lipid medication, using previously trained metabolomic predictors. We evaluated whether these metabolomic surrogates can be used as an alternative to reported information for studying the respective phenotypes using expression profiling data of four population cohorts. For the majority of the 17 traits, the metabolomic surrogates performed similarly to the reported phenotypes in terms of effect sizes, number of significant associations, replication rates, and significantly enriched pathways.The application of metabolomics-derived surrogate outcomes opens new possibilities for reuse of multi-omics data sets. In studies where availability of clinical metadata is limited, missing or incomplete information can be complemented by these surrogates, thereby increasing the size of available data sets. Additionally, the availability of such surrogates could be used to correct for potential biological confounding. In the future, it would be interesting to further investigate the use of molecular predictors across different omics types and cohorts.
KW - Clinical surrogates
KW - Expression profiling
KW - Meta-analysis
KW - Metabolomics
KW - Multi-omics
KW - Population cohort study
KW - Predictors
KW - Surrogate outcomes
KW - Surrogates
KW - Transcriptomics
UR - http://www.scopus.com/inward/record.url?scp=85135224559&partnerID=8YFLogxK
U2 - 10.1186/s12864-022-08771-7
DO - 10.1186/s12864-022-08771-7
M3 - Article
C2 - 35907790
AN - SCOPUS:85135224559
VL - 23
JO - BMC Genomics
JF - BMC Genomics
SN - 1471-2164
IS - 1
M1 - 546
ER -