TY - JOUR
T1 - Predicting protein retention in ion-exchange chromatography using an open source QSPR workflow
AU - Neijenhuis, Tim
AU - Le Bussy, Olivier
AU - Geldhof, Geoffroy
AU - Klijn, Marieke E.
AU - Ottens, Marcel
PY - 2024
Y1 - 2024
N2 - Protein-based biopharmaceuticals require high purity before final formulation to ensure product safety, making process development time consuming. Implementation of computational approaches at the initial stages of process development offers a significant reduction in development efforts. By preselecting process conditions, experimental screening can be limited to only a subset. One such computational selection approach is the application of Quantitative Structure Property Relationship (QSPR) models that describe the properties exploited during purification. This work presents a novel open-source Python tool capable of extracting a range of features from protein 3D models on a local computer allowing total transparency of the calculations. As open-source tool, it also impacts initial investments in constructing a QSPR workflow for protein property prediction for third parties, making it widely applicable within the field of bioprocess development. The focus of current calculated molecular features is projection onto the protein surface by constructing surface grid representations. Linear regression models were trained with the calculated features to predict chromatographic retention times/volumes. Model validation shows a high accuracy for anion and cation exchange chromatography data (cross-validated R2 of 0.87 and 0.95). Hence, these models demonstrate the potential of the use of QSPR to accelerate process design.
AB - Protein-based biopharmaceuticals require high purity before final formulation to ensure product safety, making process development time consuming. Implementation of computational approaches at the initial stages of process development offers a significant reduction in development efforts. By preselecting process conditions, experimental screening can be limited to only a subset. One such computational selection approach is the application of Quantitative Structure Property Relationship (QSPR) models that describe the properties exploited during purification. This work presents a novel open-source Python tool capable of extracting a range of features from protein 3D models on a local computer allowing total transparency of the calculations. As open-source tool, it also impacts initial investments in constructing a QSPR workflow for protein property prediction for third parties, making it widely applicable within the field of bioprocess development. The focus of current calculated molecular features is projection onto the protein surface by constructing surface grid representations. Linear regression models were trained with the calculated features to predict chromatographic retention times/volumes. Model validation shows a high accuracy for anion and cation exchange chromatography data (cross-validated R2 of 0.87 and 0.95). Hence, these models demonstrate the potential of the use of QSPR to accelerate process design.
KW - chromatography
KW - protein features
KW - Quantitative Structure Activity Relationship (QSAR)
KW - Quantitative Structure Property Relationship (QSPR)
KW - retention prediction
UR - http://www.scopus.com/inward/record.url?scp=85187740436&partnerID=8YFLogxK
U2 - 10.1002/biot.202300708
DO - 10.1002/biot.202300708
M3 - Article
C2 - 38479997
AN - SCOPUS:85187740436
SN - 1860-6768
VL - 19
JO - Biotechnology Journal
JF - Biotechnology Journal
IS - 3
M1 - 2300708
ER -