A Multi-Method Comparison of Machine Learning in predicting pharmacokinetic parameters: A simulation study

One important aim in population pharmacokinetics (PK) and pharmacodynamics (PD) is identification and quantification of the relationships between the parameter and covariates to improve the predictive performance of the population PK/PD modeling. Several new mathematical methods have been developed in pharmacokinetics in recent year which indicated that the machine learning-base methods are an appealing tool for analyzing PK/PD data. This simulation-base study aims to determine whether machine learning methods, including support vector regression (SVR) and Random forest (RF) which specifically designed for the prediction of blood serum concentration or clearance, could be an effective replacement for the Lasso covariate selection method in nonlinear mixed effect models. Accordingly, the predictive performance of penalized regression Lasso, SVR and RF regression was compared to detect the associations between clearance and model covariates. PK data was simulated from a one-compartment model with oral administration. Covariates were created by sampling from a multivariate standard normal distribution with different levels of correlation. The true covariates influenced only clearance at different magnitudes. Lasso, RF and SVR were compared in terms of mean absolute prediction error (MAE). The results show that SVR performed the best in small data sets, even in those in which a high correlation existed between covariates. This makes SVR a promising method for covariate selection in nonlinear mixed-effect models.