Predicting drug metabolism and pharmacokinetics features of in-house compounds by a hybrid machine-learning model

2021 
Abstract We constructed machine learning-based pharmacokinetic prediction models with very high performance. The models were trained on 26138 and 16613 compounds involved in metabolic stability and cytochrome P450 inhibition, respectively. Because the compound features largely differed between the publicly available and in-house compounds, the models learned on the public data could not predict the in-house compounds, suggesting that outside of a certain applicability domain (AD), the prediction results are unreliable and can mislead the design of novel compounds. To exclude the uncertain prediction results, we constructed another machine learning model that determines whether the newly designed compound is inside or outside the AD. The AD was evaluated multi-dimensionally with some explanatory variables: The structural similarities and the probability obtained from the pharmacokinetic prediction model. The accuracy of predicting metabolic stability was 79.9% on the test set, increasing significantly to 93.6% after excluding the low-reliability compounds. The model properly classified the reliability of the compounds. After learning on the in-house compounds, the reliability model classified almost all (90%) of the public compounds as low reliability, indicating that the AD was properly determined by the model.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    25
    References
    1
    Citations
    NaN
    KQI
    []