Filling the gaps: Data Imputation Methods for Drug Discovery
Jiahao Yu, and David Marcus
In UK-QSAR Spring 2024 Meeting , Apr 2024
Drug discovery datasets are often shown as sparse, noisy, and heterogeneous. To facilitate drug discovery projects and to ensure the effectiveness of Machine Learning (ML) algorithms and predictive models, it is necessary sometimes to find methods that can fill in the gaps in this data. In this poster, we discuss several classic and state-of-the-art data imputation methods and compare their performance with classic QSAR modelling. We found that data imputation models can usually outperform classic QSAR models, however some are unsuitable for data imputation in drug discovery, and some will require extensive calculation time.