Molecular sampling
Prerequisite
Oracle functions used in the benchmarks require additional dependencies (GuacaMol, PyTorch Geometric, etc.). Please follow the instructions below to make sure these dependencies are installed properly.
If you are using Pixi, make sure to activate the dev environment:
If you installed the environment manually using conda/mamba and pypi, make sure you have the eval extra dependencies installed (refer to the installation instructions).
GuacaMol benchmark
To reproduce the results in Table 2 of the PrexSyn paper, run the following benchmark script:
This script runs molecular optimization tasks sequentially and each task is repeated 5 times.
A subset of tasks can be specified using the -t or --task argument, and the full list of available tasks can be found in the script file.
For example, to run only the Amlodipine MPO and Celecoxib Rediscovery tasks, use:
Results will be saved in the outputs/benchmarks/optim directory. For example, the result for Amlodipine MPO can be found at outputs/benchmarks/optim/amlodipine/log.txt. The log file should be similar to the following, which corresponds to the results in Table 2:
sEH proxy
Run the following command to reproduce the sEH proxy task results:
The log file can be found at outputs/benchmarks/optim/sEH_proxy/log.txt. The log file should be similar to the following: