tune_substructure#
- nvmolkit.autotune.tune_substructure(targets: ~typing.Sequence[~rdkit.Chem.rdchem.Mol], queries: ~typing.Sequence[~rdkit.Chem.rdchem.Mol], *, api: ~typing.Callable = <function hasSubstructMatch>, maxMatches: int = 0, uniquify: bool = False, gpuIds: ~typing.Iterable[int] | None = None, calibration_set: ~typing.Iterable[int] | None = None, calibration_fraction: float = 0.1, calibration_max_size: int = 2000, target_seconds_per_trial: float = 10.0, n_trials: int = 30, search_space_overrides: dict[str, ~typing.Any] | None = None, cpu_budget: int | None = None, sampler: ~typing.Any = None, seed: int | None = None, verbose: bool = False) TuneResult#
Tune
SubstructSearchConfigfor a substructure-search workflow.- Parameters:
targets – Library of target molecules. Calibration trials are run on a (possibly auto-subsampled) slice of these targets.
queries – Query molecules. The same query set is used for every trial.
api – Which substructure-search entry point to tune. One of
hasSubstructMatch(),countSubstructMatches(), orgetSubstructMatches().maxMatches –
maxMatchesargument forwarded to the resulting config. Held constant across trials.uniquify –
uniquifyflag forwarded to the resulting config.gpuIds – GPU device IDs to use. Fixed across the study.
calibration_set – Optional explicit indices into
targets.calibration_fraction – Fraction of the workload to auto-sample.
calibration_max_size – Cap on the auto-sampled calibration size.
target_seconds_per_trial – Target wall-clock budget for one trial.
n_trials – Number of Optuna trials to run after warm-up.
search_space_overrides – Optional overrides for
batchSize,workerThreads, orpreprocessingThreadsranges.cpu_budget – Optional explicit cap on total CPU threads. The default (
None) usesos.cpu_count(). The joint constraintnum_gpus * workerThreads + preprocessingThreads <= cpu_budgetis enforced when sampling each trial. Set this when normalizing tuning runs across machines with different core counts so the search space stays comparable.sampler – Optional Optuna sampler.
seed – Seed for the default sampler.
verbose – Print warm-up and trial diagnostics.
- Returns:
TuneResultwithbest_configset to a fully-populatedSubstructSearchConfiginstance.