You are currently on a failover version of the Materials Cloud Archive hosted at CINECA, Italy.
Click here to access the main Materials Cloud Archive.
Note: If the link above redirects you to this page, it means that the Archive is currently offline due to maintenance. We will be back online as soon as possible.
This version is read-only: you can view published records and download files, but you cannot create new records or make changes to existing ones.

×

Recommended by

Indexed by

Bayesian hierarchical models for quantitative estimates for performance metrics applied to saddle search algorithms

Rohit Goswami1,2*

1 Science Institute and Faculty of Physical Sciences, University of Iceland, Reykjavík, Iceland

2 Department of Mechanical and Materials Engineering, Queen’s University, Kingston, Ontario, Canada, K7L 3N6

* Corresponding authors emails: rgoswami@ieee.org
DOI10.24435/materialscloud:xc-5e [version v1]

Publication date: Jun 01, 2025

How to cite this record

Rohit Goswami, Bayesian hierarchical models for quantitative estimates for performance metrics applied to saddle search algorithms, Materials Cloud Archive 2025.91 (2025), https://doi.org/10.24435/materialscloud:xc-5e

Description

The increasing use of high-throughput computational chemistry demands rigorous methods for evaluating algorithm performance. We present a Bayesian hierarchical modeling paradigm (brms/Stan) for analyzing key performance metrics: function evaluations, computation time, and success/failure. This framework accounts for variability across different systems and functionals, providing reliable uncertainty estimates beyond subjective visual assessments or frequentist limitations. We applied this to compare conjugate gradient (CG) and L-BFGS algorithms for the Dimer method's rotation phase (in EON, with/without removal of external rotations) on a benchmark of 500 initial saddle search approximations, analyzing over 2000 runs. Our results show CG rotations generally outperform L-BFGS, exhibiting a statistically credible, small reduction in PES calls and significantly higher odds of successful convergence. Conversely, enabling rotation removal incurred a substantial PES call penalty without a corresponding credible improvement in success odds in the implementation studied. These findings, from our novel Bayesian hierarchical modeling application, suggest CG may be preferable for Dimer rotational optimization in similar contexts. This robust statistical framework highlights benefits for revisiting optimization strategies, quantifying uncertainty, and facilitating improved high-throughput computational chemistry methods. This record contains the saddle search output logs for EON with NWChem across four settings, with/without external rotation and the use of CG/LBFGS for the rotational phase of the dimer. The record also includes fitted Bayesian Hierarchical models for performance and success analysis. These models and data are used to generate the figures and validate the analysis in the manuscript. For details, refer to the code in the associated GitHub repository.

Materials Cloud sections using this data

No Explore or Discover sections associated with this archive record.

Files

File name Size Description
hpc.tar.xz
MD5md5:5cd1034a2d822b45a4e918445b9c2e86
192.3 MiB Output logs for EON Dimer across CG-LBFGS rotations and with/without external rotation removal
models_and_preds.tar.xz
MD5md5:a18d30d132e6dcfcd56b57823371a5c1
1.5 GiB Exported BRMS models and predictions, both as R objects and parquet files with stan code.
readme.txt
MD5md5:d5e4323e70b1aec0dd4fa2901c99a8b5
4.5 KiB README containing data structure and usage instructions for the R objects and FAIR alternative.

License

Files and data are licensed under the terms of the following license: Materials Cloud non-exclusive license to distribute v1.0.
Metadata, except for email addresses, are licensed under the Creative Commons Attribution Share-Alike 4.0 International license.

External references

Preprint (Preprint describing the model and analysis of the data in the record.)
Software (Software collection used to generate the models and data in this record.)

Keywords

saddle-search performance-modeling transition-state success-modeling

Version history:

2025.91 (version v1) [This version] Jun 01, 2025 DOI10.24435/materialscloud:xc-5e