You are currently on a failover version of the Materials Cloud Archive hosted at CINECA, Italy.
Click here to access the main Materials Cloud Archive.
Note: If the link above redirects you to this page, it means that the Archive is currently offline due to maintenance. We will be back online as soon as possible.
This version is read-only: you can view published records and download files, but you cannot create new records or make changes to existing ones.

Published November 28, 2021 | Version v1
Dataset Open

3DMolNet: a generative network for molecular structures

  • 1. Department of Mathematics and Computer Science, University of Basel, Switzerland

* Contact person

Description

With the recent advances in machine learning for quantum chemistry, it is now possible to predict the chemical properties of compounds and to generate novel molecules. Existing generative models mostly use a string- or graph-based representation, but the precise three-dimensional coordinates of the atoms are usually not encoded. First attempts in this direction have been proposed, where autoregressive or GAN-based models generate atom coordinates. Those either lack a latent space in the autoregressive setting, such that a smooth exploration of the compound space is not possible, or cannot generalize to varying chemical compositions. We propose a new approach to efficiently generate molecular structures that are not restricted to a fixed size or composition. Our model is based on the variational autoencoder which learns a translation-, rotation-, and permutation-invariant low-dimensional representation of molecules. Our experiments yield a mean reconstruction error below 0.05 Angstrom, outperforming the current state-of-the-art methods by a factor of four, and which is even lower than the spatial quantization error of most chemical descriptors. The compositional and structural validity of newly generated molecules has been confirmed by quantum chemical methods in a set of experiments.

Files

File preview

files_description.md

All files

Files (169.2 KiB)

Name Size
md5:69f7bef45590e23b6581046de791ed43
176 Bytes Preview Download
md5:e8c420d2b4c526109f5a4a8b4341234b
169.0 KiB Preview Download

References

Preprint
V Nesterov, M Wieser, V Roth - arXiv preprint arXiv:2010.06477, 2020