GTAM: A Molecular Pretraining Model with Geometric Triangle Awareness

Published in Bioinformatics, 2024

Author: Xiaoyang Hou*, Tian Zhu*, Milong Ren*, Bo Duan, Chunming Zhang, Dongbo Bu, Shiwei Sun

Abstract:

Motivation: Molecular representation learning is pivotal for advancing deep learning applications in quantum chemistry and drug discovery. Existing methods for molecular representation learning often fall short of fully capturing the intricate interactions within chemical bonds of 2D topological graphs and the multifaceted effects of 3D geometric conformations.

Results: To overcome these challenges, we present a novel contrastive learning strategy for molecular representation learning, named Geometric Triangle Awareness Model (GTAM). This method integrates innovative molecular encoders for both 2D graphs and 3D conformations, enabling the accurate capture of geometric dependencies among edges in graph-based molecular structures. Furthermore, GTAM is bolstered by the development of two contrastive training objectives designed to facilitate the direct transfer of edge information between 2D topological graphs and 3D geometric conformations, enhancing the functionality of the molecular encoders. Through extensive evaluations on a range of 2D and 3D downstream tasks, our model has demonstrated superior performance over existing approaches.

Download paper here

GitHub Source Code