Repurformer: Transformers for Repurposing-Aware Molecule Generation

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Generating as diverse molecules as possible with desired properties is crucial for drug discovery research, which invokes many approaches based on deep generative models today. Despite recent advancements in these models, particularly in variational autoencoders (VAEs), generative adversarial networks (GANs), Transformers, and diffusion models, a significant challenge known as the sample bias problem remains. This problem occurs when generated molecules targeting the same protein tend to be structurally similar, reducing the diversity of generation. To address this, we propose leveraging multi-hop relationships among proteins and compounds. Our model, Repurformer, integrates bi-directional pretraining with Fast Fourier Transform (FFT) and low-pass filtering (LPF) to capture complex interactions and generate diverse molecules. A series of experiments on BindingDB dataset confirm that Repurformer successfully creates substitutes for anchor compounds that resemble positive compounds, increasing diversity between the anchor and generated compounds.

Original languageEnglish
Title of host publicationLang + Mol 2024 - 1st Workshop on Language + Molecules, Proceedings of the Workshop
EditorsCarl Edwards, Qingyun Wang, Manling Li, Lawrence Zhao, Tom Hope, Heng Ji
PublisherAssociation for Computational Linguistics (ACL)
Pages117-128
Number of pages12
ISBN (Electronic)9798891761483
StatePublished - 2024
Event1st Workshop on Language + Molecules, Lang + Mol 2024 - co-located with ACL 2024 - Bangkok, Thailand
Duration: 15 Aug 2024 → …

Publication series

NameLang + Mol 2024 - 1st Workshop on Language + Molecules, Proceedings of the Workshop

Conference

Conference1st Workshop on Language + Molecules, Lang + Mol 2024 - co-located with ACL 2024
Country/TerritoryThailand
CityBangkok
Period15/08/24 → …

Fingerprint

Dive into the research topics of 'Repurformer: Transformers for Repurposing-Aware Molecule Generation'. Together they form a unique fingerprint.

Cite this