AiGPro: a multi-tasks model for profiling of GPCRs for agonist and antagonist

Brahma, Rahul; Moon, Sunghyun; Shin, Jae-Min; Cho, Kwang-Hwi

doi:10.1186/s13321-024-00945-7

Research
Open access
Published: 29 January 2025

AiGPro: a multi-tasks model for profiling of GPCRs for agonist and antagonist

Rahul Brahma¹,
Sunghyun Moon¹,
Jae-Min Shin² &
…
Kwang-Hwi Cho¹

Journal of Cheminformatics volume 17, Article number: 12 (2025) Cite this article

1676 Accesses
1 Altmetric
Metrics details

Abstract

G protein-coupled receptors (GPCRs) play vital roles in various physiological processes, making them attractive drug discovery targets. Meanwhile, deep learning techniques have revolutionized drug discovery by facilitating efficient tools for expediting the identification and optimization of ligands. However, existing models for the GPCRs often focus on single-target or a small subset of GPCRs or employ binary classification, constraining their applicability for high throughput virtual screening. To address these issues, we introduce AiGPro, a novel multitask model designed to predict small molecule agonists (EC₅₀) and antagonists (IC₅₀) across the 231 human GPCRs, making it a first-in-class solution for large-scale GPCR profiling.

Leveraging multi-scale context aggregation and bidirectional multi-head cross-attention mechanisms, our approach demonstrates that ensemble models may not be necessary for predicting complex GPCR states and small molecule interactions. Through extensive validation using stratified tenfold cross-validation, AiGPro achieves robust performance with Pearson's correlation coefficient of 0.91, indicating broad generalizability. This breakthrough sets a new standard in the GPCR studies, outperforming previous studies. Moreover, our first-in-class multi-tasking model can predict agonist and antagonist activities across a wide range of GPCRs, offering a comprehensive perspective on ligand bioactivity within this diverse superfamily. To facilitate easy accessibility, we have deployed a web-based platform for model access at https://aicadd.ssu.ac.kr/AiGPro.

Scientific Contribution We introduce a deep learning-based multi-task model to generalize the agonist and antagonist bioactivity prediction for GPCRs accurately. The model is implemented on a user-friendly web server to facilitate rapid screening of small-molecule libraries, expediting GPCR-targeted drug discovery. Covering a diverse set of 231 GPCR targets, the platform delivers a robust, scalable solution for advancing GPCR-focused therapeutic development.

The proposed framework incorporates an innovative dual-label prediction strategy, enabling the simultaneous classification of molecules as agonists, antagonists, or both. Each prediction is further accompanied by a confidence score, offering a quantitative measure of activity likelihood. This advancement moves beyond conventional models focusing solely on binding affinity, providing a more comprehensive understanding of ligand-receptor interactions.

At the core of our model lies the Bi-Directional Multi-Head Cross-Attention (BMCA) module, a novel architecture that captures forward and backward contextual embeddings of protein and ligand features. By leveraging BMCA, the model effectively integrates structural and sequence-level information, ensuring a precise representation of molecular interactions. Results show that this approach is highly accurate in binding affinity predictions and consistent across diverse GPCR families.

By unifying agonist and antagonist bioactivity prediction into a single model architecture, we bridge a critical gap in GPCR modeling. This enhances prediction accuracy and accelerates virtual screening workflows, offering a valuable and innovative solution for advancing GPCR-targeted drug discovery.

Graphical Abstract

Introduction

G-protein coupled receptors (GPCRs) are a vast family of transmembrane proteins that play a critical role in numerous cellular signaling. They facilitate the transmission of signals from outside the cell to the inside by regulating G proteins. They are involved in multiple signaling pathways activated by various chemical compounds, hormones, and neurotransmitters, influencing crucial cellular processes such as growth, differentiation, vision, olfaction, and gustatory [1]. Out of the 826 human GPCRs, approximately 350 non-olfactory members are considered druggable, with 165 validated as drug targets [2]. Given their critical role in fundamental physiological functions, it is not surprising that they are associated with neurodegenerative and psychiatric disorders, such as Parkinson's and Alzheimer's disease (AD) [2]. Despite challenges in drug development for Alzheimer's, clinical trials exploring GPCR agonism in treatment are underway [3, 4]. The human GPCR family is categorized into classes A (rhodopsin), B (secretin and adhesion), C (glutamate), and F (Frizzled) subfamilies based on amino acid sequences. Notably, approved drugs for neuropsychiatric diseases mainly target class A and C GPCRs, underscoring their significance in therapeutic strategies. Understanding and targeting specific GPCR classes offer potential breakthroughs in treating complex neurological conditions. Remarkably, one-third of currently available drugs target GPCRs, addressing a spectrum of human diseases, including cardiac malfunction, obesity, asthma, and migraines. GPCRs account for 12% of all human protein drug targets and contribute to the therapeutic effects of 34% of small molecule drugs [2, 5, 6]. Certain drugs, exemplified by clozapine, initially designed for specific protein targets, have been retrospectively demonstrated to exert clinical actions by modulating multiple GPCR proteins [7,8,9]. This underscores the unique polypharmacological profiles associated with GPCR modulation.

As of Dec 2023, it was reported that approximately 35% (approximately 700 drugs) of all US FDA-approved drugs act on GPCR targets [6, 10]. Furthermore, 321 drugs targeting GPCRs are currently in clinical trials, 66 of which target GPCRs not presently targeted by approved drugs. Examples of drugs in clinical trials include LJPC-501, INT-767, and RX-10045[5]. Between 2011 and 2015, drugs that target GPCRs generated over $900 billion in sales [11]. Collectively, GPCRs, along with related proteins upstream or downstream from GPCRs, constitute approximately 17% of all protein targets for approved drugs [6]. It accounts for about 12% of this, underscoring its vital role in drug development and therapeutic interventions [6]. This emphasizes the significance of GPCRs as critical players in pharmaceutical research and treatment modalities.

The structural elucidation of GPCRs began in 2000 with the resolution of bovine rhodopsin, marking a continuous increase in experimental GPCR structures. Despite progress, only 70 unique GPCRs have been characterized among 370 GPCR-ligand complexes with resolved structures [12]. Among these structures, 25 GPCRs have both agonist and antagonist binding, 33 exclusively with antagonist binding, 11 solely with agonist binding, and one without any ligand bound, providing a detailed overview of GPCR conformational diversity [12]. The scarcity of high-resolution GPCR structures challenges understanding activation mechanisms and hinders structure-based drug design [13]. Experimental efforts and computational advancements like molecular dynamics (MD) and machine learning (ML) have produced high-quality models systematically cataloged in repositories such as GPCRdb [14, 15] and GPCR-EXP [16]. However, many GPCRs still lack experimental 3D data. In the absence of receptor structures, alternative ligand-based techniques, such as quantitative structure–activity relationship (QSAR) models, have been explored [17]. Datasets detailing small-molecule activity against GPCRs offer opportunities for in silico ligand-based screening, including the application of ML models.

Recent advancements in computational approaches have significantly contributed to understanding protein interactions with ligands [18,19,20,21,22,23]. Several classification models have been developed to discern the activity of GPCR ligands, ranging from simple binary prediction like active/inactive or predicting bioactivity of antagonist/agonist on a single GPCR to a small subset of GPCRs. One classification model was developed using hub and cycle structures of ligands, along with the amino acid motif sequence of GPCRs [24]. Based on the UniProt and the Database of Interacting Proteins (DIP), a Random Forest (RF) model was developed with a focus on specific and important types of GPCRs and employed different types of sequence-based features to improve the accuracy of the predictions [25]. The Helix encoder, a compound-protein interaction (CPI) model explicitly designed for class A GPCRs, employs attention-based convolutional neural networks (CNNs) [26]. GPCRLigNet, on the other hand, is an ML-based feed-forward neural (FFN) network incorporating dilated graph convolutional networks (GCN) trained with a diverse dataset to conduct binary classification into active and active GPCR ligands [27]. DeepREAL employs a multi-scale modeling approach to analyze genome-wide ligand-induced receptor activities through transfer learning from a pre-trained binary interaction classification model [28]. SDTNBI, or Substructure-Drug-Target Network-Based Inference, prioritizes potential targets for old drugs, failed drugs, and new chemical entities by integrating network analysis and chemoinformatics to bridge the gap between novel chemical entities and the established Drug-Target Interaction (DTI) network [29]. A two-step RF-based binary classifier also performed similarly to SDTNBI with an AUC of 0.795 [30]. DTI-MLCD innovatively transforms DTI prediction from binary to multi-label classification, incorporating community detection for label correlations using a fast greedy algorithm [31]. It adapts feature representations based on dataset-specific requirements, achieving competitive performance while addressing computational load and label correlation issues inherent in binary methods [31]. Some studies have focused on a specific target; for instance, in [32], an RF model was developed to classify ligands based on molecular fingerprint features against Human Adenosine Receptor type 2A (A_2AR), which is implicated in neurodegenerative diseases like Parkinson’s and cancer and is a proven druggable target [32,33,34]. Docking and ML were also used to identify the pharmacological activity of ligands for the β2 adrenergic receptor, focusing on the specific residues for both agonist and antagonist ligands interaction [35]. However, another focus was on analyzing the features of a ligand by utilizing molecular fingerprints and embeddings for GPR151, utilizing numerous classical feature selection algorithms and DL models [36]. However, the interaction of compounds with GPCR is more complex than binary or two mutually exclusive classes, i.e., Agonist or Antagonist. Other subtle activity classes include neutral antagonists, both agonists and antagonists, inverse and partial agonists, etc. A multi-class model could be a more suitable choice; however, the need for such clean labeled data makes it a challenging problem. In practice, detecting if an unseen ligand is in its state of activity, i.e., a regression model for both agonist and antagonist, would be more helpful. More relevant efforts are screening Lasso of ECFPs and the deep neural nets (SED) approach, comprising ECFP generation, critical substructure selection, and bioactivity prediction using a DNN regression model [37]. This method was applied to 16 GPCRs (Classes A, B, C, and F, spanning 13 subfamilies). Further, they also used weighted DL and RF with five types of molecular fingerprints to develop the WDL-RF methods, which extended to 26 GPCRs, covering the same classes as their previous article [38]. GCN has also effectively predicted bioactivity against diverse targets, including 33 GPCRs [39]. Further, pdCSM-GPCR, another graph-based model, predicts bioactivity across 36 primary GPCR targets [40]. Recently, ensemble models employing five algorithms demonstrated a robust predictive capability for EC50 values of human orphan GPCRs, achieving a Pearson's correlation coefficient of 0.85 through training on 200 GPCRs utilizing MSA, physiochemical properties, and molecular fingerprints [41].

Despite extensive efforts in GPCR research, current methodologies predominantly center on classifying active and inactive or characterizing agonist and antagonist attributes, limiting comprehensive small molecule profiling against GPCRs, especially regarding bioactivity properties. Existing models for regression tasks are scarce and often focus on a limited GPCR subset, underscoring the complexities in accurately predicting bioactivity for small molecules against GPCRs. This gap requires a more comprehensive approach to deciphering the complexities of GPCR interactions.

Recently, the application of attention-based models, proven highly successful in natural language processing (NLP) tasks, has found significant utility in Drug Target Prediction (DTA) and Drug-Target Interaction (DTI) challenges [29, 31, 42, 43]. Recently, AiKPro introduced structurally validated multiple sequence alignments (svMSA) and multi-head attention (MHA) with cross attention between kinase and ligand and showed improved results compared to the previous models [22, 44]. Additionally, in KinScan, the integration of multi-scale context aggregation (MSCA) and deep context encoder (DCE) resulted in a significant improvement in the performance of bioactivity values [42]. Motivated by these advancements, our present study extends this approach with AiGPro. AiGPro is the single multi-task model based on a bi-directional multi-head cross-attention (BMCA) network with an applicability domain spanning the highest numbering, n(n = 231) of GPCRs. To our knowledge, no model exists with this number of human-druggable GPCRS within its applicability domain. Several experiments demonstrated that it outperforms existing models in the accuracy and applicability domain, including ensemble models. Additionally, to enhance accessibility, we offer AiGPro as a web service, accessible free of charge at https://aicadd.ssu.ac.kr/AiGPro.

Methodology

Data collection and pre-processing

We focused on constructing a diverse and comprehensive dataset for model training to develop an effective model to address the current challenge. For these, we retrieved datasets from two databases: GLASS and GPCRdb. Last updated in February 2019, the GLASS database offered a repository of 562,871 curated GPCR-ligand interaction records featuring 342,539 ligands and 3,056 GPCRs with experimentally measured binding affinities. Simultaneously, the GPCRdb, updated as of October 25, 2023, contained data on 424 GPCRs, 217,578 ligands, and 481,718 bioactivities. Then we followed stringent filtration procedures, which excluded bioactivity values other than IC₅₀, K_i, and EC₅₀, duplicate pairs, non-sanitizable compounds by RDKit, and non-standard experimental kinematics values, keeping only the absolute values or those with “ > ” or “ < ” signs only. The resultant dataset featured 231 distinct human GPCRs and 276,183 small molecules, making 405,246 interactions comprising 44% antagonist and 56% agonist interactions. A dataset not in the above dataset containing 11,464 interactions with 11,259 unique ligands, of which 52.78% are unseen and with a similar agonist and antagonist ratio, was considered the independent test set. More comprehensive details are in Fig. 1 for the training dataset. Additional file 1 is accessible at https://aicadd.ssu.ac.kr/supportedgpcr.

Distinguishing between antagonist and agonist datasets, we categorized the combined IC₅₀ and K_i datasets as antagonistic, while the EC₅₀ dataset represented agonists. Finally, in the remaining datasets, following [45], the experimental bioactivity (BA) values were transformed by adding some noise and then into the negative log of bioactivity (pBA) values as:

$$BA = BA \pm random\left( {0,0.3 \cdot BA} \right), + if\, ^{\prime} >^{\prime}in\,BA, - if\, ^{\prime} <^{\prime} in\,BA, else\, BA.$$

(1)

$$pBA=\alpha -log\left(BA\right), BA\, in \{ {IC}_{50}{,EC}_{50},{K}_{i}\}$$

(2)

$$where \alpha = \{3, if\, the\, value\, unit\, is\, in\, milimolar\, (mM)$$

$$6, if\, the\, value\, unit\, is\, in\, micromolar\, \left(uM\right)$$

$$9, if\, the\, value\, unit\, is\, in\, nanomolar\, \left(nM\right)$$

$$12, if\, the\, value\, unit\, is\, in\, picomolar\, \left(pM\right)$$

Sequence encodings

We used 1D sequences to represent both protein and chemical compounds. These 1D sequences consist of the MSA of proteins and Simplified Molecular Input Line Entry System (SMILES) strings of compounds. We employed structure-based alignment of protein sequences to encode the 3D structural information of proteins into a 1D sequence. This method provides a comprehensive representation of the structural features of proteins and allows us to gain valuable insights into their similarities and differences. On the other hand, SMILES is a concise ASCII string widely used for describing ligand chemical structures and efficiently encapsulating information about atoms, bonds, rings, and other molecular components.

Protein

For protein sequence, a structurally based MSA was performed using the complete GPCR protein sequence of all unique proteins, facilitated by the GPCRdb sequence alignment RESTful API available at https://gpcrdb.org/services/. We describe here the encoding of a single protein sequence from MSA of all GPCRs. Given the Protein MSA, M$=\left\{{P}_{1},{P}_{2},\dots ,{P}_{n}\right\}$, where ${P}_{i}$ is a single protein at the i-th index of MSA. So, ${P}_{i}$ = (${a}_{1},{a}_{2},\dots ,{a}_{n}$), a $\in \left[ {A,^{\prime} -^{\prime}{ }} \right]$, where ${a}_{i}$ represents the i-th amino acids, n represents the length of the sequence, ‘A’ represents the types of amino acids, and ${P}_{i}$ is the i-th protein within MSA. In ${P}_{i}$, along with common amino acids, $,^{\prime} -^{\prime}$, it is also included due to the inherited feature of MSA, which represents gaps in the alignment. We encoded the protein sequence using a tokenized function, T, and obtained the tokenized sequence, T $P=\left\{{t}_{1},{t}_{2},\dots ,{t}_{n}\right\},$ where each ${t}_{i}$ is the token corresponding to${a}_{i}$:

$$t=\left\{{t}_{i}|{ t}_{i} \in T({a}_{i})\right\} , T:\Sigma \to {[N]}^{t}$$

(3)

where ${[N]}^{t}$ represent the set character of the token, which contains 25 elements, including $\left[ {TOKEN_{sp} } \right]$, where${TOKEN}_{sp}\in [ "PAD", "U\text{NK"}, "{\text{START}}\text{"}, "STOP" ]$. T enables encoding amino acids and gaps as discrete numerical values, facilitating computational operations and analysis within the MSA framework. In the study, ${P}_{n}$ = 231 and ${t}_{i}=\text{1,900}$, the maximum length of a protein sequence. The tokenized amino acid $t_{i} { }\forall_{i} { } \in { }\left[ N \right]^{t}$ is then embedded into d_p-dimensional vectors via an embedding layer.

Ligand

Consider a ligand, C $=\left\{{c}_{1},{c}_{2},\dots ,{c}_{m}\right\}$, denotes the SMILES string of a ligand with m as the length of the string and ${c}_{i}$ an i-th string within the SMILES. To get a tokenized smile, TC $=\left\{{t}_{1},{t}_{2},\dots ,{t}_{n}\right\},$ where each ${t}_{i}$ is the token corresponding to${c}_{i}$:

$$t=\left\{{t}_{i}|{ t}_{i} \in T({c}_{i})\right\} , T:\Sigma \to {[D]}^{t}$$

(4)

where ${t}_{n}$ is the length of the smile string and ${[D]}^{t}$, represents the complete set of tokens for smiles, i.e., 575 characters vocabulary dictionary, which also includes${TOKEN}_{sp}$. We then embedded the $t_{i} { }\forall_{i} { } \in { }\left[ D \right]^{t}$ to d_l-dimensional vectors via an embedding layer. We also utilized a positional embedding alongside a class token (agonist or antagonist), as shown in Fig. 2, that was embedded into dc with dimensions to d_l and d_p while using d_p and d_l of 32. Positional embedding was done in all sequences while class labels were concatenated to d_l and d_p.

Molecular feature encoding

Following [23], we calculated a 170-long vector molecular descriptors study to extract relevant features to evaluate the physicochemical attributes of chemical compounds using RDKit [46]. This descriptor includes Lipinski parameters for topological/topochemical descriptors of molecules, Atom-based LogP and molar refractivity (MR), Hybrid EState-VSA descriptors analogous to MOE van der Waals Surface Area (VSA) descriptors, QED descriptors, and Basic EState descriptors, etc. We also added the Gasteiger charge descriptor, a 512-dimensional vector that captures the charge distribution across all constituent atoms within the compound. Integrating these molecular features allows us to assess properties spanning diverse physicochemical domains of the molecules. This descriptor provides valuable insights into the compound's overall charge distribution, enhancing our understanding of its inherent characteristics.

AiGPro architecture

A schematic overview of the proposed multi-task model, AiGPro, is shown in Fig. 2. The model can be divided into the following parts: the input data representations, the multi-scale context aggregation (MSCA), the bi-directional multi-head cross-attention (BMCA), and the last output block for final prediction outputs. The MSCA block uses dilated convolution to expand its receptive convolution field without compromising the resolution or coverage to extract short and long-distance interaction information for the BMCA, which learns to extract meaningful interrelationships between distant atoms or residues. We used a similar setup for MSCA and Multi-head attention (MHA), as described in [23].

AiGPro builds upon the attention mechanism, scaled dot-product attention, introduced by Vaswani et al., and is a powerful method for calculating the connections and weighted sums between different elements in a given sequence [47]. MHA relies on self-attention, comprising multiple layers, followed by an FFN. In the architecture, MHA layers are integral, each composed of multiple attention heads. These layers leverage scaled dot-product attention, requiring the utilization of query (Q), key (K), and value (V) matrices. These matrices, denoted as ${W}_{i}^{Q}\in {R}^{{d}_{model}\times {d}_{k}},{W}_{i}^{K}\in {R}^{{d}_{model}\times {d}_{k}},{W}_{i}^{V}\in {R}^{{d}_{model}\times {d}_{V}}$ respectively, are learnable weight matrices. Here, Q = K = V is the input protein and ligand representation for the MHA.

We employed the multi-head self-attention mechanism for n times, utilizing distinct linear projections to enhance performance. The MHA computes the self-attention operation in parallel on the projected iterations of queries, keys, and values, producing output values of d_model/ℎ-dimensions. Where ${W}^{T}\in {\mathbb{R}}^{{\text{n}}{d}_{v}\times {d}_{model}}$ is a weighted parameter and $\frac{1}{\sqrt{{d}_{k}}}$ is scale factor. The output of MHA is further fed into the feed-forward layers, where ${R}_{intra}$ is the learned representation.

$${Q}_{d}={h}_{d}{W}_{d}^{Q},{K}_{d}={h}_{d}{W}_{d}^{K},{V}_{d}={h}_{d}{W}_{d}^{V}$$

(5)

$$\begin{array}{cc}head& =\begin{array}{c}{\text{A}}{\text{t}}{\text{t}}{\text{e}}{\text{n}}{\text{t}}{\text{i}}{\text{o}}{\text{n}}({Q}_{d},{K}_{d},{V}_{d})\end{array}\\ & =\begin{array}{c}{\text{s}}{\text{o}}{\text{f}}{\text{t}}{\text{m}}{\text{a}}{\text{x}}(\frac{{Q}_{d}{K}_{d}^{T}}{\sqrt{{d}_{k}}}){V}_{d}\end{array}\end{array}$$

(6)

$$MHA\left({h}_{d}\right)=concat\left({head}_{1}, \dots , {head}_{n}\right){W}^{T}$$

(7)

$${R}_{intra}=FFN\left(MHA\left({h}_{d}\right){W}_{1}^{d}+{b}_{d}^{1}\right){W}_{2}^{d}+{b}_{d}^{2}$$

(8)

Bi-directional multi-head cross-attention module (BMCA)

Figure 2, shows the intra-molecular features and relationship between elements learned, and each DCE for protein and ligand output $({{R}_{intra}^{p})}^{{d}_{model}}$ and $({{R}_{intra}^{l})}^{{d}_{model}}$ for protein and ligand respectively. However, the information on the intermolecular dependency between the protein and ligand is still missing. Thus, within the BMCA, ligands and protein features undergo successive processing through the intermolecular bi-directional cross-attention layer, yielding multimodal data augmentation features specific to ligands and proteins and combined intermolecular features. BMCA is built upon MHA with ${h}_{cross}$ attention heads, which take the learned representation $({{R}_{intra}^{p})}^{{d}_{model}}$ and $({{R}_{intra}^{l})}^{{d}_{model}}$ as the input.

In BMCA, for query protein, where ${Q}_{forward}=({{R}_{intra}^{p})}^{{d}_{model}}$ and ${K}_{forward}={V}_{forward}=({{R}_{intra}^{l})}^{{d}_{model}}$ similarly for query ligand ${Q}_{backward}=({{R}_{intra}^{l})}^{{d}_{model}}$ and ${K}_{backward}={V}_{backward}=({{R}_{intra}^{p})}^{{d}_{model}}$, the BMCA outputs ${R}_{inter }^{p}\in {\mathbb{R}}^{a\times {d}_{model}}$ and${R}_{inter }^{l}\in {\mathbb{R}}^{a\times {d}_{model}}$, is the learned representation of intermolecular feature. This feature is combined to form the final representation, as shown below.

$$R_{inter} = Concat\left( {R_{inter }^{p} , R_{inter }^{l} } \right)$$

(9)

Since BMCA is based on an attention mechanism, it has an x(x = 2) number of layers, followed by a residual connection and layer normalization. To counter overfitting, dropout layers are inserted post each computational layer, stochastically deactivating hidden unit activations to enhance model generalization beyond the training set.

The molecular features of the compounds are normalized and projected to a hidden state, MF, using an i-th projection layer, where $i\in [\text{1,2}]$, ${h}_{i}^{c}$ is the output vector of layer i, and ${W}_{i}^{c}\in {\mathbb{R}}^{{d}_{i-1}\times {d}_{i}}$ is learnable weighted parameter matrices, so the

$${h}_{i}^{c}=FN({h}_{i-1}^{c}+{b}_{i}^{c})$$

(10)

The final context aggregation block merges the representations obtained from BMCA, backbone, and projected molecular features, as shown in Fig. 2. Additionally, we added the class embedding. This captures local and global information for inter and intra-molecular information, which helps refine the representation for downstream tasks. Then, it is passed through the final DCE to compressed combined global representation as,

$${R}_{inter}^{g}{=Conc({E}^{class}, h}_{i}^{c}, )$$

(11)

which is then passed on to the final output block for final prediction.

Output block

The output block comprises a multi-layer perceptron (MLP), consisting of three fully connected neural network (FCN) layers. Each FCN layer, except the last one, to mitigate overfitting, utilizes a Leaky Rectified Linear Unit (Leaky-ReLU) activation function with a negative slope of 0.01, followed by a dropout layer. The output $pBA$, is the predicted bioactivity value between the protein and the ligand.

$$pBA=MLP({R}_{inter}^{g})$$

(12)

Model implementation and training detail

The model was developed and implemented using PyTorch and Python 3.11. It was trained on an NVIDIA 4090 24 GB with open-source CUDA 11.7 using the AdamW optimizer, with a learning rate of 0.003 and weight decay of 0.001. Dropout and L2 regularization techniques were applied to prevent overfitting. Overfitting was checked using validation data after every 10 epochs. Mixed precision and an early stopping strategy were utilized to optimize the training process. See Table 1 for more details.

Table 1 Summary of the parameters used in developing the AiGPro

Full size table

Evaluation metrics

In the study, several evaluation metrics were computed to assess the model's performance on the test set and facilitate a comparison of its predictive power. We used Pearson's correlation coefficient (CC) for performance evaluation, Mean Square Error (MSE), and the correlation coefficient (${R}^{2}$) to evaluate the performance of a model's predictions.

For model assessment, we computed the concordance Index (CI), which measures the concordance probability between the experimental and predicted values. CI can be defined as,

$$\begin{array}{c}CI=\frac{1}{\text{Z}}\sum_{{\updelta }_{\text{i}}>{\updelta }_{\text{j}}}\text{h}\left({\text{m}}_{\text{i}}-{\text{m}}_{\text{j}}\right)\end{array}$$

(13)

where δi and ${m}_{i}$ represent the experimental and predicted value for i-th data. With Z, the normalization constant, for the greater affinity ${\delta }_{i}$ and the smaller affinity δj, its prediction value is ${m}_{j}$ and ${m}_{i}$ respectively. h(x) is defined as:

$$\begin{array}{c}{\text{h}}\left(\text{x}\right)=\left\{\begin{array}{cc}1& \text{if }\text{x}>0\\ 0.5& \text{if }\text{x}=0\\ 0& \text{if }\text{x}<0\end{array}\right)\end{array}$$

(14)

The CI values range from 0 to 1, where 1 signifies the optimal outcome.

Furthermore, we utilized the Matthews Correlation Coefficient (MCC), a robust statistical metric perfect for evaluating models on binary classification [48]. In addition to MCC, we also employed the Area Under the Receiver Operating Characteristic curve (AUC-ROC) and Cohen’s kappa to comprehensively evaluate the performance of our models in classification tasks. To ensure a thorough assessment, we conducted a stratified K-fold CV(K = 10) to confirm the usability, reliability, and generalizability of AiGPro.

Web server implementations and deployment

To provide an accessible end-to-end solution, we have deployed AiGPro as a web platform using FastAPI and Nginx as the backend and reverse proxy server for load balancing. This reduces the difficulty for users without a computational background to test the model without downloading and installing anything. The User Interface (UI) is developed using React JSX, Vite, and Tailwind CSS frameworks. The predicted target activity value table is presented using React DataTables, while interactive plots and figures are generated using Plotly.js and the D3 library.

Real-world application test: a case study on Alzheimer’s disease (AD)

As proof of concept and to test the applicability domain, we tested our models and their limitations in real-world applications. Our focus was on addressing the problem of AD, so we curated GPCR data involved in the disease, not in the training dataset, to use as an external test dataset. This dataset consists of 4895 unique ligands, which form 6050 GPCR-ligand pairs, of which 5508 are antagonist interactions and 542 are agonist bioactivity data. The dataset contains 8803 unique ligands interacting with four GPCRs. These receptors are Adenosine receptor A2a (P29274), Muscarinic acetylcholine receptor M1 (P11229), Muscarinic acetylcholine receptor M3 (P20309), and Muscarinic acetylcholine receptor M2 (P08172). It is known that these proteins have a role to play in AD. These proteins are dysregulated in the cognitive area of AD patients [33, 34]. Some of these GPCRs have garnered significant interest due to numerous studies supporting them as credible targets for repurposing existing drugs or designing and discovering new drugs with clinical potential [49].

The adenosine receptor A2a interacts with 2,662 unique ligands to form 2,695 interactions. The Muscarinic acetylcholine receptor M1 interacts with 1,265 unique ligands to create a total interaction of 1,284. The Muscarinic acetylcholine receptor M3 interacts with 1,078 unique ligands to form 1,084 interaction data. The Muscarinic acetylcholine receptor M2 interacts with 982 ligands to form 987 interaction data. See Table 5 for more details on the AD test dataset.

Results and discussion

Model development

To develop robust prediction models for orphan GPCRs, we curated a comprehensive dataset consisting of 98,391 agonistic ligands, 165,639 antagonistic ligands, and 12,153 dual agonist-antagonistic ligands, covering 231 GPCRs as shown in Fig. 1. Of these GPCRs, 43 had only agonistic ligand interactions, 16 proteins had exclusively antagonistic ligand activities, and 172 GPCRs with at least one ligand exhibited both agonistic and antagonistic activities. This dataset was extracted from multiple publicly available databases. GPCRs are complex because they can adopt different conformational states, resulting in a single ligand exhibiting different activities (e.g., agonist, antagonist, or both). Consequently, traditional single-task or multi-task models, which predict only bioactivity value, are insufficient for distinguishing between these states, making them unsuitable for profiling applications.

In our earlier work, AiKPro [22], we utilized one-hot encoding with svMSA for protein representation and 3D ensemble features for ligands, successfully capturing their structural information for bioactivity prediction in kinases. While this approach yielded accurate results, the requirement for computationally expensive 3D ensemble features for ligands was a limitation. Subsequently, in KinScan [23], we advanced this methodology by employing embedding-based representations with MSCA and DCE. This innovative approach combined dilated convolutions with data-specific feature engineering for svMSA. By doing so, we eliminated the need for 3D ensemble ligand features while achieving superior predictive performance to our and existing models for Kinases.

For the current work on GPCRs, we adopted a similar foundational approach from KinScan while addressing their inherent complexities. GPCRs are membrane proteins with multi-state interactions, which cause them to exhibit different bioactivities for the same ligand under varying conditions.

To tackle this, we developed BMCA, a novel methodology that captures structural information tailored to the required output. BMCA enables our model to predict bioactivity values for both agonist and antagonist dynamically. For instance, if the input specifies agonistic activity, the model predicts a value distinct from that of antagonistic activity, even for the same ligand. This state-dependent prediction capability makes BMCA highly suited for modeling the complex, multi-state interactions characteristic of GPCRs.

Performance evaluation

Accurately predicting the binding affinity between proteins and compounds is crucial in drug discovery to differentiate between meaningful interactions and those with secondary targets, also known as off-targets. GPCRs are one of the most important targets, and many drugs target them. However, existing models only cover a single target or a small number of GPCRs because the complexity of GPCRs, being membrane proteins, limits the availability of high-quality data. To overcome this issue, an effort has been made to combine multiple ML models, creating an ensemble model to predict GPCR bioactivity values [40, 41]. Even though this approach adds integration and computation complexity, limitations remain in generality, accuracy, and applicability to broad GPCRs for large-scale profiling.

In this regard, we initially developed two separate models, AiG-ANT and AiG-AGO, to predict the bioactivity of antagonists and agonists against GPCRs. We trained these models on separate datasets comprising 183,466 antagonist and 229,312 agonist instances and evaluated them using distinct test sets for antagonist and agonist samples, respectively. In this study, we extensively evaluated the model on test data to ensure its reliability in real-world scenarios and demonstrate its strong generalization ability to predict unseen compounds. As Shown in Table 3, the AiG-ANT model performed well on the independent antagonist test set, with ${R}^{2}$ value of 0.773 and a corresponding CC of 0.879 for antagonist bioactivity predictions. The AiG-AGO model also showed promising results, with ${R}^{2}$ of 0.719 and a CC of 0.853 for agonist bioactivity predictions on the independent agonist dataset.

However, a discrepancy in performance between agonist and antagonist evaluations was evident. This was due to the limited range of EC₅₀ values for agonists. Around 90% of all agonist instances had pEC₅₀ values between 4 and 5. In contrast, antagonist data had a more uniformly normally distributed, with a standard deviation of about 1.39, compared to the narrower range of 1.04 for agonist datasets. To address this issue, we developed a multi-task model, AiGPro, and trained it on a combined agonist and antagonist samples dataset. AiGPro showed superior performance to the single-task models, with a ${R}^{2}$ of 0.829 and a CC of 0.912, surpassing the individual single-task models. This approach improved performance significantly by over 7–16% and allowed us to integrate bioactivity categories seamlessly. A similar trend of increased performance on combined datasets than single ones was also observed in previous research [41]. This can be attributed to the enhanced ability of DL models to exploit larger volumes of data and the use of conditional labeling to facilitate better fitting to data distributions.

Furthermore, to mitigate concerns regarding overfitting, our study conducted a rigorous tenfold stratified CV, as shown in Fig. 3 analysis, and evaluated performance on an independent test set, yielding similar results as shown in Table 3. Thus, our framework presents a versatile, general, and innovative approach to exploring the intricate mechanisms underlying agonistic and antagonistic ligand interactions in GPCR systems. The optimal settings used to train AiGPro depend on various parameters, such as embedding size, the number of heads in MHA, the number of layers in MSCA and BMCA, the number of epochs, the dropout rate, the learning rate, and so on. These parameters are crucial for determining the performance of AiGPro and were determined based on KinScan and some hyperparameter searches. For more detailed specifications of these parameter settings, see Table 1 and Additional File 2.

Comparison with existing options

Based on our knowledge, AiGPro is the first multi-task neural network based on the transformer’s attention mechanism architecture approach that can accurately predict the bioactivity values, i.e., antagonist IC50 and agonist EC50 of small molecules to profile against 231 GPCRs. We found that pdCSM-GPCR, a graph-based model, is similar to our model’s applicability; however, this model is limited to only 36 GPCRs, significantly limiting its applicability domain.

We compared the performance of AiGPro against pdCSM-GPCR to predict ligand activity using a dataset retrieved from pdCSM-GPCR's test dataset. AiGPro's capability to predict both agonist and antagonist activity values, as shown in Table 3, is a crucial consideration for successful therapeutic development efforts, especially in GPCR-related drug discovery. However, the significant limitation of existing models, including pdCSM-GPCR, is its inability to distinguish such crucial information. Since this test dataset doesn't contain an activity type label, we considered it an outlier dataset for AiGPro, which is a significant challenge in accurately predicting activity values. We included both activity types and considered only the lowest value in the metric against the pdCSM-GPCR.

The result, summarized in Table 2 and Additional file 4 Figure S1 and Figure S2, shows that AiGPro performed well against pdCSM-GPCR for large numbers of GPCRs, with MSE ranging from as low as 0.01 for Q99835 to 2.2. However, we observed that AiGPro performed relatively worse in some GPCRs, like Q14833, P30968, and Q14833, with MSE as high as 2, even though this MSE is lower than pdCSM-GPCR in some cases.

Table 2 Comprehensive Performance Metrics of pdCSM-GPCR and AiGPro Models for 36 GPCRs supported by pdCSM-GPCR, including Pearson and Spearman and Mean Squared Error (MSE)

Full size table

Further analysis was conducted on proteins, as presented in Table S1; we observed that the imbalance in the dataset ratio between agonists and antagonists likely contributed to the higher MSE values for two of the proteins. Interestingly, despite having a balanced dataset, the protein associated with UniProt ID Q9HC97 also exhibited poor performance. A deeper investigation revealed that most data for Q9HC97 consisted of log activity values lower than 5, suggesting that a large proportion of inactive data can negatively impact the model’s predictive accuracy.

This study highlights the unique strengths and limitations of AiGPro and pdCSM-GPCR in predicting ligand activity for class A GPCRs. While pdCSM-GPCR shows some strengths in specialized scenarios, AiGPro’s broader applicability and generalizability across a more comprehensive range of GPCRs make it a promising tool for advancing GPCR-targeted drug discovery. However, the study also underscores the need for models that can accurately distinguish between different types of ligand activities, an area that remains critical for the field.

Wei-Cheng et al. recently published models that were trained on a dataset of 200 GPCRs using EC₅₀ data to predict agonist and antagonist activity values with single-task (STL-AG) and multitask (MTL) models [41]. However, there are concerns about potential biases resulting from merging training and validation datasets, particularly in the MTL training of models. This is an essential difference from the methodology used by AiGPro, which does not incorporate such merging, resulting in a more robust and impartial evaluation framework.

As shown in Tables 2 and 3, our single-task models, the AiG-ANT and AiG-AGO, have demonstrated exceptional predictive performance with CC values of up to 0.879 for antagonists and 0.853 for agonists. In contrast, the best-performing models among the STL and MTL models, and also integrating the training and validation data within the multitask framework (MTL-AG-ATG), along with various feature combinations including additional mol2Vec (M2V) feature vectors, results in slight improvements with CC values reaching up to 0.85 from 0.80, which is lower than most of AiG models, except the AiG-AGO-B and AiG-ANT-B. However, AiGPro stands out as the best performer, exceeding these ensemble models with a remarkable CC of 0.913 and ${R}^{2}$ of 0.833 predictions on the test set. These suggest that our novel multi-task attention-based bidirectional model can learn complex relationships between GPCRs and ligands. The disparities in MSE and MAE across models in Table 3 underscore the inherent scale dependency of these metrics, necessitating careful consideration during comparative analyses.

Table 3 Comparison of performance metrics, such as MSE, MAE, and CC, for AiGPro, compared to similar existing models. Bold text indicates the best result

Full size table

Furthermore, we evaluated the model's efficacy in identifying active and inactive ligands, defining active ligands as those with a potency of less than 100 nM. As depicted in Fig. 4A, B, AiGPro exhibited overall robust performance, with a slight decrease in performance for agonists compared to antagonists. This discrepancy may be attributed to the relatively smaller number of active ligands in the training dataset, see Fig. 5, influencing the model's ability to generalize effectively to this category.

Overall, this demonstrates that the AiGPro has a balanced capability for generalization and accuracy in predicting with a broad applicability domain, enabling large-scale high throughput screening for GPCR ligands.

Ablation study

In this study, we aimed to evaluate the importance and efficiency of different components of the AiGPro design for extracting meaningful information that can help make accurate bioactivity predictions. To achieve this, we use the same datasets for training and testing and conduct ablation experiments to understand the contribution of each component, such as MSCA, DCE, molecular features, and BMCA. Although the importance of some of these components has been highlighted in previous studies [23], BMCA is a new addition that requires a dedicated examination of its efficacy and relevance. Thus, we conducted ablation experiments to assess the impact of the BMCA module on the AiGPro model's performance.

As shown in Table 4, the removal of BMCA had a substantially varied influence on the predictive capabilities of the single and multi-task models. The single-task models without BMCA, namely the AiG-AGO and AiG-ANT, performed very well; however, on adding BMCA to this model (AiG-ANT-B and AiG-AGO-B), a significant reduction in performance was observed, with CC dropping to as lowest of 0.829 and 0.794 from 0.879 and 0.853 for ANT and AGO models respectively. These models were based on a previous study, which was well designed for predicting bioactivity, also held in the current study. Nevertheless, the absence of the BMCA in the multi-task model resulted in a significant decrease in performance, with a CC of only 0.759, approximately 5% lower than the weakest single-task model, namely the AiG-AGO-B. However, including the BMCA led to a substantial improvement in performance, surpassing even the strongest single-task models and achieving a CC of 0.912.

Table 4 Ablation study on the effect of BMCA on single and multi-task Models

Full size table

As a result, the single-task model cannot take advantage of BMCA, and these models perform inferiorly with the proposed architecture. Overall, our ablation experiments provide compelling evidence supporting the significance of the BMCA module within the AiGPro architecture. By elucidating its critical role in information extraction and predictive accuracy, our study contributes valuable insights into advancing computational methodologies for bioactivity prediction in drug discovery and development using a multi-task model.

Applicability test on Alzheimer's related proteins

To verify the practical applicability of the model, we conducted a case study on GPCRs implicated in AD, namely Adenosine receptor A2a, Muscarinic acetylcholine receptor (mAChR) M1, mAChR M2, and mAChR M3. The A2A adenosine receptor, a vital member of the P1 purinergic receptor family, significantly influences the pathophysiology of various neurodegenerative disorders, including AD. Its regulatory effects on neurons and glial cells modulate synaptic transmission and neuroinflammation. Notably, the A₂A receptor is the most extensively studied adenosine subtype concerning its effects on neurodegenerative diseases and the availability of selective receptor antagonists currently undergoing clinical evaluation.

Likewise, the involvement of mAChR M1, M2, and M3 in AD is well-documented, with several ongoing clinical investigations [49]. Notably, the M1 subtype has witnessed the development of orthosteric ligands like xanomeline and, recently, HTL9936, progressing from preclinical models to human trials. While allosteric ligands for M1-mAChR are in early developmental stages, promising data from preclinical studies underscore their potential efficacy [49].

Experimental evidence underscores the crucial role of M1-mAChR in cognitive function, supported by studies demonstrating cognitive deficits upon genetic ablation or pharmacological inhibition of M1-mAChR signaling in rodents. Conversely, activation of M1-mAChR has been shown to ameliorate learning and memory deficits in preclinical models of neurodegeneration and human patients with central nervous system disorders such as schizophrenia [49,50,51].

The M2-mAChR subtype exhibits widespread expression throughout crucial brain regions involved in cognition, and its antagonism has shown potential in rescuing cognitive deficits in neurodegeneration in rodent models [52]. In contrast, the M3-mAChR subtype exhibits the lowest expression levels in the central nervous system, primarily localized in the hypothalamus. While its precise role remains unclear, studies using knockout and phospho-deficient knockin mice suggest a potential involvement of M3-mAChR in cognitive function [53].

We evaluated the predictive capabilities of AiGPro in comparison to existing models, such as pdCSM-GPCR, and general methodologies like Directed Message Passing Neural Network (D-MPNN) models implemented in Chemprop [54]. For Chemprop/D-MPNN, we trained multiple models: two single-task models (one for agonists and another for antagonists) and a multi-task model for both activities, using the same dataset. These models were then tested on the Alzheimer’s dataset, as shown in Table 5, which included ligands with both agonistic and antagonistic activities, providing challenges to the models like pdCSM, which do not differentiate between these activities. Notably, only one of the four GPCRs analyzed falls within pdCSM-GPCR’s scope.

Table 5 Overview of data in detail for external GPCR application test related to Alzheimer’s disease

Full size table

As shown in Table 6, Figure S3, and Figure S4, AiGPro outperformed other predictive models across various metrics. However, there were instances where Chemprop models delivered comparable or slightly superior performance. For protein P20309, AiGPro achieved the highest R² of 0.865 and the lowest MSE of 0.417 for all ligands. Chemprop(Multi) and Chemprop(Anta) exhibited R² values close to AiGPro’s, at 0.858 and 0.863, respectively. However, their MSE values were slightly higher, reflecting less precise predictions. Similarly, for P08172, Chemprop(Anta) and Chemprop(Multi) achieved comparable or marginally better R² values of 0.790 and 0.813, respectively, compared to AiGPro’s 0.805. Additionally, their MSE values for all ligands were close to AiGPro’s 0.465, with Chemprop(Multi) slightly outperforming AiGPro. For P29274, Chemprop(Anta) and Chemprop(Multi) performed marginally better than AiGPro in R² and MSE. In contrast, AiGPro outperformed all other models for P11229, achieving an R² of 0.712 compared to Chemprop(Anta) and Chemprop(Multi), which scored 0.630 and 0.653, respectively.

Table 6 Comparative analysis of AiGPro with other available methods on the Alzheimer’s data

Full size table

One of the most notable trends is observed in Chemprop's performance for agonist and antagonist predictions. While Chemprop models, particularly Chemprop(Multi) and Chemprop(Anta), demonstrated competitive performance for antagonist activity prediction, all Chemprop models failed for agonistic activity prediction. Negative R² values, such as -1.229 for P29274 and -0.861 for P08172, highlight Chemprop(Ago) 's inability to generalize or make meaningful predictions for agonists. This failure was further corroborated by higher MSE values and lower CI scores for agonists across all tested proteins, underscoring a critical limitation in Chemprop's generalizability. In contrast, AiGPro demonstrated robust performance across both agonists and antagonists, consistently delivered high predictive accuracy and reliability across all proteins even on skewed datasets such as P08172, outperforming Chemprop models in overall generalization. pdcsm-GPCR's inability to process agonists or antagonists underscores its lack of versatility; its ability to maintain high performance across diverse ligand types makes it a versatile tool for profiling large-scale GPCRs. These results emphasize the limitations of existing models in handling mixed activity datasets and underscore the need for AiGPro's broader applicability and reliability in GPCR profiling.

Further, we also performed classification tests, the findings of which are presented in Fig. 6A, B. These findings highlight AiGPro's robust performance even on classification tasks on novel datasets. Demonstrating notably high Cohen's kappa, ROC–AUC, and MCC values for both agonist and antagonist ligands, these results affirm the reliability of AiGPro and reassure its robust performance in classification tasks. Such validation underscores its potential importance in advancing practical research and enhancing GPCR targetd drug discovery efforts.

Limitations

While AiGPro demonstrates significant advancements in predicting both agonistic and antagonistic activities across GPCRs, it has certain limitations. The model’s performance relies heavily on the availability of high-quality training data. Despite our dataset being among the most comprehensive for GPCRs, it remains sparse for specific targets. This data sparsity particularly impacts the model’s generalizability for underrepresented GPCR families and ligands with rare activity profiles, with agonistic ligands notably underrepresented. Although our approach addresses some challenges associated with traditional multi-task models, data imbalance remains a significant limitation. Agonist data is considerably less abundant than antagonist data, as illustrated in Fig. 1. This imbalance has resulted in higher MSE values exceeding 1.00 for proteins like Q14833 and Q16602, as shown in Table 2, introducing biases that can affect prediction accuracy for these targets. Furthermore, as demonstrated in the case study, AiGPro may not be well-suited for single-target predictions and may underperform compared to single-task models specifically optimized for antagonistic activity.

Moreover, AiGPro does not fully account for experimental variations, such as receptor conformations from 3D structures, which can significantly influence bioactivity values. Although innovative data processing techniques were applied to reduce inconsistencies, its capacity to model complex ligand behaviors, such as partial agonism or antagonism, is not yet fully explored, indicating potential areas for further development. These challenges underscore the need to development of innovative techniques to enhance AiGPro’s robustness and broader applicability.

AiGPro web service

We have developed a user-friendly web server, accessible at https://aicadd.ssu.ac.kr/AiGPro, to facilitate the utilization of the AiGPro models for individuals with limited coding expertise. See Fig. 7. This online platform enables users to submit a SMILES string representing their query compounds, generating a profile against 231 GPCRs. The computed results, presented as activity scores, are conveniently organized in a paginated table, with each page displaying 10 predictions encompassing both antagonist and agonist compounds, and can be downloaded in CSV file format for further analysis. The tool is designed to determine the nature of given small molecules, categorizing them as agonists, antagonists, or inactive compounds for GPCR proteins.

The platform’s efficient processing speed and user-friendly interface make it invaluable for drug screening and design endeavors.

Conclusion

This study presents AiGPro, a novel bi-directional multi-head cross-attention incorporating a multi-scale content aggregation-based model, leveraging the self-attention mechanism and dilated convolution. Our proposed framework facilitates the comprehensive exploration and learning of both intra and intermolecular features of GPCRs and ligands, thereby enhancing generalizability for accurate prediction of bioactivity values for both agonist (EC₅₀) and antagonist (IC₅₀) activities.

GPCRs play a pivotal role in human pathophysiology, making them a prime target for drug discovery. However, the complexity of GPCRs and the scarcity of high-quality data have led to limited applicability of prior ML approaches. AiGPro overcomes these challenges, demonstrating exceptional performance and applicability domain across 231 GPCRs, thus establishing itself as the first-in-class method for GPCR profiling, setting new benchmarks in accuracy and efficacy for identifying and eliminating off-targets. This advancement holds promise for accelerating GPCR drug development by facilitating high throughput screening, compound evaluation, prioritization, and prediction of activity profiles.

Our results demonstrated that an innovative model could predict both agonist and antagonist bioactivity values of GPCR ligands with superior performance compared to complex ensemble models, eliminating the need for ensemble models. Further, we have developed and deployed an end-to-end platform accessible at https://aicadd.ssu.ac.kr/AiGPro, enabling convenient access to AiGPro models for the identification of off-targets against GPCRs, thereby offering scalable, rapid, and precise profiling of small molecules. The community can leverage the user-friendly web server AiGPro to enrich molecule libraries for screening purposes and facilitate rational GPCR ligand design.

Data availability

AiGPro predictive models have been made available via a freely accessible and easy-to-use web server at https://aicadd.ssu.ac.kr/AiGPro. Code and dataset used to develop the models can be accessed from Glass (https://zhanggroup.org/GLASS/) and GPCRdb (https://gpcrdb.org/) and Code (https://github.com/Chemoinfomatics/AiGPro) and models weights can be downloaded from (https://aicadd.ssu.ac.kr/download). No datasets were generated or analysed during the current study.

References

Liu N, Wang Y, Li T, Feng X (2021) G-protein coupled receptors (GPCRs): signaling pathways, characterization, and functions in insect physiology and toxicology. Int J Mol Sci. https://doi.org/10.3390/IJMS22105260
Article PubMed PubMed Central Google Scholar
Yang D, Zhou Q, Labroska V et al (2021) G protein-coupled receptors: structure- and function-based drug discovery. Signal Transduct Target Ther 1(6):1–27. https://doi.org/10.1038/s41392-020-00435-w
Article CAS Google Scholar
Wong TS, Li G, Li S et al (2023) G protein-coupled receptors in neurodegenerative diseases and psychiatric disorders. Signal Transduct Target Ther 1(8):1–57. https://doi.org/10.1038/s41392-023-01427-2
Article CAS Google Scholar
Velloso JPL, Kovacs AS, Pires DEV, Ascher DB (2024) AI-driven GPCR analysis, engineering, and targeting. Curr Opin Pharmacol 74:102427. https://doi.org/10.1016/J.COPH.2023.102427
Article CAS PubMed Google Scholar
Hauser AS, Attwood MM, Rask-Andersen M et al (2017) Trends in GPCR drug discovery: new agents, targets and indications. Nat Rev Drug Discov 12(16):829–842. https://doi.org/10.1038/nrd.2017.178
Article CAS Google Scholar
Sriram K, Insel PA (2018) G protein-coupled receptors as targets for approved drugs: how many targets and how many drugs? Mol Pharmacol 93:251–258. https://doi.org/10.1124/MOL.117.111062/-/DC1
Article CAS PubMed PubMed Central Google Scholar
Fribourg M, Moreno JL, Holloway T et al (2011) Decoding the signaling of a GPCR heteromeric complex reveals a unifying mechanism of action of antipsychotic drugs. Cell 147:1011–1023. https://doi.org/10.1016/j.cell.2011.09.055
Article CAS PubMed PubMed Central Google Scholar
Schmid CL, Streicher JM, Meltzer HY, Bohn LM (2014) Clozapine Acts as an Agonist at Serotonin 2A Receptors to Counter MK-801-Induced Behaviors through a βArrestin2-Independent Activation of Akt. Neuropsychopharmacology 8(39):1902–1913. https://doi.org/10.1038/npp.2014.38
Jendryka M, Palchaudhuri M, Ursu D et al (2019) Pharmacokinetic and pharmacodynamic actions of clozapine-N-oxide, clozapine, and compound 21 in DREADD-based chemogenetics in mice. Sci Rep 1(9):1–14. https://doi.org/10.1038/s41598-019-41088-2
Article CAS Google Scholar
Cheng L, Xia F, Li Z et al (2023) Structure, function and drug discovery of GPCR signaling. Mol Biomed 4:46. https://doi.org/10.1186/S43556-023-00156-W
Article PubMed PubMed Central Google Scholar
Oprea TI, Bologa CG, Brunak S et al (2018) Unexplored therapeutic opportunities in the human genome. Nat Rev Drug Discov 5(17):317–332. https://doi.org/10.1038/nrd.2018.14
Article CAS Google Scholar
Congreve M, de Graaf C, Swain NA, Tate CG (2020) Impact of GPCR structures on drug discovery. Cell 181:81–91. https://doi.org/10.1016/J.CELL.2020.03.003
Article CAS PubMed Google Scholar
Chen Z, Ren X, Zhou Y, Huang N (2024) Exploring structure-based drug discovery of GPCRs beyond the orthosteric binding site. hLife. https://doi.org/10.1016/J.HLIFE.2024.01.002
Article Google Scholar
Kooistra AJ, Mordalski S, Pándy-Szekeres G et al (2021) GPCRdb in 2021: integrating GPCR sequence, structure and function. Nucleic Acids Res 49:D335–D343. https://doi.org/10.1093/NAR/GKAA1080
Article CAS PubMed Google Scholar
Pándy-Szekeres G, Caroli J, Mamyrbekov A et al (2023) GPCRdb in 2023: state-specific structure models using AlphaFold2 and new ligand resources. Nucleic Acids Res 51:D395–D402. https://doi.org/10.1093/NAR/GKAC1013
Article PubMed Google Scholar
Chan WKB, Zhang Y (2020) Virtual screening of human class-A GPCRs using ligand profiles built on multiple ligand-receptor interactions. J Mol Biol 432:4872. https://doi.org/10.1016/J.JMB.2020.07.003
Article CAS PubMed PubMed Central Google Scholar
Ahmed M, Hasani HJ, Kalyaanamoorthy S, Barakat K (2021) GPCR_LigandClassify.py; a rigorous machine learning classifier for GPCR targeting compounds. Sci Rep 11:9510. https://doi.org/10.1038/S41598-021-88939-5
Article CAS PubMed PubMed Central Google Scholar
Wang K, Zhou R, Tang J, Li M (2023) GraphscoreDTA: optimized graph neural network for protein–ligand binding affinity prediction. Bioinformatics. https://doi.org/10.1093/BIOINFORMATICS/BTAD340
Article PubMed PubMed Central Google Scholar
Yousefi N, Yazdani-Jahromi M, Tayebi A et al (2023) BindingSite-AugmentedDTA: enabling a next-generation pipeline for interpretable prediction models in drug repurposing. Brief Bioinform 24:1–13. https://doi.org/10.1093/BIB/BBAD136
Article CAS Google Scholar
Yazdani-Jahromi M, Yousefi N, Tayebi A et al (2022) AttentionSiteDTI: an interpretable graph-based model for drug-target interaction prediction using NLP sentence-level relation classification. Brief Bioinform 23:1–14. https://doi.org/10.1093/BIB/BBAC272
Article CAS Google Scholar
Park H, Brahma R, Shin JM, Cho KH (2022) Prediction of human cytochrome P450 inhibition using bio-selectivity induced deep neural network. Bull Korean Chem Soc 43:261–269. https://doi.org/10.1002/BKCS.12445
Article CAS Google Scholar
Park H, Hong S, Lee M et al (2023) AiKPro: deep learning model for kinome-wide bioactivity profiling using structure-based sequence alignments and molecular 3D conformer ensemble descriptors. Sci Rep 1(13):1–12. https://doi.org/10.1038/s41598-023-37456-8
Article CAS Google Scholar
Brahma R, Shin JM, Cho KH (2023) KinScan: AI-based rapid profiling of activity across the kinome. Brief Bioinform. https://doi.org/10.1093/BIB/BBAD396
Article PubMed Google Scholar
Seo S, Choi J, Ahn SK et al (2018) Prediction of GPCR-ligand binding using machine learning algorithms. Comput Math Methods Med. https://doi.org/10.1155/2018/6565241
Article PubMed PubMed Central Google Scholar
Karimi S, Ahmadi M, Goudarzi F, Ferdousi R (2020) A computational model for GPCR-ligand interaction prediction. J Integr Bioinform 18:155–165. https://doi.org/10.1515/JIB-2019-0084/DOWNLOADASSET/SUPPL/J_JIB-2019-0084_SUPPL.XLSX
Article PubMed PubMed Central Google Scholar
Yamane H, Ishida T (2023) Helix encoder: a compound-protein interaction prediction model specifically designed for class A GPCRs. Front Bioinform 3:1193025. https://doi.org/10.3389/FBINF.2023.1193025/BIBTEX
Article PubMed PubMed Central Google Scholar
Remington JM, McKay KT, Beckage NB et al (2023) GPCRLigNet: rapid screening for GPCR active ligands using machine learning. J Comput Aided Mol Des 37:147–156. https://doi.org/10.1007/S10822-023-00497-2/METRICS
Article CAS PubMed PubMed Central Google Scholar
Cai T, Abbu KA, Liu Y, Xie L (2022) DeepREAL: a deep learning powered multi-scale modeling framework for predicting out-of-distribution ligand-induced GPCR activity. Bioinformatics 38:2561–2570. https://doi.org/10.1093/BIOINFORMATICS/BTAC154
Article CAS PubMed PubMed Central Google Scholar
Wu Z, Cheng F, Li J et al (2017) SDTNBI: an integrated network and chemoinformatics tool for systematic prediction of drug–target interactions and drug repositioning. Brief Bioinform 18:333–347. https://doi.org/10.1093/BIB/BBW012
Article CAS PubMed Google Scholar
Oh J, Ceong H, Na D, Park C (2022) A machine learning model for classifying G-protein-coupled receptors as agonists or antagonists. BMC Bioinform 23:1–10. https://doi.org/10.1186/S12859-022-04877-7/TABLES/2
Article Google Scholar
Chu Y, Shan X, Chen T et al (2021) DTI-MLCD: predicting drug-target interactions using multi-label learning with community detection method. Brief Bioinform 22:1–15. https://doi.org/10.1093/BIB/BBAA205
Article CAS Google Scholar
Goßen J, Ribeiro RP, Bier D et al (2023) AI-based identification of therapeutic agents targeting GPCRs: introducing ligand type classifiers and systems biology. Chem Sci 14:8651–8661. https://doi.org/10.1039/D3SC02352D
Article PubMed PubMed Central Google Scholar
Merighi S, Borea PA, Varani K et al (2022) A2A adenosine receptor antagonists in neurodegenerative diseases. Curr Med Chem 29:4138. https://doi.org/10.2174/0929867328666211129122550
Article CAS PubMed PubMed Central Google Scholar
Al-Attraqchi OHA, Attimarad M, Venugopala KN et al (2019) Adenosine A2A receptor as a potential drug target—current status and future perspectives. Curr Pharm Des 25:2716–2740. https://doi.org/10.2174/1381612825666190716113444
Article CAS PubMed Google Scholar
Jiménez-Rosés M, Morgan BA, Jimenez Sigstad M et al (2022) Combined docking and machine learning identify key molecular determinants of ligand pharmacological activity on β2 adrenoceptor. Pharmacol Res Perspect. https://doi.org/10.1002/PRP2.994
Article PubMed PubMed Central Google Scholar
Xu H, Zhang B, Liu Q (2023) Deep learning-based classification model for GPR151 activator activity prediction. BMC Bioinform. https://doi.org/10.1186/S12859-023-05369-Y
Article Google Scholar
Wu J, Liu B, Chan WKB et al (2019) Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors. Bioinformatics 35:i324–i332. https://doi.org/10.1093/BIOINFORMATICS/BTZ336
Article CAS PubMed PubMed Central Google Scholar
Wu J, Zhang Q, Wu W et al (2018) WDL-RF: predicting bioactivities of ligand molecules acting with G protein-coupled receptors by combining weighted deep learning and random forest. Bioinformatics 34:2271. https://doi.org/10.1093/BIOINFORMATICS/BTY070
Article CAS PubMed PubMed Central Google Scholar
Sakai M, Nagayasu K, Shibui N et al (2021) Prediction of pharmacological activities from chemical structures with graph convolutional neural networks. Sci Rep 1(11):1–14. https://doi.org/10.1038/s41598-020-80113-7
Article CAS Google Scholar
Velloso JPL, Ascher DB, Pires DEV (2021) pdCSM-GPCR: predicting potent GPCR ligands with graph-based signatures. Bioinform Adv. https://doi.org/10.1093/BIOADV/VBAB031
Article PubMed PubMed Central Google Scholar
Huang WC, Lin WT, Hung MS et al (2024) Decrypting orphan GPCR drug discovery via multitask learning. J Cheminform 16:1–11. https://doi.org/10.1186/S13321-024-00806-3/FIGURES/6
Article Google Scholar
Bhatnagar R, Sardar S, Beheshti M, Podichetty JT (2022) How can natural language processing help model informed drug development?: a review. JAMIA Open 5:1–14. https://doi.org/10.1093/JAMIAOPEN/OOAC043
Article Google Scholar
Hu Z, Liu W, Zhang C et al (2023) SAM-DTA: a sequence-agnostic model for drug–target binding affinity prediction. Brief Bioinform 24:1–15. https://doi.org/10.1093/BIB/BBAC533
Article Google Scholar
Huang Z, Zhang P, Deng L (2023) DeepCoVDR: deep transfer learning with graph transformer and cross-attention for predicting COVID-19 drug response. Bioinformatics 39:i475–i483. https://doi.org/10.1093/BIOINFORMATICS/BTAD244
Article PubMed PubMed Central Google Scholar
Thakur A, Kumar A, Sharma V, Mehta V (2022) PIC50: an open source tool for interconversion of PIC50 values and IC50 for efficient data representation and analysis. bioRxiv. https://doi.org/10.1101/2022.10.15.512366
Article Google Scholar
RDKit. https://www.rdkit.org/
Vaswani A, Shazeer N, Parmar N, et al (2017) Attention Is All You Need. Adv Neural Inf Process Syst 2017-December:5999–6009
Chicco D, Jurman G (2020) The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genomics 21:1–13. https://doi.org/10.1186/S12864-019-6413-7/TABLES/5
Article Google Scholar
Dwomoh L, Tejeda GS, Tobin AB (2022) Targeting the M1 muscarinic acetylcholine receptor in Alzheimer’s disease. Neuronal Signal 6:20210004. https://doi.org/10.1042/NS20210004
Article Google Scholar
Brown AJH, Bradley SJ, Marshall FH et al (2021) From structure to clinic: design of a muscarinic M1 receptor agonist with potential to treatment of Alzheimer’s disease. Cell 184:5886-5901.e22. https://doi.org/10.1016/J.CELL.2021.11.001
Article CAS PubMed PubMed Central Google Scholar
Shirey JK, Brady AE, Jones PJ et al (2009) A selective allosteric potentiator of the m1 muscarinic acetylcholine receptor increases activity of medial prefrontal cortical neurons and restores impairments in reversal learning. J Neurosci 29:14271. https://doi.org/10.1523/JNEUROSCI.3930-09.2009
Article CAS PubMed PubMed Central Google Scholar
Rowe WB, O’Donnell JP, Pearson D et al (2003) Long-term effects of BIBN-99, a selective muscarinic M2 receptor antagonist, on improving spatial memory performance in aged cognitively impaired rats. Behav Brain Res 145:171–178. https://doi.org/10.1016/S0166-4328(03)00116-5
Article CAS PubMed Google Scholar
Poulin B, Butcher A, McWilliams P et al (2010) The M3-muscarinic receptor regulates learning and memory in a receptor phosphorylation/arrestin-dependent manner. Proc Natl Acad Sci USA 107:9440–9445. https://doi.org/10.1073/PNAS.0914801107/-/DCSUPPLEMENTAL
Article CAS PubMed PubMed Central Google Scholar
Heid E, Greenman KP, Chung Y, et al (2024) Chemprop: A Machine Learning Package for Chemical Property Prediction. J Chem Inf Model 64:9–17. https://doi.org/10.1021/ACS.JCIM.3C01250

Download references

Acknowledgements

The subject is supported by KREONET (Korean Research Environment Open NETWork), which is managed and operated by KISTI (Korean Institute of Science and Technology Information). We thank the National Institute for International Education (NIIED), Government of the Republic of Korea, for the Global Korean Scholarship to R.B. This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2021R1A6A1A10044154).

Author information

Authors and Affiliations

School of Systems Biomedical Science, Soongsil University, 369 Sangdo-ro, Dongjak-gu, 06978, Seoul, Republic of Korea
Rahul Brahma, Sunghyun Moon & Kwang-Hwi Cho
AzothBio, Rm. DA724 Hyundai Knowledge Industry Center, Hanam-si, Gyeonggi-do, Republic of Korea
Jae-Min Shin

Authors

Rahul Brahma
View author publications
You can also search for this author inPubMed Google Scholar
Sunghyun Moon
View author publications
You can also search for this author inPubMed Google Scholar
Jae-Min Shin
View author publications
You can also search for this author inPubMed Google Scholar
Kwang-Hwi Cho
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

Experiments, coding, writing, web server, database development, and maintenance were conducted by R.B under the supervision of K.H.C, with conceptualization by J.M.S and K.H.C. S.H.M helped to build the web server. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Jae-Min Shin or Kwang-Hwi Cho.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Additional file 2.

Additional file 3.

Additional file 4.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Brahma, R., Moon, S., Shin, JM. et al. AiGPro: a multi-tasks model for profiling of GPCRs for agonist and antagonist. J Cheminform 17, 12 (2025). https://doi.org/10.1186/s13321-024-00945-7

Download citation

Received: 04 August 2024
Accepted: 27 December 2024
Published: 29 January 2025
DOI: https://doi.org/10.1186/s13321-024-00945-7

AiGPro: a multi-tasks model for profiling of GPCRs for agonist and antagonist

Abstract

Graphical Abstract

Introduction

Methodology

Data collection and pre-processing

Sequence encodings

Protein

Ligand

Molecular feature encoding

AiGPro architecture

Bi-directional multi-head cross-attention module (BMCA)

Output block

Model implementation and training detail

Evaluation metrics

Web server implementations and deployment

Real-world application test: a case study on Alzheimer’s disease (AD)

Results and discussion

Model development

Performance evaluation

Comparison with existing options

Ablation study

Applicability test on Alzheimer's related proteins

Limitations

AiGPro web service

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1.

Additional file 2.

Additional file 3.

Additional file 4.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Journal of Cheminformatics

Contact us