1
|
Rai GP, Shanker A. The coevolutionary landscape of drug resistance in epidermal growth factor receptor: A cancer perspective. Comput Biol Med 2025; 189:110001. [PMID: 40073493 DOI: 10.1016/j.compbiomed.2025.110001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2024] [Revised: 03/03/2025] [Accepted: 03/04/2025] [Indexed: 03/14/2025]
Abstract
Epidermal growth factor receptor (EGFR), the first receptor tyrosine kinase, plays a critical role in neoplastic metastasis, angiogenesis, tumor invasion, and apoptosis, making it a prime target for treating non-small cell lung cancer (NSCLC). Although tyrosine kinase inhibitors (TKIs) have shown high efficacy and promise for cancer patients, resistance to these drugs often develops within a year due to alterations. The present study investigates the compensatory alterations in EGFR to understand the evolutionary process behind drug resistance. Our findings reveal that coevolutionary alterations expand the drug-binding pocket; leading to reduced drug efficacy and suggested that such changes significantly influence the structural adaptation of the EGFR against these drugs. Analysis such as root mean square deviation (RMSD), root mean square fluctuation (RMSF), solvent accessible surface area (SASA), principal component analysis (PCA), and free energy landscape (FEL) demonstrated that structures of wild EGFR docked with gefitinib are more stable which suggests its susceptibility towards drug than coevolution dependent double mutant. The findings were supported by MM-GBSA binding affinity analysis. The insights from this study highlighted the evolution-induced structural changes which contributes to drug resistance in EGFR and may certainly aid in designing more effective drugs.
Collapse
Affiliation(s)
- Gyan Prakash Rai
- Department of Bioinformatics, Central University of South Bihar, Gaya, Bihar, 824236, India
| | - Asheesh Shanker
- Department of Bioinformatics, Central University of South Bihar, Gaya, Bihar, 824236, India.
| |
Collapse
|
2
|
Fiorote J, Alves J, Stock L, Treptow W. Investigating Statistical Conditions of Coevolutionary Signals that Enable Algorithmic Predictions of Protein Partners. J Chem Inf Model 2025; 65:4107-4115. [PMID: 40232741 PMCID: PMC12042258 DOI: 10.1021/acs.jcim.5c00052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2025] [Revised: 04/01/2025] [Accepted: 04/02/2025] [Indexed: 04/16/2025]
Abstract
This study examines the statistical conditions of coevolutionary signals that allow algorithmic predictions of protein partners based on amino acid sequences rather than 3D structures. It introduces a Markov stochastic model that predicts the number of correct protein partners based on coevolutionary information. The model defines state probabilities using a Poisson mixture of normal distributions, with key parameters including the total number of protein sequences M, the coevolutionary information gap α, and variance σ02. The model suggests that algorithmic approaches that maximize coevolutionary information cannot effectively resolve partners in protein families with a large number of sequences M ≥ 100. The model shows that true-positive (TP) rates can be enhanced by disregarding mismatches among similar sequences. This approach allows a distinction, in terms of {α, σ02}, between optimized solutions with trivial errors and other degenerate solutions. Our findings enable the a priori classification of protein families where partners can be reliably predicted by ignoring trivial errors between similar sequences, advancing the understanding of coevolutionary models for large protein data sets.
Collapse
Affiliation(s)
- José Fiorote
- Laboratório
de Biologia Teórica e Computacional (LBTC), Universidade de Brasília, Brasilia, DF 70910-900, Brasil
| | - João Alves
- Laboratório
de Biologia Teórica e Computacional (LBTC), Universidade de Brasília, Brasilia, DF 70910-900, Brasil
| | - Letícia Stock
- Ben May Department
for Cancer Research, University of Chicago, Chicago, Illinois 60637, United States
| | - Werner Treptow
- Laboratório
de Biologia Teórica e Computacional (LBTC), Universidade de Brasília, Brasilia, DF 70910-900, Brasil
| |
Collapse
|
3
|
Tibery DV, Nunes JAA, da Mata DO, Menezes LFS, de Souza ACB, Fernandes-Pedrosa MDF, Treptow W, Schwartz EF. Unveiling Tst3, a Multi-Target Gating Modifier Scorpion α Toxin from Tityus stigmurus Venom of Northeast Brazil: Evaluation and Comparison with Well-Studied Ts3 Toxin of Tityus serrulatus. Toxins (Basel) 2024; 16:257. [PMID: 38922152 PMCID: PMC11209618 DOI: 10.3390/toxins16060257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 05/27/2024] [Accepted: 05/30/2024] [Indexed: 06/27/2024] Open
Abstract
Studies on the interaction sites of peptide toxins and ion channels typically involve site-directed mutations in toxins. However, natural mutant toxins exist among them, offering insights into how the evolutionary process has conserved crucial sequences for activities and molecular target selection. In this study, we present a comparative investigation using electrophysiological approaches and computational analysis between two alpha toxins from evolutionarily close scorpion species of the genus Tityus, namely, Tst3 and Ts3 from T. stigmurus and T. serrulatus, respectively. These toxins exhibit three natural substitutions near the C-terminal region, which is directly involved in the interaction between alpha toxins and Nav channels. Additionally, we characterized the activity of the Tst3 toxin on Nav1.1-Nav1.7 channels. The three natural changes between the toxins did not alter sensitivity to Nav1.4, maintaining similar intensities regarding their ability to alter opening probabilities, delay fast inactivation, and induce persistent currents. Computational analysis demonstrated a preference for the down conformation of VSD4 and a shift in the conformational equilibrium towards this state. This illustrates that the sequence of these toxins retained the necessary information, even with alterations in the interaction site region. Through electrophysiological and computational analyses, screening of the Tst3 toxin on sodium isoform revealed its classification as a classic α-NaTx with a broad spectrum of activity. It effectively delays fast inactivation across all tested isoforms. Structural analysis of molecular energetics at the interface of the VSD4-Tst3 complex further confirmed this effect.
Collapse
Affiliation(s)
- Diogo Vieira Tibery
- Laboratório de Neurofarmacologia, Departamento de Ciências Fisiológicas, Universidade de Brasília (UnB), Brasília 70910-900, Distrito Federal, Brazil; (D.V.T.); (D.O.d.M.); (L.F.S.M.); (A.C.B.d.S.)
| | - João Antonio Alves Nunes
- Laboratório de Biologia Teórica e Computacional (LBTC), Departamento de Biologia Celular, Universidade de Brasília (UnB), Brasília 70910-900, Distrito Federal, Brazil; (J.A.A.N.); (W.T.)
| | - Daniel Oliveira da Mata
- Laboratório de Neurofarmacologia, Departamento de Ciências Fisiológicas, Universidade de Brasília (UnB), Brasília 70910-900, Distrito Federal, Brazil; (D.V.T.); (D.O.d.M.); (L.F.S.M.); (A.C.B.d.S.)
| | - Luis Felipe Santos Menezes
- Laboratório de Neurofarmacologia, Departamento de Ciências Fisiológicas, Universidade de Brasília (UnB), Brasília 70910-900, Distrito Federal, Brazil; (D.V.T.); (D.O.d.M.); (L.F.S.M.); (A.C.B.d.S.)
| | - Adolfo Carlos Barros de Souza
- Laboratório de Neurofarmacologia, Departamento de Ciências Fisiológicas, Universidade de Brasília (UnB), Brasília 70910-900, Distrito Federal, Brazil; (D.V.T.); (D.O.d.M.); (L.F.S.M.); (A.C.B.d.S.)
| | - Matheus de Freitas Fernandes-Pedrosa
- Laboratório de Tecnologia e Biotecnologia Farmacêutica, Departamento de Farmácia, Universidade Federal do Rio Grande do Norte (UFRN), Natal 59012-570, Rio Grande do Norte, Brazil;
| | - Werner Treptow
- Laboratório de Biologia Teórica e Computacional (LBTC), Departamento de Biologia Celular, Universidade de Brasília (UnB), Brasília 70910-900, Distrito Federal, Brazil; (J.A.A.N.); (W.T.)
| | - Elisabeth Ferroni Schwartz
- Laboratório de Neurofarmacologia, Departamento de Ciências Fisiológicas, Universidade de Brasília (UnB), Brasília 70910-900, Distrito Federal, Brazil; (D.V.T.); (D.O.d.M.); (L.F.S.M.); (A.C.B.d.S.)
| |
Collapse
|
4
|
Koyama K, Hashimoto K, Nagao C, Mizuguchi K. Attention network for predicting T-cell receptor-peptide binding can associate attention with interpretable protein structural properties. FRONTIERS IN BIOINFORMATICS 2023; 3:1274599. [PMID: 38170146 PMCID: PMC10759225 DOI: 10.3389/fbinf.2023.1274599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 11/27/2023] [Indexed: 01/05/2024] Open
Abstract
Understanding how a T-cell receptor (TCR) recognizes its specific ligand peptide is crucial for gaining an insight into biological functions and disease mechanisms. Despite its importance, experimentally determining TCR-peptide-major histocompatibility complex (TCR-pMHC) interactions is expensive and time-consuming. To address this challenge, computational methods have been proposed, but they are typically evaluated by internal retrospective validation only, and few researchers have incorporated and tested an attention layer from language models into structural information. Therefore, in this study, we developed a machine learning model based on a modified version of Transformer, a source-target attention neural network, to predict the TCR-pMHC interaction solely from the amino acid sequences of the TCR complementarity-determining region (CDR) 3 and the peptide. This model achieved competitive performance on a benchmark dataset of the TCR-pMHC interaction, as well as on a truly new external dataset. Additionally, by analyzing the results of binding predictions, we associated the neural network weights with protein structural properties. By classifying the residues into large- and small-attention groups, we identified statistically significant properties associated with the largely attended residues such as hydrogen bonds within CDR3. The dataset that we created and the ability of our model to provide an interpretable prediction of TCR-peptide binding should increase our knowledge about molecular recognition and pave the way for designing new therapeutics.
Collapse
Affiliation(s)
- Kyohei Koyama
- Laboratory for Computational Biology, Institute for Protein Research, Osaka University, Osaka, Japan
- National Institutes of Biomedical Innovation, Health and Nutrition, Osaka, Japan
- Graduate School of Frontier Biosciences, Osaka University, Osaka, Japan
| | - Kosuke Hashimoto
- Laboratory for Computational Biology, Institute for Protein Research, Osaka University, Osaka, Japan
| | - Chioko Nagao
- Laboratory for Computational Biology, Institute for Protein Research, Osaka University, Osaka, Japan
| | - Kenji Mizuguchi
- Laboratory for Computational Biology, Institute for Protein Research, Osaka University, Osaka, Japan
- National Institutes of Biomedical Innovation, Health and Nutrition, Osaka, Japan
- Graduate School of Frontier Biosciences, Osaka University, Osaka, Japan
| |
Collapse
|
5
|
Mukherjee I, Chakrabarti S. Co-evolutionary landscape at the interface and non-interface regions of protein-protein interaction complexes. Comput Struct Biotechnol J 2021; 19:3779-3795. [PMID: 34285778 PMCID: PMC8271121 DOI: 10.1016/j.csbj.2021.06.039] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 06/22/2021] [Accepted: 06/22/2021] [Indexed: 11/16/2022] Open
Abstract
Proteins involved in interactions throughout the course of evolution tend to co-evolve and compensatory changes may occur in interacting proteins to maintain or refine such interactions. However, certain residue pair alterations may prove to be detrimental for functional interactions. Hence, determining co-evolutionary pairings that could be structurally or functionally relevant for maintaining the conservation of an inter-protein interaction is important. Inter-protein co-evolution analysis in several complexes utilizing multiple existing methodologies suggested that co-evolutionary pairings can occur in spatially proximal and distant regions in inter-protein interactions. Subsequently, the Co-Var (Correlated Variation) method based on mutual information and Bhattacharyya coefficient was developed, validated, and found to perform relatively better than CAPS and EV-complex. Interestingly, while applying the Co-Var measure and EV-complex program on a set of protein-protein interaction complexes, co-evolutionary pairings were obtained in interface and non-interface regions in protein complexes. The Co-Var approach involves determining high degree co-evolutionary pairings that include multiple co-evolutionary connections between particular co-evolved residue positions in one protein with multiple residue positions in the binding partner. Detailed analyses of high degree co-evolutionary pairings in protein-protein complexes involved in cancer metastasis suggested that most of the residue positions forming such co-evolutionary connections mainly occurred within functional domains of constituent proteins and substitution mutations were also common among these positions. The physiological relevance of these predictions suggested that Co-Var can predict residues that could be crucial for preserving functional protein-protein interactions. Finally, Co-Var web server (http://www.hpppi.iicb.res.in/ishi/covar/index.html) that implements this methodology identifies co-evolutionary pairings in intra and inter-protein interactions.
Collapse
Affiliation(s)
- Ishita Mukherjee
- Structural Biology and Bioinformatics Division, Council for Scientific and Industrial Research (CSIR) - Indian Institute of Chemical Biology (IICB), Kolkata, West Bengal 700032, India
| | - Saikat Chakrabarti
- Structural Biology and Bioinformatics Division, Council for Scientific and Industrial Research (CSIR) - Indian Institute of Chemical Biology (IICB), Kolkata, West Bengal 700032, India
| |
Collapse
|
6
|
Trivial and nontrivial error sources account for misidentification of protein partners in mutual information approaches. Sci Rep 2021; 11:6902. [PMID: 33767294 PMCID: PMC7994710 DOI: 10.1038/s41598-021-86455-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Accepted: 03/15/2021] [Indexed: 12/01/2022] Open
Abstract
The problem of finding the correct set of partners for a given pair of interacting protein families based on multi-sequence alignments (MSAs) has received great attention over the years. Recently, the native contacts of two interacting proteins were shown to store the strongest mutual information (MI) signal to discriminate MSA concatenations with the largest fraction of correct pairings. Although that signal might be of practical relevance in the search for an effective heuristic to solve the problem, the number of MSA concatenations with near-native MI is large, imposing severe limitations. Here, a Genetic Algorithm that explores possible MSA concatenations according to a MI maximization criteria is shown to find degenerate solutions with two error sources, arising from mismatches among (i) similar and (ii) non-similar sequences. If mistakes made among similar sequences are disregarded, type-(i) solutions are found to resolve correct pairings at best true positive (TP) rates of 70%—far above the very same estimates in type-(ii) solutions. A machine learning classification algorithm helps to show further that differences between optimized solutions based on TP rates are not artificial and may have biological meaning associated with the three-dimensional distribution of the MI signal. Type-(i) solutions may therefore correspond to reliable results for predictive purposes, found here to be more likely obtained via MI maximization across protein systems having a minimum critical number of amino acid contacts on their interaction surfaces (N > 200).
Collapse
|