López-Cortés XA, Lara G, Fernández N, Manríquez-Troncoso JM, Venthur H. Insight into the Relationships Between Chemical, Protein and Functional Variables in the PBP/GOBP Family in Moths Based on Machine Learning.
Int J Mol Sci 2025;
26:2302. [PMID:
40076924 PMCID:
PMC11901117 DOI:
10.3390/ijms26052302]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2025] [Revised: 02/13/2025] [Accepted: 02/19/2025] [Indexed: 03/14/2025] Open
Abstract
During their lives, insects must cope with a plethora of chemicals, of which a few will have an impact at the behavioral level. To detect these chemicals, insects use several protein families located in their main olfactory organs, the antennae. Inside the antennae, odorant-binding proteins (OBPs), as the most studied protein family, bind volatile chemicals to transport them. Pheromone-binding proteins (PBPs) and general-odorant-binding proteins (GOPBs) are two subclasses of OBPs and have evolved in moths with a putative olfactory role. Predictions for OBP-chemical interactions have remained limited, and functional data collected over the years unused. In this study, chemical, protein and functional data were curated, and related datasets were created with descriptors. Regression algorithms were implemented and their performance evaluated. Our results indicate that XGBoostRegressor exhibits the best performance (R2 of 0.76, RMSE of 0.28 and MAE of 0.20), followed by GradientBoostingRegressor and LightGBMRegressor. To the best of our knowledge, this is the first study showing a correlation among chemical, protein and functional data, particularly in the context of the PBP/GOBP family of proteins in moths.
Collapse