1
|
Zhang F, Hirama Y, Onishi S, Mori T, Ono N, Kanaya S. Design of Fragrance Formulations with Antiviral Activity Using Bayesian Optimization. Microorganisms 2024; 12:1568. [PMID: 39203410 PMCID: PMC11356527 DOI: 10.3390/microorganisms12081568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 07/25/2024] [Accepted: 07/25/2024] [Indexed: 09/03/2024] Open
Abstract
In case of future viral threats, including the proposed Disease X that has been discussed since the emergence of the COVID-19 pandemic in March 2020, our research has focused on the development of antiviral strategies using fragrance compounds with known antiviral activity. Despite the recognized antiviral properties of mixtures of certain fragrance compounds, there has been a lack of a systematic approach to optimize these mixtures. Confronted with the significant combinatorial challenge and the complexity of the compound formulation space, we employed Bayesian optimization, guided by Gaussian Process Regression (GPR), to systematically explore and identify formulations with demonstrable antiviral efficacy. This approach required the transformation of the characteristics of formulations into quantifiable feature values using molecular descriptors, subsequently modeling these data to predict and propose formulations with likely antiviral efficacy enhancements. The predicted formulations underwent experimental testing, resulting in the identification of combinations capable of inactivating 99.99% of viruses, including a notably efficacious formulation of five distinct fragrance types. This model demonstrates high predictive accuracy (coefficient determination Rcv2 > 0.7) and suggests a new frontier in antiviral strategy development. Our findings indicate the powerful potential of computational modeling to surpass human analytical capabilities in the pursuit of complex, fragrance-based antiviral formulations.
Collapse
Affiliation(s)
- Fan Zhang
- Material Science Research, Kao Corporation, 1334 Minato, Wakayama-shi 640-8580, Wakayama, Japan;
- Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma 630-0192, Nara, Japan;
| | - Yui Hirama
- Biological Science Research, Kao Corporation, 2606 Akabane, Ichikai-machi, Haga-gun 321-3426, Tochigi, Japan; (Y.H.); (S.O.); (T.M.)
| | - Shintaro Onishi
- Biological Science Research, Kao Corporation, 2606 Akabane, Ichikai-machi, Haga-gun 321-3426, Tochigi, Japan; (Y.H.); (S.O.); (T.M.)
| | - Takuya Mori
- Biological Science Research, Kao Corporation, 2606 Akabane, Ichikai-machi, Haga-gun 321-3426, Tochigi, Japan; (Y.H.); (S.O.); (T.M.)
| | - Naoaki Ono
- Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma 630-0192, Nara, Japan;
- Data Science Center, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma 630-0192, Nara, Japan
| | - Shigehiko Kanaya
- Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma 630-0192, Nara, Japan;
- Data Science Center, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma 630-0192, Nara, Japan
| |
Collapse
|
2
|
Shirasawa R, Takaki K, Miyao T. Generalizability Improvement of Interpretable Symbolic Regression Models for Quantitative Structure-Activity Relationships. ACS OMEGA 2024; 9:9463-9474. [PMID: 38434845 PMCID: PMC10905595 DOI: 10.1021/acsomega.3c09047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 01/20/2024] [Accepted: 01/26/2024] [Indexed: 03/05/2024]
Abstract
In the pursuit of optimal quantitative structure-activity relationship (QSAR) models, two key factors are paramount: the robustness of predictive ability and the interpretability of the model. Symbolic regression (SR) searches for the mathematical expressions that explain a training data set. Thus, the models provided by SR are globally interpretable. We previously proposed an SR method that can generate interpretable expressions by humans. This study introduces an enhanced symbolic regression method, termed filter-induced genetic programming 2 (FIGP2), as an extension of our previously proposed SR method. FIGP2 is designed to improve the generalizability of SR models and to be applicable to data sets in which cost-intensive descriptors are employed. The FIGP2 method incorporates two major improvements: a modified domain filter to eradicate diverging expressions based on optimal calculation and the introduction of a stability metric to penalize expressions that would lead to overfitting. Our retrospective comparative analysis using 12 structure-activity relationship data sets revealed that FIGP2 surpassed the previously proposed SR method and conventional modeling methods, such as support vector regression and multivariate linear regression in terms of predictive performance. Generated mathematical expressions by FIGP2 were relatively simple and not divergent in the domain of function. Taken together, FIGP2 can be used for making interpretable regression models with predictive ability.
Collapse
Affiliation(s)
- Raku Shirasawa
- Graduate School of Science and Technology, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan
- Advanced Research Laboratory, Technology Infrastructure Center, Technology Platform, Sony Group Corporation, Atsugi Tec., 4-14-1 Asahi-cho, Atsugi-shi, Kanagawa 243-0014, Japan
| | - Katsushi Takaki
- Graduate School of Science and Technology, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan
| | - Tomoyuki Miyao
- Graduate School of Science and Technology, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan
- Data Science Center, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan
| |
Collapse
|