1
|
Li G, Li J, Tian Y, Zhao Y, Pang X, Yan A. Machine learning-based classification models for non-covalent Bruton's tyrosine kinase inhibitors: predictive ability and interpretability. Mol Divers 2023:10.1007/s11030-023-10696-6. [PMID: 37479824 DOI: 10.1007/s11030-023-10696-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Accepted: 07/07/2023] [Indexed: 07/23/2023]
Abstract
In this study, we built classification models using machine learning techniques to predict the bioactivity of non-covalent inhibitors of Bruton's tyrosine kinase (BTK) and to provide interpretable and transparent explanations for these predictions. To achieve this, we gathered data on BTK inhibitors from the Reaxys and ChEMBL databases, removing compounds with covalent bonds and duplicates to obtain a dataset of 3895 inhibitors of non-covalent. These inhibitors were characterized using MACCS fingerprints and Morgan fingerprints, and four traditional machine learning algorithms (decision trees (DT), random forests (RF), support vector machines (SVM), and extreme gradient boosting (XGBoost)) were used to build 16 classification models. In addition, four deep learning models were developed using deep neural networks (DNN). The best model, Model D_4, which was built using XGBoost and MACCS fingerprints, achieved an accuracy of 94.1% and a Matthews correlation coefficient (MCC) of 0.75 on the test set. To provide interpretable explanations, we employed the SHAP method to decompose the predicted values into the contributions of each feature. We also used K-means dimensionality reduction and hierarchical clustering to visualize the clustering effects of molecular structures of the inhibitors. The results of this study were validated using crystal structures, and we found that the interaction between the BTK amino acid residue and the important features of clustered scaffold was consistent with the known properties of the complex crystal structures. Overall, our models demonstrated high predictive ability and a qualitative model can be converted to a quantitative model to some extent by SHAP, making them valuable for guiding the design of new BTK inhibitors with desired activity.
Collapse
Affiliation(s)
- Guo Li
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, Beijing, People's Republic of China
| | - Jiaxuan Li
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, Beijing, People's Republic of China
| | - Yujia Tian
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, Beijing, People's Republic of China
| | - Yunyang Zhao
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, Beijing, People's Republic of China
| | - Xiaoyang Pang
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, Beijing, People's Republic of China
| | - Aixia Yan
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, Beijing, People's Republic of China.
| |
Collapse
|
2
|
Jeon YJ, Lee Y, Yang JS, Park YS, Jung SJ. Physical and mental health characteristics related to trust in and intention to receive COVID-19 vaccination: results from a Korean community-based longitudinal study. Epidemiol Health 2022; 44:e2022064. [PMID: 35940179 PMCID: PMC9943634 DOI: 10.4178/epih.e2022064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Accepted: 08/03/2022] [Indexed: 01/12/2023] Open
Abstract
OBJECTIVES The aim of this study was to explore factors affecting attitudes toward coronavirus disease 2019 (COVID-19) vaccination, including socio-demographic characteristics and mental health status during the pandemic. METHODS This study analyzed responses from 1,768 participants who were originally included in a community cohort study and responded to 3 online surveys related to COVID-19 (March 2020 to March 2021). The k-means method was used to cluster trust in and intention to receive COVID-19 vaccination. Baseline (2013-2018) socio-demographic characteristics, physical health status, and depressive symptoms were analyzed as exposure variables, and current mental health status was included in the analyses. RESULTS Almost half of all participants were classified into the moderate trust and high intention cluster (n=838, 47.4%); those with high trust and high intention accounted only for 16.9%. They tended to be older, had high-income levels, and engaged in regular physical activity at baseline (p<0.05), and their sleep quality and psychological resilience were relatively high compared to other groups. CONCLUSIONS Our results suggest that more efforts are required to enhance the perceived need for and trust in COVID-19 vaccination.
Collapse
Affiliation(s)
- Ye Jin Jeon
- Department of Public Health, Graduate School, Yonsei University, Seoul, Korea
| | - Youngrong Lee
- Department of Preventive Medicine, Yonsei University College of Medicine, Seoul, Korea
| | - Ji Su Yang
- Department of Public Health, Graduate School, Yonsei University, Seoul, Korea
| | - Young Su Park
- Department of Health Studies, Haverford College, Haverford, PA, USA
| | - Sun Jae Jung
- Department of Public Health, Graduate School, Yonsei University, Seoul, Korea,Department of Preventive Medicine, Yonsei University College of Medicine, Seoul, Korea,Correspondence: Sun Jae Jung Department of Preventive Medicine, Yonsei University College of Medicine, 50-1 Yonsei-ro, Seodaemun-gu, Seoul 03722, Korea E-mail:
| |
Collapse
|
3
|
Cong Y, Endo T. Multi-Omics and Artificial Intelligence-Guided Drug Repositioning: Prospects, Challenges, and Lessons Learned from COVID-19. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2022; 26:361-371. [PMID: 35759424 DOI: 10.1089/omi.2022.0068] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Drug repurposing is of interest for therapeutics innovation in many human diseases including coronavirus disease 2019 (COVID-19). Methodological innovations in drug repurposing are currently being empowered by convergence of omics systems science and digital transformation of life sciences. This expert review article offers a systematic summary of the application of artificial intelligence (AI), particularly machine learning (ML), to drug repurposing and classifies and introduces the common clustering, dimensionality reduction, and other methods. We highlight, as a present-day high-profile example, the involvement of AI/ML-based drug discovery in the COVID-19 pandemic and discuss the collection and sharing of diverse data types, and the possible futures awaiting drug repurposing in an era of AI/ML and digital technologies. The article provides new insights on convergence of multi-omics and AI-based drug repurposing. We conclude with reflections on the various pathways to expedite innovation in drug development through drug repurposing for prompt responses to the current COVID-19 pandemic and future ecological crises in the 21st century.
Collapse
Affiliation(s)
- Yi Cong
- Laboratory of Information Biology, Information Science and Technology, Hokkaido University, Sapporo, Japan
| | - Toshinori Endo
- Laboratory of Information Biology, Information Science and Technology, Hokkaido University, Sapporo, Japan
| |
Collapse
|
4
|
Jiao SJ, Liu LY, Liu Q. A Hybrid Deep Learning Model for Recognizing Actions of Distracted Drivers. SENSORS 2021; 21:s21217424. [PMID: 34770728 PMCID: PMC8588220 DOI: 10.3390/s21217424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/09/2021] [Revised: 10/30/2021] [Accepted: 11/04/2021] [Indexed: 11/17/2022]
Abstract
With the rapid spreading of in-vehicle information systems such as smartphones, navigation systems, and radios, the number of traffic accidents caused by driver distractions shows an increasing trend. Timely identification and warning are deemed to be crucial for distracted driving and the establishment of driver assistance systems is of great value. However, almost all research on the recognition of the driver’s distracted actions using computer vision methods neglected the importance of temporal information for action recognition. This paper proposes a hybrid deep learning model for recognizing the actions of distracted drivers. Specifically, we used OpenPose to obtain skeleton information of the human body and then constructed the vector angle and modulus ratio of the human body structure as features to describe the driver’s actions, thereby realizing the fusion of deep network features and artificial features, which improve the information density of spatial features. The K-means clustering algorithm was used to preselect the original frames, and the method of inter-frame comparison was used to obtain the final keyframe sequence by comparing the Euclidean distance between manually constructed vectors representing frames and the vector representing the cluster center. Finally, we constructed a two-layer long short-term memory neural network to obtain more effective spatiotemporal features, and one softmax layer to identify the distracted driver’s action. The experimental results based on the collected dataset prove the effectiveness of this framework, and it can provide a theoretical basis for the establishment of vehicle distraction warning systems.
Collapse
|
5
|
Wu C, Peng Q, Lee J, Leibnitz K, Xia Y. Effective hierarchical clustering based on structural similarities in nearest neighbor graphs. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.107295] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
6
|
Shi C, Zhang R, Yu Y, Sun X, Lin X. A SLIC-DBSCAN Based Algorithm for Extracting Effective Sky Region from a Single Star Image. SENSORS 2021; 21:s21175786. [PMID: 34502677 PMCID: PMC8434426 DOI: 10.3390/s21175786] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 08/17/2021] [Accepted: 08/24/2021] [Indexed: 11/16/2022]
Abstract
The star tracker is widely used for high-accuracy missions due to its high accuracy position high autonomy and low power consumption. On the other hand, the ability of interference suppression of the star tracker has always been a hot issue of concern. A SLIC-DBSCAN-based algorithm for extracting effective information from a single image with strong interference has been developed in this paper to remove interferences. Firstly, the restricted LC (luminance-based contrast) transformation is utilized to enhance the contrast between background noise and the large-area interference. Then, SLIC (the simple linear iterative clustering) algorithm is adopted to segment the saliency map and in this process, optimized parameters are harnessed. Finally, from these segments, features are extracted and superpixels with similar features are combined by using DBSCAN (density-based spatial clustering of applications with noise). The proposed algorithm is proved effective by successfully removing large-area interference and extracting star spots from the sky region of the real star image.
Collapse
Affiliation(s)
- Chenguang Shi
- Innovation Academy for Microsatellites of Chinese Academy of Sciences, Room 426, Building 4, 99 Haike Road, Shanghai 201203, China; (C.S.); (Y.Y.); (X.L.)
- University of Chinese Academy of Sciences, Beijing 100049, China;
| | - Rui Zhang
- Innovation Academy for Microsatellites of Chinese Academy of Sciences, Room 426, Building 4, 99 Haike Road, Shanghai 201203, China; (C.S.); (Y.Y.); (X.L.)
- University of Chinese Academy of Sciences, Beijing 100049, China;
- Correspondence:
| | - Yong Yu
- Innovation Academy for Microsatellites of Chinese Academy of Sciences, Room 426, Building 4, 99 Haike Road, Shanghai 201203, China; (C.S.); (Y.Y.); (X.L.)
- University of Chinese Academy of Sciences, Beijing 100049, China;
| | - Xingzhe Sun
- University of Chinese Academy of Sciences, Beijing 100049, China;
| | - Xiaodong Lin
- Innovation Academy for Microsatellites of Chinese Academy of Sciences, Room 426, Building 4, 99 Haike Road, Shanghai 201203, China; (C.S.); (Y.Y.); (X.L.)
- University of Chinese Academy of Sciences, Beijing 100049, China;
| |
Collapse
|
7
|
Zeng S, Tong X, Sang N. Study on multi-center fuzzy C-means algorithm based on transitive closure and spectral clustering. Appl Soft Comput 2014. [DOI: 10.1016/j.asoc.2013.11.020] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
8
|
Corsini P, Lazzerini B, Marcelloni F. A Fuzzy Relational Clustering Algorithm Based on a Dissimilarity Measure Extracted From Data. ACTA ACUST UNITED AC 2004; 34:775-82. [PMID: 15369122 DOI: 10.1109/tsmcb.2003.817041] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
One of the critical aspects of clustering algorithms is the correct identification of the dissimilarity measure used to drive the partitioning of the data set. The dissimilarity measure induces the cluster shape and therefore determines the success of clustering algorithms. As cluster shapes change from a data set to another, dissimilarity measures should be extracted from data. To this aim, we exploit some pairs of points with known dissimilarity value to teach a dissimilarity relation to a feed-forward neural network. Then, we use the neural dissimilarity measure to guide an unsupervised relational clustering algorithm. Experiments on synthetic data sets and on the Iris data set show that the relational clustering algorithm based on the neural dissimilarity outperforms some popular clustering algorithms (with possible partial supervision) based on spatial dissimilarity.
Collapse
Affiliation(s)
- Paolo Corsini
- Dipartimento di Ingegneria dell'Informazione: Elettronica, Informatica, Telecomunicazioni University of Pisa, Via Diotisalvi, 2-56122 Pisa, Italy.
| | | | | |
Collapse
|