1
|
Cheng J, Zhang Y, Zhang H, Ji S, Lu M. TransFOL: A Logical Query Model for Complex Relational Reasoning in Drug-Drug Interaction. IEEE J Biomed Health Inform 2024; 28:4975-4985. [PMID: 38743532 DOI: 10.1109/jbhi.2024.3401035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Predicting drug-drug interaction (DDI) plays a crucial role in drug recommendation and discovery. However, wet lab methods are prohibitively expensive and time-consuming due to drug interactions. In recent years, deep learning methods have gained widespread use in drug reasoning. Although these methods have demonstrated effectiveness, they can only predict the interaction between a drug pair and do not contain any other information. However, DDI is greatly affected by various other biomedical factors (such as the dose of the drug). As a result, it is challenging to apply them to more complex and meaningful reasoning tasks. Therefore, this study regards DDI as a link prediction problem on knowledge graphs and proposes a DDI prediction model based on Cross-Transformer and Graph Convolutional Networks (GCNs) in first-order logical query form, TransFOL. In the model, a biomedical query graph is first built to learn the embedding representation. Subsequently, an enhancement module is designed to aggregate the semantics of entities and relations. Cross-Transformer is used for encoding to obtain semantic information between nodes, and GCN is used to gather neighbour information further and predict inference results. To evaluate the performance of TransFOL on common DDI tasks, we conduct experiments on two benchmark datasets. The experimental results indicate that our model outperforms state-of-the-art methods on traditional DDI tasks. Additionally, we introduce different biomedical information in the other two experiments to make the settings more realistic. Experimental results verify the strong drug reasoning ability and generalization of TransFOL in complex settings.
Collapse
|
2
|
Yu L, Xu Z, Qiu W, Xiao X. MSDSE: Predicting drug-side effects based on multi-scale features and deep multi-structure neural network. Comput Biol Med 2024; 169:107812. [PMID: 38091725 DOI: 10.1016/j.compbiomed.2023.107812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 11/10/2023] [Accepted: 12/03/2023] [Indexed: 02/08/2024]
Abstract
Unexpected side effects may accompany the research stage and post-marketing of drugs. These accidents lead to drug development failure and even endanger patients' health. Thus, it is essential to recognize the unknown drug-side effects. Most existing methods in silico find the answer from the association network or similarity network of drugs while ignoring the drug-intrinsic attributes. The limitation is that they can only handle drugs in the maturation stage. To be suitable for early drug-side effect screening, we conceive a multi-structural deep learning framework, MSDSE, which synthetically considers the multi-scale features derived from the drug. MSDSE can jointly learn SMILES sequence-based word embedding, substructure-based molecular fingerprint, and chemical structure-based graph embedding. In the preprocessing stage of MSDSE, we project all features to the abstract space with the same dimension. MSDSE builds a bi-level channel strategy, including a convolutional neural network module with an Inception structure and a multi-head Self-Attention module, to learn and integrate multi-modal features from local to global perspectives. Finally, MSDSE regards the prediction of drug-side effects as pair-wise learning and outputs the pair-wise probability of drug-side effects through the inner product operation. MSDSE is evaluated and analyzed on benchmark datasets and performs optimally compared to other baseline models. We also set up the ablation study to explain the rationality of the feature approach and model structure. Moreover, we select model partial prediction results for the case study to reveal actual capability. The original data are available at http://github.com/yuliyi/MSDSE.
Collapse
Affiliation(s)
- Liyi Yu
- School of Information Engineering, Jingdezhen Ceramic University, Jingdezhen, 333403, China
| | - Zhaochun Xu
- School of Information Engineering, Jingdezhen Ceramic University, Jingdezhen, 333403, China
| | - Wangren Qiu
- School of Information Engineering, Jingdezhen Ceramic University, Jingdezhen, 333403, China
| | - Xuan Xiao
- School of Information Engineering, Jingdezhen Ceramic University, Jingdezhen, 333403, China.
| |
Collapse
|
3
|
Chalasani SH, Syed J, Ramesh M, Patil V, Pramod Kumar T. Artificial intelligence in the field of pharmacy practice: A literature review. EXPLORATORY RESEARCH IN CLINICAL AND SOCIAL PHARMACY 2023; 12:100346. [PMID: 37885437 PMCID: PMC10598710 DOI: 10.1016/j.rcsop.2023.100346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Revised: 10/06/2023] [Accepted: 10/07/2023] [Indexed: 10/28/2023] Open
Abstract
Artificial intelligence (AI) is a transformative technology used in various industrial sectors including healthcare. In pharmacy practice, AI has the potential to significantly improve medication management and patient care. This review explores various AI applications in the field of pharmacy practice. The incorporation of AI technologies provides pharmacists with tools and systems that help them make accurate and evidence-based clinical decisions. By using AI algorithms and Machine Learning, pharmacists can analyze a large volume of patient data, including medical records, laboratory results, and medication profiles, aiding them in identifying potential drug-drug interactions, assessing the safety and efficacy of medicines, and making informed recommendations tailored to individual patient requirements. Various AI models have been developed to predict and detect adverse drug events, assist clinical decision support systems with medication-related decisions, automate dispensing processes in community pharmacies, optimize medication dosages, detect drug-drug interactions, improve adherence through smart technologies, detect and prevent medication errors, provide medication therapy management services, and support telemedicine initiatives. By incorporating AI into clinical practice, health care professionals can augment their decision-making processes and provide patients with personalized care. AI allows for greater collaboration between different healthcare services provided to a single patient. For patients, AI may be a useful tool for providing guidance on how and when to take a medication, aiding in patient education, and promoting medication adherence and AI may be used to know how and where to obtain the most cost-effective healthcare and how best to communicate with healthcare professionals, optimize the health monitoring using wearables devices, provide everyday lifestyle and health guidance, and integrate diet and exercise.
Collapse
Affiliation(s)
- Sri Harsha Chalasani
- Dept. of Pharmacy Practice, JSS College of Pharmacy, JSS Academy of Higher Education & Research, Mysuru 15, Karnataka, India
| | - Jehath Syed
- Dept. of Pharmacy Practice, JSS College of Pharmacy, JSS Academy of Higher Education & Research, Mysuru 15, Karnataka, India
| | - Madhan Ramesh
- Dept. of Pharmacy Practice, JSS College of Pharmacy, JSS Academy of Higher Education & Research, Mysuru 15, Karnataka, India
| | - Vikram Patil
- Dept. of Radiology, JSS Medical College & Hospital, JSS Academy of Higher Education & Research, Mysuru 15, Karnataka, India
| | | |
Collapse
|
4
|
Xuan P, Xu K, Cui H, Nakaguchi T, Zhang T. Graph generative and adversarial strategy-enhanced node feature learning and self-calibrated pairwise attribute encoding for prediction of drug-related side effects. Front Pharmacol 2023; 14:1257842. [PMID: 37731739 PMCID: PMC10507253 DOI: 10.3389/fphar.2023.1257842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 08/17/2023] [Indexed: 09/22/2023] Open
Abstract
Background: Inferring drug-related side effects is beneficial for reducing drug development cost and time. Current computational prediction methods have concentrated on graph reasoning over heterogeneous graphs comprising the drug and side effect nodes. However, the various topologies and node attributes within multiple drug-side effect heterogeneous graphs have not been completely exploited. Methods: We proposed a new drug-side effect association prediction method, GGSC, to deeply integrate the diverse topologies and attributes from multiple heterogeneous graphs and the self-calibration attributes of each drug-side effect node pair. First, we created two heterogeneous graphs comprising the drug and side effect nodes and their related similarity and association connections. Since each heterogeneous graph has its specific topology and node attributes, a node feature learning strategy was designed and the learning for each graph was enhanced from a graph generative and adversarial perspective. We constructed a generator based on a graph convolutional autoencoder to encode the topological structure and node attributes from the whole heterogeneous graph and then generate the node features embedding the graph topology. A discriminator based on multilayer perceptron was designed to distinguish the generated topological features from the original ones. We also designed representation-level attention to discriminate the contributions of topological representations from multiple heterogeneous graphs and adaptively fused them. Finally, we constructed a self-calibration module based on convolutional neural networks to guide pairwise attribute learning through the features of the small latent space. Results: The comparison experiment results showed that GGSC had higher prediction performance than several state-of-the-art prediction methods. The ablation experiments demonstrated the effectiveness of topological enhancement learning, representation-level attention, and self-calibrated pairwise attribute learning. In addition, case studies over five drugs demonstrated GGSC's ability in discovering the potential drug-related side effect candidates. Conclusion: We proposed a drug-side effect association prediction method, and the method is beneficial for screening the reliable association candidates for the biologists to discover the actual associations.
Collapse
Affiliation(s)
- Ping Xuan
- Department of Computer Science, School of Engineering, Shantou University, Shantou, China
| | - Kai Xu
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| | - Hui Cui
- Department of Computer Science and Information Technology, La Trobe University, Melbourne, VI, Australia
| | - Toshiya Nakaguchi
- Center for Frontier Medical Engineering, Chiba University, Chiba, Japan
| | - Tiangang Zhang
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
- School of Mathematical Science, Heilongjiang University, Harbin, China
| |
Collapse
|
5
|
Due A. What are side effects? EUROPEAN JOURNAL FOR PHILOSOPHY OF SCIENCE 2023; 13:16. [PMID: 36936702 PMCID: PMC10006551 DOI: 10.1007/s13194-023-00519-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Accepted: 02/21/2023] [Indexed: 06/18/2023]
Abstract
Side effects are ubiquitous in medicine and they often play a role in treatment decisions for patients and clinicians alike. Philosophers and health researchers often use side effects to illustrate issues with contemporary medical research and practice. However, technical definitions of 'side effect' differ among health authorities. Thus, determining the side effects of an intervention can differ depending on whose definition we assume. Here I review some of the common definitions of side effect and highlight their issues. In response, I offer an account of side effects as jointly (i) unintended and (ii) effects due to the causal capacities or invariances of an intervention. I discuss (i) by examining the intentions or reasons behind therapeutic interventions, and I discuss (ii) by appealing to a manipulationist model of causation. The analysis here highlights that side effects are conceptually distinct from related outcomes like adverse events, adverse drug reactions, and placebo effects. The analysis also allows for reflection on the utility of 'side effect' as a technical term in medical research and practice.
Collapse
Affiliation(s)
- Austin Due
- Department of History and Philosophy of Science, University of Pittsburgh, Pittsburgh, PA USA
| |
Collapse
|
6
|
Das P, Mazumder DH. An extensive survey on the use of supervised machine learning techniques in the past two decades for prediction of drug side effects. Artif Intell Rev 2023; 56:1-28. [PMID: 36819660 PMCID: PMC9930028 DOI: 10.1007/s10462-023-10413-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/01/2023] [Indexed: 02/19/2023]
Abstract
Approved drugs for sale must be effective and safe, implying that the drug's advantages outweigh its known harmful side effects. Side effects (SE) of drugs are one of the common reasons for drug failure that may halt the whole drug discovery pipeline. The side effects might vary from minor concerns like a runny nose to potentially life-threatening issues like liver damage, heart attack, and death. Therefore, predicting the side effects of the drug is vital in drug development, discovery, and design. Supervised machine learning-based side effects prediction task has recently received much attention since it reduces time, chemical waste, design complexity, risk of failure, and cost. The advancement of supervised learning approaches for predicting side effects have emerged as essential computational tools. Supervised machine learning technique provides early information on drug side effects to develop an effective drug based on drug properties. Still, there are several challenges to predicting drug side effects. Thus, a near-exhaustive survey is carried out in this paper on the use of supervised machine learning approaches employed in drug side effects prediction tasks in the past two decades. In addition, this paper also summarized the drug descriptor required for the side effects prediction task, commonly utilized drug properties sources, computational models, and their performances. Finally, the research gap, open problems, and challenges for the further supervised learning-based side effects prediction task have been discussed.
Collapse
Affiliation(s)
- Pranab Das
- Department of Computer Science and Engineering, National Institute of Technology Nagaland, Chumukedima, Dimapur, Nagaland 797103 India
| | - Dilwar Hussain Mazumder
- Department of Computer Science and Engineering, National Institute of Technology Nagaland, Chumukedima, Dimapur, Nagaland 797103 India
| |
Collapse
|
7
|
McNair D. Artificial Intelligence and Machine Learning for Lead-to-Candidate Decision-Making and Beyond. Annu Rev Pharmacol Toxicol 2023; 63:77-97. [PMID: 35679624 DOI: 10.1146/annurev-pharmtox-051921-023255] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
The use of artificial intelligence (AI) and machine learning (ML) in pharmaceutical research and development has to date focused on research: target identification; docking-, fragment-, and motif-based generation of compound libraries; modeling of synthesis feasibility; rank-ordering likely hits according to structural and chemometric similarity to compounds having known activity and affinity to the target(s); optimizing a smaller library for synthesis and high-throughput screening; and combining evidence from screening to support hit-to-lead decisions. Applying AI/ML methods to lead optimization and lead-to-candidate (L2C) decision-making has shown slower progress, especially regarding predicting absorption, distribution, metabolism, excretion, and toxicology properties. The present review surveys reasons why this is so, reports progress that has occurred in recent years, and summarizes some of the issues that remain. Effective AI/ML tools to derisk L2C and later phases of development are important to accelerate the pharmaceutical development process, ameliorate escalating development costs, and achieve greater success rates.
Collapse
Affiliation(s)
- Douglas McNair
- Global Health, Integrated Development, Bill & Melinda Gates Foundation, Seattle, Washington, USA;
| |
Collapse
|
8
|
Uner OC, Kuru HI, Cinbis RG, Tastan O, Cicek AE. DeepSide: A Deep Learning Approach for Drug Side Effect Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:330-339. [PMID: 34995191 DOI: 10.1109/tcbb.2022.3141103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Drug failures due to unforeseen adverse effects at clinical trials pose health risks for the participants and lead to substantial financial losses. Side effect prediction algorithms have the potential to guide the drug design process. LINCS L1000 dataset provides a vast resource of cell line gene expression data perturbed by different drugs and creates a knowledge base for context specific features. The state-of-the-art approach that aims at using context specific information relies on only the high-quality experiments in LINCS L1000 and discards a large portion of the experiments. In this study, our goal is to boost the prediction performance by utilizing this data to its full extent. We experiment with 5 deep learning architectures. We find that a multi-modal architecture produces the best predictive performance among multi-layer perceptron-based architectures when drug chemical structure (CS), and the full set of drug perturbed gene expression profiles (GEX) are used as modalities. Overall, we observe that the CS is more informative than the GEX. A convolutional neural network-based model that uses only SMILES string representation of the drugs achieves the best results and provides 13.0% macro-AUC and 3.1% micro-AUC improvements over the state-of-the-art. We also show that the model is able to predict side effect-drug pairs that are reported in the literature but was missing in the ground truth side effect dataset. DeepSide is available at http://github.com/OnurUner/DeepSide.
Collapse
|
9
|
Drug Treatment Effect Model Based on MODWT and Hawkes Self-Exciting Point Process. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:4038290. [PMID: 36277000 PMCID: PMC9586769 DOI: 10.1155/2022/4038290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Revised: 08/23/2022] [Accepted: 09/27/2022] [Indexed: 12/03/2022]
Abstract
In precision medicine, especially in the pharmacodynamic area, the lack of an adequate long-term drug effect monitoring model leads to a quite low robustness to the instant drug treatment. Modelling the effect of drug based on the monitoring variables is essential to measure the drug benefit and its side effect preciously. In order to model the complex drug behavior in the context of time series, a sin function is selected to describe the basic trend of heart rate variable that is medically monitored. A Hawkes self-exciting point process model is chosen to describe the effect caused by multiple and sequential drug usage at different time points. The model considers the time lag between the drug given time and the drug effect during the whole drug emission period. A cumulative Gamma distribution is employed to describe the time lag effect. Simulation results demonstrate the established model effectively when describing the baseline trend and the drug effect with low noise levels, where the maximal overlap discrete wavelet transformation is utilized for the information decomposition in the frequency zone. The real data of the variables heart rate and drug liquemin from a medical database is analyzed. Instead of the original time series, scale variable s4 is selected according to the Granger cointegration test. The results show that the model accurately characterizes the cumulative drug effect with the Pearson correlation test value as 0.22, which is more significant for the value under 0.1. In the future, the model can be extended to more complicated scenarios through taking into account multiple monitoring variables and different kinds of drugs.
Collapse
|
10
|
Decoding kinase-adverse event associations for small molecule kinase inhibitors. Nat Commun 2022; 13:4349. [PMID: 35896580 PMCID: PMC9329312 DOI: 10.1038/s41467-022-32033-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 07/14/2022] [Indexed: 11/08/2022] Open
Abstract
Small molecule kinase inhibitors (SMKIs) are being approved at a fast pace under expedited programs for anticancer treatment. In this study, we construct a multi-domain dataset from a total of 4638 patients in the registrational trials of 16 FDA-approved SMKIs and employ a machine-learning model to examine the relationships between kinase targets and adverse events (AEs). Internal and external (datasets from two independent SMKIs) validations have been conducted to verify the usefulness of the established model. We systematically evaluate the potential associations between 442 kinases with 2145 AEs and made publicly accessible an interactive web application “Identification of Kinase-Specific Signal” (https://gongj.shinyapps.io/ml4ki). The developed model (1) provides a platform for experimentalists to identify and verify undiscovered KI-AE pairs, (2) serves as a precision-medicine tool to mitigate individual patient safety risks by forecasting clinical safety signals and (3) can function as a modern drug development tool to screen and compare SMKI target therapies from the safety perspective. Small molecule kinase inhibitors (SMKIs) are being approved at a fast pace under expedited programs for anticancer treatment. Here, the authors employ a machine-learning model to examine the relationships between kinase targets and adverse events in the trials of 16 FDA-approved SMKIs.
Collapse
|
11
|
Yao J, Sun W, Jian Z, Wu Q, Wang X. Effective knowledge graph embeddings based on multidirectional semantics relations for polypharmacy side effects prediction. Bioinformatics 2022; 38:2315-2322. [PMID: 35176135 DOI: 10.1093/bioinformatics/btac094] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Revised: 01/25/2022] [Accepted: 02/15/2022] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Polypharmacy is the combined use of drugs for the treatment of diseases. However, it often shows a high risk of side effects. Due to unnecessary interactions of combined drugs, the side effects of polypharmacy increase the risk of disease and even lead to death. Thus, obtaining abundant and comprehensive information on the side effects of polypharmacy is a vital task in the healthcare industry. Early traditional methods used machine learning techniques to predict side effects. However, they often make costly efforts to extract features of drugs for prediction. Later, several methods based on knowledge graphs are proposed. They are reported to outperform traditional methods. However, they still show limited performance by failing to model complex relations of side effects among drugs. RESULTS To resolve the above problems, we propose a novel model by further incorporating complex relations of side effects into knowledge graph embeddings. Our model can translate and transmit multidirectional semantics with fewer parameters, leading to better scalability in large-scale knowledge graphs. Experimental evaluation shows that our model outperforms state-of-the-art models in terms of the average area under the ROC and precision-recall curves. AVAILABILITY AND IMPLEMENTATION Code and data are available at: https://github.com/galaxysunwen/MSTE-master.
Collapse
Affiliation(s)
- Junfeng Yao
- School of Informatics, Xiamen University, Xiamen, Fujian 361005, China.,Institute of Artificial Intelligence, Xiamen University, Xiamen, Fujian 361005, China.,Key Laboratory of Digital Protection and Intelligent Processing of Intangible Cultural Heritage of Fujian and Taiwan Ministry of Culture and Tourism, Xiamen University, Xiamen, Fujian 361005, China
| | - Wen Sun
- School of Informatics, Xiamen University, Xiamen, Fujian 361005, China
| | - Zhongquan Jian
- School of Informatics, Xiamen University, Xiamen, Fujian 361005, China.,Institute of Artificial Intelligence, Xiamen University, Xiamen, Fujian 361005, China
| | - Qingqiang Wu
- School of Informatics, Xiamen University, Xiamen, Fujian 361005, China.,Institute of Artificial Intelligence, Xiamen University, Xiamen, Fujian 361005, China.,Key Laboratory of Digital Protection and Intelligent Processing of Intangible Cultural Heritage of Fujian and Taiwan Ministry of Culture and Tourism, Xiamen University, Xiamen, Fujian 361005, China
| | - Xiaoli Wang
- School of Informatics, Xiamen University, Xiamen, Fujian 361005, China
| |
Collapse
|
12
|
Kingsmore KM, Puglisi CE, Grammer AC, Lipsky PE. An introduction to machine learning and analysis of its use in rheumatic diseases. Nat Rev Rheumatol 2021; 17:710-730. [PMID: 34728818 DOI: 10.1038/s41584-021-00708-w] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/04/2021] [Indexed: 02/07/2023]
Abstract
Machine learning (ML) is a computerized analytical technique that is being increasingly employed in biomedicine. ML often provides an advantage over explicitly programmed strategies in the analysis of multidimensional information by recognizing relationships in the data that were not previously appreciated. As such, the use of ML in rheumatology is increasing, and numerous studies have employed ML to classify patients with rheumatic autoimmune inflammatory diseases (RAIDs) from medical records and imaging, biometric or gene expression data. However, these studies are limited by sample size, the accuracy of sample labelling, and absence of datasets for external validation. In addition, there is potential for ML models to overfit or underfit the data and, thereby, these models might produce results that cannot be replicated in an unrelated dataset. In this Review, we introduce the basic principles of ML and discuss its current strengths and weaknesses in the classification of patients with RAIDs. Moreover, we highlight the successful analysis of the same type of input data (for example, medical records) with different algorithms, illustrating the potential plasticity of this analytical approach. Altogether, a better understanding of ML and the future application of advanced analytical techniques based on this approach, coupled with the increasing availability of biomedical data, may facilitate the development of meaningful precision medicine for patients with RAIDs.
Collapse
Affiliation(s)
| | | | - Amrie C Grammer
- AMPEL BioSolutions and RILITE Research Institute, Charlottesville, VA, USA
| | - Peter E Lipsky
- AMPEL BioSolutions and RILITE Research Institute, Charlottesville, VA, USA
| |
Collapse
|
13
|
Abstract
AbstractNowadays, a vast amount of clinical data scattered across different sites on the Internet hinders users from finding helpful information for their well-being improvement. Besides, the overload of medical information (e.g., on drugs, medical tests, and treatment suggestions) have brought many difficulties to medical professionals in making patient-oriented decisions. These issues raise the need to apply recommender systems in the healthcare domain to help both, end-users and medical professionals, make more efficient and accurate health-related decisions. In this article, we provide a systematic overview of existing research on healthcare recommender systems. Different from existing related overview papers, our article provides insights into recommendation scenarios and recommendation approaches. Examples thereof are food recommendation, drug recommendation, health status prediction, healthcare service recommendation, and healthcare professional recommendation. Additionally, we develop working examples to give a deep understanding of recommendation algorithms. Finally, we discuss challenges concerning the development of healthcare recommender systems in the future.
Collapse
|
14
|
Bresso E, Monnin P, Bousquet C, Calvier FE, Ndiaye NC, Petitpain N, Smaïl-Tabbone M, Coulet A. Investigating ADR mechanisms with Explainable AI: a feasibility study with knowledge graph mining. BMC Med Inform Decis Mak 2021; 21:171. [PMID: 34039343 PMCID: PMC8157660 DOI: 10.1186/s12911-021-01518-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Accepted: 05/05/2021] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Adverse drug reactions (ADRs) are statistically characterized within randomized clinical trials and postmarketing pharmacovigilance, but their molecular mechanism remains unknown in most cases. This is true even for hepatic or skin toxicities, which are classically monitored during drug design. Aside from clinical trials, many elements of knowledge about drug ingredients are available in open-access knowledge graphs, such as their properties, interactions, or involvements in pathways. In addition, drug classifications that label drugs as either causative or not for several ADRs, have been established. METHODS We propose in this paper to mine knowledge graphs for identifying biomolecular features that may enable automatically reproducing expert classifications that distinguish drugs causative or not for a given type of ADR. In an Explainable AI perspective, we explore simple classification techniques such as Decision Trees and Classification Rules because they provide human-readable models, which explain the classification itself, but may also provide elements of explanation for molecular mechanisms behind ADRs. In summary, (1) we mine a knowledge graph for features; (2) we train classifiers at distinguishing, on the basis of extracted features, drugs associated or not with two commonly monitored ADRs: drug-induced liver injuries (DILI) and severe cutaneous adverse reactions (SCAR); (3) we isolate features that are both efficient in reproducing expert classifications and interpretable by experts (i.e., Gene Ontology terms, drug targets, or pathway names); and (4) we manually evaluate in a mini-study how they may be explanatory. RESULTS Extracted features reproduce with a good fidelity classifications of drugs causative or not for DILI and SCAR (Accuracy = 0.74 and 0.81, respectively). Experts fully agreed that 73% and 38% of the most discriminative features are possibly explanatory for DILI and SCAR, respectively; and partially agreed (2/3) for 90% and 77% of them. CONCLUSION Knowledge graphs provide sufficiently diverse features to enable simple and explainable models to distinguish between drugs that are causative or not for ADRs. In addition to explaining classifications, most discriminative features appear to be good candidates for investigating ADR mechanisms further.
Collapse
Affiliation(s)
- Emmanuel Bresso
- Université de Lorraine, CNRS, Inria, LORIA, Nancy, France
- Centre d’Investigations Cliniques Plurithématique 1433, Inserm 1116, CHRU de Nancy, Université de Lorraine, Nancy, France
| | - Pierre Monnin
- Université de Lorraine, CNRS, Inria, LORIA, Nancy, France
- Orange, Belfort, France
| | - Cédric Bousquet
- Service de santé publique et information médicale, CHU de Saint Etienne, Saint Etienne, France
- Sorbonne Université, Inserm, Université Paris 13, LIMICS, Paris, France
| | - François-Elie Calvier
- Service de santé publique et information médicale, CHU de Saint Etienne, Saint Etienne, France
| | | | - Nadine Petitpain
- Centre Régional de Pharmacovigilance, CHRU of Nancy, Nancy, France
| | | | - Adrien Coulet
- Université de Lorraine, CNRS, Inria, LORIA, Nancy, France
- Inria Paris, Paris, France
- Centre de Recherche des Cordeliers, INSERM, Sorbonne Université, Université de Paris, Paris, France
| |
Collapse
|
15
|
Tran TNT, Felfernig A, Trattner C, Holzinger A. Recommender systems in the healthcare domain: state-of-the-art and research issues. J Intell Inf Syst 2020. [DOI: 10.1007/s10844-020-00633-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
AbstractNowadays, a vast amount of clinical data scattered across different sites on the Internet hinders users from finding helpful information for their well-being improvement. Besides, the overload of medical information (e.g., on drugs, medical tests, and treatment suggestions) have brought many difficulties to medical professionals in making patient-oriented decisions. These issues raise the need to apply recommender systems in the healthcare domain to help both, end-users and medical professionals, make more efficient and accurate health-related decisions. In this article, we provide a systematic overview of existing research on healthcare recommender systems. Different from existing related overview papers, our article provides insights into recommendation scenarios and recommendation approaches. Examples thereof are food recommendation, drug recommendation, health status prediction, healthcare service recommendation, and healthcare professional recommendation. Additionally, we develop working examples to give a deep understanding of recommendation algorithms. Finally, we discuss challenges concerning the development of healthcare recommender systems in the future.
Collapse
|
16
|
Galeano D, Li S, Gerstein M, Paccanaro A. Predicting the frequencies of drug side effects. Nat Commun 2020; 11:4575. [PMID: 32917868 PMCID: PMC7486409 DOI: 10.1038/s41467-020-18305-y] [Citation(s) in RCA: 50] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Accepted: 08/07/2020] [Indexed: 12/25/2022] Open
Abstract
A central issue in drug risk-benefit assessment is identifying frequencies of side effects in humans. Currently, frequencies are experimentally determined in randomised controlled clinical trials. We present a machine learning framework for computationally predicting frequencies of drug side effects. Our matrix decomposition algorithm learns latent signatures of drugs and side effects that are both reproducible and biologically interpretable. We show the usefulness of our approach on 759 structurally and therapeutically diverse drugs and 994 side effects from all human physiological systems. Our approach can be applied to any drug for which a small number of side effect frequencies have been identified, in order to predict the frequencies of further, yet unidentified, side effects. We show that our model is informative of the biology underlying drug activity: individual components of the drug signatures are related to the distinct anatomical categories of the drugs and to the specific drug routes of administration. Currently, the frequencies of drug side effects are determined in randomised controlled clinical trials. Here the authors develop an interpretable machine learning approach to predict the frequencies of unknown side effects for drugs with a small number of determined side effect frequencies.
Collapse
Affiliation(s)
- Diego Galeano
- Department of Computer Science, Centre for Systems and Synthetic Biology, Royal Holloway, University of London, Egham Hill, Egham, UK.,School of Applied Mathematics, Fundação Getulio Vargas, Rio de Janeiro, Brazil
| | - Shantao Li
- Department of Computer Science and Department of Biomedical Data Sciences, Stanford University, Stanford, CA, USA
| | - Mark Gerstein
- Department of Molecular Biophysics and Biochemistry, Department of Computer Science, and Department of Statistics and Data Science, Yale University, New Haven, CT, 06520, USA
| | - Alberto Paccanaro
- Department of Computer Science, Centre for Systems and Synthetic Biology, Royal Holloway, University of London, Egham Hill, Egham, UK. .,School of Applied Mathematics, Fundação Getulio Vargas, Rio de Janeiro, Brazil.
| |
Collapse
|
17
|
Deng S, Sun Y, Zhao T, Hu Y, Zang T. A Review of Drug Side Effect Identification Methods. Curr Pharm Des 2020; 26:3096-3104. [PMID: 32532187 DOI: 10.2174/1381612826666200612163819] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2020] [Accepted: 05/18/2020] [Indexed: 11/22/2022]
Abstract
Drug side effects have become an important indicator for evaluating the safety of drugs. There are two main factors in the frequent occurrence of drug safety problems; on the one hand, the clinical understanding of drug side effects is insufficient, leading to frequent adverse drug reactions, while on the other hand, due to the long-term period and complexity of clinical trials, side effects of approved drugs on the market cannot be reported in a timely manner. Therefore, many researchers have focused on developing methods to identify drug side effects. In this review, we summarize the methods of identifying drug side effects and common databases in this field. We classified methods of identifying side effects into four categories: biological experimental, machine learning, text mining and network methods. We point out the key points of each kind of method. In addition, we also explain the advantages and disadvantages of each method. Finally, we propose future research directions.
Collapse
Affiliation(s)
- Shuai Deng
- College of Science, Beijing Forestry University, Beijing, China
| | - Yige Sun
- Microbiology Department, Harbin Medical University, Harbin, 150081, China
| | - Tianyi Zhao
- School of Life Science and Technology, Department of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
| | - Yang Hu
- School of Life Science and Technology, Department of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
| | - Tianyi Zang
- School of Life Science and Technology, Department of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
| |
Collapse
|
18
|
Jiang H, Qiu Y, Hou W, Cheng X, Yim MY, Ching WK. Drug Side-Effect Profiles Prediction: From Empirical to Structural Risk Minimization. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:402-410. [PMID: 29994681 DOI: 10.1109/tcbb.2018.2850884] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The identification of drug side-effects is considered to be an important step in drug design, which could not only shorten the time but also reduce the cost of drug development. In this paper, we investigate the relationship between the potential side-effects of drug candidates and their chemical structures. The preliminary Regularized Regression (RR) model for drug side-effects prediction has promising features in the efficiency of model training and the existence of a closed form solution. It performs better than other state-of-the-art methods, in terms of minimum accuracy and average accuracy. In order to dig inside how drug structure will associate with side effect, we further propose weighted GTS (Generalized T-Student Kernel: WGTS) SVM model from a structural risk minimization perspective. The SVM model proposed in this paper provides a better understanding of drug side-effects in the process of drug development. The usefulness of the WGTS model lies in the superior performance in a cross validation setting on 888 approved drugs with 1385 side-effects profiling from SIDER database. This work is expected to shed light on intriguing studies that predict potential un-identifying side-effects and suggest how we can avoid drug side-effects by the removal of some distinguished chemical structures.
Collapse
|
19
|
Drug repurposing to improve treatment of rheumatic autoimmune inflammatory diseases. Nat Rev Rheumatol 2019; 16:32-52. [PMID: 31831878 DOI: 10.1038/s41584-019-0337-0] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/01/2019] [Indexed: 02/08/2023]
Abstract
The past century has been characterized by intensive efforts, within both academia and the pharmaceutical industry, to introduce new treatments to individuals with rheumatic autoimmune inflammatory diseases (RAIDs), often by 'borrowing' treatments already employed in one RAID or previously used in an entirely different disease, a concept known as drug repurposing. However, despite sharing some clinical manifestations and immune dysregulation, disease pathogenesis and phenotype vary greatly among RAIDs, and limited understanding of their aetiology has made repurposing drugs for RAIDs challenging. Nevertheless, the past century has been characterized by different 'waves' of repurposing. Early drug repurposing occurred in academia and was based on serendipitous observations or perceived disease similarity, often driven by the availability and popularity of drug classes. Since the 1990s, most biologic therapies have been developed for one or several RAIDs and then tested among the others, with varying levels of success. The past two decades have seen data-driven repurposing characterized by signature-based approaches that rely on molecular biology and genomics. Additionally, many data-driven strategies employ computational modelling and machine learning to integrate multiple sources of data. Together, these repurposing periods have led to advances in the treatment for many RAIDs.
Collapse
|
20
|
Poleksic A, Xie L. Predicting serious rare adverse reactions of novel chemicals. Bioinformatics 2019; 34:2835-2842. [PMID: 29617731 PMCID: PMC6084596 DOI: 10.1093/bioinformatics/bty193] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2017] [Accepted: 03/28/2018] [Indexed: 02/02/2023] Open
Abstract
Motivation Adverse drug reactions (ADRs) are one of the main causes of death and a major financial burden on the world’s economy. Due to the limitations of the animal model, computational prediction of serious and rare ADRs is invaluable. However, current state-of-the-art computational methods do not yield significantly better predictions of rare ADRs than random guessing. Results We present a novel method, based on the theory of ‘compressed sensing’ (CS), which can accurately predict serious side-effects of candidate and market drugs. Not only is our method able to infer new chemical-ADR associations using existing noisy, biased and incomplete databases, but our data also demonstrate that the accuracy of CS in predicting a serious ADR for a candidate drug increases with increasing knowledge of other ADRs associated with the drug. In practice, this means that as the candidate drug moves up the different stages of clinical trials, the prediction accuracy of our method will increase accordingly. Availability and implementation The program is available at https://github.com/poleksic/side-effects. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Aleksandar Poleksic
- Department of Computer Science, University of Northern Iowa, Cedar Falls, IA, USA
| | - Lei Xie
- Department of Computer Science, Hunter College, The Graduate Center, The City University of New York, New York, NY, USA.,Ph.D. Program in Computer Science, Biochemistry and Biology, The Graduate Center, The City University of New York, New York, NY, USA
| |
Collapse
|
21
|
Zhang W, Liu X, Chen Y, Wu W, Wang W, Li X. Feature-derived graph regularized matrix factorization for predicting drug side effects. Neurocomputing 2018. [DOI: 10.1016/j.neucom.2018.01.085] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
22
|
Zhu Y, Elemento O, Pathak J, Wang F. Drug knowledge bases and their applications in biomedical informatics research. Brief Bioinform 2018; 20:1308-1321. [DOI: 10.1093/bib/bbx169] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2017] [Revised: 11/15/2017] [Indexed: 11/14/2022] Open
Abstract
Abstract
Recent advances in biomedical research have generated a large volume of drug-related data. To effectively handle this flood of data, many initiatives have been taken to help researchers make good use of them. As the results of these initiatives, many drug knowledge bases have been constructed. They range from simple ones with specific focuses to comprehensive ones that contain information on almost every aspect of a drug. These curated drug knowledge bases have made significant contributions to the development of efficient and effective health information technologies for better health-care service delivery. Understanding and comparing existing drug knowledge bases and how they are applied in various biomedical studies will help us recognize the state of the art and design better knowledge bases in the future. In addition, researchers can get insights on novel applications of the drug knowledge bases through a review of successful use cases. In this study, we provide a review of existing popular drug knowledge bases and their applications in drug-related studies. We discuss challenges in constructing and using drug knowledge bases as well as future research directions toward a better ecosystem of drug knowledge bases.
Collapse
|
23
|
Bean DM, Wu H, Iqbal E, Dzahini O, Ibrahim ZM, Broadbent M, Stewart R, Dobson RJB. Knowledge graph prediction of unknown adverse drug reactions and validation in electronic health records. Sci Rep 2017; 7:16416. [PMID: 29180758 PMCID: PMC5703951 DOI: 10.1038/s41598-017-16674-x] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2017] [Accepted: 11/16/2017] [Indexed: 01/31/2023] Open
Abstract
Unknown adverse reactions to drugs available on the market present a significant health risk and limit accurate judgement of the cost/benefit trade-off for medications. Machine learning has the potential to predict unknown adverse reactions from current knowledge. We constructed a knowledge graph containing four types of node: drugs, protein targets, indications and adverse reactions. Using this graph, we developed a machine learning algorithm based on a simple enrichment test and first demonstrated this method performs extremely well at classifying known causes of adverse reactions (AUC 0.92). A cross validation scheme in which 10% of drug-adverse reaction edges were systematically deleted per fold showed that the method correctly predicts 68% of the deleted edges on average. Next, a subset of adverse reactions that could be reliably detected in anonymised electronic health records from South London and Maudsley NHS Foundation Trust were used to validate predictions from the model that are not currently known in public databases. High-confidence predictions were validated in electronic records significantly more frequently than random models, and outperformed standard methods (logistic regression, decision trees and support vector machines). This approach has the potential to improve patient safety by predicting adverse reactions that were not observed during randomised trials.
Collapse
Affiliation(s)
- Daniel M Bean
- Department of Biostatistics and Health Informatics, Institute of Psychiatry Psychology and Neuroscience, King's College London, 16 De Crespigny Park, London, SE5 8AF, United Kingdom
| | - Honghan Wu
- Department of Biostatistics and Health Informatics, Institute of Psychiatry Psychology and Neuroscience, King's College London, 16 De Crespigny Park, London, SE5 8AF, United Kingdom
| | - Ehtesham Iqbal
- Department of Biostatistics and Health Informatics, Institute of Psychiatry Psychology and Neuroscience, King's College London, 16 De Crespigny Park, London, SE5 8AF, United Kingdom
| | - Olubanke Dzahini
- South London and Maudsley NHS Foundation Trust, Denmark Hill, London, SE5 8AZ, United Kingdom
- Institute of Pharmaceutical Science, King's College, London, 5th Floor, Franklin-Wilkins Building, 150 Stamford Street, London, SE1 9NH, United Kingdom
| | - Zina M Ibrahim
- Department of Biostatistics and Health Informatics, Institute of Psychiatry Psychology and Neuroscience, King's College London, 16 De Crespigny Park, London, SE5 8AF, United Kingdom
- Farr Institute of Health Informatics Research, UCL Institute of Health Informatics, University College London, London, WC1E 6BT, United Kingdom
| | - Matthew Broadbent
- South London and Maudsley NHS Foundation Trust, Denmark Hill, London, SE5 8AZ, United Kingdom
| | - Robert Stewart
- South London and Maudsley NHS Foundation Trust, Denmark Hill, London, SE5 8AZ, United Kingdom
- Institute of Psychiatry, Psychology and Neuroscience, King's College London, 16 De Crespigny Park, London, SE5 8AF, United Kingdom
| | - Richard J B Dobson
- Department of Biostatistics and Health Informatics, Institute of Psychiatry Psychology and Neuroscience, King's College London, 16 De Crespigny Park, London, SE5 8AF, United Kingdom.
- Farr Institute of Health Informatics Research, UCL Institute of Health Informatics, University College London, London, WC1E 6BT, United Kingdom.
| |
Collapse
|
24
|
Muñoz E, Nováček V, Vandenbussche PY. Facilitating prediction of adverse drug reactions by using knowledge graphs and multi-label learning models. Brief Bioinform 2017; 20:190-202. [DOI: 10.1093/bib/bbx099] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2017] [Indexed: 01/02/2023] Open
Affiliation(s)
- Emir Muñoz
- Fujitsu Ireland Ltd., Co. Dublin, Ireland
- Insight Centre for Data Analytics, NUI Galway, Co. Galway, Ireland
| | - Vít Nováček
- Insight Centre for Data Analytics, NUI Galway, Co. Galway, Ireland
| | | |
Collapse
|
25
|
Manczinger M, Bodnár VÁ, Papp BT, Bolla SB, Szabó K, Balázs B, Csányi E, Szél E, Erős G, Kemény L. Drug Repurposing by Simulating Flow Through Protein-Protein Interaction Networks. Clin Pharmacol Ther 2017. [PMID: 28643328 PMCID: PMC5836852 DOI: 10.1002/cpt.769] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
As drug development is extremely expensive, the identification of novel indications for in‐market drugs is financially attractive. Multiple algorithms are used to support such drug repurposing, but highly reliable methods combining simulation of intracellular networks and machine learning are currently not available. We developed an algorithm that simulates drug effects on the flow of information through protein–protein interaction networks, and used support vector machine to identify potentially effective drugs in our model disease, psoriasis. Using this method, we screened about 1,500 marketed and investigational substances, identified 51 drugs that were potentially effective, and selected three of them for experimental confirmation. All drugs inhibited tumor necrosis factor alpha‐induced nuclear factor kappa B activity in vitro, suggesting they might be effective for treating psoriasis in humans. Additionally, these drugs significantly inhibited imiquimod‐induced ear thickening and inflammation in the mouse model of the disease. All results suggest high prediction performance for the algorithm.
Collapse
Affiliation(s)
- M Manczinger
- Department of Dermatology and Allergology, University of Szeged, Hungary.,MTA-SZTE Dermatological Research Group, University of Szeged, Hungary
| | - V Á Bodnár
- Department of Dermatology and Allergology, University of Szeged, Hungary
| | - B T Papp
- Department of Dermatology and Allergology, University of Szeged, Hungary.,Szeged Scientists Academy, Hungary
| | - S B Bolla
- Department of Dermatology and Allergology, University of Szeged, Hungary
| | - K Szabó
- Department of Dermatology and Allergology, University of Szeged, Hungary.,MTA-SZTE Dermatological Research Group, University of Szeged, Hungary
| | - B Balázs
- Department of Pharmaceutical Technology, University of Szeged, Hungary
| | - E Csányi
- Department of Pharmaceutical Technology, University of Szeged, Hungary
| | - E Szél
- Department of Dermatology and Allergology, University of Szeged, Hungary
| | - G Erős
- Department of Dermatology and Allergology, University of Szeged, Hungary
| | - L Kemény
- Department of Dermatology and Allergology, University of Szeged, Hungary.,MTA-SZTE Dermatological Research Group, University of Szeged, Hungary
| |
Collapse
|
26
|
Quantitative prediction of drug side effects based on drug-related features. Interdiscip Sci 2017; 9:434-444. [DOI: 10.1007/s12539-017-0236-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2016] [Revised: 04/29/2017] [Accepted: 05/03/2017] [Indexed: 01/07/2023]
|
27
|
Chen X, Shi H, Yang F, Yang L, Lv Y, Wang S, Dai E, Sun D, Jiang W. Large-scale identification of adverse drug reaction-related proteins through a random walk model. Sci Rep 2016; 6:36325. [PMID: 27805066 PMCID: PMC5090865 DOI: 10.1038/srep36325] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2016] [Accepted: 10/13/2016] [Indexed: 12/19/2022] Open
Abstract
Adverse drug reactions (ADRs) are responsible for drug failure in clinical trials and affect life quality of patients. The identification of ADRs during the early phases of drug development is an important task. Therefore, predicting potential protein targets eliciting ADRs is essential for understanding the pathogenesis of ADRs. In this study, we proposed a computational algorithm,Integrated Network for Protein-ADR relations (INPADR), to infer potential protein-ADR relations based on an integrated network. First, the integrated network was constructed by connecting the protein-protein interaction network and the ADR similarity network using known protein-ADR relations. Then, candidate protein-ADR relations were further prioritized by performing a random walk with restart on this integrated network. Leave-one-out cross validation was used to evaluate the ability of the INPADR. An AUC of 0.8486 was obtained, which was a significant improvement compared to previous methods. We also applied the INPADR to two ADRs to evaluate its accuracy. The results suggested that the INPADR is capable of finding novel protein-ADR relations. This study provides new insight to our understanding of ADRs. The predicted ADR-related proteins will provide a reference for preclinical safety pharmacology studies and facilitate the identification of ADRs during the early phases of drug development.
Collapse
Affiliation(s)
- Xiaowen Chen
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Hongbo Shi
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Feng Yang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Lei Yang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Yingli Lv
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Shuyuan Wang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Enyu Dai
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Dianjun Sun
- Center for Endemic Disease Control, Chinese Center for Disease Control and Prevention, Harbin Medical University, Harbin, 150081, China
| | - Wei Jiang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| |
Collapse
|
28
|
|
29
|
Mining Chemical Activity Status from High-Throughput Screening Assays. PLoS One 2015; 10:e0144426. [PMID: 26658480 PMCID: PMC4682830 DOI: 10.1371/journal.pone.0144426] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2015] [Accepted: 11/18/2015] [Indexed: 01/20/2023] Open
Abstract
High-throughput screening (HTS) experiments provide a valuable resource that reports biological activity of numerous chemical compounds relative to their molecular targets. Building computational models that accurately predict such activity status (active vs. inactive) in specific assays is a challenging task given the large volume of data and frequently small proportion of active compounds relative to the inactive ones. We developed a method, DRAMOTE, to predict activity status of chemical compounds in HTP activity assays. For a class of HTP assays, our method achieves considerably better results than the current state-of-the-art-solutions. We achieved this by modification of a minority oversampling technique. To demonstrate that DRAMOTE is performing better than the other methods, we performed a comprehensive comparison analysis with several other methods and evaluated them on data from 11 PubChem assays through 1,350 experiments that involved approximately 500,000 interactions between chemicals and their target proteins. As an example of potential use, we applied DRAMOTE to develop robust models for predicting FDA approved drugs that have high probability to interact with the thyroid stimulating hormone receptor (TSHR) in humans. Our findings are further partially and indirectly supported by 3D docking results and literature information. The results based on approximately 500,000 interactions suggest that DRAMOTE has performed the best and that it can be used for developing robust virtual screening models. The datasets and implementation of all solutions are available as a MATLAB toolbox online at www.cbrc.kaust.edu.sa/dramote and can be found on Figshare.
Collapse
|
30
|
Zhang W, Liu F, Luo L, Zhang J. Predicting drug side effects by multi-label learning and ensemble learning. BMC Bioinformatics 2015; 16:365. [PMID: 26537615 PMCID: PMC4634905 DOI: 10.1186/s12859-015-0774-y] [Citation(s) in RCA: 100] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2015] [Accepted: 10/14/2015] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND Predicting drug side effects is an important topic in the drug discovery. Although several machine learning methods have been proposed to predict side effects, there is still space for improvements. Firstly, the side effect prediction is a multi-label learning task, and we can adopt the multi-label learning techniques for it. Secondly, drug-related features are associated with side effects, and feature dimensions have specific biological meanings. Recognizing critical dimensions and reducing irrelevant dimensions may help to reveal the causes of side effects. METHODS In this paper, we propose a novel method 'feature selection-based multi-label k-nearest neighbor method' (FS-MLKNN), which can simultaneously determine critical feature dimensions and construct high-accuracy multi-label prediction models. RESULTS Computational experiments demonstrate that FS-MLKNN leads to good performances as well as explainable results. To achieve better performances, we further develop the ensemble learning model by integrating individual feature-based FS-MLKNN models. When compared with other state-of-the-art methods, the ensemble method produces better performances on benchmark datasets. CONCLUSIONS In conclusion, FS-MLKNN and the ensemble method are promising tools for the side effect prediction. The source code and datasets are available in the Additional file 1.
Collapse
Affiliation(s)
- Wen Zhang
- School of Computer, Wuhan University, Wuhan, 430072, China. .,Research Institute of Shenzhen, Wuhan University, Shenzhen, 518057, China.
| | - Feng Liu
- International School of software, Wuhan University, Wuhan, 430072, China.
| | - Longqiang Luo
- School of Mathematics and Statistics, Wuhan University, Wuhan, 430072, China.
| | - Jingxia Zhang
- School of Mathematics and Statistics, Wuhan University, Wuhan, 430072, China.
| |
Collapse
|
31
|
Pérez-Nueno VI, Souchet M, Karaboga AS, Ritchie DW. GESSE: Predicting Drug Side Effects from Drug–Target Relationships. J Chem Inf Model 2015; 55:1804-23. [DOI: 10.1021/acs.jcim.5b00120] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Violeta I. Pérez-Nueno
- Harmonic Pharma, Espace Transfert, 615 rue du Jardin Botanique, 54600 Villers-les-Nancy, France
| | - Michel Souchet
- Harmonic Pharma, Espace Transfert, 615 rue du Jardin Botanique, 54600 Villers-les-Nancy, France
| | - Arnaud S. Karaboga
- Harmonic Pharma, Espace Transfert, 615 rue du Jardin Botanique, 54600 Villers-les-Nancy, France
| | - David W. Ritchie
- INRIA Nancy − Grand Est, Equipe Capsid, 615 rue du Jardin Botanique, 54600 Villers-les-Nancy, France
| |
Collapse
|
32
|
In silico assessment of adverse drug reactions and associated mechanisms. Drug Discov Today 2015; 21:58-71. [PMID: 26272036 DOI: 10.1016/j.drudis.2015.07.018] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2015] [Revised: 07/15/2015] [Accepted: 07/31/2015] [Indexed: 12/31/2022]
Abstract
During recent years, various in silico approaches have been developed to estimate chemical and biological drug features, for example chemical fragments, protein targets, pathways, among others, that correlate with adverse drug reactions (ADRs) and explain the associated mechanisms. These features have also been used for the creation of predictive models that enable estimation of ADRs during the early stages of drug development. In this review, we discuss various in silico approaches to predict these features for a certain drug, estimate correlations with ADRs, establish causal relationships between selected features and ADR mechanisms and create corresponding predictive models.
Collapse
|
33
|
Baker NC, Fourches D, Tropsha A. Drug Side Effect Profiles as Molecular Descriptors for Predictive Modeling of Target Bioactivity. Mol Inform 2015; 34:160-70. [DOI: 10.1002/minf.201400134] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2014] [Accepted: 12/16/2014] [Indexed: 11/05/2022]
|