1
|
Vinh T, Nguyen L, Trinh QH, Nguyen-Vo TH, Nguyen BP. Predicting Cardiotoxicity of Molecules Using Attention-Based Graph Neural Networks. J Chem Inf Model 2024; 64:1816-1827. [PMID: 38438914 DOI: 10.1021/acs.jcim.3c01286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2024]
Abstract
In drug discovery, the search for new and effective medications is often hindered by concerns about toxicity. Numerous promising molecules fail to pass the later phases of drug development due to strict toxicity assessments. This challenge significantly increases the cost, time, and human effort needed to discover new therapeutic molecules. Additionally, a considerable number of drugs already on the market have been withdrawn or re-evaluated because of their unwanted side effects. Among the various types of toxicity, drug-induced heart damage is a severe adverse effect commonly associated with several medications, especially those used in cancer treatments. Although a number of computational approaches have been proposed to identify the cardiotoxicity of molecules, the performance and interpretability of the existing approaches are limited. In our study, we proposed a more effective computational framework to predict the cardiotoxicity of molecules using an attention-based graph neural network. Experimental results indicated that the proposed framework outperformed the other methods. The stability of the model was also confirmed by our experiments. To assist researchers in evaluating the cardiotoxicity of molecules, we have developed an easy-to-use online web server that incorporates our model.
Collapse
Affiliation(s)
- Tuan Vinh
- Department of Chemistry, Emory University, 201 Dowman Drive, Atlanta, Georgia 30322-1007, United States
| | - Loc Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Kelburn Parade, Wellington 6012, New Zealand
| | - Quang H Trinh
- School of Information and Communication Technology, Hanoi University of Science and Technology, Hanoi 100000, Vietnam
| | - Thanh-Hoang Nguyen-Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Kelburn Parade, Wellington 6012, New Zealand
- School of Innovation, Design and Technology, Wellington Institute of Technology, 21 Kensington Avenue, Lower Hutt 5012, New Zealand
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Kelburn Parade, Wellington 6012, New Zealand
| |
Collapse
|
2
|
Mbinta JF, Wang AX, Nguyen BP, Paynter J, Awuni PMA, Pine R, Sporle AA, Bowe S, Simpson CR. Herpes zoster vaccine safety in the Aotearoa New Zealand population: a self-controlled case series study. Nat Commun 2023; 14:4330. [PMID: 37468475 DOI: 10.1038/s41467-023-39595-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Accepted: 06/19/2023] [Indexed: 07/21/2023] Open
Abstract
In Aotearoa New Zealand, zoster vaccine live is used for the prevention of zoster and associated complications in adults. This study assessed the risk of pre-specified serious adverse events following zoster vaccine live immunisation among adults in routine clinical practice. We conducted a self-controlled case series study using routinely collected national data. We compared the incidence of serious adverse events during the at-risk period with the control period. Rate ratios were estimated using Conditional Poisson regression models. Falsification outcomes analyses were used to evaluate biases in our study population. From April 2018 to July 2021, 278,375 received the vaccine. The rate ratio of serious adverse events following immunisation was 0·43 (95% confidence interval [CI]: 0·37-0·50). There was no significant increase in the risk of cerebrovascular accidents, acute myocardial infarction, acute pericarditis, acute myocarditis, and Ramsay-Hunt Syndrome. The herpes zoster vaccine is safe in adults in Aotearoa New Zealand.
Collapse
Affiliation(s)
- James F Mbinta
- School of Health, Wellington Faculty of Health, Victoria University of Wellington, Wellington, New Zealand.
| | - Alex X Wang
- School of Mathematics and Statistics, Wellington Faculty of Engineering, Victoria University of Wellington, Wellington, New Zealand
| | - Binh P Nguyen
- School of Mathematics and Statistics, Wellington Faculty of Engineering, Victoria University of Wellington, Wellington, New Zealand
| | - Janine Paynter
- Department of General Practice & Primary Healthcare, University of Auckland, Auckland, New Zealand
| | | | - Russell Pine
- School of Health, Wellington Faculty of Health, Victoria University of Wellington, Wellington, New Zealand
| | - Andrew A Sporle
- iNZight Analytics Ltd., Department of Statistics, Faculty of Science, University of Auckland, Auckland, New Zealand
| | - Steve Bowe
- School of Health, Wellington Faculty of Health, Victoria University of Wellington, Wellington, New Zealand
| | - Colin R Simpson
- School of Health, Wellington Faculty of Health, Victoria University of Wellington, Wellington, New Zealand.
- Usher Institute, The University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
3
|
Nguyen-Vo TH, Trinh QH, Nguyen L, Nguyen-Hoang PU, Rahardja S, Nguyen BP. i4mC-GRU: Identifying DNA N 4-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features. Comput Struct Biotechnol J 2023; 21:3045-3053. [PMID: 37273848 PMCID: PMC10238585 DOI: 10.1016/j.csbj.2023.05.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Revised: 05/12/2023] [Accepted: 05/12/2023] [Indexed: 06/06/2023] Open
Abstract
N4-methylcytosine (4mC) is one of the most common DNA methylation modifications found in both prokaryotic and eukaryotic genomes. Since the 4mC has various essential biological roles, determining its location helps reveal unexplored physiological and pathological pathways. In this study, we propose an effective computational method called i4mC-GRU using a gated recurrent unit and duplet sequence-embedded features to predict potential 4mC sites in mouse (Mus musculus) genomes. To fairly assess the performance of the model, we compared our method with several state-of-the-art methods using two different benchmark datasets. Our results showed that i4mC-GRU achieved area under the receiver operating characteristic curve values of 0.97 and 0.89 and area under the precision-recall curve values of 0.98 and 0.90 on the first and second benchmark datasets, respectively. Briefly, our method outperformed existing methods in predicting 4mC sites in mouse genomes. Also, we deployed i4mC-GRU as an online web server, supporting users in genomics studies.
Collapse
Affiliation(s)
- Thanh-Hoang Nguyen-Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington 6140, New Zealand
- School of Innovation, Design and Technology, Wellington Institute of Technology, Wellington 5012, New Zealand
| | - Quang H. Trinh
- School of Information and Communication Technology, Hanoi University of Science and Technology, Hanoi 100000, Vietnam
| | - Loc Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington 6140, New Zealand
| | - Phuong-Uyen Nguyen-Hoang
- Computational Biology Center, International University - VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Susanto Rahardja
- School of Marine Science and Technology, Northwestern Polytechnical University, Xi’an 710072, China
- Infocomm Technology Cluster, Singapore Institute of Technology, Singapore 138683, Singapore
| | - Binh P. Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington 6140, New Zealand
| |
Collapse
|
4
|
Nghiem N, Atkinson J, Nguyen BP, Tran-Duy A, Wilson N. Predicting high health-cost users among people with cardiovascular disease using machine learning and nationwide linked social administrative datasets. Health Econ Rev 2023; 13:9. [PMID: 36738348 PMCID: PMC9898915 DOI: 10.1186/s13561-023-00422-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 01/23/2023] [Indexed: 06/18/2023]
Abstract
OBJECTIVES To optimise planning of public health services, the impact of high-cost users needs to be considered. However, most of the existing statistical models for costs do not include many clinical and social variables from administrative data that are associated with elevated health care resource use, and are increasingly available. This study aimed to use machine learning approaches and big data to predict high-cost users among people with cardiovascular disease (CVD). METHODS We used nationally representative linked datasets in New Zealand to predict CVD prevalent cases with the most expensive cost belonging to the top quintiles by cost. We compared the performance of four popular machine learning models (L1-regularised logistic regression, classification trees, k-nearest neighbourhood (KNN) and random forest) with the traditional regression models. RESULTS The machine learning models had far better accuracy in predicting high health-cost users compared with the logistic models. The harmony score F1 (combining sensitivity and positive predictive value) of the machine learning models ranged from 30.6% to 41.2% (compared with 8.6-9.1% for the logistic models). Previous health costs, income, age, chronic health conditions, deprivation, and receiving a social security benefit were among the most important predictors of the CVD high-cost users. CONCLUSIONS This study provides additional evidence that machine learning can be used as a tool together with big data in health economics for identification of new risk factors and prediction of high-cost users with CVD. As such, machine learning may potentially assist with health services planning and preventive measures to improve population health while potentially saving healthcare costs.
Collapse
Affiliation(s)
- Nhung Nghiem
- Department of Public Health, University of Otago, Wellington, New Zealand.
| | - June Atkinson
- Department of Public Health, University of Otago, Wellington, New Zealand
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington, New Zealand
| | - An Tran-Duy
- Centre for Health Policy, Melbourne School of Population and Global Health, University of Melbourne, Melbourne, Australia
| | - Nick Wilson
- Department of Public Health, University of Otago, Wellington, New Zealand
| |
Collapse
|
5
|
Wang AX, Chukova SS, Nguyen BP. Ensemble k-Nearest Neighbors based on Centroid Displacement. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.02.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]
|
6
|
Mbinta JF, Wang AX, Nguyen BP, Paynter J, Awuni PMA, Pine R, Sporle AA, Simpson CR. Herpes zoster vaccine effectiveness against herpes zoster and postherpetic neuralgia in New Zealand: a retrospective cohort study. Lancet Reg Health West Pac 2023; 31:100601. [PMID: 36879782 PMCID: PMC9985042 DOI: 10.1016/j.lanwpc.2022.100601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
BACKGROUND Herpes zoster (HZ) and associated complications cause significant burden to older people. A HZ vaccination programme was introduced in Aotearoa New Zealand in April 2018 with a single dose vaccine for those aged 65 years and a four-year catch up for 66-80 year-olds. This study aimed to assess the 'real-world' effectiveness of the zoster vaccine live (ZVL) against HZ and postherpetic neuralgia (PHN). METHODS We conducted a nationwide retrospective matched cohort study from 1 April 2018 to 1 April 2021 using a linked de-identified patient level Ministry of Health data platform. A Cox proportional hazards model was used to estimate ZVL vaccine effectiveness (VE) against HZ and PHN adjusting for covariates. Multiple outcomes were assessed in the primary (hospitalised HZ and PHN - primary diagnosis) and secondary (hospitalised HZ and PHN: primary and secondary diagnosis, community HZ) analyses. A sub-group analysis was carried out in, adults ≥ 65 years old, immunocompromised adults, Māori, and Pacific populations. FINDINGS A total of 824,142 (274,272 vaccinated with ZVL matched with 549,870 unvaccinated) New Zealand residents were included in the study. The matched population was 93.4% immunocompetent, 52.2% female, 80.2% European (level 1 ethnic codes), and 64.5% were 65-74 years old (mean age = 71.1±5.0). Vaccinated versus unvaccinated incidence of hospitalised HZ was 0.16 vs. 0.31/1,000 person-years and 0.03 vs. 0.08/1000 person-years for PHN. In the primary analysis, the adjusted overall VE against hospitalised HZ and hospitalised PHN was 57.8% (95% CI: 41.1-69.8) and 73.7% (95% CI:14.0-92.0) respectively. In adults ≥ 65 years old, the VE against hospitalised HZ was 54.4% (95% CI: 36.0-67.5) and VE against hospitalised PHN was 75·5% (95% CI: 19.9-92.5). In the secondary analysis, the VE against community HZ was 30.0% (95% CI: 25.6-34.5). The ZVL VE against hospitalised HZ for immunocompromised adults was 51.1% (95% CI: 23.1-69.5), and PHN hospitalisation was 67.6% (95% CI: 9.3-88.4). The VE against HZ hospitalisation for Māori was 45.2% (95% CI: -23.2-75.6) and for Pacific Peoples was 52.2% (95% CI: -40.6 -83·7). INTERPRETATION ZVL was associated with a reduction in risk of hospitalisation from HZ and PHN in the New Zealand population. FUNDING Wellington Doctoral Scholarship awarded to JFM.
Collapse
Key Words
- AI diseases, Autoimmune diseases
- Adj HR, Adjusted hazard ratio
- CI, Confidence interval
- COPD, Chronic obstructive pulmonary diseases
- CVD, Cerebrovascular diseases
- DHB, District health board
- DM, Diabetes mellitus
- HR, Hazard ratio
- HZ, Herpes zoster
- Herpes zoster
- ICD-10-AM-iii, International Statistical Classification of Diseases and Related Health Problems-Tenth Revision-Australian Modification
- IHD, Ischaemic heart diseases
- MELAA, Middle Eastern / Latin American / African
- NZ, New Zealand
- NZDep2013, New Zealand Socioeconomic 2013 deprivation index
- New Zealand
- PHN, Postherpetic neuralgia
- PPV, Positive predictive value
- Postherpetic neuralgia
- RCTs, Randomised control trials
- VZV, Varicella zoster virus
- Varicella zoster virus
- ZVL, Zoster vaccine live
- Zoster vaccine live
Collapse
Affiliation(s)
- James F. Mbinta
- School of Health, Wellington Faculty of Health, Victoria University of Wellington, Wellington, New Zealand
| | - Alex X. Wang
- School of Mathematics and Statistics, Wellington Faculty of Engineering, Victoria University of Wellington, Wellington, New Zealand
| | - Binh P. Nguyen
- School of Mathematics and Statistics, Wellington Faculty of Engineering, Victoria University of Wellington, Wellington, New Zealand
| | - Janine Paynter
- Department of General Practice & Primary Healthcare, The University of Auckland, Auckland, New Zealand
| | | | - Russell Pine
- School of Health, Wellington Faculty of Health, Victoria University of Wellington, Wellington, New Zealand
| | - Andrew A. Sporle
- iNZight Analytics Ltd; Department of Statistics, Faculty of Science, University of Auckland, Auckland, New Zealand
| | - Colin R. Simpson
- School of Health, Wellington Faculty of Health, Victoria University of Wellington, Wellington, New Zealand
- Usher Institute, The University of Edinburgh, Edinburgh, UK
| |
Collapse
|
7
|
Rahardja S, Wang M, Nguyen BP, Fränti P, Rahardja S. Correction: A lightweight classification of adaptor proteins using transformer networks. BMC Bioinformatics 2023; 24:15. [PMID: 36631775 PMCID: PMC9835347 DOI: 10.1186/s12859-022-05131-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Affiliation(s)
- Sylwan Rahardja
- grid.9668.10000 0001 0726 2490School of Computing, University of Eastern Finland, Joensuu, Finland
| | - Mou Wang
- grid.440588.50000 0001 0307 1240School of Marine Science and Technology, Northwestern Polytechnical University, Xi’an, 710072 China
| | - Binh P. Nguyen
- grid.267827.e0000 0001 2292 3111School of Mathematics and Statistics, Victoria University of Wellington, Wellington, New Zealand
| | - Pasi Fränti
- grid.9668.10000 0001 0726 2490School of Computing, University of Eastern Finland, Joensuu, Finland
| | - Susanto Rahardja
- grid.440588.50000 0001 0307 1240School of Marine Science and Technology, Northwestern Polytechnical University, Xi’an, 710072 China ,grid.486188.b0000 0004 1790 4399Singapore Institute of Technology, Singapore, 138683 Singapore
| |
Collapse
|
8
|
Do TTT, Nguyen-Vo TH, Pham HT, Trinh QH, Nguyen BP. iNSP-GCAAP: Identifying nonclassical secreted proteins using global composition of amino acid properties. Proteomics 2023; 23:e2100134. [PMID: 36401584 DOI: 10.1002/pmic.202100134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Revised: 08/02/2022] [Accepted: 11/10/2022] [Indexed: 11/21/2022]
Abstract
Nonclassical secreted proteins (NSPs) refer to a group of proteins released into the extracellular environment under the facilitation of different biological transporting pathways apart from the Sec/Tat system. As experimental determination of NSPs is often costly and requires skilled handling techniques, computational approaches are necessary. In this study, we introduce iNSP-GCAAP, a computational prediction framework, to identify NSPs. We propose using global composition of a customized set of amino acid properties to encode sequence data and use the random forest (RF) algorithm for classification. We used the training dataset introduced by Zhang et al. (Bioinformatics, 36(3), 704-712, 2020) to develop our model and test it with the independent test set in the same study. The area under the receiver operating characteristic curve on that test set was 0.9256, which outperformed other state-of-the-art methods using the same datasets. Our framework is also deployed as a user-friendly web-based application to support the research community to predict NSPs.
Collapse
Affiliation(s)
- Trang T T Do
- School of Innovation, Design and Technology, Wellington Institute of Technology, Lower Hutt, New Zealand
| | - Thanh-Hoang Nguyen-Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington, New Zealand
| | - Hung T Pham
- Faculty of Information Technology, Posts and Telecommunications Institute of Technology, Hanoi, Vietnam
| | - Quang H Trinh
- School of Information and Communication Technology, Hanoi University of Science and Technology, Hanoi, Vietnam
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington, New Zealand
| |
Collapse
|
9
|
Nguyen QH, Ngo HH, Nguyen-Vo TH, Do TT, Rahardja S, Nguyen BP. eMIC-AntiKP: Estimating minimum inhibitory concentrations of antibiotics towards Klebsiella pneumoniae using deep learning. Comput Struct Biotechnol J 2022; 21:751-757. [PMID: 36659924 PMCID: PMC9827358 DOI: 10.1016/j.csbj.2022.12.041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Revised: 12/22/2022] [Accepted: 12/23/2022] [Indexed: 12/27/2022] Open
Abstract
Nowadays, antibiotic resistance has become one of the most concerning problems that directly affects the recovery process of patients. For years, numerous efforts have been made to efficiently use antimicrobial drugs with appropriate doses not only to exterminate microbes but also stringently constrain any chances for bacterial evolution. However, choosing proper antibiotics is not a straightforward and time-effective process because well-defined drugs can only be given to patients after determining microbic taxonomy and evaluating minimum inhibitory concentrations (MICs). Besides conventional methods, numerous computer-aided frameworks have been recently developed using computational advances and public data sources of clinical antimicrobial resistance. In this study, we introduce eMIC-AntiKP, a computational framework specifically designed to predict the MIC values of 20 antibiotics towards Klebsiella pneumoniae. Our prediction models were constructed using convolutional neural networks and k-mer counting-based features. The model for cefepime has the most limited performance with a test 1-tier accuracy of 0.49, while the model for ampicillin has the highest performance with a test 1-tier accuracy of 1.00. Most models have satisfactory performance, with test accuracies ranging from about 0.70-0.90. The significance of eMIC-AntiKP is the effective utilization of computing resources to make it a compact and portable tool for most moderately configured computers. We provide users with two options, including an online web server for basic analysis and an offline package for deeper analysis and technical modification.
Collapse
Affiliation(s)
- Quang H. Nguyen
- School of Information and Communication Technology, Hanoi University of Science and Technology, Hanoi 100000, Viet Nam
| | - Hoang H. Ngo
- School of Information and Communication Technology, Hanoi University of Science and Technology, Hanoi 100000, Viet Nam
| | - Thanh-Hoang Nguyen-Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington 6140, New Zealand
| | - Trang T.T. Do
- School of Innovation, Design and Technology, Wellington Institute of Technology, Lower Hutt 5012, New Zealand
| | - Susanto Rahardja
- School of Marine Science and Technology, Northwestern Polytechnical University, Xi’an 710072, China,Infocomm Technology Cluster, Singapore Institute of Technology, Singapore 138683, Singapore,Corresponding author at: School of Marine Science and Technology, Northwestern Polytechnical University, Xi’an 710072, China.
| | - Binh P. Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington 6140, New Zealand,Corresponding author.
| |
Collapse
|
10
|
Rahardja S, Wang M, Nguyen BP, Fränti P, Rahardja S. A lightweight classification of adaptor proteins using transformer networks. BMC Bioinformatics 2022; 23:461. [PMID: 36333658 PMCID: PMC9635127 DOI: 10.1186/s12859-022-05000-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Accepted: 09/13/2022] [Indexed: 11/06/2022] Open
Abstract
BACKGROUND Adaptor proteins play a key role in intercellular signal transduction, and dysfunctional adaptor proteins result in diseases. Understanding its structure is the first step to tackling the associated conditions, spurring ongoing interest in research into adaptor proteins with bioinformatics and computational biology. Our study aims to introduce a small, new, and superior model for protein classification, pushing the boundaries with new machine learning algorithms. RESULTS We propose a novel transformer based model which includes convolutional block and fully connected layer. We input protein sequences from a database, extract PSSM features, then process it via our deep learning model. The proposed model is efficient and highly compact, achieving state-of-the-art performance in terms of area under the receiver operating characteristic curve, Matthew's Correlation Coefficient and Receiver Operating Characteristics curve. Despite merely 20 hidden nodes translating to approximately 1% of the complexity of previous best known methods, the proposed model is still superior in results and computational efficiency. CONCLUSIONS The proposed model is the first transformer model used for recognizing adaptor protein, and outperforms all existing methods, having PSSM profiles as inputs that comprises convolutional blocks, transformer and fully connected layers for the use of classifying adaptor proteins.
Collapse
Affiliation(s)
- Sylwan Rahardja
- grid.9668.10000 0001 0726 2490School of Computing, University of Eastern Finland, Joensuu, Finland
| | - Mou Wang
- grid.440588.50000 0001 0307 1240School of Marine Science and Technology, Northwestern Polytechnical University and Singapore Institute of Technology, 710072 Xi’an, China
| | - Binh P. Nguyen
- grid.267827.e0000 0001 2292 3111School of Mathematics and Statistics, Victoria University of Wellington, Wellington, New Zealand
| | - Pasi Fränti
- grid.9668.10000 0001 0726 2490School of Computing, University of Eastern Finland, Joensuu, Finland
| | - Susanto Rahardja
- grid.440588.50000 0001 0307 1240School of Marine Science and Technology, Northwestern Polytechnical University and Singapore Institute of Technology, 710072 Xi’an, China ,grid.486188.b0000 0004 1790 4399Singapore Institute of Technology, Singapore, 138683 Singapore
| |
Collapse
|
11
|
Nguyen-Vo TH, Trinh QH, Nguyen L, Nguyen-Hoang PU, Rahardja S, Nguyen BP. iPromoter-Seqvec: identifying promoters using bidirectional long short-term memory and sequence-embedded features. BMC Genomics 2022; 23:681. [PMID: 36192696 PMCID: PMC9531353 DOI: 10.1186/s12864-022-08829-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 08/08/2022] [Indexed: 11/30/2022] Open
Abstract
Background Promoters, non-coding DNA sequences located at upstream regions of the transcription start site of genes/gene clusters, are essential regulatory elements for the initiation and regulation of transcriptional processes. Furthermore, identifying promoters in DNA sequences and genomes significantly contributes to discovering entire structures of genes of interest. Therefore, exploration of promoter regions is one of the most imperative topics in molecular genetics and biology. Besides experimental techniques, computational methods have been developed to predict promoters. In this study, we propose iPromoter-Seqvec – an efficient computational model to predict TATA and non-TATA promoters in human and mouse genomes using bidirectional long short-term memory neural networks in combination with sequence-embedded features extracted from input sequences. The promoter and non-promoter sequences were retrieved from the Eukaryotic Promoter database and then were refined to create four benchmark datasets. Results The area under the receiver operating characteristic curve (AUCROC) and the area under the precision-recall curve (AUCPR) were used as two key metrics to evaluate model performance. Results on independent test sets showed that iPromoter-Seqvec outperformed other state-of-the-art methods with AUCROC values ranging from 0.85 to 0.99 and AUCPR values ranging from 0.86 to 0.99. Models predicting TATA promoters in both species had slightly higher predictive power compared to those predicting non-TATA promoters. With a novel idea of constructing artificial non-promoter sequences based on promoter sequences, our models were able to learn highly specific characteristics discriminating promoters from non-promoters to improve predictive efficiency. Conclusions iPromoter-Seqvec is a stable and robust model for predicting both TATA and non-TATA promoters in human and mouse genomes. Our proposed method was also deployed as an online web server with a user-friendly interface to support research communities. Links to our source codes and web server are available at https://github.com/mldlproject/2022-iPromoter-Seqvec. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08829-6.
Collapse
Affiliation(s)
- Thanh-Hoang Nguyen-Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Gate 7, Kelburn Parade, 6140, Wellington, New Zealand
| | - Quang H Trinh
- School of Information and Communication Technology, Hanoi University of Science and Technology, 1 Dai Co Viet, 100000, Hanoi, Vietnam
| | - Loc Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Gate 7, Kelburn Parade, 6140, Wellington, New Zealand
| | - Phuong-Uyen Nguyen-Hoang
- Computational Biology Center, International University - VNU HCMC, Quarter 6, Linh Trung Ward, Thu Duc District, 700000, Ho Chi Minh City, Vietnam
| | - Susanto Rahardja
- School of Marine Science and Technology, Northwestern Polytechnical University, 127 West Youyi Road, 710072, Xi'an, China. .,Infocomm Technology Cluster, Singapore Institute of Technology, 10 Dover Drive, 138683, Singapore, Singapore.
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Gate 7, Kelburn Parade, 6140, Wellington, New Zealand.
| |
Collapse
|
12
|
Nguyen Q, Tran HV, Nguyen BP, Do TTT. Identifying Transcription Factors That Prefer Binding to Methylated DNA Using Reduced G-Gap Dipeptide Composition. ACS Omega 2022; 7:32322-32330. [PMID: 36119976 PMCID: PMC9475634 DOI: 10.1021/acsomega.2c03696] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Accepted: 08/23/2022] [Indexed: 06/15/2023]
Abstract
Transcription factors (TFs) play an important role in gene expression and regulation of 3D genome conformation. TFs have ability to bind to specific DNA fragments called enhancers and promoters. Some TFs bind to promoter DNA fragments which are near the transcription initiation site and form complexes that allow polymerase enzymes to bind to initiate transcription. Previous studies showed that methylated DNAs had ability to inhibit and prevent TFs from binding to DNA fragments. However, recent studies have found that there were TFs that could bind to methylated DNA fragments. The identification of these TFs is an important steppingstone to a better understanding of cellular gene expression mechanisms. However, as experimental methods are often time-consuming and labor-intensive, developing computational methods is essential. In this study, we propose two machine learning methods for two problems: (1) identifying TFs and (2) identifying TFs that prefer binding to methylated DNA targets (TFPMs). For the TF identification problem, the proposed method uses the position-specific scoring matrix for data representation and a deep convolutional neural network for modeling. This method achieved 90.56% sensitivity, 83.96% specificity, and an area under the receiver operating characteristic curve (AUC) of 0.9596 on an independent test set. For the TFPM identification problem, we propose to use the reduced g-gap dipeptide composition for data representation and the support vector machine algorithm for modeling. This method achieved 82.61% sensitivity, 64.86% specificity, and an AUC of 0.8486 on another independent test set. These results are higher than those of other studies on the same problems.
Collapse
Affiliation(s)
- Quang
H. Nguyen
- School
of Information and Communication Technology, Hanoi University of Science and Technology, 1 Dai Co Viet, Hanoi 100000, Vietnam
| | - Hoang V. Tran
- School
of Information and Communication Technology, Hanoi University of Science and Technology, 1 Dai Co Viet, Hanoi 100000, Vietnam
| | - Binh P. Nguyen
- School
of Mathematics and Statistics, Victoria
University of Wellington, Kelburn Parade, Wellington 6140, New Zealand
| | - Trang T. T. Do
- School
of Innovation, Design and Technology, Wellington
Institute of Technology, 21 Kensington Avenue, Lower Hutt 5012, New Zealand
| |
Collapse
|
13
|
Mbinta JF, Nguyen BP, Awuni PMA, Paynter J, Simpson CR. Post-licensure zoster vaccine effectiveness against herpes zoster and postherpetic neuralgia in older adults: a systematic review and meta-analysis. Lancet Healthy Longev 2022; 3:e263-e275. [PMID: 36098300 DOI: 10.1016/s2666-7568(22)00039-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 02/15/2022] [Accepted: 02/17/2022] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Given the substantial impact of herpes zoster on health and quality of life, and its considerable economic burden, prevention through vaccination is a priority. We aimed to evaluate the effectiveness of the herpes zoster vaccines (recombinant zoster vaccine [RZV] and zoster vaccine live [ZVL]) against incident herpes zoster and postherpetic neuralgia in older adults. METHODS We did a systematic review and meta-analysis of studies assessing the effectiveness of herpes zoster vaccines in adults aged 50 years or older, compared with no vaccination or another vaccine. We searched published literature on MEDLINE, Embase, Cochrane Library, Cumulative Index to Nursing and Allied Health Literature, ProQuest Central, and Dimensions, as well as unpublished studies, grey literature, and the reference lists of included studies. Observational studies published in any language between May 25, 2006, and Dec 31, 2020, were included. Eligible studies were appraised for methodological quality using standardised critical appraisal instruments from the Joanna Briggs Institute, and data were extracted from selected studies using a standardised tool. Random-effects meta-analysis models were used to estimate pooled vaccine effectiveness for outcomes of interest (herpes zoster, herpes zoster ophthalmicus, and postherpetic neuralgia) among clinically and methodologically comparable studies, with a fixed-effects model also used for herpes zoster ophthalmicus. Vaccine effectiveness was also assessed in people with comorbidities. As a post-hoc analysis, a forward citation search was done on Jan 31, 2021. This study is registered on PROSPERO, CRD42021232383. FINDINGS Our search identified 1240 studies, of which 1162 were excluded based on title and abstract screening. A further 56 articles were excluded on reading the full text. 22 studies (21 cohort studies and one case-control study, involving 9 536 086 participants and 3·35 million person-years in the USA, UK, Canada, and Sweden) were included in the quantitative analysis. Of these, 13 articles were included in the meta-analysis. The overall quality of evidence was very low for all outcomes. The pooled vaccine effectiveness for ZVL against herpes zoster in adults was 45·9% (95% CI 42·2-49·4; seven studies). The vaccine effectiveness for ZVL against postherpetic neuralgia was 59·7% (58·4-89·7; three studies) and against herpes zoster ophthalmicus (in a fixed-effects model) was 30·0% (20·5-38·4; two studies). ZVL was effective in preventing herpes zoster in people with comorbidities, including diabetes (vaccine effectiveness 49·8%, 45·1-54·1; three studies), chronic kidney disease (54·3%, 49·0-59·1; four studies), liver disease (52·9%, 41·6-62·1; two studies), heart disease (52·3%, 45·0-58·7; two studies), and lung disease (49·0%, 32·2-66·2; two studies). In a post-hoc analysis of two studies from the USA published after 2020, the pooled vaccine effectiveness for RZV against herpes zoster in adults was 79·2% (57·6-89·7). Substantial heterogeneity (I2≥75%) was observed in 50% of the meta-analyses. INTERPRETATION ZVL and RZV are effective in preventing herpes zoster in routine clinical practice. ZVL also reduces the risk of postherpetic neuralgia. Selection bias and confounding by unmeasured variables are inherent challenges of observational studies based on large health-care databases. Nevertheless, these findings will reassure policy makers, health practitioners, and the public that the vaccinations currently available for herpes zoster vaccination programmes are effective at preventing herpes zoster and related complications. FUNDING None.
Collapse
Affiliation(s)
- James F Mbinta
- School of Health, Wellington Faculty of Health, Victoria University of Wellington, Wellington, New Zealand.
| | - Binh P Nguyen
- School of Mathematics and Statistics, Wellington Faculty of Engineering, Victoria University of Wellington, Wellington, New Zealand
| | | | - Janine Paynter
- Department of General Practice and Primary Healthcare, The University of Auckland, Auckland, New Zealand
| | - Colin R Simpson
- School of Health, Wellington Faculty of Health, Victoria University of Wellington, Wellington, New Zealand; Usher Institute, The University of Edinburgh, Edinburgh, UK
| |
Collapse
|
14
|
Truong VT, Nguyen BP, Nguyen-Vo TH, Mazur W, Chung ES, Palmer C, Tretter JT, Alsaied T, Pham VT, Do HQ, Do PTN, Pham VN, Ha BN, Chau HN, Le TK. Application of machine learning in screening for congenital heart diseases using fetal echocardiography. Int J Cardiovasc Imaging 2022; 38:10.1007/s10554-022-02566-3. [PMID: 35192082 DOI: 10.1007/s10554-022-02566-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2021] [Accepted: 02/13/2022] [Indexed: 11/05/2022]
Abstract
There is a growing body of literature supporting the utilization of machine learning (ML) to improve diagnosis and prognosis tools of cardiovascular disease. The current study was to investigate the impact that the ML framework may have on the sensitivity of predicting the presence or absence of congenital heart disease (CHD) using fetal echocardiography. A comprehensive fetal echocardiogram including 2D cardiac chamber quantification, valvar assessments, assessment of great vessel morphology, and Doppler-derived blood flow interrogation was recorded. The postnatal echocardiogram was used to ascertain the diagnosis of CHD. A random forest (RF) algorithm with a nested tenfold cross-validation was used to train models for assessing the presence of CHD. The study population was derived from a database of 3910 singleton fetuses with maternal age of 28.8 ± 5.2 years and gestational age at the time of fetal echocardiography of 22.0 weeks (IQR 21-24). The proportion of CHD was 14.1% for the studied cohort confirmed by post-natal echocardiograms. Our proposed RF-based framework provided a sensitivity of 0.85, a specificity of 0.88, a positive predictive value of 0.55 and a negative predictive value of 0.97 to detect the CHD with the mean of mean ROC curves of 0.94 and the mean of mean PR curves of 0.84. Additionally, six first features, including cardiac axis, peak velocity of blood flow across the pulmonic valve, cardiothoracic ratio, pulmonary valvar annulus diameter, right ventricular end-diastolic diameter, and aortic valvar annulus diameter, are essential features that play crucial roles in adding more predictive values to the model in detecting patients with CHD. ML using RF can provide increased sensitivity in prenatal CHD screening with very good performance. The incorporation of ML algorithms into fetal echocardiography may further standardize the assessment for CHD.
Collapse
Affiliation(s)
- Vien T Truong
- The Christ Hospital Health Network, Cincinnati, OH, USA
- The Lindner Research Center, Cincinnati, OH, USA
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington, New Zealand
| | - Thanh-Hoang Nguyen-Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington, New Zealand
| | | | | | | | - Justin T Tretter
- Cincinnati Children's Hospital Medical Center, University of Cincinnati, College of Medicine, Cincinnati, OH, USA
| | - Tarek Alsaied
- Cincinnati Children's Hospital Medical Center, University of Cincinnati, College of Medicine, Cincinnati, OH, USA
| | - Vy T Pham
- Washington University School of Medicine in St. Louis, St. Louis, MO, USA
| | - Huan Q Do
- Heart Institute of HCMC, Ho Chi Minh City, Vietnam
| | | | - Vinh N Pham
- Heart Center, Tam Anh General Hospital, Ho Chi Minh City, Vietnam
| | - Ban N Ha
- Heart Institute of HCMC, Ho Chi Minh City, Vietnam
| | - Hoa N Chau
- University of Medicine and Pharmacy, Ho Chi Minh City, Vietnam
| | - Tuyen K Le
- Heart Institute of HCMC, Ho Chi Minh City, Vietnam.
| |
Collapse
|
15
|
Nguyen L, Nguyen Vo TH, Trinh QH, Nguyen BH, Nguyen-Hoang PU, Le L, Nguyen BP. iANP-EC: Identifying Anticancer Natural Products Using Ensemble Learning Incorporated with Evolutionary Computation. J Chem Inf Model 2022; 62:5080-5089. [PMID: 35157472 DOI: 10.1021/acs.jcim.1c00920] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Cancer is one of the most deadly diseases that annually kills millions of people worldwide. The investigation on anticancer medicines has never ceased to seek better and more adaptive agents with fewer side effects. Besides chemically synthetic anticancer compounds, natural products are scientifically proved as a highly potential alternative source for anticancer drug discovery. Along with experimental approaches being used to find anticancer drug candidates, computational approaches have been developed to virtually screen for potential anticancer compounds. In this study, we construct an ensemble computational framework, called iANP-EC, using machine learning approaches incorporated with evolutionary computation. Four learning algorithms (k-NN, SVM, RF, and XGB) and four molecular representation schemes are used to build a set of classifiers, among which the top-four best-performing classifiers are selected to form an ensemble classifier. Particle swarm optimization (PSO) is used to optimise the weights used to combined the four top classifiers. The models are developed by a set of curated 997 compounds which are collected from the NPACT and CancerHSP databases. The results show that iANP-EC is a stable, robust, and effective framework that achieves an AUC-ROC value of 0.9193 and an AUC-PR value of 0.8366. The comparative analysis of molecular substructures between natural anticarcinogens and nonanticarcinogens partially unveils several key substructures that drive anticancerous activities. We also deploy the proposed ensemble model as an online web server with a user-friendly interface to support the research community in identifying natural products with anticancer activities.
Collapse
Affiliation(s)
- Loc Nguyen
- Computational Biology Center, International University - VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Thanh-Hoang Nguyen Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington 6140, New Zealand
| | - Quang H Trinh
- Computational Biology Center, International University - VNU HCMC, Ho Chi Minh City 700000, Vietnam.,School of Information and Communication Technology, Hanoi University of Science and Technology, Hanoi 100000, Vietnam
| | - Bach Hoai Nguyen
- School of Engineering and Computer Science, Victoria University of Wellington, Wellington 6140, New Zealand
| | - Phuong-Uyen Nguyen-Hoang
- Computational Biology Center, International University - VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Ly Le
- Computational Biology Center, International University - VNU HCMC, Ho Chi Minh City 700000, Vietnam.,Vingroup Big Data Institute, Ha Noi 100000, Vietnam
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington 6140, New Zealand
| |
Collapse
|
16
|
Le NQK, Nguyen BP. Prediction of FMN Binding Sites in Electron Transport Chains Based on 2-D CNN and PSSM Profiles. IEEE/ACM Trans Comput Biol Bioinform 2021; 18:2189-2197. [PMID: 31380767 DOI: 10.1109/tcbb.2019.2932416] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Flavin mono-nucleotides (FMNs) are cofactors that hold responsibility for carrying and transferring electrons in the electron transport chain stage of cellular respiration. Without being facilitated by FMNs, energy production is stagnant due to the interruption in most of the cellular processes. Investigation on FMN's functions, therefore, can gain holistic understanding about human diseases and molecular information on drug targets. We proposed a deep learning model using a two-dimensional convolutional neural network and position specific scoring matrices that could identify FMN interacting residues with the sensitivity of 83.7 percent, specificity of 99.2 percent, accuracy of 98.2 percent, and Matthews correlation coefficients of 0.85 for an independent dataset containing 141 FMN binding sites and 1,920 non-FMN binding sites. The proposed method outperformed other previous studies using similar evaluation metrics. Our positive outcome can also promote the utilization of deep learning in dealing with various problems in bioinformatics and computational biology.
Collapse
|
17
|
Nguyen-Vo TH, Trinh QH, Nguyen L, Nguyen-Hoang PU, Nguyen TN, Nguyen DT, Nguyen BP, Le L. iCYP-MFE: Identifying Human Cytochrome P450 Inhibitors Using Multitask Learning and Molecular Fingerprint-Embedded Encoding. J Chem Inf Model 2021; 62:5059-5068. [PMID: 34672553 DOI: 10.1021/acs.jcim.1c00628] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
The human cytochrome P450 (CYP) superfamily holds responsibilities for the metabolism of both endogenous and exogenous compounds such as drugs, cellular metabolites, and toxins. The inhibition exerted on the CYP enzymes is closely associated with adverse drug reactions encompassing metabolic failures and induced side effects. In modern drug discovery, identification of potential CYP inhibitors is, therefore, highly essential. Alongside experimental approaches, numerous computational models have been proposed to address this biochemical issue. In this study, we introduce iCYP-MFE, a computational framework for virtual screening on CYP inhibitors toward 1A2, 2C9, 2C19, 2D6, and 3A4 isoforms. iCYP-MFE contains a set of five robust, stable, and effective prediction models developed using multitask learning incorporated with molecular fingerprint-embedded features. The results show that multitask learning can remarkably leverage useful information from related tasks to promote global performance. Comparative analysis indicates that iCYP-MFE achieves three predominant tasks, one equivalent task, and one less effective task compared to state-of-the-art methods. The area under the receiver operating characteristic curve (AUC-ROC) and the area under the precision-recall curve (AUC-PR) were two decisive metrics used for model evaluation. The prediction task for CYP2D6-inhibition achieves the highest AUC-ROC value of 0.93 while the prediction task for CYP1A2-inhibition obtains the highest AUC-PR value of 0.92. The substructural analysis preliminarily explains the nature of the CYP-inhibitory activity of compounds. An online web server for iCYP-MFE with a user-friendly interface was also deployed to support scientific communities in identifying CYP inhibitors.
Collapse
Affiliation(s)
- Thanh-Hoang Nguyen-Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Kelburn Parade, Wellington 6140, New Zealand
| | - Quang H Trinh
- Computational Biology Center, International University-VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Loc Nguyen
- Computational Biology Center, International University-VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Phuong-Uyen Nguyen-Hoang
- Computational Biology Center, International University-VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Thien-Ngan Nguyen
- Computational Biology Center, International University-VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Dung T Nguyen
- School of Information and Communication Technology, Hanoi University of Science and Technology, Hanoi 100000, Vietnam
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Kelburn Parade, Wellington 6140, New Zealand
| | - Ly Le
- Computational Biology Center, International University-VNU HCMC, Ho Chi Minh City 700000, Vietnam.,Vingroup Big Data Institute, Ha Noi 100000, Vietnam
| |
Collapse
|
18
|
Nguyen T, Adnan M, Nguyen BP, de Ligt J, Geoghegan JL, Dean R, Jefferies S, Baker MG, Seah WK, Sporle AA, French NP, Murdoch DR, Welch D, Simpson CR. COVID-19 vaccine strategies for Aotearoa New Zealand: a mathematical modelling study. Lancet Reg Health West Pac 2021; 15:100256. [PMID: 34426804 PMCID: PMC8375363 DOI: 10.1016/j.lanwpc.2021.100256] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 07/15/2021] [Accepted: 07/29/2021] [Indexed: 12/16/2022]
Abstract
Background: COVID-19 elimination measures, including border closures have been applied in New Zealand. We have modelled the potential effect of vaccination programmes for opening borders. Methods: We used a deterministic age-stratified Susceptible, Exposed, Infectious, Recovered (SEIR) model. We minimised spread by varying the age-stratified vaccine allocation to find the minimum herd immunity requirements (the effective reproduction number Reff<1 with closed borders) under various vaccine effectiveness (VE) scenarios and R0 values. We ran two-year open-border simulations for two vaccine strategies: minimising Reff and targeting high-risk groups. Findings: Targeting of high-risk groups will result in lower hospitalisations and deaths in most scenarios. Reaching the herd immunity threshold (HIT) with a vaccine of 90% VE against disease and 80% VE against infection requires at least 86•5% total population uptake for R0=4•5 (with high vaccination coverage for 30-49-year-olds) and 98•1% uptake for R0=6. In a two-year open-border scenario with 10 overseas cases daily and 90% total population vaccine uptake (including 0-15 year olds) with the same vaccine, the strategy of targeting high-risk groups is close to achieving HIT, with an estimated 11,400 total hospitalisations (peak 324 active and 36 new daily cases in hospitals), and 1,030 total deaths. Interpretation: Targeting high-risk groups for vaccination will result in fewer hospitalisations and deaths with open borders compared to targeting reduced transmission. With a highly effective vaccine and a high total uptake, opening borders will result in increasing cases, hospitalisations, and deaths. Other public health and social measures will still be required as part of an effective pandemic response. Funding: This project was funded by the Health Research Council [20/1018]. Research in context.
Collapse
Affiliation(s)
- Trung Nguyen
- Institute of Environmental Science and Research, New Zealand
| | - Mehnaz Adnan
- Institute of Environmental Science and Research, New Zealand
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, New Zealand
| | - Joep de Ligt
- Institute of Environmental Science and Research, New Zealand
| | - Jemma L Geoghegan
- Department of Microbiology and Immunology, University of Otago, New Zealand and Institute of Environmental Science and Research, New Zealand
| | - Richard Dean
- Institute of Environmental Science and Research, New Zealand
| | - Sarah Jefferies
- Institute of Environmental Science and Research, New Zealand
| | - Michael G Baker
- Department of Public Health, University of Otago, New Zealand
| | - Winston Kg Seah
- School of Engineering and Computer Science, Victoria University of Wellington, New Zealand
| | - Andrew A Sporle
- Department of Statistics, The University of Auckland, New Zealand and iNZight Analytics Ltd
| | | | - David R Murdoch
- Department of Pathology and Biomedical Science, University of Otago, New Zealand
| | - David Welch
- School of Computer Science, The University of Auckland, New Zealand
| | - Colin R Simpson
- School of Health, Wellington Faculty of Health, Victoria University of Wellington, Wellington, New Zealand.,Usher Institute, The University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|
19
|
Nguyen-Vo TH, Trinh QH, Nguyen L, Do TTT, Chua MCH, Nguyen BP. Predicting Antimalarial Activity in Natural Products Using Pretrained Bidirectional Encoder Representations from Transformers. J Chem Inf Model 2021; 62:5050-5058. [DOI: 10.1021/acs.jcim.1c00584] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Thanh-Hoang Nguyen-Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Kelburn Parade, Wellington 6140, New Zealand
| | - Quang H. Trinh
- Computational Biology Center, International University−VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Loc Nguyen
- Computational Biology Center, International University−VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Trang T. T. Do
- School of Business and Information Technology, Wellington Institute of Technology, 21 Kensington Avenue, Lower Hutt 5012, New Zealand
| | - Matthew Chin Heng Chua
- Institute of Systems Science, National University of Singapore, 29 Heng Mui Keng Terrace, Singapore 119620, Singapore
| | - Binh P. Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Kelburn Parade, Wellington 6140, New Zealand
| |
Collapse
|
20
|
Nguyen QH, Nguyen BP, Nguyen TB, Do TT, Mbinta JF, Simpson CR. Stacking segment-based CNN with SVM for recognition of atrial fibrillation from single-lead ECG recordings. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2021.102672] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
21
|
Mbinta JF, Nguyen BP, Awuni PMA, Eme PE, Simpson CR. Postlicensure herpes zoster vaccine effectiveness: systematic review protocol. BMJ Open 2021; 11:e040964. [PMID: 33622942 PMCID: PMC7907883 DOI: 10.1136/bmjopen-2020-040964] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Revised: 01/10/2021] [Accepted: 01/21/2021] [Indexed: 11/04/2022] Open
Abstract
INTRODUCTION Herpes zoster (HZ) and associated complications inflict substantial morbidity and associated healthcare and socioeconomic burdens. Current treatments are not fully effective, especially among the most vulnerable populations. Two HZ vaccines are available and are part of the national immunisation programmes in many countries. This review will evaluate the effectiveness of zoster vaccines against incident HZ and postherpetic neuralgia in adults 50 years and older. METHODS AND ANALYSIS The key information sources that will be searched include MEDLINE (Ovid), Embase (Ovid), Cochrane libraries and CINAHL. This search will consider postlicensure observational studies published in all languages between 2006 and 2020 that assessed the effectiveness of HZ/zoster vaccines in adults 50 years and older. The identification of studies will be complemented with the search of reference lists and citations, and contact with authors of papers to request missing or additional data, where required. Following the search, all identified citations will be collated, and duplicates will be removed. Titles and abstracts will then be screened by two independent reviewers for assessment against the inclusion criteria for the review. Selected studies will follow the process of critical appraisal, data extraction and data synthesis. Statistical analyses will be performed using a random-effect model. ETHICS AND DISSEMINATION Formal ethical approval is not required, as primary data will not be collected. The review will be disseminated in peer-reviewed publications and conference presentations.
Collapse
Affiliation(s)
- James F Mbinta
- School of Health, Wellington Faculty of Health, Victoria University of Wellington, Wellington, New Zealand
| | - Binh P Nguyen
- School of Mathematics and Statistics, Wellington Faculty of Science, Victoria University of Wellington, Wellington, New Zealand
| | - Prosper Mandela A Awuni
- Chifley Business School, Torrens University Australia, Brisbane Campus, Fortitude Valley, Queensland, Australia
| | - Paul E Eme
- School of Health, Wellington Faculty of Health, Victoria University of Wellington, Wellington, New Zealand
| | - Colin R Simpson
- School of Health, Wellington Faculty of Health, Victoria University of Wellington, Wellington, New Zealand
- Usher Institute, The University of Edinburgh, Edinburgh, UK
| |
Collapse
|
22
|
Nguyen-Vo TH, Nguyen L, Do N, Le PH, Nguyen TN, Nguyen BP, Le L. Predicting Drug-Induced Liver Injury Using Convolutional Neural Network and Molecular Fingerprint-Embedded Features. ACS Omega 2020; 5:25432-25439. [PMID: 33043223 PMCID: PMC7542839 DOI: 10.1021/acsomega.0c03866] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 09/11/2020] [Indexed: 05/10/2023]
Abstract
As a critical issue in drug development and postmarketing safety surveillance, drug-induced liver injury (DILI) leads to failures in clinical trials as well as retractions of on-market approved drugs. Therefore, it is important to identify DILI compounds in the early-stages through in silico and in vivo studies. It is difficult using conventional safety testing methods, since the predictive power of most of the existing frameworks is insufficiently effective to address this pharmacological issue. In our study, we employ a natural language processing (NLP) inspired computational framework using convolutional neural networks and molecular fingerprint-embedded features. Our development set and independent test set have 1597 and 322 compounds, respectively. These samples were collected from previous studies and matched with established chemical databases for structural validity. Our study comes up with an average accuracy of 0.89, Matthews's correlation coefficient (MCC) of 0.80, and an AUC of 0.96. Our results show a significant improvement in the AUC values compared to the recent best model with a boost of 6.67%, from 0.90 to 0.96. Also, based on our findings, molecular fingerprint-embedded featurizer is an effective molecular representation for future biological and biochemical studies besides the application of classic molecular fingerprints.
Collapse
Affiliation(s)
- Thanh-Hoang Nguyen-Vo
- School of Mathematics
and Statistics, Victoria University of Wellington, Wellington 6140, New Zealand
| | - Loc Nguyen
- Computational Biology Center, International University—VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Nguyet Do
- Computational Biology Center, International University—VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Phuc H. Le
- Computational Biology Center, International University—VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Thien-Ngan Nguyen
- Computational Biology Center, International University—VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Binh P. Nguyen
- School of Mathematics
and Statistics, Victoria University of Wellington, Wellington 6140, New Zealand
- . Phone: (+64) 4 463 5233. ext 8896
| | - Ly Le
- Computational Biology Center, International University—VNU HCMC, Ho Chi Minh City 700000, Vietnam
- Vingroup Big Data Institute, Ha Noi 100000, Vietnam
- . Phone: (+84) 906-578-836
| |
Collapse
|
23
|
Hussain S, Anees A, Das A, Nguyen BP, Marzuki M, Lin S, Wright G, Singhal A. High-content image generation for drug discovery using generative adversarial networks. Neural Netw 2020; 132:353-363. [PMID: 32977280 DOI: 10.1016/j.neunet.2020.09.007] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Revised: 06/11/2020] [Accepted: 09/07/2020] [Indexed: 10/23/2022]
Abstract
Immense amount of high-content image data generated in drug discovery screening requires computationally driven automated analysis. Emergence of advanced machine learning algorithms, like deep learning models, has transformed the interpretation and analysis of imaging data. However, deep learning methods generally require large number of high-quality data samples, which could be limited during preclinical investigations. To address this issue, we propose a generative modeling based computational framework to synthesize images, which can be used for phenotypic profiling of perturbations induced by drug compounds. We investigated the use of three variants of Generative Adversarial Network (GAN) in our framework, viz., a basic Vanilla GAN, Deep Convolutional GAN (DCGAN) and Progressive GAN (ProGAN), and found DCGAN to be most efficient in generating realistic synthetic images. A pre-trained convolutional neural network (CNN) was used to extract features of both real and synthetic images, followed by a classification model trained on real and synthetic images. The quality of synthesized images was evaluated by comparing their feature distributions with that of real images. The DCGAN-based framework was applied to high-content image data from a drug screen to synthesize high-quality cellular images, which were used to augment the real image data. The augmented dataset was shown to yield better classification performance compared with that obtained using only real images. We also demonstrated the application of proposed method on the generation of bacterial images and computed feature distributions for bacterial images specific to different drug treatments. In summary, our results showed that the proposed DCGAN-based framework can be utilized to generate realistic synthetic high-content images, thus enabling the study of drug-induced effects on cells and bacteria.
Collapse
Affiliation(s)
- Shaista Hussain
- Institute of High Performance Computing, A*STAR, 138673, Singapore.
| | - Ayesha Anees
- Institute of High Performance Computing, A*STAR, 138673, Singapore
| | - Ankit Das
- Institute of High Performance Computing, A*STAR, 138673, Singapore
| | - Binh P Nguyen
- School of Mathematics and Statistics, VUW, 6140, New Zealand
| | | | - Shuping Lin
- Skin Research Institute of Singapore, A*STAR, 138648, Singapore
| | - Graham Wright
- Skin Research Institute of Singapore, A*STAR, 138648, Singapore
| | - Amit Singhal
- Singapore Immunology Network, A*STAR, 138648, Singapore.
| |
Collapse
|
24
|
Le TK, Truong V, Nguyen-Vo TH, Nguyen BP, Ngo TN, Bui QV, Pham TK, Tretter J, Taylor M, Levy P, Chung E, Mazur W, Do HQ, Do PT, Pham VN, Chau HN. APPLICATION OF MACHINE LEARNING IN SCREENING OF CONGENITAL HEART DISEASES USING FETAL ECHOCARDIOGRAPHY. J Am Coll Cardiol 2020. [DOI: 10.1016/s0735-1097(20)31275-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
25
|
Nguyen BP, Nguyen QH, Doan-Ngoc GN, Nguyen-Vo TH, Rahardja S. iProDNA-CapsNet: identifying protein-DNA binding residues using capsule neural networks. BMC Bioinformatics 2019; 20:634. [PMID: 31881828 PMCID: PMC6933727 DOI: 10.1186/s12859-019-3295-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Accepted: 11/26/2019] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Since protein-DNA interactions are highly essential to diverse biological events, accurately positioning the location of the DNA-binding residues is necessary. This biological issue, however, is currently a challenging task in the age of post-genomic where data on protein sequences have expanded very fast. In this study, we propose iProDNA-CapsNet - a new prediction model identifying protein-DNA binding residues using an ensemble of capsule neural networks (CapsNets) on position specific scoring matrix (PSMM) profiles. The use of CapsNets promises an innovative approach to determine the location of DNA-binding residues. In this study, the benchmark datasets introduced by Hu et al. (2017), i.e., PDNA-543 and PDNA-TEST, were used to train and evaluate the model, respectively. To fairly assess the model performance, comparative analysis between iProDNA-CapsNet and existing state-of-the-art methods was done. RESULTS Under the decision threshold corresponding to false positive rate (FPR) ≈ 5%, the accuracy, sensitivity, precision, and Matthews's correlation coefficient (MCC) of our model is increased by about 2.0%, 2.0%, 14.0%, and 5.0% with respect to TargetDNA (Hu et al., 2017) and 1.0%, 75.0%, 45.0%, and 77.0% with respect to BindN+ (Wang et al., 2010), respectively. With regards to other methods not reporting their threshold settings, iProDNA-CapsNet also shows a significant improvement in performance based on most of the evaluation metrics. Even with different patterns of change among the models, iProDNA-CapsNets remains to be the best model having top performance in most of the metrics, especially MCC which is boosted from about 8.0% to 220.0%. CONCLUSIONS According to all evaluation metrics under various decision thresholds, iProDNA-CapsNet shows better performance compared to the two current best models (BindN and TargetDNA). Our proposed approach also shows that CapsNet can potentially be used and adopted in other biological applications.
Collapse
Affiliation(s)
- Binh P. Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Gate 7, Kelburn Parade, Wellington, 6140 New Zealand
| | - Quang H. Nguyen
- School of Information and Communication Technology, Hanoi University of Science and Technology, 1 Dai Co Viet, Hanoi, 100000 Vietnam
| | - Giang-Nam Doan-Ngoc
- School of Information and Communication Technology, Hanoi University of Science and Technology, 1 Dai Co Viet, Hanoi, 100000 Vietnam
| | - Thanh-Hoang Nguyen-Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Gate 7, Kelburn Parade, Wellington, 6140 New Zealand
| | - Susanto Rahardja
- School of Marine Science and Technology, Northwestern Polytechnical University, 127 West Youyi Road, Xi’an, 710072 China
| |
Collapse
|
26
|
Khanh Le NQ, Nguyen QH, Chen X, Rahardja S, Nguyen BP. Classification of adaptor proteins using recurrent neural networks and PSSM profiles. BMC Genomics 2019; 20:966. [PMID: 31874633 PMCID: PMC6929330 DOI: 10.1186/s12864-019-6335-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Accepted: 11/25/2019] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Adaptor proteins are carrier proteins that play a crucial role in signal transduction. They commonly consist of several modular domains, each having its own binding activity and operating by forming complexes with other intracellular-signaling molecules. Many studies determined that the adaptor proteins had been implicated in a variety of human diseases. Therefore, creating a precise model to predict the function of adaptor proteins is one of the vital tasks in bioinformatics and computational biology. Few computational biology studies have been conducted to predict the protein functions, and in most of those studies, position specific scoring matrix (PSSM) profiles had been used as the features to be fed into the neural networks. However, the neural networks could not reach the optimal result because the sequential information in PSSMs has been lost. This study proposes an innovative approach by incorporating recurrent neural networks (RNNs) and PSSM profiles to resolve this problem. RESULTS Compared to other state-of-the-art methods which had been applied successfully in other problems, our method achieves enhancement in all of the common measurement metrics. The area under the receiver operating characteristic curve (AUC) metric in prediction of adaptor proteins in the cross-validation and independent datasets are 0.893 and 0.853, respectively. CONCLUSIONS This study opens a research path that can promote the use of RNNs and PSSM profiles in bioinformatics and computational biology. Our approach is reproducible by scientists that aim to improve the performance results of different protein function prediction problems. Our source code and datasets are available at https://github.com/ngphubinh/adaptors.
Collapse
Affiliation(s)
- Nguyen Quoc Khanh Le
- Professional Master Program in Artificial Intelligence in Medicine, Taipei Medical University, Keelung Road, Da'an Distric, Taipei City 106, Taiwan (R.O.C.)
| | - Quang H Nguyen
- School of Information and Communication Technology, Hanoi University of Science and Technology, 1 Dai Co Viet, Hanoi 100000, Vietnam
| | - Xuan Chen
- Beijing Genomics Institute, 21 Hongan 3rd Street, Shenzhen 518083, China
| | - Susanto Rahardja
- School of Marine Science and Technology, Northwestern Polytechnical University, 127 West Youyi Road, Xi'an 710072, China.
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Gate 7, Kelburn Parade, Wellington 6140, New Zealand
| |
Collapse
|
27
|
Nguyen QH, Nguyen-Vo TH, Le NQK, Do TTT, Rahardja S, Nguyen BP. iEnhancer-ECNN: identifying enhancers and their strength using ensembles of convolutional neural networks. BMC Genomics 2019; 20:951. [PMID: 31874637 PMCID: PMC6929481 DOI: 10.1186/s12864-019-6336-3] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
BACKGROUND Enhancers are non-coding DNA fragments which are crucial in gene regulation (e.g. transcription and translation). Having high locational variation and free scattering in 98% of non-encoding genomes, enhancer identification is, therefore, more complicated than other genetic factors. To address this biological issue, several in silico studies have been done to identify and classify enhancer sequences among a myriad of DNA sequences using computational advances. Although recent studies have come up with improved performance, shortfalls in these learning models still remain. To overcome limitations of existing learning models, we introduce iEnhancer-ECNN, an efficient prediction framework using one-hot encoding and k-mers for data transformation and ensembles of convolutional neural networks for model construction, to identify enhancers and classify their strength. The benchmark dataset from Liu et al.'s study was used to develop and evaluate the ensemble models. A comparative analysis between iEnhancer-ECNN and existing state-of-the-art methods was done to fairly assess the model performance. RESULTS Our experimental results demonstrates that iEnhancer-ECNN has better performance compared to other state-of-the-art methods using the same dataset. The accuracy of the ensemble model for enhancer identification (layer 1) and enhancer classification (layer 2) are 0.769 and 0.678, respectively. Compared to other related studies, improvements in the Area Under the Receiver Operating Characteristic Curve (AUC), sensitivity, and Matthews's correlation coefficient (MCC) of our models are remarkable, especially for the model of layer 2 with about 11.0%, 46.5%, and 65.0%, respectively. CONCLUSIONS iEnhancer-ECNN outperforms other previously proposed methods with significant improvement in most of the evaluation metrics. Strong growths in the MCC of both layers are highly meaningful in assuring the stability of our models.
Collapse
Affiliation(s)
- Quang H Nguyen
- School of Information and Communication Technology, Hanoi University of Science and Technology, 1 Dai Co Viet, Hanoi 100000, Vietnam
| | - Thanh-Hoang Nguyen-Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Gate 7, Kelburn Parade, Wellington, 6142, New Zealand
| | - Nguyen Quoc Khanh Le
- Professional Master Program in Artificial Intelligence in Medicine, Taipei Medical University, Keelung Road, Da'an Distric, Taipei City, 106, Taiwan (R.O.C.)
| | - Trang T T Do
- Institute of Research and Development, Duy Tan University, Danang 550000, Vietnam
| | - Susanto Rahardja
- School of Marine Science and Technology, Northwestern Polytechnical University, 127 West Youyi Road, Xi'an 710072, China.
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Gate 7, Kelburn Parade, Wellington, 6142, New Zealand.
| |
Collapse
|
28
|
Nguyen BP, Pham HN, Tran H, Nghiem N, Nguyen QH, Do TTT, Tran CT, Simpson CR. Predicting the onset of type 2 diabetes using wide and deep learning with electronic health records. Comput Methods Programs Biomed 2019; 182:105055. [PMID: 31505379 DOI: 10.1016/j.cmpb.2019.105055] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Revised: 08/17/2019] [Accepted: 08/27/2019] [Indexed: 06/10/2023]
Abstract
OBJECTIVE Diabetes is responsible for considerable morbidity, healthcare utilisation and mortality in both developed and developing countries. Currently, methods of treating diabetes are inadequate and costly so prevention becomes an important step in reducing the burden of diabetes and its complications. Electronic health records (EHRs) for each individual or a population have become important tools in understanding developing trends of diseases. Using EHRs to predict the onset of diabetes could improve the quality and efficiency of medical care. In this paper, we apply a wide and deep learning model that combines the strength of a generalised linear model with various features and a deep feed-forward neural network to improve the prediction of the onset of type 2 diabetes mellitus (T2DM). MATERIALS AND METHODS The proposed method was implemented by training various models into a logistic loss function using a stochastic gradient descent. We applied this model using public hospital record data provided by the Practice Fusion EHRs for the United States population. The dataset consists of de-identified electronic health records for 9948 patients, of which 1904 have been diagnosed with T2DM. Prediction of diabetes in 2012 was based on data obtained from previous years (2009-2011). The imbalance class of the model was handled by Synthetic Minority Oversampling Technique (SMOTE) for each cross-validation training fold to analyse the performance when synthetic examples for the minority class are created. We used SMOTE of 150 and 300 percent, in which 300 percent means that three new synthetic instances are created for each minority class instance. This results in the approximated diabetes:non-diabetes distributions in the training set of 1:2 and 1:1, respectively. RESULTS Our final ensemble model not using SMOTE obtained an accuracy of 84.28%, area under the receiver operating characteristic curve (AUC) of 84.13%, sensitivity of 31.17% and specificity of 96.85%. Using SMOTE of 150 and 300 percent did not improve AUC (83.33% and 82.12%, respectively) but increased sensitivity (49.40% and 71.57%, respectively) with a moderate decrease in specificity (90.16% and 76.59%, respectively). DISCUSSION AND CONCLUSIONS Our algorithm has further optimised the prediction of diabetes onset using a novel state-of-the-art machine learning algorithm: the wide and deep learning neural network architecture.
Collapse
Affiliation(s)
- Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Kelburn Parade, Wellington 6140, New Zealand.
| | - Hung N Pham
- School of Information and Communication Technology, Hanoi University of Science and Technology, 1 Dai Co Viet Road, Hanoi 100000, Vietnam
| | - Hop Tran
- School of Mathematics and Statistics, Victoria University of Wellington, Kelburn Parade, Wellington 6140, New Zealand
| | - Nhung Nghiem
- Department of Public Health, University of Otago, 23A Mein Street, Wellington 6021, New Zealand
| | - Quang H Nguyen
- School of Information and Communication Technology, Hanoi University of Science and Technology, 1 Dai Co Viet Road, Hanoi 100000, Vietnam
| | - Trang T T Do
- Institute for Infocomm Research, Agency for Science, Technology and Research, 1 Fusionopolis Way, Singapore 138632, Singapore
| | - Cao Truong Tran
- Faculty of Information Technology, Le Quy Don Technical University, 236 Hoang Quoc Viet Street, Hanoi 100000, Vietnam
| | - Colin R Simpson
- Faculty of Health, Victoria University of Wellington, Kelburn Parade, Wellington 6140, New Zealand; Usher Institute, The University of Edinburgh, Edinburgh, EH89AG, United Kingdom
| |
Collapse
|
29
|
Abstract
BACKGROUND Confetti fluorescence and other multi-color genetic labelling strategies are useful for observing stem cell regeneration and for other problems of cell lineage tracing. One difficulty of such strategies is segmenting the cell boundaries, which is a very different problem from segmenting color images from the real world. This paper addresses the difficulties and presents a superpixel-based framework for segmentation of regenerated muscle fibers in mice. RESULTS We propose to integrate an edge detector into a superpixel algorithm and customize the method for multi-channel images. The enhanced superpixel method outperforms the original and another advanced superpixel algorithm in terms of both boundary recall and under-segmentation error. Our framework was applied to cross-section and lateral section images of regenerated muscle fibers from confetti-fluorescent mice. Compared with "ground-truth" segmentations, our framework yielded median Dice similarity coefficients of 0.92 and higher. CONCLUSION Our segmentation framework is flexible and provides very good segmentations of multi-color muscle fibers. We anticipate our methods will be useful for segmenting a variety of tissues in confetti fluorecent mice and in mice with similar multi-color labels.
Collapse
Affiliation(s)
- Binh P Nguyen
- Centre for Computational Biology, and Program in Cancer and Stem Cell Biology, Duke-NUS Medical School, Singapore, 169857, Singapore.,BioSystems and Micromechanics (BioSyM) Singapore - MIT Alliance for Research and Technology, Singapore, 138602, Singapore
| | - Hans Heemskerk
- Centre for Computational Biology, and Program in Cancer and Stem Cell Biology, Duke-NUS Medical School, Singapore, 169857, Singapore.,BioSystems and Micromechanics (BioSyM) Singapore - MIT Alliance for Research and Technology, Singapore, 138602, Singapore
| | - Peter T C So
- BioSystems and Micromechanics (BioSyM) Singapore - MIT Alliance for Research and Technology, Singapore, 138602, Singapore.,Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Lisa Tucker-Kellogg
- Centre for Computational Biology, and Program in Cancer and Stem Cell Biology, Duke-NUS Medical School, Singapore, 169857, Singapore. .,BioSystems and Micromechanics (BioSyM) Singapore - MIT Alliance for Research and Technology, Singapore, 138602, Singapore.
| |
Collapse
|
30
|
Wen R, Tay WL, Nguyen BP, Chng CB, Chui CK. Hand gesture guided robot-assisted surgery based on a direct augmented reality interface. Comput Methods Programs Biomed 2014; 116:68-80. [PMID: 24438993 DOI: 10.1016/j.cmpb.2013.12.018] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2013] [Revised: 12/17/2013] [Accepted: 12/22/2013] [Indexed: 06/03/2023]
Abstract
Radiofrequency (RF) ablation is a good alternative to hepatic resection for treatment of liver tumors. However, accurate needle insertion requires precise hand-eye coordination and is also affected by the difficulty of RF needle navigation. This paper proposes a cooperative surgical robot system, guided by hand gestures and supported by an augmented reality (AR)-based surgical field, for robot-assisted percutaneous treatment. It establishes a robot-assisted natural AR guidance mechanism that incorporates the advantages of the following three aspects: AR visual guidance information, surgeon's experiences and accuracy of robotic surgery. A projector-based AR environment is directly overlaid on a patient to display preoperative and intraoperative information, while a mobile surgical robot system implements specified RF needle insertion plans. Natural hand gestures are used as an intuitive and robust method to interact with both the AR system and surgical robot. The proposed system was evaluated on a mannequin model. Experimental results demonstrated that hand gesture guidance was able to effectively guide the surgical robot, and the robot-assisted implementation was found to improve the accuracy of needle insertion. This human-robot cooperative mechanism is a promising approach for precise transcutaneous ablation therapy.
Collapse
Affiliation(s)
- Rong Wen
- Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore 117576, Singapore
| | - Wei-Liang Tay
- Department of Electrical and Computer Engineering, National University of Singapore, 4 Engineering Drive 3, Singapore 117583, Singapore
| | - Binh P Nguyen
- Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore 117576, Singapore.
| | - Chin-Boon Chng
- Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore 117576, Singapore
| | - Chee-Kong Chui
- Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore 117576, Singapore
| |
Collapse
|
31
|
Nguyen BP, Chui CK, Ong SH, Chang S. An efficient compression scheme for 4-D medical images using hierarchical vector quantization and motion compensation. Comput Biol Med 2011; 41:843-56. [PMID: 21802074 DOI: 10.1016/j.compbiomed.2011.07.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2010] [Revised: 03/20/2011] [Accepted: 07/08/2011] [Indexed: 10/17/2022]
Abstract
This paper proposes an efficient compression scheme for compressing time-varying medical volumetric data. The scheme uses 3-D motion estimation to create a homogenous preprocessed data to be compressed by a 3-D image compression algorithm using hierarchical vector quantization. A new block distortion measure, called variance of residual (VOR), and three 3-D fast block matching algorithms are used to improve the motion estimation process in term of speed and data fidelity. The 3-D image compression process involves the application of two different encoding techniques based on the homogeneity of input data. Our method can achieve a higher fidelity and faster decompression time compared to other lossy compression methods producing similar compression ratios. The combination of 3-D motion estimation using VOR and hierarchical vector quantization contributes to the good performance.
Collapse
Affiliation(s)
- Binh P Nguyen
- Department of Electrical and Computer Engineering, National University of Singapore, 4 Engineering Drive 3, Singapore 117576, Singapore.
| | | | | | | |
Collapse
|
32
|
Nguyen BP, Ren XD, Schwartz MA, Carter WG. Ligation of integrin alpha 3beta 1 by laminin 5 at the wound edge activates Rho-dependent adhesion of leading keratinocytes on collagen. J Biol Chem 2001; 276:43860-70. [PMID: 11571278 DOI: 10.1074/jbc.m103404200] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Wounding of the epidermis signals the transition of keratinocytes from quiescent anchorage on endogenous basement membrane laminin 5 to migration on exposed dermal collagen. In this study, we attempt to characterize activation signals that transform quiescent keratinocytes into migratory leading cells at the wound edge. Previously, we reported that adhesion and spreading on collagen via integrin alpha(2)beta(1) by cultured human foreskin keratinocytes (HFKs) requires RhoGTP, a regulator of actin stress fibers. In contrast, adhesion and spreading on laminin 5 requires integrins alpha(3)beta(1) and alpha(6)beta(4) and is dependent on phosphoinositide 3-hydroxykinase (Nguyen, B. P., Gil, S. G., and Carter, W. G. (2000) J. Biol. Chem. 275, 31896-31907). Here, we report that quiescent HFKs do not adhere to collagen but adhere and spread on laminin 5. By using collagen adhesion as one criterion for conversion to a "leading wound cell," we found that activation of collagen adhesion requires elevation of RhoGTP. Adhesion of quiescent HFKs to laminin 5 via integrin alpha(3)beta(1) and alpha(6)beta(4) is sufficient to increase levels of RhoGTP required for adhesion and spreading on collagen. Consistently, adhesion of quiescent HFKs to laminin 5, but not collagen, also promotes expression of the precursor form of laminin 5, a characteristic of leading keratinocytes in the epidermal outgrowth. We suggest that wounding of quiescent epidermis initiates adhesion and spreading of keratinocytes at the wound edge on endogenous basement membrane laminin 5 via alpha(3)beta(1) and alpha(6)beta(4) in a Rho-independent mechanism. Spreading on endogenous laminin 5 via alpha(3)beta(1) is necessary but not sufficient to elevate expression of precursor laminin 5 and RhoGTP, allowing for subsequent collagen adhesion via alpha(2)beta(1), all characteristics of leading keratinocytes in the epidermal outgrowth.
Collapse
Affiliation(s)
- B P Nguyen
- Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | | | | | | |
Collapse
|
33
|
Abstract
Deposition of laminin 5 over exposed dermal collagen in epidermal wounds is an early event in repair of the basement membrane. We report that deposition of laminin 5 onto collagen switches adhesion and signaling from collagen-dependent to laminin 5-dependent. Ligation of laminin 5 by integrin alpha(6)beta(4) activates phosphoinositide 3-OH-kinase (PI3K) signaling. This activation allows for adhesion and spreading via integrin alpha(3)beta(1) on laminin 5 independent of RhoGTPase, a regulator of actin stress fibers. In contrast, adhesion and spreading on collagen via alpha(2)beta(1) is Rho-dependent and is inhibited by toxin B, a Rho inhibitor. Deposition of laminin 5 and ligation of alpha(6)beta(4) increases PI3K-dependent production of phosphoinositide di- and triphosphates, PI3K activity, and phosphorylation of downstream target protein c-Jun NH(2)-terminal kinase. Conversely, blocking laminin 5-deposition with brefeldin A, an inhibitor of vesicle transport, or with anti-laminin 5 monoclonal antibodies abolishes the PI3K-dependent spreading mediated by alpha(3)beta(1) and phosphorylation of c-Jun NH(2)-terminal kinase. Studies with keratinocytes lacking alpha(6)beta(4) or laminin 5 confirm that deposition of laminin 5 and ligation by alpha(6)beta(4) are required for PI3K-dependent spreading via alpha(3)beta(1). We suggest that deposition of laminin 5 onto the collagen substratum, as in wound repair, enables human foreskin keratinocytes to interact via alpha(6)beta(4) and to switch from a RhoGTPase-dependent adhesion on collagen to a PI3K-dependent adhesion and spreading mediated by integrin alpha(3)beta(1) on laminin 5.
Collapse
Affiliation(s)
- B P Nguyen
- Fred Hutchinson Cancer Research Center, Seattle, Washington 98109 and the Department of Pathobiology, University of Washington, Seattle, Washington 98195, USA
| | | | | |
Collapse
|
34
|
Abstract
Adhesion of keratinocytes in a wound outgrowth to laminin 5 in the basement membrane via integrins alpha6beta4 and alpha3beta1 is distinct from adhesion to dermal collagen via alpha2beta1 or to fibronectin via alpha5beta1. Leading cells in the outgrowth are distinguished from following keratinocytes by deposition of laminin 5, failure to communicate via gap junctions and sensitivity to toxin B, an inhibitor of RhoGTPase. Laminin 5 deposited by leading keratinocytes onto dermal collagen dominates over dermal ligands and changes the cell signals required for adhesion from collagen-dependent to laminin-5-dependent. Thus, deposition of laminin 5 can instruct keratinocytes to switch from an activated phenotype to a quiescent and integrated epithelial phenotype.
Collapse
Affiliation(s)
- B P Nguyen
- Fred Hutchinson Cancer Research Center, Division of Basic Sciences, A3-015, 1100 Fairview Avenue North, Washington 98109, Seattle, USA
| | | | | | | |
Collapse
|
35
|
Lampe PD, Nguyen BP, Gil S, Usui M, Olerud J, Takada Y, Carter WG. Cellular interaction of integrin alpha3beta1 with laminin 5 promotes gap junctional communication. J Cell Biol 1998; 143:1735-47. [PMID: 9852164 PMCID: PMC2132974 DOI: 10.1083/jcb.143.6.1735] [Citation(s) in RCA: 136] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/1998] [Revised: 10/21/1998] [Indexed: 12/04/2022] Open
Abstract
Wounding of skin activates epidermal cell migration over exposed dermal collagen and fibronectin and over laminin 5 secreted into the provisional basement membrane. Gap junctional intercellular communication (GJIC) has been proposed to integrate the individual motile cells into a synchronized colony. We found that outgrowths of human keratinocytes in wounds or epibole cultures display parallel changes in the expression of laminin 5, integrin alpha3beta1, E-cadherin, and the gap junctional protein connexin 43. Adhesion of keratinocytes on laminin 5, collagen, and fibronectin was found to differentially regulate GJIC. When keratinocytes were adhered on laminin 5, both structural (assembly of connexin 43 in gap junctions) and functional (dye transfer) assays showed a two- to threefold increase compared with collagen and five- to eightfold over fibronectin. Based on studies with immobilized integrin antibody and integrin-transfected Chinese hamster ovary cells, the interaction of integrin alpha3beta1 with laminin 5 was sufficient to promote GJIC. Mapping of intermediate steps in the pathway linking alpha3beta1-laminin 5 interactions to GJIC indicated that protein trafficking and Rho signaling were both required. We suggest that adhesion of epithelial cells to laminin 5 in the basement membrane via alpha3beta1 promotes GJIC that integrates individual cells into synchronized epiboles.
Collapse
Affiliation(s)
- P D Lampe
- Divisions of Basic Sciences and Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA.
| | | | | | | | | | | | | |
Collapse
|
36
|
Abstract
BACKGROUND Optimal asthma management requires accurate assessment of asthma severity. OBJECTIVE To compare patients' perceptions of their asthma severity with that obtained by using the guidelines published by the National Asthma Education and Prevention Program's Expert Panel and with functional impairment measured by spirometry and numeric criteria of the American Thoracic Society. METHODS We enrolled 323 patients age 18 to 50 years who were members of the Kaiser Foundation Health Plan for > or = 1 year in a randomized control trial of an asthma education program. Each had a confirmed diagnosis of bronchial asthma and had been receiving antiasthma medication for > or = 1 year. Patients rated the severity of their asthma. Office spirometry was performed, and, using the Mini-Wright peak flow meter, patients kept 2-week diaries of at-home recordings of morning and evening peak expiratory flow rates. RESULTS A statistically significant association was noted between patients' perceptions of asthma severity and both medication severity rating (P < .001) and diurnal variation rating (P = .003) and evening peak expiratory flow rate percentage (P = .019). In comparison with a severity composite based on criteria of the National Asthma Education Program, 54% of patients accurately estimated asthma severity, 27% overestimated, and 20% underestimated severity. CONCLUSION A clinically significant proportion of asthmatic patients substantially underestimate disease severity and thereby may be at risk of increased mortality or morbidity.
Collapse
Affiliation(s)
- B P Nguyen
- Department of Allergy, Kaiser Permanente Medical Center, San Francisco, California, USA
| | | | | |
Collapse
|
37
|
Abstract
In an open-label study, we compared the efficacy and safety of intravenous infusion of fenoldopam mesylate with that of sodium nitroprusside in patients with severe hypertension or in hypertensive crisis. Both antihypertensive medications were infused at a maximal dose increment of 0.2 microgram/kg/min (fenoldopam) and 1 microgram/kg/min (nitroprusside), with a maximal infusion rate of 1.5 micrograms/kg/min fenoldopam mesylate or 8 micrograms/kg/min sodium nitroprusside. Once the desired reduction in diastolic blood pressure was achieved (less than 110 mm Hg if initial diastolic blood pressure was 120-149 mm Hg, or by at least 40 mm Hg if initial diastolic blood pressure was 150-190 mm Hg), the maximal infusion rate used was maintained for at least 1 hour, and then, the infusion was slowed gradually over 2 hours. After the infusion treatment, patients remained in the hospital for 2 days of follow-up. Both antihypertensive agents successfully controlled the blood pressure in all the patients by the end of the maintenance periods. Between the baseline and the end of the maintenance period, analysis of variance showed that the changes in the variables induced by fenoldopam mesylate did not differ significantly from those induced by sodium nitroprusside. The incidence of side effects listed were similar in both groups of patients. The detection of toxic levels of thiocyanate in two patients treated with nitroprusside, however, shows that fenoldopam might be preferable for the control of a hypertensive crisis or severe hypertension in patients with decreased renal function.
Collapse
MESH Headings
- 2,3,4,5-Tetrahydro-7,8-dihydroxy-1-phenyl-1H-3-benzazepine/adverse effects
- 2,3,4,5-Tetrahydro-7,8-dihydroxy-1-phenyl-1H-3-benzazepine/analogs & derivatives
- 2,3,4,5-Tetrahydro-7,8-dihydroxy-1-phenyl-1H-3-benzazepine/therapeutic use
- Adult
- Blood Pressure/drug effects
- Blood Urea Nitrogen
- Creatine/blood
- Fenoldopam
- Ferricyanides/therapeutic use
- Heart Rate/drug effects
- Humans
- Hypertension/drug therapy
- Hypertension/metabolism
- Hypertension/physiopathology
- Injections, Intravenous
- Middle Aged
- Nitroprusside/adverse effects
- Nitroprusside/therapeutic use
- Randomized Controlled Trials as Topic
Collapse
Affiliation(s)
- E Reisin
- Nephrology Section, Louisiana State University, New Orleans
| | | | | | | | | |
Collapse
|
38
|
Affiliation(s)
- B P Nguyen
- Department of Medicine and Pathology, LSU School of Medicine, New Orleans 70112-2822
| | | | | |
Collapse
|
39
|
Saenz C, Sanders CV, Nguyen BP, Reisin E. Macronodular lesions associated with Staphylococcus aureus bacteremia. A condition that resembles a 'lymphocutaneous' syndrome. Arch Intern Med 1987; 147:793. [PMID: 3827470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
A 54-year-old man, diagnosed as having Wegener's granulomatosis and treated with a regimen of cyclophosphamide and prednisone and hemodialysis, was found to have Staphylococcus aureus in blood, urine, and pus that were removed from the infected area. He had unusual macronodular lesions of the skin that resembled the lymphocutaneous syndrome. These lesions resolved with antibiotic therapy. To our knowledge, this condition has not been described previously.
Collapse
|