Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

24
(from Reference Citation Analysis)

Article PDFs (6)

Cited by > 0 (10)

Searched Name

Bayesian neural network

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Wang Z, Li YP, Huang GH, Gong JW, Li YF, Zhang Q. A factorial-analysis-based Bayesian neural network method for quantifying China's CO₂ emissions under dual-carbon target. Sci Total Environ 2024;920:170698. [PMID: 38342455 DOI: 10.1016/j.scitotenv.2024.170698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 01/10/2024] [Accepted: 02/02/2024] [Indexed: 02/13/2024]

Abstract

Energy-structure transformation and CO2-emission reduction are becoming particularly urgent for China and many other countries. Development of effective methods that are capable of quantifying and predicting CO2 emissions to achieve carbon neutrality is desired. This study advances a factorial-analysis-based Bayesian neural network (abbreviated as FABNN) method to reflect the complex relationship between inputs and outputs as well as reveal the individual and interactive effects of multiple factors affecting CO2 emissions. FABNN is then applied to analyzing CO2 emissions of China (abbreviated as CEC), where multiple factors involve in energy (e.g., the consumption of natural gas, CONG), economic (e.g., Gross domestic product, GDP) and social (e.g., the rate of urbanization, ROU) aspects are investigated and 512 scenarios are designed to achieve the national dual carbon targets (i.e., carbon peak before 2030 and carbon neutrality by 2060). Comparing to the conventional machine learning methods, FABNN performs better in calibration and validation results, indicating that FABNN is suitable for CEC simulation and prediction. Results disclose that the top three factors affecting CEC under the dual‑carbon target are GDP, CONG, and ROU; energy, economic and social contributions are 43.5 %, 34.6 % and 21.9 %, respectively. CEC reaches its carbon peak during 2027-2032 and achieve carbon neutrality during 2053-2057 under all scenarios. Under the optimal scenario (S195), the CO2-emission reduction potential is about 772.2 million tonnes and the consumptions of coal, petroleum and natural gas can be respectively reduced by 3.1 %, 9.9 % and 23.0 % compared to the worst scenario (S466). The results can provide solid support for national energy-structure transformation and CO2-emission reduction to achieve carbon-peak and carbon-neutrality targets.

Collapse

Dong Z, Chen X, Ritter J, Bai L, Huang J. American society of anesthesiologists physical status classification significantly affects the performances of machine learning models in intraoperative hypotension inference. J Clin Anesth 2024;92:111309. [PMID: 37922642 PMCID: PMC10873053 DOI: 10.1016/j.jclinane.2023.111309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Revised: 09/24/2023] [Accepted: 10/24/2023] [Indexed: 11/07/2023]

Abstract

STUDY OBJECTIVE

To explore how American Society of Anesthesiologists (ASA) physical status classification affects different machine learning models in hypotension prediction and whether the prediction uncertainty could be quantified.

DESIGN

Observational Studies SETTING: UofL health hospital PATIENTS: This study involved 562 hysterectomy surgeries performed on patients (≥ 18 years) between June 2020 and July 2021.

INTERVENTIONS

None MEASUREMENTS: Preoperative and intraoperative data is collected. Three parametric machine learning models, including Bayesian generalized linear model (BGLM), Bayesian neural network (BNN), a newly proposed BNN with multivariate mixed responses (BNNMR), and one nonparametric model, Gaussian Process (GP), were explored to predict patients' diastolic and systolic blood pressures (continuous responses) and patients' hypotensive event (binary response) for the next five minutes. Data was separated into American Society of Anesthesiologists (ASA) physical status class 1- 4 before being read in by four machine learning models. Statistical analysis and models' constructions are performed in Python. Sensitivity, specificity, and the confidence/credible intervals were used to evaluate the prediction performance of each model for each ASA physical status class.

MAIN RESULTS

ASA physical status classes require distinct models to accurately predict intraoperative blood pressures and hypotensive events. Overall, high sensitivity (above 0.85) and low uncertainty can be achieved by all models for ASA class 4 patients. In contrast, models trained without controlling ASA classes yielded lower sensitivity (below 0.5) and larger uncertainty. Particularly, in terms of predicting binary hypotensive event, for ASA physical status class 1, BNNMR yields the highest sensitivity of 1. For classes 2 and 3, BNN has the highest sensitivity of 0.429 and 0.415, respectively. For class 4, BNNMR and GP are tied with the highest sensitivity of 0.857. On the other hand, the sensitivity is just 0.031, 0.429, 0.165 and 0.305 for BNNMR, BNN, GBLM and GP models respectively, when training data is not divided by ASA physical status classes. In terms of predicting systolic blood pressure, the GP regression yields the lowest root mean squared errors (RMSE) of 2.072, 7.539, 9.214 and 0.295 for ASA physical status classes 1, 2, 3 and 4, respectively, but a RMSE of 126.894 if model is trained without controlling the ASA physical status class. The RMSEs for other models are far higher. RMSEs are 2.175, 13.861, 17.560 and 22.426 for classes 1, 2, 3 and 4 respectively for the BGLM. In terms of predicting diastolic blood pressure, the GP regression yields the lowest RMSEs of 2.152, 6.573, 5.371 and 0.831 for ASA physical status classes 1, 2, 3 and 4, respectively; RMSE of 8.084 if model is trained without controlling the ASA physical status class. The RMSEs for other models are far higher. Finally, in terms of the width of the 95% confidence interval of the mean prediction for systolic and diastolic blood pressures, GP regression gives narrower confidence interval with much smaller margin of error across all four ASA physical status classes.

CONCLUSIONS

Different ASA physical status classes present different data distributions, and thus calls for distinct machine learning models to improve prediction accuracy and reduce predictive uncertainty. Uncertainty quantification enabled by Bayesian inference provides valuable information for clinicians as an additional metric to evaluate performance of machine learning models for medical decision making.

Collapse

Dai H, Liu Y, Wang J, Ren J, Gao Y, Dong Z, Zhao B. Large-scale spatiotemporal deep learning predicting urban residential indoor PM_2.5 concentration. Environ Int 2023;182:108343. [PMID: 38029622 DOI: 10.1016/j.envint.2023.108343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 11/09/2023] [Accepted: 11/20/2023] [Indexed: 12/01/2023]

Lu J, Huang Z, Zhuang B, Cheng Z, Guo J, Lou H. Development and evaluation of a robotic system for lumbar puncture and epidural steroid injection. Front Neurorobot 2023;17:1253761. [PMID: 37881516 PMCID: PMC10595035 DOI: 10.3389/fnbot.2023.1253761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 08/11/2023] [Indexed: 10/27/2023] Open

Tao C. Applications of Bayesian Neural Networks in Outlier Detection. Big Data 2023;11:369-386. [PMID: 36706252 DOI: 10.1089/big.2021.0343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]

Abstract

Anomaly detection is crucial in a variety of domains, such as fraud detection, disease diagnosis, and equipment defect detection. With the development of deep learning, anomaly detection with Bayesian neural networks (BNNs) becomes a novel research topic in recent years. This article aims to propose a widely applicable method of outlier detection (a category of anomaly detection) using BNNs based on uncertainty measurement. There are three kinds of uncertainties generated in the prediction of BNNs: epistemic uncertainty, aleatoric uncertainty, and (model) misspecification uncertainty. Although the approaches in previous studies are adopted to measure epistemic and aleatoric uncertainty, a new method of utilizing loss functions to quantify misspecification uncertainty is proposed in this article. Then, these three uncertainty sources are merged together by specific combination models to construct total prediction uncertainty. In this study, the key idea is that the observations with high total prediction uncertainty should correspond to outliers in the data. The method of this research is applied to the experiments on Modified National Institute of Standards and Technology (MNIST) dataset and Taxi dataset, respectively. From the results, if the network is appropriately constructed and well-trained and model parameters are carefully tuned, most anomalous images in MNIST dataset and all the abnormal traffic periods in Taxi dataset can be nicely detected. In addition, the performance of this method is compared with the BNN anomaly detection methods proposed before and the classical Local Outlier Factor and Density-Based Spatial Clustering of Applications with Noise methods. This study links the classification of uncertainties in essence with anomaly detection and takes the lead to consider combining different uncertainty sources to reform detection outcomes instead of using only single uncertainty each time.

Collapse

Gao D, Xie X, Wei D. A Design Methodology for Fault-Tolerant Neuromorphic Computing Using Bayesian Neural Network. Micromachines (Basel) 2023;14:1840. [PMID: 37893277 PMCID: PMC10608997 DOI: 10.3390/mi14101840] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 09/22/2023] [Accepted: 09/25/2023] [Indexed: 10/29/2023]

Goka R, Moroto Y, Maeda K, Ogawa T, Haseyama M. Prediction of Shooting Events in Soccer Videos Using Complete Bipartite Graphs and Players' Spatial-Temporal Relations. Sensors (Basel) 2023;23:s23094506. [PMID: 37177712 PMCID: PMC10181557 DOI: 10.3390/s23094506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 05/01/2023] [Accepted: 05/03/2023] [Indexed: 05/15/2023]

Zhang Q, Bu Z, Chen K, Long Q. Differentially Private Bayesian Neural Networks on Accuracy, Privacy and Reliability. Mach Learn Knowl Discov Databases 2023;13716:604-619. [PMID: 37602203 PMCID: PMC10438902 DOI: 10.1007/978-3-031-26412-2_37] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/22/2023]

Gong H, Leng S, Baffour F, Yu L, Fletcher JG, McCollough CH. Multi-energy CT material decomposition using Bayesian deep convolutional neural network with explicit penalty of uncertainty and bias. Proc SPIE Int Soc Opt Eng 2023;12463:124633M. [PMID: 37063491 PMCID: PMC10099768 DOI: 10.1117/12.2654317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]

Abstract

Convolutional neural network (CNN)-based material decomposition has the potential to improve image quality (visual appearance) and quantitative accuracy of material maps. Most methods use deterministic CNNs with mean-square-error loss to provide point-estimates of mass densities. Point estimates can be over-confident as the reliability of CNNs is frequently compromised by bias and two major uncertainties - data and model uncertainties originating from noise in inputs and train-test data dissimilarity, respectively. Also, mean-square-error lacks explicit control of uncertainty and bias. To tackle these problems, a Bayesian dual-task CNN (BDT-CNN) with explicit penalization of uncertainty and bias was developed. It is a probabilistic CNN that concurrently conducts material classification and quantification and allows for pixel-wise modeling of bias, data uncertainty, and model uncertainty. CNN was trained with images of physical and simulated tissue-mimicking inserts at varying mass densities. Hydroxyapatite (nominal density 400mg/cc) and blood (nominal density 1095mg/cc) inserts were placed in different-sized body phantoms (30 - 45cm) and used to evaluate mean-absolute-bias (MAB) in predicted mass densities across different images at routine- and half-routine-dose. Patient CT exams were collected to assess generalizability of BDT-CNN in the presence of anatomical background. Noise insertion was used to simulate patient exams at half- and quarter-routine-dose. The deterministic dual-task CNN was used as baseline. In phantoms, BDT-CNN improved consistency of insert delineation, especially edges, and reduced overall bias (average MAB for hydroxyapatite: BDT-CNN 5.4mgHA/cc, baseline 11.0mgHA/cc and blood: BDT-CNN 8.9mgBlood/cc, baseline 14.0mgBlood/cc). In patient images, BDT-CNN improved detail preservation, lesion conspicuity, and structural consistency across different dose levels.

Collapse

Gong H, Yu L, Leng S, Hsieh SS, Fletcher JG, McCollough CH. Patient-specific uncertainty and bias quantification of non-transparent convolutional neural network model through knowledge distillation and Bayesian deep learning. Proc SPIE Int Soc Opt Eng 2023;12463:124631K. [PMID: 37063493 PMCID: PMC10100102 DOI: 10.1117/12.2654318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]

Lee K, Cho D, Jang J, Choi K, Jeong HO, Seo J, Jeong WK, Lee S. RAMP: response-aware multi-task learning with contrastive regularization for cancer drug response prediction. Brief Bioinform 2023;24:6865135. [PMID: 36460623 DOI: 10.1093/bib/bbac504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 10/13/2022] [Accepted: 10/24/2022] [Indexed: 12/05/2022] Open

Phan TC, Pranata A, Farragher J, Bryant A, Nguyen HT, Chai R. Machine Learning Derived Lifting Techniques and Pain Self-Efficacy in People with Chronic Low Back Pain. Sensors (Basel) 2022;22:s22176694. [PMID: 36081153 PMCID: PMC9460822 DOI: 10.3390/s22176694] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Revised: 08/16/2022] [Accepted: 08/31/2022] [Indexed: 05/14/2023]

Lee HH, Kim H. Bayesian deep learning-based ¹ H-MRS of the brain: Metabolite quantification with uncertainty estimation using Monte Carlo dropout. Magn Reson Med 2022;88:38-52. [PMID: 35344604 DOI: 10.1002/mrm.29214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 01/14/2022] [Accepted: 02/11/2022] [Indexed: 11/09/2022]

Abstract

PURPOSE

To develop a Bayesian convolutional neural network (BCNN) with Monte Carlo dropout sampling for metabolite quantification with simultaneous uncertainty estimation in deep learning-based proton MRS of the brain.

METHODS

Human brain spectra were simulated using basis spectra for 17 metabolites and macromolecules (N = 100 000) at 3.0 Tesla. In addition, actual in vivo spectra (N = 5) were modified by adjusting SNR and linewidth with increasing severity of spectral degradation (N = 50). A BCNN was trained on the simulated spectra to generate a noise-free, line-narrowed, macromolecule signal-removed, metabolite-only spectrum from a typical human brain spectrum. At inference, each input spectrum was Monte Carlo dropout sampled (50 times), and the resulting mean spectrum and variance spectrum were used for metabolite quantification and uncertainty estimation, respectively.

RESULTS

Using the simulated spectra, the mean absolute percent errors of the BCNN-predicted metabolite content were < 10% for Cr, Glu, Gln, mI, NAA, and Tau (< 5% for Glu, NAA, and mI). For all metabolites, the correlations (r's) between the ground-truth error and BCNN-predicted uncertainty ranged 0.72-0.94 (0.83 ± 0.06; p < 0.001). Using the modified in vivo spectra, the extent of variation in the estimated metabolite content against the increasing severity of spectral degradation tended to be smaller with BCNN than with linear combination of model spectra (LCModel). Overall, the variation in metabolite content tended to be more highly correlated with the uncertainty from BCNN than with the Cramér-Rao lower-bounds from LCModel (0.938 ± 0.019 vs. 0.881 ± 0.057 [p = 0.115]).

CONCLUSION

The BCNN with Monte Carlo dropout sampling may be used in deep learning-based MRS for the estimation of uncertainty in the machine-predicted metabolite content, which is important in the clinical application of deep learning-based MRS.

Collapse

Sun Y, Song Q, Liang F. Learning Sparse Deep Neural Networks with a Spike-and-Slab Prior. Stat Probab Lett 2022;180:109246. [PMID: 34744226 PMCID: PMC8570537 DOI: 10.1016/j.spl.2021.109246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

Sharma S, Chatterjee S. Winsorization for Robust Bayesian Neural Networks. Entropy (Basel) 2021;23:1546. [PMID: 34828244 DOI: 10.3390/e23111546] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 11/15/2021] [Accepted: 11/18/2021] [Indexed: 11/17/2022]

Xiong H, Berkovsky S, Romano M, Sharan RV, Liu S, Coiera E, McLellan LF. Prediction of anxiety disorders using a feature ensemble based bayesian neural network. J Biomed Inform 2021;123:103921. [PMID: 34628061 DOI: 10.1016/j.jbi.2021.103921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Revised: 09/16/2021] [Accepted: 09/29/2021] [Indexed: 11/25/2022]

Wang D, Yu J, Chen L, Li X, Jiang H, Chen K, Zheng M, Luo X. A hybrid framework for improving uncertainty quantification in deep learning-based QSAR regression modeling. J Cheminform 2021;13:69. [PMID: 34544485 PMCID: PMC8454160 DOI: 10.1186/s13321-021-00551-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 09/05/2021] [Indexed: 11/24/2022] Open

Affiliation(s)

Dingyan Wang Shanghai Key Laboratory of Forensic Medicine, Academy of Forensic Science, Shanghai, 200063, China University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing, 100049, China Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China
Jie Yu University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing, 100049, China Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China
Lifan Chen University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing, 100049, China Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China
Xutong Li University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing, 100049, China Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China
Hualiang Jiang University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing, 100049, China Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China
Kaixian Chen University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing, 100049, China Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China
Mingyue Zheng University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing, 100049, China. Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China.
Xiaomin Luo Shanghai Key Laboratory of Forensic Medicine, Academy of Forensic Science, Shanghai, 200063, China. University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing, 100049, China. Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China.

Collapse

Gokmen T. Enabling Training of Neural Networks on Noisy Hardware. Front Artif Intell 2021;4:699148. [PMID: 34568813 PMCID: PMC8458875 DOI: 10.3389/frai.2021.699148] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Accepted: 08/16/2021] [Indexed: 11/13/2022] Open

Abstract

Deep neural networks (DNNs) are typically trained using the conventional stochastic gradient descent (SGD) algorithm. However, SGD performs poorly when applied to train networks on non-ideal analog hardware composed of resistive device arrays with non-symmetric conductance modulation characteristics. Recently we proposed a new algorithm, the Tiki-Taka algorithm, that overcomes this stringent symmetry requirement. Here we build on top of Tiki-Taka and describe a more robust algorithm that further relaxes other stringent hardware requirements. This more robust second version of the Tiki-Taka algorithm (referred to as TTv2) 1. decreases the number of device conductance states requirement from 1000s of states to only 10s of states, 2. increases the noise tolerance to the device conductance modulations by about 100x, and 3. increases the noise tolerance to the matrix-vector multiplication performed by the analog arrays by about 10x. Empirical simulation results show that TTv2 can train various neural networks close to their ideal accuracy even at extremely noisy hardware settings. TTv2 achieves these capabilities by complementing the original Tiki-Taka algorithm with lightweight and low computational complexity digital filtering operations performed outside the analog arrays. Therefore, the implementation cost of TTv2 compared to SGD and Tiki-Taka is minimal, and it maintains the usual power and speed benefits of using analog hardware for training workloads. Here we also show how to extract the neural network from the analog hardware once the training is complete for further model deployment. Similar to Bayesian model averaging, we form analog hardware compatible averages over the neural network weights derived from TTv2 iterates. This model average then can be transferred to another analog or digital hardware with notable improvements in test accuracy, transcending the trained model itself. In short, we describe an end-to-end training and model extraction technique for extremely noisy crossbar-based analog hardware that can be used to accelerate DNN training workloads and match the performance of full-precision SGD.

Collapse

Shi L, Copot C, Vanlanduit S. A Bayesian Deep Neural Network for Safe Visual Servoing in Human-Robot Interaction. Front Robot AI 2021;8:687031. [PMID: 34222355 PMCID: PMC8247479 DOI: 10.3389/frobt.2021.687031] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2021] [Accepted: 05/24/2021] [Indexed: 11/13/2022] Open

Cocco L, Tonelli R, Marchesi M. Predictions of bitcoin prices through machine learning based frameworks. PeerJ Comput Sci 2021;7:e413. [PMID: 33834099 PMCID: PMC8022579 DOI: 10.7717/peerj-cs.413] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2020] [Accepted: 02/04/2021] [Indexed: 06/02/2023]

Godefroy G, Arnal B, Bossy E. Compensating for visibility artefacts in photoacoustic imaging with a deep learning approach providing prediction uncertainties. Photoacoustics 2021;21:100218. [PMID: 33364161 PMCID: PMC7750172 DOI: 10.1016/j.pacs.2020.100218] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Revised: 10/15/2020] [Accepted: 10/17/2020] [Indexed: 05/04/2023]

Xue W, Guo T, Ni D. Left ventricle quantification with sample-level confidence estimation via Bayesian neural network. Comput Med Imaging Graph 2020;84:101753. [PMID: 32755759 DOI: 10.1016/j.compmedimag.2020.101753] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 06/24/2020] [Accepted: 07/03/2020] [Indexed: 11/28/2022]

Manogaran G, Shakeel PM, Fouad H, Nam Y, Baskar S, Chilamkurti N, Sundarasekar R. Wearable IoT Smart-Log Patch: An Edge Computing-Based Bayesian Deep Learning Network System for Multi Access Physical Monitoring System. Sensors (Basel) 2019;19:E3030. [PMID: 31324070 DOI: 10.3390/s19133030] [Citation(s) in RCA: 126] [Impact Index Per Article: 25.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Revised: 06/26/2019] [Accepted: 07/01/2019] [Indexed: 12/27/2022]

Yin T, Zhu HP. Probabilistic Damage Detection of a Steel Truss Bridge Model by Optimally Designed Bayesian Neural Network. Sensors (Basel) 2018;18:E3371. [PMID: 30304848 DOI: 10.3390/s18103371] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/12/2018] [Revised: 10/03/2018] [Accepted: 10/06/2018] [Indexed: 12/01/2022]