1
|
Yang G, Xiao Q, Zhang Z, Yu Z, Wang X, Lu Q. Exploring AI in metasurface structures with forward and inverse design. iScience 2025; 28:111995. [PMID: 40104054 PMCID: PMC11914293 DOI: 10.1016/j.isci.2025.111995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/20/2025] Open
Abstract
As an artificially manufactured planar device, a metasurface structure can produce unusual electromagnetic responses by harnessing four basic characteristics of the light wave. Traditional design processes rely on numerical algorithms combined with parameter optimization. However, such methods are often time-consuming and struggle to match actual responses. This paper aims to give a unique perspective to classify the artificial intelligence(AI)-enabled design, dividing it into forward and inverse designs according to the mapping relationship between variables and performance. Forward designs are driven by intelligent algorithms; neural networks are one of the principal ways to realize reverse design. This paper reviews recent progress in AI-enabled metasurface design, examining the principles, advantages, and potential applications. A rich content and detailed comparison can help build a holistic understanding of metasurface design. Moreover, the authors believe that this systematic and detailed review will pave the way for future research and the selection of practical applications.
Collapse
Affiliation(s)
- Guantai Yang
- Frontiers Science Center for Flexible Electronics (FSCFE) Institute of Flexible Electronics (IFE), Northwestern Polytechnical University, Xi'an 710072, China
- School of Automation, Northwestern Polytechnical University, 127 West Youyi Road, Beilin District, Xi'an 710072, China
| | - Qingxiong Xiao
- School of Automation, Northwestern Polytechnical University, 127 West Youyi Road, Beilin District, Xi'an 710072, China
| | - Zhilin Zhang
- School of Automation, Northwestern Polytechnical University, 127 West Youyi Road, Beilin District, Xi'an 710072, China
| | - Zhe Yu
- College of Optical Science and Engineering, Zhejiang University, Hangzhou 310027, China
| | - Xiaoxu Wang
- School of Automation, Northwestern Polytechnical University, 127 West Youyi Road, Beilin District, Xi'an 710072, China
| | - Qianbo Lu
- Frontiers Science Center for Flexible Electronics (FSCFE) Institute of Flexible Electronics (IFE), Northwestern Polytechnical University, Xi'an 710072, China
| |
Collapse
|
2
|
Roy A, Saha P, Gautam N, Schwenker F, Sarkar R. Adaptive genetic algorithm based deep feature selector for cancer detection in lung histopathological images. Sci Rep 2025; 15:4803. [PMID: 39922836 PMCID: PMC11807166 DOI: 10.1038/s41598-025-86362-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Accepted: 01/10/2025] [Indexed: 02/10/2025] Open
Abstract
Cancer is a global health concern because of a significant mortality rate and a wide range of affected organs. Early detection and accurate classification of cancer types are crucial for effective treatment. Imaging tests on different image modalities such as Histopathology images, provide valuable insights into the cellular and architectural features of tissues, allowing pathologists to make diagnosis, determine disease stages, and guide treatment decisions. They are an essential tool in the study and understanding of diseases, aiding in research, education, and patient care. Convolutional neural network based pretrained deep learning models can be used successfully to detect lung cancer. In this study, we have used a channel attention-enabled deep learning model as a feature extractor followed by an adaptive Genetic Algorithm (GA) based feature selector. Here, we calculate the fitness score of each chromosome (i.e., a candidate solution) using a filter method, instead of a classifier. Further, the GA optimized feature vector is fed to the K-nearest neighbors classifier for final classification. The proposed method shows a promising result with an overall accuracy of 99.75% on the LC25000 dataset, which is a publicly available dataset of lung histopathological images. The source code for this work can be found https://github.com/priyam-03/GA-Feature-Selector-Lung-Cancer .
Collapse
Affiliation(s)
- Avigyan Roy
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
| | - Priyam Saha
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
| | - Nandita Gautam
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, India.
| | | | - Ram Sarkar
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
| |
Collapse
|
3
|
Li J, Zheng M, Dong D, Xie X. PC-CS-YOLO: High-Precision Obstacle Detection for Visually Impaired Safety. SENSORS (BASEL, SWITZERLAND) 2025; 25:534. [PMID: 39860905 PMCID: PMC11768684 DOI: 10.3390/s25020534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2024] [Revised: 01/10/2025] [Accepted: 01/16/2025] [Indexed: 01/27/2025]
Abstract
The issue of obstacle avoidance and safety for visually impaired individuals has been a major topic of research. However, complex street environments still pose significant challenges for blind obstacle detection systems. Existing solutions often fail to provide real-time, accurate obstacle avoidance decisions. In this study, we propose a blind obstacle detection system based on the PC-CS-YOLO model. The system improves the backbone network by adopting the partial convolutional feed-forward network (PCFN) to reduce computational redundancy. Additionally, to enhance the network's robustness in multi-scale feature fusion, we introduce the Cross-Scale Attention Fusion (CSAF) mechanism, which integrates features from different sensory domains to achieve superior performance. Compared to state-of-the-art networks, our system shows improvements of 2.0%, 3.9%, and 1.5% in precision, recall, and mAP50, respectively. When evaluated on a GPU, the inference speed is 20.6 ms, which is 15.3 ms faster than YOLO11, meeting the real-time requirements for blind obstacle avoidance systems.
Collapse
Affiliation(s)
- Jincheng Li
- School of Artificial Intelligence and Computer Science, Nantong University, Nantong 226019, China; (J.L.); (M.Z.); (D.D.)
| | - Menglin Zheng
- School of Artificial Intelligence and Computer Science, Nantong University, Nantong 226019, China; (J.L.); (M.Z.); (D.D.)
| | - Danyang Dong
- School of Artificial Intelligence and Computer Science, Nantong University, Nantong 226019, China; (J.L.); (M.Z.); (D.D.)
| | - Xing Xie
- Engineering Training Center, Nantong University, Nantong 226019, China
| |
Collapse
|
4
|
Li M, Feng Y, Wu X. AttentionTTE: a deep learning model for estimated time of arrival. Front Artif Intell 2024; 7:1258086. [PMID: 39247849 PMCID: PMC11378341 DOI: 10.3389/frai.2024.1258086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 07/22/2024] [Indexed: 09/10/2024] Open
Abstract
Estimating travel time (ETA) for arbitrary paths is crucial in urban intelligent transportation systems. Previous studies primarily focus on constructing complex feature systems for individual road segments or sub-segments, which fail to effectively model the influence of each road segment on others. To address this issue, we propose an end-to-end model, AttentionTTE. It utilizes a self-attention mechanism to capture global spatial correlations and a recurrent neural network to capture temporal dependencies from local spatial correlations. Additionally, a multi-task learning module integrates global spatial correlations and temporal dependencies to estimate the travel time for both the entire path and each local path. We evaluate our model on a large trajectory dataset, and extensive experimental results demonstrate that AttentionTTE achieves state-of-the-art performance compared to other methods.
Collapse
Affiliation(s)
- Mu Li
- School of Computer Science and Engineering, Beihang University, Beijing, China
| | - Yijun Feng
- School of Computer Science and Engineering, Beihang University, Beijing, China
| | - Xiangdong Wu
- Ecole Centrale de Pékin, Beihang University, Beijing, China
| |
Collapse
|
5
|
Bai X, Zhang N, Cao X, Chen W. Prediction of PM 2.5 concentration based on a CNN-LSTM neural network algorithm. PeerJ 2024; 12:e17811. [PMID: 39131620 PMCID: PMC11313410 DOI: 10.7717/peerj.17811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Accepted: 07/03/2024] [Indexed: 08/13/2024] Open
Abstract
Fine particulate matter (PM2.5) is a major air pollutant affecting human survival, development and health. By predicting the spatial distribution concentration of PM2.5, pollutant sources can be better traced, allowing measures to protect human health to be implemented. Thus, the purpose of this study is to predict and analyze the PM2.5 concentration of stations based on the integrated deep learning of a convolutional neural network long short-term memory (CNN-LSTM) model. To solve the complexity and nonlinear characteristics of PM2.5 time series data problems, we adopted the CNN-LSTM deep learning model. We collected the PM2.5data of Qingdao in 2020 as well as meteorological factors such as temperature, wind speed and air pressure for pre-processing and characteristic analysis. Then, the CNN-LSTM deep learning model was integrated to capture the temporal and spatial features and trends in the data. The CNN layer was used to extract spatial features, while the LSTM layer was used to learn time dependencies. Through comparative experiments and model evaluation, we found that the CNN-LSTM model can achieve excellent PM2.5 prediction performance. The results show that the coefficient of determination (R2) is 0.91, and the root mean square error (RMSE) is 8.216 µg/m3. The CNN-LSTM model achieves better prediction accuracy and generalizability compared with those of the CNN and LSTM models (R2 values of 0.85 and 0.83, respectively, and RMSE values of 11.356 and 14.367, respectively). Finally, we analyzed and explained the predicted results. We also found that some meteorological factors (such as air temperature, pressure, and wind speed) have significant effects on the PM2.5 concentration at ground stations in Qingdao. In summary, by using deep learning methods, we obtained better prediction performance and revealed the association between PM2.5 concentration and meteorological factors. These findings are of great significance for improving the quality of the atmospheric environment and protecting public health.
Collapse
Affiliation(s)
- Xuesong Bai
- School of Information and Control Engineering, Qingdao University of Technology, Qingdao, Qingdao City, Shandong Province, China
| | - Na Zhang
- School of Information and Control Engineering, Qingdao University of Technology, Qingdao, Qingdao City, Shandong Province, China
| | - Xiaoyi Cao
- Key Laboratory for Semi-Arid Climate Change of the Ministry of Education, College of Atmospheric Sciences, Lanzhou University, Lanzhou, Lanzhou City, Gansu Province, China
| | - Wenqian Chen
- School of Information and Control Engineering, Qingdao University of Technology, Qingdao, Qingdao City, Shandong Province, China
| |
Collapse
|
6
|
Dai W, Zhu W, Zhou G, Liu G, Xu J, Zhou H, Hu Y, Liu Z, Li J, Li L. AISOA-SSformer: An Effective Image Segmentation Method for Rice Leaf Disease Based on the Transformer Architecture. PLANT PHENOMICS (WASHINGTON, D.C.) 2024; 6:0218. [PMID: 39105185 PMCID: PMC11298559 DOI: 10.34133/plantphenomics.0218] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/24/2024] [Accepted: 06/21/2024] [Indexed: 08/07/2024]
Abstract
Rice leaf diseases have an important impact on modern farming, threatening crop health and yield. Accurate semantic segmentation techniques are crucial for segmenting diseased leaf parts and assisting farmers in disease identification. However, the diversity of rice growing environments and the complexity of leaf diseases pose challenges. To address these issues, this study introduces an innovative semantic segmentation algorithm for rice leaf pests and diseases based on the Transformer architecture AISOA-SSformer. First, it features the sparse global-update perceptron for real-time parameter updating, enhancing model stability and accuracy in learning irregular leaf features. Second, the salient feature attention mechanism is introduced to separate and reorganize features using the spatial reconstruction module (SRM) and channel reconstruction module (CRM), focusing on salient feature extraction and reducing background interference. Additionally, the annealing-integrated sparrow optimization algorithm fine-tunes the sparrow algorithm, gradually reducing the stochastic search amplitude to minimize loss. This enhances the model's adaptability and robustness, particularly against fuzzy edge features. The experimental results show that AISOA-SSformer achieves an 83.1% MIoU, an 80.3% Dice coefficient, and a 76.5% recall on a homemade dataset, with a model size of only 14.71 million parameters. Compared with other popular algorithms, it demonstrates greater accuracy in rice leaf disease segmentation. This method effectively improves segmentation, providing valuable insights for modern plantation management. The data and code used in this study will be open sourced at https://github.com/ZhouGuoXiong/Rice-Leaf-Disease-Segmentation-Dataset-Code.
Collapse
Affiliation(s)
- Weisi Dai
- Faculty of Electronic Information and Physics,
Central South University of Forestry and Technology, Changsha, 410004 Hunan, China
| | - Wenke Zhu
- College of Bangor,
Central South University of Forestry and Technology, Changsha, 410004 Hunan, China
| | - Guoxiong Zhou
- Faculty of Electronic Information and Physics,
Central South University of Forestry and Technology, Changsha, 410004 Hunan, China
| | - Genhua Liu
- Faculty of Electronic Information and Physics,
Central South University of Forestry and Technology, Changsha, 410004 Hunan, China
| | - Jiaxin Xu
- Faculty of Electronic Information and Physics,
Central South University of Forestry and Technology, Changsha, 410004 Hunan, China
| | - Hongliang Zhou
- Faculty of Electronic Information and Physics,
Central South University of Forestry and Technology, Changsha, 410004 Hunan, China
| | - Yahui Hu
- Plant Protection Institute, Hunan Academy of Agricultural Sciences, Changsha, 410125 Hunan, China
| | - Zewei Liu
- Faculty of Electronic Information and Physics,
Central South University of Forestry and Technology, Changsha, 410004 Hunan, China
| | - Jinyang Li
- Faculty of Electronic Information and Physics,
Central South University of Forestry and Technology, Changsha, 410004 Hunan, China
| | - Liujun Li
- Department of Soil and Water Systems,
University of Idaho, Moscow, ID 83844, USA
| |
Collapse
|
7
|
Liu X, Chen J, Zhang Q, Zhang X, Wei E, Wang N, Wang Q, Wang J, Chen J. Floating on groundwater: Insight of multi-source remote sensing for Qaidam basin. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 365:121513. [PMID: 38909574 DOI: 10.1016/j.jenvman.2024.121513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 05/03/2024] [Accepted: 06/16/2024] [Indexed: 06/25/2024]
Abstract
Situated in the north of the Qinghai-Tibet Plateau, the Qaidam Basin experiences limited precipitation and significant evaporation. Despite these conditions, it stands out as one of the most densely distributed lakes in China. The formation of these lakes is controversial: whether the lake water primarily originates from local precipitation or external water sources. To address this issue, this paper explores the recharge sources of lakes in the Qaidam Basin and the circulation patterns of groundwater from a remote sensing perspective. Based on deep learning networks, we optimized the soft object regions of the Object-Contextual Representations Network (OCRNet) and proposed the Remote·Sensing Adaptive-Improved OCRNet (RSA-IOCRNet). Compared with seven other networks, RSA-IOCRNet obtained better experimental results and was used to construct an area sequence of 16 major lakes in the Qaidam Basin. Combined with multi-source data, the comprehensive analysis indicates no significant correlation between climatic factors and lake changes, while an obvious correlation between lakes and groundwater changes in the eastern Qaidam, consisting with the results of the field survey. Deep-circulating groundwater recharges numerous Qaidam lakes through upwelling from fault zones, such as Gasikule Lake and Xiaochaidan Lake. Groundwater in the Qaidam Basin is more depleted in hydrogen-oxygen isotope characteristics than surface water in the basin, but similar to some river water in the endorheic Tibetan Plateau. This indicates that Tibetan seepage water, estimated at approximately 540 billion m3/a, is transported through the Qaidam Basin via deep circulation. Moreover, it rises to recharge the groundwater and lakes within this basin through fracture zones, extending to various arid and semi-arid regions such as Taitema Lake. This work provides a new perspective on the impact of deep groundwater on lakes and water circulation in these areas.
Collapse
Affiliation(s)
- Xiangmei Liu
- College of Artificial Intelligence and Automation, Hohai University, Changzhou, 213200, China
| | - Jiaqi Chen
- College of Information Science and Engineering, Hohai University, Changzhou, 213200, China.
| | - Qiwen Zhang
- Navy Submarine Academy, Qingdao, 266199, China
| | - Xi Zhang
- School of Earth Science and Engineering, Hohai University, Nanjing, 211100, China; Institute of Hydrogeology and Environmental Geology, Chinese Academy of Geological Sciences, Xiamen, 361000, China
| | - Ersa Wei
- College of Computer Science and Software Engineering, Hohai University, Nanjing, 211100, China
| | - Nuoya Wang
- College of Information Science and Engineering, Hohai University, Changzhou, 213200, China
| | - Qingwei Wang
- College of Computer Science and Software Engineering, Hohai University, Nanjing, 211100, China
| | - Jiahan Wang
- College of Information Science and Engineering, Hohai University, Changzhou, 213200, China
| | - Jiansheng Chen
- School of Earth Science and Engineering, Hohai University, Nanjing, 211100, China
| |
Collapse
|
8
|
Yilmaz S, Tasyurek M, Amuk M, Celik M, Canger EM. Developing deep learning methods for classification of teeth in dental panoramic radiography. Oral Surg Oral Med Oral Pathol Oral Radiol 2024; 138:118-127. [PMID: 37316425 DOI: 10.1016/j.oooo.2023.02.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 09/13/2022] [Accepted: 02/10/2023] [Indexed: 06/16/2023]
Abstract
OBJECTIVES We aimed to develop an artificial intelligence-based clinical dental decision-support system using deep-learning methods to reduce diagnostic interpretation error and time and increase the effectiveness of dental treatment and classification. STUDY DESIGN We compared the performance of 2 deep-learning methods, You Only Look Once V4 (YOLO-V4) and Faster Regions with the Convolutional Neural Networks (R-CNN), for tooth classification in dental panoramic radiography for tooth classification in dental panoramic radiography to determine which is more successful in terms of accuracy, time, and detection ability. Using a method based on deep-learning models trained on a semantic segmentation task, we analyzed 1200 panoramic radiographs selected retrospectively. In the classification process, our model identified 36 classes, including 32 teeth and 4 impacted teeth. RESULTS The YOLO-V4 method achieved a mean 99.90% precision, 99.18% recall, and 99.54% F1 score. The Faster R-CNN method achieved a mean 93.67% precision, 90.79% recall, and 92.21% F1 score. Experimental evaluations showed that the YOLO-V4 method outperformed the Faster R-CNN method in terms of accuracy of predicted teeth in the tooth classification process, speed of tooth classification, and ability to detect impacted and erupted third molars. CONCLUSIONS The YOLO-V4 method outperforms the Faster R-CNN method in terms of accuracy of tooth prediction, speed of detection, and ability to detect impacted third molars and erupted third molars. The proposed deep learning based methods can assist dentists in clinical decision making, save time, and reduce the negative effects of stress and fatigue in daily practice.
Collapse
Affiliation(s)
- Serkan Yilmaz
- Faculty of Dentistry, Department of Oral and Maxillofacial Radiology, Erciyes University, Kayseri, Turkey
| | - Murat Tasyurek
- Department of Computer Engineering, Kayseri University, Kayseri, Turkey
| | - Mehmet Amuk
- Faculty of Dentistry, Department of Oral and Maxillofacial Radiology, Erciyes University, Kayseri, Turkey
| | - Mete Celik
- Department of Computer Engineering, Erciyes University, Kayseri, Turkey
| | - Emin Murat Canger
- Faculty of Dentistry, Department of Oral and Maxillofacial Radiology, Erciyes University, Kayseri, Turkey.
| |
Collapse
|
9
|
Ma X, Li T, Deng J, Li T, Li J, Chang C, Wang R, Li G, Qi T, Hao S. Infrared and Visible Image Fusion Algorithm Based on Double-Domain Transform Filter and Contrast Transform Feature Extraction. SENSORS (BASEL, SWITZERLAND) 2024; 24:3949. [PMID: 38931733 PMCID: PMC11207559 DOI: 10.3390/s24123949] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Revised: 06/15/2024] [Accepted: 06/16/2024] [Indexed: 06/28/2024]
Abstract
Current challenges in visible and infrared image fusion include color information distortion, texture detail loss, and target edge blur. To address these issues, a fusion algorithm based on double-domain transform filter and nonlinear contrast transform feature extraction (DDCTFuse) is proposed. First, for the problem of incomplete detail extraction that exists in the traditional transform domain image decomposition, an adaptive high-pass filter is proposed to decompose images into high-frequency and low-frequency portions. Second, in order to address the issue of fuzzy fusion target caused by contrast loss during the fusion process, a novel feature extraction algorithm is devised based on a novel nonlinear transform function. Finally, the fusion results are optimized and color-corrected by our proposed spatial-domain logical filter, in order to solve the color loss and edge blur generated in the fusion process. To validate the benefits of the proposed algorithm, nine classical algorithms are compared on the LLVIP, MSRS, INO, and Roadscene datasets. The results of these experiments indicate that the proposed fusion algorithm exhibits distinct targets, provides comprehensive scene information, and offers significant image contrast.
Collapse
Affiliation(s)
- Xu Ma
- College of Safety Science and Engineering, Xi’an University of Science and Technology, Xi’an 710054, China;
- College of Electrical and Control Engineering, Xi’an University of Science and Technology, Xi’an 710054, China; (T.L.); (T.L.); (J.L.); (C.C.); (R.W.); (G.L.); (T.Q.); (S.H.)
| | - Tianqi Li
- College of Electrical and Control Engineering, Xi’an University of Science and Technology, Xi’an 710054, China; (T.L.); (T.L.); (J.L.); (C.C.); (R.W.); (G.L.); (T.Q.); (S.H.)
| | - Jun Deng
- College of Safety Science and Engineering, Xi’an University of Science and Technology, Xi’an 710054, China;
| | - Tong Li
- College of Electrical and Control Engineering, Xi’an University of Science and Technology, Xi’an 710054, China; (T.L.); (T.L.); (J.L.); (C.C.); (R.W.); (G.L.); (T.Q.); (S.H.)
| | - Jiahao Li
- College of Electrical and Control Engineering, Xi’an University of Science and Technology, Xi’an 710054, China; (T.L.); (T.L.); (J.L.); (C.C.); (R.W.); (G.L.); (T.Q.); (S.H.)
| | - Chi Chang
- College of Electrical and Control Engineering, Xi’an University of Science and Technology, Xi’an 710054, China; (T.L.); (T.L.); (J.L.); (C.C.); (R.W.); (G.L.); (T.Q.); (S.H.)
| | - Rui Wang
- College of Electrical and Control Engineering, Xi’an University of Science and Technology, Xi’an 710054, China; (T.L.); (T.L.); (J.L.); (C.C.); (R.W.); (G.L.); (T.Q.); (S.H.)
| | - Guoliang Li
- College of Electrical and Control Engineering, Xi’an University of Science and Technology, Xi’an 710054, China; (T.L.); (T.L.); (J.L.); (C.C.); (R.W.); (G.L.); (T.Q.); (S.H.)
| | - Tianrui Qi
- College of Electrical and Control Engineering, Xi’an University of Science and Technology, Xi’an 710054, China; (T.L.); (T.L.); (J.L.); (C.C.); (R.W.); (G.L.); (T.Q.); (S.H.)
| | - Shuai Hao
- College of Electrical and Control Engineering, Xi’an University of Science and Technology, Xi’an 710054, China; (T.L.); (T.L.); (J.L.); (C.C.); (R.W.); (G.L.); (T.Q.); (S.H.)
| |
Collapse
|
10
|
Liu X, He Y, Li J, Yan R, Li X, Huang H. A Comparative Review on Enhancing Visual Simultaneous Localization and Mapping with Deep Semantic Segmentation. SENSORS (BASEL, SWITZERLAND) 2024; 24:3388. [PMID: 38894177 PMCID: PMC11174785 DOI: 10.3390/s24113388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 05/16/2024] [Accepted: 05/21/2024] [Indexed: 06/21/2024]
Abstract
Visual simultaneous localization and mapping (VSLAM) enhances the navigation of autonomous agents in unfamiliar environments by progressively constructing maps and estimating poses. However, conventional VSLAM pipelines often exhibited degraded performance in dynamic environments featuring mobile objects. Recent research in deep learning led to notable progress in semantic segmentation, which involves assigning semantic labels to image pixels. The integration of semantic segmentation into VSLAM can effectively differentiate between static and dynamic elements in intricate scenes. This paper provided a comprehensive comparative review on leveraging semantic segmentation to improve major components of VSLAM, including visual odometry, loop closure detection, and environmental mapping. Key principles and methods for both traditional VSLAM and deep semantic segmentation were introduced. This paper presented an overview and comparative analysis of the technical implementations of semantic integration across various modules of the VSLAM pipeline. Furthermore, it examined the features and potential use cases associated with the fusion of VSLAM and semantics. It was found that the existing VSLAM model continued to face challenges related to computational complexity. Promising future research directions were identified, including efficient model design, multimodal fusion, online adaptation, dynamic scene reconstruction, and end-to-end joint optimization. This review shed light on the emerging paradigm of semantic VSLAM and how deep learning-enabled semantic reasoning could unlock new capabilities for autonomous intelligent systems to operate reliably in the real world.
Collapse
Affiliation(s)
- Xiwen Liu
- Key Laboratory of Urban Land Resources Monitoring and Simulation, Ministry of Natura Resources, Shenzhen 518034, China;
- School of Smart City, Chongqing Jiaotong University, Chongqing 400074, China; (Y.H.); (R.Y.); (X.L.)
| | - Yong He
- School of Smart City, Chongqing Jiaotong University, Chongqing 400074, China; (Y.H.); (R.Y.); (X.L.)
| | - Jue Li
- College of Traffic & Transportation, Chongqing Jiaotong University, Chongqing 400074, China
| | - Rui Yan
- School of Smart City, Chongqing Jiaotong University, Chongqing 400074, China; (Y.H.); (R.Y.); (X.L.)
| | - Xiaoyu Li
- School of Smart City, Chongqing Jiaotong University, Chongqing 400074, China; (Y.H.); (R.Y.); (X.L.)
| | - Hui Huang
- Chongqing Digital City Technology Co., Ltd., Chongqing 400074, China;
| |
Collapse
|
11
|
Li Y, Liu B, Deng J, Guo Y, Du H. Image-based molecular representation learning for drug development: a survey. Brief Bioinform 2024; 25:bbae294. [PMID: 38920347 PMCID: PMC11200195 DOI: 10.1093/bib/bbae294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Revised: 05/19/2024] [Accepted: 06/08/2024] [Indexed: 06/27/2024] Open
Abstract
Artificial intelligence (AI) powered drug development has received remarkable attention in recent years. It addresses the limitations of traditional experimental methods that are costly and time-consuming. While there have been many surveys attempting to summarize related research, they only focus on general AI or specific aspects such as natural language processing and graph neural network. Considering the rapid advance on computer vision, using the molecular image to enable AI appears to be a more intuitive and effective approach since each chemical substance has a unique visual representation. In this paper, we provide the first survey on image-based molecular representation for drug development. The survey proposes a taxonomy based on the learning paradigms in computer vision and reviews a large number of corresponding papers, highlighting the contributions of molecular visual representation in drug development. Besides, we discuss the applications, limitations and future directions in the field. We hope this survey could offer valuable insight into the use of image-based molecular representation learning in the context of drug development.
Collapse
Affiliation(s)
- Yue Li
- Division of Gastroenterology, Dongzhimen Hospital, Beijing University of Chinese Medicine, No. 5 Haiyun Warehouse, 100700, Beijing, China
| | - Bingyan Liu
- School of Computer Science, Beijing University of Posts and Telecommunications, No.10 Xituchen Street, 100876, Beijing, China
| | - Jinyan Deng
- Division of Gastroenterology, Dongzhimen Hospital, Beijing University of Chinese Medicine, No. 5 Haiyun Warehouse, 100700, Beijing, China
| | - Yi Guo
- Division of Gastroenterology, Dongzhimen Hospital, Beijing University of Chinese Medicine, No. 5 Haiyun Warehouse, 100700, Beijing, China
| | - Hongbo Du
- Division of Gastroenterology, Dongzhimen Hospital, Beijing University of Chinese Medicine, No. 5 Haiyun Warehouse, 100700, Beijing, China
- Institute of Liver Disease, Beijing University of Chinese Medicine, No. 5 Haiyun Warehouse, 100700, Beijing, China
| |
Collapse
|
12
|
Lin X, Huang X, Wang L. Underwater object detection method based on learnable query recall mechanism and lightweight adapter. PLoS One 2024; 19:e0298739. [PMID: 38416764 PMCID: PMC10901356 DOI: 10.1371/journal.pone.0298739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Accepted: 01/29/2024] [Indexed: 03/01/2024] Open
Abstract
With the rapid development of ocean observation technology, underwater object detection has begun to occupy an essential position in the fields of aquaculture, environmental monitoring, marine science, etc. However, due to the problems unique to underwater images such as severe noise, blurred objects, and multi-scale, deep learning-based target detection algorithms lack sufficient capabilities to cope with these challenges. To address these issues, we improve DETR to make it well suited for underwater scenarios. First, a simple and effective learnable query recall mechanism is proposed to mitigate the effect of noise and can significantly improve the detection performance of the object. Second, for underwater small and irregular object detection, a lightweight adapter is designed to provide multi-scale features for the encoding and decoding stages. Third, the regression mechanism of the bounding box is optimized using the combination loss of smooth L1 and CIoU. Finally, we validate the designed network against other state-of-the-art methods on the RUOD dataset. The experimental results show that the proposed method is effective.
Collapse
Affiliation(s)
- Xi Lin
- Institute of Logistics Science and Engineering, Shanghai Maritime University, Shanghai, People's Republic of China
| | - Xixia Huang
- Institute of Logistics Science and Engineering, Shanghai Maritime University, Shanghai, People's Republic of China
| | - Le Wang
- Institute of Logistics Science and Engineering, Shanghai Maritime University, Shanghai, People's Republic of China
| |
Collapse
|
13
|
Wu K, Xu L, Li X, Zhang Y, Yue Z, Gao Y, Chen Y. Named entity recognition of rice genes and phenotypes based on BiGRU neural networks. Comput Biol Chem 2024; 108:107977. [PMID: 37995493 DOI: 10.1016/j.compbiolchem.2023.107977] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 10/26/2023] [Accepted: 10/29/2023] [Indexed: 11/25/2023]
Abstract
Named Entity Recognition (NER) is a fundamental but crucial task in natural language processing (NLP) and big data analysis, with wide application range. NER for rice genes and phenotypes is a technique to identify genes and phenotypes from a large amount of text. NER for rice genes and phenotypes can facilitate the acquisition of information in the field of crops and provide references for our research on higher quality crops. At the same time, named entity recognition still faces many challenges. In this paper, we propose an improved bidirectional gated recurrent unit neural network (BI-GRU) method, which is used to automatically identify the required entities (i.e. gene names, rice phenotypes) from relevant rice literature and patents. The neural network model is combined with the Softmax function to directly output the probabilities of labels, forming the BI-GRU-SF model. With the ability of deep learning methods, the semantic information in the context can be learned without the need for feature engineering. Finally, we conducted experiments, and the results showed that our proposed model provided better performance compared to other models. All datasets and resource codes of BI-GRU-SF are available at https://github.com/qqeeqq/NER for academic use.
Collapse
Affiliation(s)
- Kangjie Wu
- School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China
| | - Liqian Xu
- School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China
| | - Xinxiang Li
- School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China
| | - Youhua Zhang
- School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China
| | - Zhenyu Yue
- School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China
| | - Yujia Gao
- School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China
| | - Yiqiong Chen
- School of Information and Computer, Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China.
| |
Collapse
|
14
|
Liu H, Cheng J, Cao J, Katib I. Preassigned-time synchronization for complex-valued memristive neural networks with reaction-diffusion terms and Markov parameters. Neural Netw 2024; 169:520-531. [PMID: 37948970 DOI: 10.1016/j.neunet.2023.11.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Revised: 10/01/2023] [Accepted: 11/05/2023] [Indexed: 11/12/2023]
Abstract
This study addresses the preassigned-time synchronization for complex-valued memristive neural networks with reaction-diffusion terms and Markov parameters. Employing a preassigned-time stable control strategy, two distinct controllers with varying power exponent parameters are designed to ensure that synchronization can be achieved within a predefined time frame. Unlike existing finite/fixed-time results, a priori specification of the settling time is addressed. Furthermore, Green's formula and boundary conditions are efficiently applied to overcome potential symmetry loss. Additionally, the activation function's constraint range is more lenient compared to existing constraints. Finally, the effectiveness of the presented methods are demonstrated through two examples.
Collapse
Affiliation(s)
- Hongliang Liu
- School of Mathematics and Physics, University of South China, Hengyang, 421001, PR China
| | - Jun Cheng
- School of Mathematics and Statistics, Guangxi Normal University, Guilin, 541004, PR China.
| | - Jinde Cao
- School of Mathematics, Southeast University, Nanjing 210096, PR China
| | - Iyad Katib
- Department of Computer Science, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| |
Collapse
|
15
|
Zamani MG, Nikoo MR, Jahanshahi S, Barzegar R, Meydani A. Forecasting water quality variable using deep learning and weighted averaging ensemble models. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:124316-124340. [PMID: 37996598 DOI: 10.1007/s11356-023-30774-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 10/27/2023] [Indexed: 11/25/2023]
Abstract
Water quality variables, including chlorophyll-a (Chl-a), play a pivotal role in comprehending and evaluating the condition of aquatic ecosystems. Chl-a, a pigment present in diverse aquatic organisms, notably algae and cyanobacteria, serves as a valuable indicator of water quality. Thus, the objectives of this study encompass: (1) the assessment of the predictive capabilities of four deep learning (DL) models - namely, recurrent neural network (RNN), long short-term memory (LSTM), gated recurrence unit (GRU), and temporal convolutional network (TCN) - in forecasting Chl-a concentrations; (2) the incorporation of these DL models into ensemble models (EMs) employing genetic algorithm (GA) and non-dominated sorting genetic algorithm (NSGA-II) to harness the strengths of each standalone model; and (3) the evaluation of the efficacy of the developed EMs. Utilizing data collected at 15-min intervals from Small Prespa Lake (SPL) in Greece, the models employed hourly Chl-a concentration lag times, extending up to 6 h, as models' inputs to forecast Chla (t+1). The proposed models underwent training on 70% of the dataset and were subsequently validated on the remaining 30%. Among the standalone DL models, the GRU model exhibited superior performance in Chl-a forecasting, surpassing the RNN, LSTM, and TCN models by 8%, 2%, and 2%, respectively. Furthermore, the integration of DL models through single-objective GA and multi-objective NSGA-II optimization algorithms yielded hybrid models adept at effectively forecasting both low and high Chl-a concentrations. The ensemble model based on NSGA-II outperformed standalone DL models as well as the GA-based model across a range of evaluation indices. For instance, considering the R-squared metric, the study's findings demonstrated that the EM-NSGA-II stands out with exceptional effectiveness compared to DL and EM-GA models, showcasing improvements of 14% (RNN), 8% (LSTM), 6% (GRU), 8% (TCN), and 3% (EM-GA) during the testing phase.
Collapse
Affiliation(s)
- Mohammad G Zamani
- Department of Water Resources Engineering, Faculty of Civil, Water and Environmental Engineering, Amirkabir University of Technology, Tehran, Iran
| | - Mohammad Reza Nikoo
- Department of Civil and Architectural Engineering, Sultan Qaboos University, Muscat, Oman.
| | - Sina Jahanshahi
- Department of Water Resources Engineering, Faculty of Civil, Water and Environmental Engineering, University of Tehran, Tehran, Iran
| | - Rahim Barzegar
- Groundwater Research Group (GRES), Research Institute on Mines and Environment (RIME), Université du Québec en Abitibi-Témiscamingue (UQAT), Amos, Québec, Canada
| | - Amirreza Meydani
- Department of Geography and Spatial Sciences, University of Delaware, Newark, DE, USA
| |
Collapse
|
16
|
Lou LL, Qiu WR, Liu Z, Xu ZC, Xiao X, Huang SF. Stacking-ac4C: an ensemble model using mixed features for identifying n4-acetylcytidine in mRNA. Front Immunol 2023; 14:1267755. [PMID: 38094296 PMCID: PMC10716444 DOI: 10.3389/fimmu.2023.1267755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 11/14/2023] [Indexed: 12/18/2023] Open
Abstract
N4-acetylcytidine (ac4C) is a modification of cytidine at the nitrogen-4 position, playing a significant role in the translation process of mRNA. However, the precise mechanism and details of how ac4C modifies translated mRNA remain unclear. Since identifying ac4C sites using conventional experimental methods is both labor-intensive and time-consuming, there is an urgent need for a method that can promptly recognize ac4C sites. In this paper, we propose a comprehensive ensemble learning model, the Stacking-based heterogeneous integrated ac4C model, engineered explicitly to identify ac4C sites. This innovative model integrates three distinct feature extraction methodologies: Kmer, electron-ion interaction pseudo-potential values (PseEIIP), and pseudo-K-tuple nucleotide composition (PseKNC). The model also incorporates the robust Cluster Centroids algorithm to enhance its performance in dealing with imbalanced data and alleviate underfitting issues. Our independent testing experiments indicate that our proposed model improves the Mcc by 15.61% and the ROC by 5.97% compared to existing models. To test our model's adaptability, we also utilized a balanced dataset assembled by the authors of iRNA-ac4C. Our model showed an increase in Sn of 4.1%, an increase in Acc of nearly 1%, and ROC improvement of 0.35% on this balanced dataset. The code for our model is freely accessible at https://github.com/louliliang/ST-ac4C.git, allowing users to quickly build their model without dealing with complicated mathematical equations.
Collapse
Affiliation(s)
- Li-Liang Lou
- Computer Department, Jing-De-Zhen Ceramic Institute, Jingdezhen, China
| | - Wang-Ren Qiu
- Computer Department, Jing-De-Zhen Ceramic Institute, Jingdezhen, China
| | - Zi Liu
- Computer Department, Jing-De-Zhen Ceramic Institute, Jingdezhen, China
| | - Zhao-Chun Xu
- Computer Department, Jing-De-Zhen Ceramic Institute, Jingdezhen, China
| | - Xuan Xiao
- Computer Department, Jing-De-Zhen Ceramic Institute, Jingdezhen, China
| | - Shun-Fa Huang
- School of Information Engineering , Jingdezhen University, Jingdezhen, China
| |
Collapse
|
17
|
Reveles-Gómez LC, Luna-García H, Celaya-Padilla JM, Barría-Huidobro C, Gamboa-Rosales H, Solís-Robles R, Arceo-Olague JG, Galván-Tejada JI, Galván-Tejada CE, Rondon D, Villalba-Condori KO. Detection of Pedestrians in Reverse Camera Using Multimodal Convolutional Neural Networks. SENSORS (BASEL, SWITZERLAND) 2023; 23:7559. [PMID: 37688015 PMCID: PMC10490826 DOI: 10.3390/s23177559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Revised: 08/22/2023] [Accepted: 08/25/2023] [Indexed: 09/10/2023]
Abstract
In recent years, the application of artificial intelligence (AI) in the automotive industry has led to the development of intelligent systems focused on road safety, aiming to improve protection for drivers and pedestrians worldwide to reduce the number of accidents yearly. One of the most critical functions of these systems is pedestrian detection, as it is crucial for the safety of everyone involved in road traffic. However, pedestrian detection goes beyond the front of the vehicle; it is also essential to consider the vehicle's rear since pedestrian collisions occur when the car is in reverse drive. To contribute to the solution of this problem, this research proposes a model based on convolutional neural networks (CNN) using a proposed one-dimensional architecture and the Inception V3 architecture to fuse the information from the backup camera and the distance measured by the ultrasonic sensors, to detect pedestrians when the vehicle is reversing. In addition, specific data collection was performed to build a database for the research. The proposed model showed outstanding results with 99.85% accuracy and 99.86% correct classification performance, demonstrating that it is possible to achieve the goal of pedestrian detection using CNN by fusing two types of data.
Collapse
Affiliation(s)
- Luis C. Reveles-Gómez
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico; (L.C.R.-G.)
| | - Huizilopoztli Luna-García
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico; (L.C.R.-G.)
| | - José M. Celaya-Padilla
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico; (L.C.R.-G.)
| | - Cristian Barría-Huidobro
- Centro de Investigación en Ciberseguridad, Universidad Mayor de Chile, Manuel Montt 367, Providencia 7500628, Chile
| | - Hamurabi Gamboa-Rosales
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico; (L.C.R.-G.)
| | - Roberto Solís-Robles
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico; (L.C.R.-G.)
| | - José G. Arceo-Olague
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico; (L.C.R.-G.)
| | - Jorge I. Galván-Tejada
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico; (L.C.R.-G.)
| | - Carlos E. Galván-Tejada
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico; (L.C.R.-G.)
| | - David Rondon
- Departamento Estudios Generales, Universidad Continental, Arequipa 04001, Peru
| | | |
Collapse
|
18
|
Jiang B, Zhang K, Liu X, Lu Y. Prediction model with multi-point relationship fusion via graph convolutional network: A case study on mining-induced surface subsidence. PLoS One 2023; 18:e0289846. [PMID: 37585397 PMCID: PMC10431667 DOI: 10.1371/journal.pone.0289846] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 07/27/2023] [Indexed: 08/18/2023] Open
Abstract
Accurate prediction of surface subsidence is of significance for analyzing the pattern of mining-induced surface subsidence, and for mining under buildings, railways, and water bodies. To address the problem that the existing prediction models ignore the correlation between subsidence points, resulting in large prediction errors, a Multi-point Relationship Fusion prediction model based on Graph Convolutional Networks (MRF-GCN) for mining-induced subsidence was proposed. Taking the surface subsidence in 82/83 mining area of Yuandian No. 2 Mine in Anhui Province in eastern China as an example, the surface deformation data obtained from 250 InSAR images captured by Sentinel-1A satellite from 2018 to 2022, combined with GNSS observation data, were used for modeling. The deformation pattern of each single observation point was obtained by feeding their deformation observation data into the LSTM encoder, after that, the relationship graph was created based on the correlation between points in the observation network and MRF-GCN was established. Then the prediction results came out through a nonlinear activation function of neural network. The research shows that the R2R2 value of MRF-GCN model was 0.865 0, much larger than that of Long-Short Term Memory (LSTM) and other conventional models, while mean square error (MSE) of MRF-GCN model was 1.59 899, much smaller than that of LSTM and other conventional models. Therefore, the MRF-GCN model has better prediction accuracy than other models and can be applied to predicting surface subsidence in large areas.
Collapse
Affiliation(s)
- Baoxing Jiang
- State Key Laboratory of Mining Response and Disaster Prevention and Control in Deep Coal Mines, Huainan, China
- School of Geomatics, Anhui University of Science and Technology, Huainan, China
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Kun Zhang
- State Key Laboratory of Mining Response and Disaster Prevention and Control in Deep Coal Mines, Huainan, China
- School of Geomatics, Anhui University of Science and Technology, Huainan, China
| | - Xiaopeng Liu
- State Key Laboratory of Mining Response and Disaster Prevention and Control in Deep Coal Mines, Huainan, China
- School of Geomatics, Anhui University of Science and Technology, Huainan, China
| | - Yuxi Lu
- School of Geomatics, Anhui University of Science and Technology, Huainan, China
- Faculty of Land Resources Engineering, Kunming University of Science and Technology, Kunming, China
| |
Collapse
|
19
|
Qian J, Li H, Wang J, He L. Recent Advances in Explainable Artificial Intelligence for Magnetic Resonance Imaging. Diagnostics (Basel) 2023; 13:1571. [PMID: 37174962 PMCID: PMC10178221 DOI: 10.3390/diagnostics13091571] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 03/29/2023] [Accepted: 04/26/2023] [Indexed: 05/15/2023] Open
Abstract
Advances in artificial intelligence (AI), especially deep learning (DL), have facilitated magnetic resonance imaging (MRI) data analysis, enabling AI-assisted medical image diagnoses and prognoses. However, most of the DL models are considered as "black boxes". There is an unmet need to demystify DL models so domain experts can trust these high-performance DL models. This has resulted in a sub-domain of AI research called explainable artificial intelligence (XAI). In the last decade, many experts have dedicated their efforts to developing novel XAI methods that are competent at visualizing and explaining the logic behind data-driven DL models. However, XAI techniques are still in their infancy for medical MRI image analysis. This study aims to outline the XAI applications that are able to interpret DL models for MRI data analysis. We first introduce several common MRI data modalities. Then, a brief history of DL models is discussed. Next, we highlight XAI frameworks and elaborate on the principles of multiple popular XAI methods. Moreover, studies on XAI applications in MRI image analysis are reviewed across the tissues/organs of the human body. A quantitative analysis is conducted to reveal the insights of MRI researchers on these XAI techniques. Finally, evaluations of XAI methods are discussed. This survey presents recent advances in the XAI domain for explaining the DL models that have been utilized in MRI applications.
Collapse
Affiliation(s)
- Jinzhao Qian
- Imaging Research Center, Department of Radiology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, USA
- Department of Computer Science, University of Cincinnati, Cincinnati, OH 45221, USA
| | - Hailong Li
- Imaging Research Center, Department of Radiology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, USA
- Department of Radiology, College of Medicine, University of Cincinnati, Cincinnati, OH 45221, USA
| | - Junqi Wang
- Imaging Research Center, Department of Radiology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, USA
| | - Lili He
- Imaging Research Center, Department of Radiology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, USA
- Department of Computer Science, University of Cincinnati, Cincinnati, OH 45221, USA
- Department of Radiology, College of Medicine, University of Cincinnati, Cincinnati, OH 45221, USA
| |
Collapse
|
20
|
Li Q, Hu S, Shimasaki K, Ishii I. An Active Multi-Object Ultrafast Tracking System with CNN-Based Hybrid Object Detection. SENSORS (BASEL, SWITZERLAND) 2023; 23:4150. [PMID: 37112491 PMCID: PMC10145589 DOI: 10.3390/s23084150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 04/09/2023] [Accepted: 04/11/2023] [Indexed: 06/19/2023]
Abstract
This study proposes a visual tracking system that can detect and track multiple fast-moving appearance-varying targets simultaneously with 500 fps image processing. The system comprises a high-speed camera and a pan-tilt galvanometer system, which can rapidly generate large-scale high-definition images of the wide monitored area. We developed a CNN-based hybrid tracking algorithm that can robustly track multiple high-speed moving objects simultaneously. Experimental results demonstrate that our system can track up to three moving objects with velocities lower than 30 m per second simultaneously within an 8-m range. The effectiveness of our system was demonstrated through several experiments conducted on simultaneous zoom shooting of multiple moving objects (persons and bottles) in a natural outdoor scene. Moreover, our system demonstrates high robustness to target loss and crossing situations.
Collapse
Affiliation(s)
| | | | | | - Idaku Ishii
- Correspondence: ; Tel.: +81-82-424-7692; Fax: +81-82-422-7158
| |
Collapse
|
21
|
Chaudhary H, Sajid M, Kaushik S, Allahem A. Stability analysis of chaotic generalized Lotka-Volterra system via active compound difference anti-synchronization method. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:9410-9422. [PMID: 37161249 DOI: 10.3934/mbe.2023413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
This work deals with a systematic approach for the investigation of compound difference anti-synchronization (CDAS) scheme among chaotic generalized Lotka-Volterra biological systems (GLVBSs). First, an active control strategy (ACS) of nonlinear type is described which is specifically based on Lyapunov's stability analysis (LSA) and master-slave framework. In addition, the biological control law having nonlinear expression is constructed for attaining asymptotic stability pattern for the error dynamics of the discussed GLVBSs. Also, simulation results through MATLAB environment are executed for illustrating the efficacy and correctness of considered CDAS approach. Remarkably, our attained analytical outcomes have been in outstanding conformity with the numerical outcomes. The investigated CDAS strategy has numerous significant applications to the fields of encryption and secure communication.
Collapse
Affiliation(s)
- Harindri Chaudhary
- Department of Mathematics, Deshbandhu College, University of Delhi, New Delhi 110019, India
| | - Mohammad Sajid
- Department of Mechanical Engineering, College of Engineering, Qassim University, Buraydah 51452, Saudi Arabia
| | - Santosh Kaushik
- Department of Mathematics, Bhagini Nivedita College, University of Delhi, New Delhi 110043, India
| | - Ali Allahem
- Department of Mathematics, College of Science, Qassim University, Buraydah 51452, Saudi Arabia
| |
Collapse
|
22
|
Yu H, Park E. A harmless webtoon for all: An automatic age-restriction prediction system for webtoon contents. TELEMATICS AND INFORMATICS 2022. [DOI: 10.1016/j.tele.2022.101906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
23
|
da Cunha CR, Aoki N, Ferry DK, Lai YC. A method for finding the background potential of quantum devices from scanning gate microscopy data using machine learning. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2022. [DOI: 10.1088/2632-2153/ac6ec7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Abstract
The inverse problem of estimating the background potential from measurements of the local density of states is a challenging issue in quantum mechanics. Even more difficult is to do this estimation using approximate methods such as scanning gate microscopy (SGM). Here, we propose a machine-learning-based solution by exploiting adaptive cellular neural networks (CNNs). In the paradigmatic setting of a quantum point contact, the training data consist of potential-SGM functional relations represented by image pairs. These are generated by the recursive Green’s function method. We demonstrate that the CNN-based machine learning framework can predict the background potential corresponding to the experimental image data. This is confirmed by analyzing the estimated potential with image processing techniques based on the comparison between the charge densities and those obtained using different techniques. Correlation analysis of the images suggests the possibility of estimating different contributions to the background potential. In particular, our results indicate that both charge puddles and fixed impurities contribute to the spatial patterns found in the SGM data. Our work represents a timely contribution to the rapidly evolving field of exploiting machine learning to solve difficult problems in physics.
Collapse
|
24
|
Hassaoui M, Hanini M, El Kafhali S. Domain generated algorithms detection applying a combination of a deep feature selection and traditional machine learning models. JOURNAL OF COMPUTER SECURITY 2022. [DOI: 10.3233/jcs-210139] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The use of command and control (C2) servers in cyberattacks has risen considerably, attackers frequently employ the domain generated algorithm (DGA) technique to conceal their C2 servers. Various machine learning models have been suggested for binary identification of domain names as either benign or DGA domain. The Existing techniques are inefficient and have real-time detection issues and are also very data hypersensitive, therefore, they can be circumvented by the attackers. The main problem this article addresses is how to automatically detect DGA in a way that does not rely solely on reverse engineering, not strongly affected by data size, and allows detection of this DGA in real time. This paper presents DTFS-DGA model that combine neural networks models with traditional machine learning models and maintains its performance even if the data size changes to detect DGA in real time. The model uses 15 linguistics and networks features with the features extracted by long short-term memory and convolutional neural network to classify domain names using random forest and support vector machines. The comprehensive experimental findings confirm the suggested model’s accuracy. To be precise, the model achieve an average accuracy of 99.8 % for the classification.
Collapse
Affiliation(s)
- Mohamed Hassaoui
- Computer, Networks, Mobility and Modeling laboratory, Hassan First University of settat, Faculty of Sciences and Techniques, Settat, Morocco
| | - Mohamed Hanini
- Computer, Networks, Mobility and Modeling laboratory, Hassan First University, Faculty of Sciences and Techniques, Settat, Morocco
| | - Said El Kafhali
- Computer, Networks, Mobility and Modeling laboratory, Hassan First University, Faculty of Sciences and Techniques, Settat, Morocco
| |
Collapse
|
25
|
Mahmoud AN, Vanderveken F, Ciubotaru F, Adelmann C, Hamdioui S, Cotofana S. A Spin Wave-Based Approximate 4:2 Compressor: Seeking the most energy-efficient digital computing paradigm. IEEE NANOTECHNOLOGY MAGAZINE 2022. [DOI: 10.1109/mnano.2021.3126095] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
26
|
Smith JT, Ochoa M, Faulkner D, Haskins G, Intes X. Deep learning in macroscopic diffuse optical imaging. JOURNAL OF BIOMEDICAL OPTICS 2022; 27:JBO-210288VRR. [PMID: 35218169 PMCID: PMC8881080 DOI: 10.1117/1.jbo.27.2.020901] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Accepted: 02/09/2022] [Indexed: 05/02/2023]
Abstract
SIGNIFICANCE Biomedical optics system design, image formation, and image analysis have primarily been guided by classical physical modeling and signal processing methodologies. Recently, however, deep learning (DL) has become a major paradigm in computational modeling and has demonstrated utility in numerous scientific domains and various forms of data analysis. AIM We aim to comprehensively review the use of DL applied to macroscopic diffuse optical imaging (DOI). APPROACH First, we provide a layman introduction to DL. Then, the review summarizes current DL work in some of the most active areas of this field, including optical properties retrieval, fluorescence lifetime imaging, and diffuse optical tomography. RESULTS The advantages of using DL for DOI versus conventional inverse solvers cited in the literature reviewed herein are numerous. These include, among others, a decrease in analysis time (often by many orders of magnitude), increased quantitative reconstruction quality, robustness to noise, and the unique capability to learn complex end-to-end relationships. CONCLUSIONS The heavily validated capability of DL's use across a wide range of complex inverse solving methodologies has enormous potential to bring novel DOI modalities, otherwise deemed impractical for clinical translation, to the patient's bedside.
Collapse
Affiliation(s)
- Jason T. Smith
- Rensselaer Polytechnic Institute, Department of Biomedical Engineering, Troy, New York, United States
| | - Marien Ochoa
- Rensselaer Polytechnic Institute, Department of Biomedical Engineering, Troy, New York, United States
| | - Denzel Faulkner
- Rensselaer Polytechnic Institute, Department of Biomedical Engineering, Troy, New York, United States
| | - Grant Haskins
- Rensselaer Polytechnic Institute, Department of Biomedical Engineering, Troy, New York, United States
| | - Xavier Intes
- Rensselaer Polytechnic Institute, Center for Modeling, Simulation and Imaging for Medicine, Troy, New York, United States
| |
Collapse
|
27
|
India perspective: CNN-LSTM hybrid deep learning model-based COVID-19 prediction and current status of medical resource availability. Soft comput 2021; 26:645-664. [PMID: 34815733 PMCID: PMC8603002 DOI: 10.1007/s00500-021-06490-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/24/2021] [Indexed: 12/26/2022]
Abstract
The epidemic situation may cause severe social and economic impacts on a country. So, there is a need for a trustworthy prediction model that can offer better prediction results. The forecasting result will help in making the prevention policies and remedial action in time, and thus, we can reduce the overall social and economic impacts on the country. This article introduces a CNN-LSTM hybrid deep learning prediction model, which can correctly forecast the COVID-19 epidemic across India. The proposed model uses convolutional layers, to extract meaningful information and learn from a given time series dataset. It is also enriched with the LSTM layer's capability, which means it can identify long-term and short-term dependencies. The experimental evaluation has been performed to gauge the performance and suitability of our proposed model among the other well-established time series forecasting models. From the empirical analysis, it is also clear that the use of extra convolutional layers with the LSTM layer may increase the forecasting model's performance. Apart from this, the deep insides of the current situation of medical resource availability across India have been discussed.
Collapse
|
28
|
A Review of Remote Sensing Image Dehazing. SENSORS 2021; 21:s21113926. [PMID: 34200320 PMCID: PMC8201244 DOI: 10.3390/s21113926] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Revised: 05/29/2021] [Accepted: 06/02/2021] [Indexed: 11/16/2022]
Abstract
Remote sensing (RS) is one of the data collection technologies that help explore more earth surface information. However, RS data captured by satellite are susceptible to particles suspended during the imaging process, especially for data with visible light band. To make up for such deficiency, numerous dehazing work and efforts have been made recently, whose strategy is to directly restore single hazy data without the need for using any extra information. In this paper, we first classify the current available algorithm into three categories, i.e., image enhancement, physical dehazing, and data-driven. The advantages and disadvantages of each type of algorithm are then summarized in detail. Finally, the evaluation indicators used to rank the recovery performance and the application scenario of the RS data haze removal technique are discussed, respectively. In addition, some common deficiencies of current available methods and future research focus are elaborated.
Collapse
|
29
|
|
30
|
ATES A. Enhanced equilibrium optimization method with fractional order chaotic and application engineering. Neural Comput Appl 2021. [DOI: 10.1007/s00521-021-05756-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
31
|
A Fault Diagnosis and Visualization Method for High-Speed Train Based on Edge and Cloud Collaboration. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11031251] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Safety is the most important aspect of railway transportation. To ensure the safety of high-speed trains, various train components are equipped with sensor devices for real-time monitoring. Sensor monitoring data can be used for fast intelligent diagnosis and accurate positioning of train faults. However, existing train fault diagnosis technology based on cloud computing has disadvantages of long processing times and high consumption of computing resources, which conflict with the real-time response requirements of fault diagnosis. Aiming at the problems of train fault diagnosis in the cloud environment, this paper proposes a train fault diagnosis model based on edge and cloud collaboration. The model first utilizes a SAES-DNN (stacked auto-encoders deep neural network) fault recognition method, which can integrate automatic feature extraction and type recognition and complete fault classification over deep hidden features in high-dimensional data, so as to quickly locate faults. Next, to adapt to the characteristics of edge computing, the model applies a SAES-DNN model trained in the cloud and deployed in the edge via the transfer learning strategy and carries out real-time fault diagnosis on the vehicle sensor monitoring data. Using a motor fault as an example, when compared with a similar intelligent learning model, the proposed intelligent fault diagnosis model can greatly improve diagnosis accuracy and significantly reduce training time. Through the transfer learning approach, adaptability of the fault diagnosis algorithm for personalized applications and real-time performance of the fault diagnosis is enhanced. This paper also proposes a visual analysis method of train fault data based on knowledge graphs, which can effectively analyze fault causes and fault correlation.
Collapse
|
32
|
Duan L, Wang Q, Wei H, Wang Z. Multi-type synchronization dynamics of delayed reaction-diffusion recurrent neural networks with discontinuous activations. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2020.03.040] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
33
|
Kaya K, Gündüz Öğüdücü Ş. Deep Flexible Sequential (DFS) Model for Air Pollution Forecasting. Sci Rep 2020; 10:3346. [PMID: 32098977 PMCID: PMC7042334 DOI: 10.1038/s41598-020-60102-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Accepted: 02/07/2020] [Indexed: 11/08/2022] Open
Abstract
Growing metropolitan areas bring rapid urbanization and air pollution problems. As diseases and mortality rates increase because of the air pollution problem, it becomes a necessity to estimate the air pollution density and inform the public to protect the health. Air pollution problem displays contextual characteristics such as meteorological conditions, industrial and technological developments, traffic problem etc. that change from country to country and also from city to city. In this study, we determined PM[Formula: see text] as the target pollutant and designed a new deep learning based air quality forecasting model, namely DFS (Deep Flexible Sequential). Our study uses real world hourly data from Istanbul, Turkey between 2014 and 2018 to forecast the air pollution 4, 12, and 24 hours before. DFS model is a hybrid & flexible deep model including Long Short Term Memory (LSTM) and Convolutional Neural Network (CNN). The proposed model also is capable of generalization with standard and flexible Dropout layers. Through flexible Dropout layer, the model also obtains flexibility to adapt changing window sizes in sequential modelling. Moreover, this model can be applied to other air pollution time series data problems with small modifications on parameters by taking into account the nature of the data set.
Collapse
Affiliation(s)
- Kıymet Kaya
- Istanbul Technical University, Department of Computer Engineering & ITU AI Research and Application Center, Istanbul, 34467, Turkey.
| | - Şule Gündüz Öğüdücü
- Istanbul Technical University, Department of Computer Engineering & ITU AI Research and Application Center, Istanbul, 34467, Turkey.
| |
Collapse
|
34
|
Brown KA, Brittman S, Maccaferri N, Jariwala D, Celano U. Machine Learning in Nanoscience: Big Data at Small Scales. NANO LETTERS 2020; 20:2-10. [PMID: 31804080 DOI: 10.1021/acs.nanolett.9b04090] [Citation(s) in RCA: 67] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Recent advances in machine learning (ML) offer new tools to extract new insights from large data sets and to acquire small data sets more effectively. Researchers in nanoscience are experimenting with these tools to tackle challenges in many fields. In addition to ML's advancement of nanoscience, nanoscience provides the foundation for neuromorphic computing hardware to expand the implementation of ML algorithms. In this Mini Review, we highlight some recent efforts to connect the ML and nanoscience communities by focusing on three types of interaction: (1) using ML to analyze and extract new insights from large nanoscience data sets, (2) applying ML to accelerate material discovery, including the use of active learning to guide experimental design, and (3) the nanoscience of memristive devices to realize hardware tailored for ML. We conclude with a discussion of challenges and opportunities for future interactions between nanoscience and ML researchers.
Collapse
Affiliation(s)
- Keith A Brown
- Department of Mechanical Engineering, Physics Department, and Division of Materials Science and Engineering , Boston University , Boston , Massachusetts 02215 , United States
| | - Sarah Brittman
- U.S. Naval Research Laboratory , Washington , DC 20375 , United States
| | - Nicolò Maccaferri
- Department of Physics and Materials Science , University of Luxembourg , 162a avenue de la Faïencerie , L-1511 Luxembourg , Luxembourg
| | - Deep Jariwala
- Department of Electrical and Systems Engineering , University of Pennsylvania , Philadelphia , Pennsylvania 19104 , United States
| | - Umberto Celano
- imec , Kapeldreef 75 , B-3001 Heverlee ( Leuven ), Belgium
| |
Collapse
|
35
|
Neural network methodology for real-time modelling of bio-heat transfer during thermo-therapeutic applications. Artif Intell Med 2019; 101:101728. [DOI: 10.1016/j.artmed.2019.101728] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2019] [Revised: 07/30/2019] [Accepted: 09/26/2019] [Indexed: 12/26/2022]
|
36
|
Imran M, Siddiqui MK, Baig AQ, Khalid W, Shaker H. Topological properties of cellular neural networks. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2019. [DOI: 10.3233/jifs-181813] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Muhammad Imran
- Department of Mathematical Sciences, United Arab Emirates University, Al Ain, United Arab Emirates
- School of Natural Sciences(SNS), National University of Sciences and Technology (NUST), Islamabad, Pakistan
| | | | - Abdul Qudair Baig
- Department of Mathematics, The University of Lahore, Pakpattan Campus, Pakistan
| | - Waqas Khalid
- Department of Mathematics, COMSATS University Islamabad, Attock Campus, Pakistan
| | - Hani Shaker
- Department of Mathematics, COMSATS University Islamabad, Lahore Campus, Pakistan
| |
Collapse
|
37
|
Gupta S, Upadhyaya V, Singh A, Varshney P, Srivastava S. Modeling of fractional order chaotic systems using artificial bee colony optimization and ant colony optimization. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2018. [DOI: 10.3233/jifs-169816] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Sangeeta Gupta
- Department of Instrumentation and Control Engineering, NSIT, New Delhi, India
| | - Varun Upadhyaya
- Department of Instrumentation and Control Engineering, NSIT, New Delhi, India
| | - Ayush Singh
- Department of Instrumentation and Control Engineering, NSIT, New Delhi, India
| | - Pragya Varshney
- Department of Instrumentation and Control Engineering, NSIT, New Delhi, India
| | - Smriti Srivastava
- Department of Instrumentation and Control Engineering, NSIT, New Delhi, India
| |
Collapse
|
38
|
|
39
|
Askari E, Setarehdan SK, Sheikhani A, Mohammadi MR, Teshnehlab M. Modeling the connections of brain regions in children with autism using cellular neural networks and electroencephalography analysis. Artif Intell Med 2018; 89:40-50. [DOI: 10.1016/j.artmed.2018.05.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2017] [Revised: 05/18/2018] [Accepted: 05/22/2018] [Indexed: 11/26/2022]
|
40
|
Soft tissue deformation modelling through neural dynamics-based reaction-diffusion mechanics. Med Biol Eng Comput 2018; 56:2163-2176. [PMID: 29845488 DOI: 10.1007/s11517-018-1849-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Accepted: 05/18/2018] [Indexed: 10/16/2022]
Abstract
Soft tissue deformation modelling forms the basis of development of surgical simulation, surgical planning and robotic-assisted minimally invasive surgery. This paper presents a new methodology for modelling of soft tissue deformation based on reaction-diffusion mechanics via neural dynamics. The potential energy stored in soft tissues due to a mechanical load to deform tissues away from their rest state is treated as the equivalent transmembrane potential energy, and it is distributed in the tissue masses in the manner of reaction-diffusion propagation of nonlinear electrical waves. The reaction-diffusion propagation of mechanical potential energy and nonrigid mechanics of motion are combined to model soft tissue deformation and its dynamics, both of which are further formulated as the dynamics of cellular neural networks to achieve real-time computational performance. The proposed methodology is implemented with a haptic device for interactive soft tissue deformation with force feedback. Experimental results demonstrate that the proposed methodology exhibits nonlinear force-displacement relationship for nonlinear soft tissue deformation. Homogeneous, anisotropic and heterogeneous soft tissue material properties can be modelled through the inherent physical properties of mass points. Graphical abstract Soft tissue deformation modelling with haptic feedback via neural dynamics-based reaction-diffusion mechanics.
Collapse
|
41
|
Yang G, Yang J, Sheng W, Junior FEF, Li S. Convolutional Neural Network-Based Embarrassing Situation Detection under Camera for Social Robot in Smart Homes. SENSORS 2018; 18:s18051530. [PMID: 29757211 PMCID: PMC5982546 DOI: 10.3390/s18051530] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2018] [Revised: 05/02/2018] [Accepted: 05/05/2018] [Indexed: 11/28/2022]
Abstract
Recent research has shown that the ubiquitous use of cameras and voice monitoring equipment in a home environment can raise privacy concerns and affect human mental health. This can be a major obstacle to the deployment of smart home systems for elderly or disabled care. This study uses a social robot to detect embarrassing situations. Firstly, we designed an improved neural network structure based on the You Only Look Once (YOLO) model to obtain feature information. By focusing on reducing area redundancy and computation time, we proposed a bounding-box merging algorithm based on region proposal networks (B-RPN), to merge the areas that have similar features and determine the borders of the bounding box. Thereafter, we designed a feature extraction algorithm based on our improved YOLO and B-RPN, called F-YOLO, for our training datasets, and then proposed a real-time object detection algorithm based on F-YOLO (RODA-FY). We implemented RODA-FY and compared models on our MAT social robot. Secondly, we considered six types of situations in smart homes, and developed training and validation datasets, containing 2580 and 360 images, respectively. Meanwhile, we designed three types of experiments with four types of test datasets composed of 960 sample images. Thirdly, we analyzed how a different number of training iterations affects our prediction estimation, and then we explored the relationship between recognition accuracy and learning rates. Our results show that our proposed privacy detection system can recognize designed situations in the smart home with an acceptable recognition accuracy of 94.48%. Finally, we compared the results among RODA-FY, Inception V3, and YOLO, which indicate that our proposed RODA-FY outperforms the other comparison models in recognition accuracy.
Collapse
Affiliation(s)
- Guanci Yang
- Key Laboratory of Advanced Manufacturing Technology of Ministry of Education, Guizhou University, Guiyang 550025, China.
| | - Jing Yang
- Key Laboratory of Advanced Manufacturing Technology of Ministry of Education, Guizhou University, Guiyang 550025, China.
| | - Weihua Sheng
- School of Electrical and Computer Engineering, Oklahoma State University, Stillwater, OK 74074, USA.
| | | | - Shaobo Li
- Key Laboratory of Advanced Manufacturing Technology of Ministry of Education, Guizhou University, Guiyang 550025, China.
| |
Collapse
|
42
|
Askari E, Setarehdan SK, Sheikhani A, Mohammadi MR, Teshnehlab M. Designing a model to detect the brain connections abnormalities in children with autism using 3D-cellular neural networks. J Integr Neurosci 2018; 17:391-411. [PMID: 29689730 DOI: 10.3233/jin-180075] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022] Open
Abstract
In neuropsychological disorders, the significant abnormalities in the brain connections in some regions are observed. This paper presents a novel model to demonstrate the connections between different regions in children with autism. The proposed model first conducts the wavelet decomposition of electroencephalography signals by wavelet transform then the features are extracted, such as relative energy and entropy. These features are fed to the 3D-cellular neural network model as inputs to indicate the brain connections. The results showed that there are significant differences and abnormalities in the left hemisphere, (p<0.05) at the electrodes AF3, F3, P7, T7 and O1 in alpha band, AF3, F7, T7 and O1 in beta band, T7 and P7 in gamma band for children with autism compared with the control children. Also, the evaluation of the obtained connections values between brain regions indicated that there are more abnormalities in the connectivity of frontal and parietal lobes and the relations of the neighboring regions in all three bands especially in gamma band for autistic children. Evaluation of the analysis demonstrated that alpha frequency band had the best distinction level of 96.6% based on the obtained values of the cellular neural network using support vector machine method.
Collapse
Affiliation(s)
- Elham Askari
- Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran. E-mail:
| | - Seyed Kamaledin Setarehdan
- Control and Intelligent Processing Center of Excellence, School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran. E-mail:
| | - Ali Sheikhani
- Department of Biomedical Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran. E-mail:
| | - Mohammad Reza Mohammadi
- Psychiatry and Psychology Research Center, Roozbeh Hospital, Tehran University of Medical Sciences, Tehran, Iran. E-mail:
| | - Mohammad Teshnehlab
- Department of Control Engineering, K.N. Toosi University of Technology, Tehran, Iran. E-mail:
| |
Collapse
|
43
|
Hu X, Feng G, Duan S, Liu L. A Memristive Multilayer Cellular Neural Network With Applications to Image Processing. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2017; 28:1889-1901. [PMID: 27187973 DOI: 10.1109/tnnls.2016.2552640] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
The memristor has been extensively studied in electrical engineering and biological sciences as a means to compactly implement the synaptic function in neural networks. The cellular neural network (CNN) is one of the most implementable artificial neural network models and capable of massively parallel analog processing. In this paper, a novel memristive multilayer CNN (Mm-CNN) model is presented along with its performance analysis and applications. In this new CNN design, the memristor crossbar circuit acts as the synapse, which realizes one signed synaptic weight with a pair of memristors and performs the synaptic weighting compactly and linearly. Moreover, the complex weighted summation is executed in an efficient way with a proper design of Mm-CNN cell circuits. The proposed Mm-CNN has several merits, such as compactness, nonvolatility, versatility, and programmability of synaptic weights. Its performance in several image processing applications is illustrated through simulations.
Collapse
|
44
|
|
45
|
Tsompanas MA, Adamatzky A, Ieropoulos I, Phillips N, Sirakoulis GC, Greenman J. Cellular non-linear network model of microbial fuel cell. Biosystems 2017; 156-157:53-62. [DOI: 10.1016/j.biosystems.2017.04.003] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2017] [Revised: 04/10/2017] [Accepted: 04/12/2017] [Indexed: 01/09/2023]
|
46
|
Hayashida Y, Kudo Y, Ishida R, Okuno H, Yagi T. Retinal Circuit Emulator With Spatiotemporal Spike Outputs at Millisecond Resolution in Response to Visual Events. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2017; 11:597-611. [PMID: 28489548 DOI: 10.1109/tbcas.2017.2662659] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
To gain insights on how visual information of the real world is filtered, compressed, and encoded by the vertebrate retinas, emulating in silico the spatiotemporal patterns of the graded and action potentials of neuronal responses to natural visual scenes on biological time scale is a feasible approach. As a basic platform for such an emulation, we here developed a compact hardware system comprising an analog silicon retina and a field-programmable gate array module. With utilizing the Izhikevich formalism, a retinal circuit model that emulates spiking of ganglion cells was implemented in this system. The emulated spike timing had the resolution of about 2 ms relative to the stimulus onset and was little affected by timings of the synchronous frame sampling in the silicon retina. Thus, the emulator can mimic the event-driven spike outputs of biological retinas. The system was useful for simultaneously visualizing neural images of both the graded potentials and the spikes in response to real live visual scenes. Since our emulator system is reconfigurable, it provides a flexible platform for investigating visual functions of retinal circuits under natural visual environment.
Collapse
Affiliation(s)
- Yuki Hayashida
- Graduate School of Engineering, Osaka University, Suita, Japan
| | - Yuka Kudo
- Graduate School of Engineering, Osaka University, Suita, Japan
| | - Ryoya Ishida
- Graduate School of Engineering, Osaka University, Suita, Japan
| | - Hirotsugu Okuno
- Graduate School of Engineering, Osaka University, Suita, Japan
| | - Tetsuya Yagi
- Graduate School of Engineering, Osaka University, Suita, Japan
| |
Collapse
|
47
|
Bilotta E, Pantano P, Vena S. Speeding Up Cellular Neural Network Processing Ability by Embodying Memristors. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2017; 28:1228-1232. [PMID: 26863675 DOI: 10.1109/tnnls.2015.2511818] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Cellular neural networks (CNNs) are an efficient tool for image analysis and pattern recognition. Based on elementary cells connected to neighboring units, they are easy to install in hardware, carrying out massively parallel processes. This brief presents a new model of CNN with memory devices, which enhances further CNN performance. By introducing a memristive element in basic cells, we carry out different experiments, allowing the analysis of the functions traditionally carried out by the standard CNN. Without modifying the templates considered by the scientific literature, this simple variation originates a significant improvement in ∼ 30 % of performances in pattern recognition and image processing. These progresses were experimentally calculated on the time the system requires to reach a fixed point. Moreover, the different role that each parameter has in the developed method was also analyzed to better understand the complex processing ability of these systems.
Collapse
|
48
|
Sutton B, Camsari KY, Behin-Aein B, Datta S. Intrinsic optimization using stochastic nanomagnets. Sci Rep 2017; 7:44370. [PMID: 28295053 PMCID: PMC5353626 DOI: 10.1038/srep44370] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2016] [Accepted: 02/07/2017] [Indexed: 11/09/2022] Open
Abstract
This paper draws attention to a hardware system which can be engineered so that its intrinsic physics is described by the generalized Ising model and can encode the solution to many important NP-hard problems as its ground state. The basic constituents are stochastic nanomagnets which switch randomly between the ±1 Ising states and can be monitored continuously with standard electronics. Their mutual interactions can be short or long range, and their strengths can be reconfigured as needed to solve specific problems and to anneal the system at room temperature. The natural laws of statistical mechanics guide the network of stochastic nanomagnets at GHz speeds through the collective states with an emphasis on the low energy states that represent optimal solutions. As proof-of-concept, we present simulation results for standard NP-complete examples including a 16-city traveling salesman problem using experimentally benchmarked models for spin-transfer torque driven stochastic nanomagnets.
Collapse
Affiliation(s)
- Brian Sutton
- School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, 47907, USA
| | - Kerem Yunus Camsari
- School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, 47907, USA
| | | | - Supriyo Datta
- School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, 47907, USA
| |
Collapse
|
49
|
Zhang W, Li J, Ding C, Xing K. $${\varvec{p}}$$ p th Moment Exponential Stability of Hybrid Delayed Reaction–Diffusion Cohen–Grossberg Neural Networks. Neural Process Lett 2016. [DOI: 10.1007/s11063-016-9572-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
50
|
Krebs WK, McCarley JS, Kozek T, Miller GM, Sinai MJ, Werblin FS. An Evaluation of a Sensor Fusion System to Improve Drivers' Nighttime Detection of Road Hazards. ACTA ACUST UNITED AC 2016. [DOI: 10.1177/154193129904302315] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Objective: To determine whether combination of two spectral bands within a single sensor-fused image can improve drivers' detection of road hazards. Methods: Images were collected with visible and short wave infrared sensors, and were combined by an image fusion algorithm derived from a computational model of human retinal processing (Werblin et al., 1997). Test stimuli were single-band and fused images of a nighttime scene, collected with sensors mounted atop a vehicle and facing down a stretch of road into an opposing vehicle's headlights. The intensity of the opposing headlights was varied to produce images of low and of high glare. The subjects' task was to detect the presence of a pedestrian within each image. Results: Sensor-fused imagery reliably produced performance better than or equivalent to that produced by either format of single-band imagery. Conclusions: Sensor fusion may provide an effective method of facilitating the detection of road hazards under low visibility conditions.
Collapse
Affiliation(s)
- WK Krebs
- Naval Postgraduate School, Department of Operations Research, Monterey, CA
| | - JS McCarley
- Naval Postgraduate School, Department of Operations Research, Monterey, CA
| | - T Kozek
- TeraOps Corporation, Berkeley, CA
| | - GM Miller
- PVP Advanced EO Systems Inc, Orange, CA
| | - MJ Sinai
- Naval Postgraduate School, Department of Operations Research, Monterey, CA
| | - FS Werblin
- TeraOps Corporation, Berkeley, CA
- University of California, Berkeley, Molecular Cell Biology Department, CA
| |
Collapse
|