1
|
Fei B, Luo T, Yang W, Liu L, Zhang R, He Y. Curriculumformer: Taming Curriculum Pre-Training for Enhanced 3-D Point Cloud Understanding. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:7316-7330. [PMID: 38870001 DOI: 10.1109/tnnls.2024.3406587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2024]
Abstract
Learning universal representations of 3-D point clouds is essential for reducing the need for manual annotation of large-scale and irregular point cloud datasets. The current modus operandi for representative learning is self-supervised learning, which has shown great potential for improving point cloud understanding. Nevertheless, it remains an open problem how to employ auto-encoding for learning universal 3-D representations of irregularly structured point clouds, as previous methods focus on either global shapes or local geometries. To this end, we present a cascaded self-supervised point cloud representation learning framework, dubbed Curriculumformer, aiming to tame curriculum pre-training for enhanced point cloud understanding. Our main idea lies in devising a progressive pre-training strategy, which trains the Transformer in an easy-to-hard manner. Specifically, we first pre-train the Transformer using an upsampling strategy, which allows it to learn global information. Then, we follow up with a completion strategy, which enables the Transformer to gain insight into local geometries. Finally, we propose a Multi-Modal Multi-Modality Contrastive Learning (M4CL) strategy to enhance the ability of representation learning by enriching the Transformer with semantic information. In this way, the pre-trained Transformer can be easily transferred to a wide range of downstream applications. We demonstrate the superior performance of Curriculumformer on various discriminant and generative tasks, outperforming state-of-the-art methods. Moreover, Curriculumformer can also be integrated into other off-the-shelf methods to promote their performance. Our code is available at https://github.com/Fayeben/Curriculumformer.
Collapse
|
2
|
Sun K, Zhang J, Xu S, Zhao Z, Zhang C, Liu J, Hu J. CACNN: Capsule Attention Convolutional Neural Networks for 3D Object Recognition. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:4091-4102. [PMID: 37934641 DOI: 10.1109/tnnls.2023.3326606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2023]
Abstract
Recently, view-based approaches, which recognize a 3D object through its projected 2-D images, have been extensively studied and have achieved considerable success in 3D object recognition. Nevertheless, most of them use a pooling operation to aggregate viewwise features, which usually leads to the visual information loss. To tackle this problem, we propose a novel layer called capsule attention layer (CAL) by using attention mechanism to fuse the features expressed by capsules. In detail, instead of dynamic routing algorithm, we use an attention module to transmit information from the lower level capsules to higher level capsules, which obviously improves the speed of capsule networks. In particular, the view pooling layer of multiview convolutional neural network (MVCNN) becomes a special case of our CAL when the trainable weights are chosen on some certain values. Furthermore, based on CAL, we propose a capsule attention convolutional neural network (CACNN) for 3D object recognition. Extensive experimental results on three benchmark datasets demonstrate the efficiency of our CACNN and show that it outperforms many state-of-the-art methods.
Collapse
|
3
|
Lee S, Heo S, Lee S. DMESH: A Structure-Preserving Diffusion Model for 3-D Mesh Denoising. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:4385-4399. [PMID: 38412085 DOI: 10.1109/tnnls.2024.3367327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/29/2024]
Abstract
Denoising diffusion models have shown a powerful capacity for generating high-quality image samples by progressively removing noise. Inspired by this, we present a diffusion-based mesh denoiser that progressively removes noise from mesh. In general, the iterative algorithm of diffusion models attempts to manipulate the overall structure and fine details of target meshes simultaneously. For this reason, it is difficult to apply the diffusion process to a mesh denoising task that removes artifacts while maintaining a structure. To address this, we formulate a structure-preserving diffusion process. Instead of diffusing the mesh vertices to be distributed as zero-centered isotopic Gaussian distribution, we diffuse each vertex into a specific noise distribution, in which the entire structure can be preserved. In addition, we propose a topology-agnostic mesh diffusion model by projecting the vertex into multiple 2-D viewpoints to efficiently learn the diffusion using a deep network. This enables the proposed method to learn the diffusion of arbitrary meshes that have an irregular topology. Finally, the denoised mesh can be obtained via refinement based on 2-D projections obtained from reverse diffusion. Through extensive experiments, we demonstrate that our method outperforms the state-of-the-art mesh denoising methods in both quantitative and qualitative evaluations.
Collapse
|
4
|
Ganapathi II, Dharejo FA, Javed S, Ali SS, Werghi N. Unsupervised Dual Transformer Learning for 3-D Textured Surface Segmentation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:5020-5031. [PMID: 38466603 DOI: 10.1109/tnnls.2024.3365515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
Analysis of the 3-D texture is indispensable for various tasks, such as retrieval, segmentation, classification, and inspection of sculptures, knit fabrics, and biological tissues. A 3-D texture represents a locally repeated surface variation (SV) that is independent of the overall shape of the surface and can be determined using the local neighborhood and its characteristics. Existing methods mostly employ computer vision techniques that analyze a 3-D mesh globally, derive features, and then utilize them for classification or retrieval tasks. While several traditional and learning-based methods have been proposed in the literature, only a few have addressed 3-D texture analysis, and none have considered unsupervised schemes so far. This article proposes an original framework for the unsupervised segmentation of 3-D texture on the mesh manifold. The problem is approached as a binary surface segmentation task, where the mesh surface is partitioned into textured and nontextured regions without prior annotation. The proposed method comprises a mutual transformer-based system consisting of a label generator (LG) and a label cleaner (LC). Both models take geometric image representations of the surface mesh facets and label them as texture or nontexture using an iterative mutual learning scheme. Extensive experiments on three publicly available datasets with diverse texture patterns demonstrate that the proposed framework outperforms standard and state-of-the-art unsupervised techniques and performs reasonably well compared to supervised methods.
Collapse
|
5
|
Chen J, Jiao L, Liu X, Liu F, Li L, Yang S. Multiresolution Interpretable Contourlet Graph Network for Image Classification. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:17716-17729. [PMID: 37747859 DOI: 10.1109/tnnls.2023.3307721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/27/2023]
Abstract
Modeling contextual relationships in images as graph inference is an interesting and promising research topic. However, existing approaches only perform graph modeling of entities, ignoring the intrinsic geometric features of images. To overcome this problem, a novel multiresolution interpretable contourlet graph network (MICGNet) is proposed in this article. MICGNet delicately balances graph representation learning with the multiscale and multidirectional features of images, where contourlet is used to capture the hyperplanar directional singularities of images and multilevel sparse contourlet coefficients are encoded into graph for further graph representation learning. This process provides interpretable theoretical support for optimizing the model structure. Specifically, first, the superpixel-based region graph is constructed. Then, the region graph is applied to code the nonsubsampled contourlet transform (NSCT) coefficients of the image, which are considered as node features. Considering the statistical properties of the NSCT coefficients, we calculate the node similarity, i.e., the adjacency matrix, using Mahalanobis distance. Next, graph convolutional networks (GCNs) are employed to further learn more abstract multilevel NSCT-enhanced graph representations. Finally, the learnable graph assignment matrix is designed to get the geometric association representations, which accomplish the assignment of graph representations to grid feature maps. We conduct comparative experiments on six publicly available datasets, and the experimental analysis shows that MICGNet is significantly more effective and efficient than other algorithms of recent years.
Collapse
|
6
|
Lei H, Akhtar N, Shah M, Mian A. Mesh Convolution With Continuous Filters for 3-D Surface Parsing. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:14863-14877. [PMID: 37310827 DOI: 10.1109/tnnls.2023.3281871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Geometric feature learning for 3-D surfaces is critical for many applications in computer graphics and 3-D vision. However, deep learning currently lags in hierarchical modeling of 3-D surfaces due to the lack of required operations and/or their efficient implementations. In this article, we propose a series of modular operations for effective geometric feature learning from 3-D triangle meshes. These operations include novel mesh convolutions, efficient mesh decimation, and associated mesh (un)poolings. Our mesh convolutions exploit spherical harmonics as orthonormal bases to create continuous convolutional filters. The mesh decimation module is graphics processing unit (GPU)-accelerated and able to process batched meshes on-the-fly, while the (un)pooling operations compute features for upsampled/downsampled meshes. We provide an open-source implementation of these operations, collectively termed Picasso. Picasso supports heterogeneous mesh batching and processing. Leveraging its modular operations, we further contribute a novel hierarchical neural network for perceptual parsing of 3-D surfaces, named PicassoNet++. It achieves highly competitive performance for shape analysis and scene segmentation on prominent 3-D benchmarks. The code, data, and trained models are available at https://github.com/EnyaHermite/Picasso.
Collapse
|
7
|
Khodadad M, Shiraee Kasmaee A, Mahyar H, Rezanejad M. MLGCN: an ultra efficient graph convolutional neural model for 3D point cloud analysis. Front Artif Intell 2024; 7:1439340. [PMID: 39372661 PMCID: PMC11449895 DOI: 10.3389/frai.2024.1439340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Accepted: 08/29/2024] [Indexed: 10/08/2024] Open
Abstract
With the rapid advancement of 3D acquisition technologies, 3D sensors such as LiDARs, 3D scanners, and RGB-D cameras have become increasingly accessible and cost-effective. These sensors generate 3D point cloud data that require efficient algorithms for tasks such as 3D model classification and segmentation. While deep learning techniques have proven effective in these areas, existing models often rely on complex architectures, leading to high computational costs that are impractical for real-time applications like augmented reality and robotics. In this work, we propose the Multi-level Graph Convolutional Neural Network (MLGCN), an ultra-efficient model for 3D point cloud analysis. The MLGCN model utilizes shallow Graph Neural Network (GNN) blocks to extract features at various spatial locality levels, leveraging precomputed KNN graphs shared across GCN blocks. This approach significantly reduces computational overhead and memory usage, making the model well-suited for deployment on low-memory and low-CPU devices. Despite its efficiency, MLGCN achieves competitive performance in object classification and part segmentation tasks, demonstrating results comparable to state-of-the-art models while requiring up to a thousand times fewer floating-point operations and significantly less storage. The contributions of this paper include the introduction of a lightweight, multi-branch graph-based network for 3D shape analysis, the demonstration of the model's efficiency in both computation and storage, and a thorough theoretical and experimental evaluation of the model's performance. We also conduct ablation studies to assess the impact of different branches within the model, providing valuable insights into the role of specific components.
Collapse
Affiliation(s)
- Mohammad Khodadad
- Department of Computational Science and Engineering, McMaster University, Hamilton, ON, Canada
| | - Ali Shiraee Kasmaee
- Department of Computational Science and Engineering, McMaster University, Hamilton, ON, Canada
| | - Hamidreza Mahyar
- Department of Computational Science and Engineering, McMaster University, Hamilton, ON, Canada
| | - Morteza Rezanejad
- Rosalind and Morris Goodman Cancer Institute, McGill University, Montreal, QC, Canada
| |
Collapse
|
8
|
Zhu C, Zhang C, Shang T, Zhang C, Zhai S, Cao L, Xu Z, Su Z, Song Y, Su A, Li C, Duan H. GAPS: a geometric attention-based network for peptide binding site identification by the transfer learning approach. Brief Bioinform 2024; 25:bbae297. [PMID: 38990514 PMCID: PMC11238429 DOI: 10.1093/bib/bbae297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2024] [Revised: 04/28/2024] [Accepted: 06/07/2024] [Indexed: 07/12/2024] Open
Abstract
Protein-peptide interactions (PPepIs) are vital to understanding cellular functions, which can facilitate the design of novel drugs. As an essential component in forming a PPepI, protein-peptide binding sites are the basis for understanding the mechanisms involved in PPepIs. Therefore, accurately identifying protein-peptide binding sites becomes a critical task. The traditional experimental methods for researching these binding sites are labor-intensive and time-consuming, and some computational tools have been invented to supplement it. However, these computational tools have limitations in generality or accuracy due to the need for ligand information, complex feature construction, or their reliance on modeling based on amino acid residues. To deal with the drawbacks of these computational algorithms, we describe a geometric attention-based network for peptide binding site identification (GAPS) in this work. The proposed model utilizes geometric feature engineering to construct atom representations and incorporates multiple attention mechanisms to update relevant biological features. In addition, the transfer learning strategy is implemented for leveraging the protein-protein binding sites information to enhance the protein-peptide binding sites recognition capability, taking into account the common structure and biological bias between proteins and peptides. Consequently, GAPS demonstrates the state-of-the-art performance and excellent robustness in this task. Moreover, our model exhibits exceptional performance across several extended experiments including predicting the apo protein-peptide, protein-cyclic peptide and the AlphaFold-predicted protein-peptide binding sites. These results confirm that the GAPS model is a powerful, versatile, stable method suitable for diverse binding site predictions.
Collapse
Affiliation(s)
- Cheng Zhu
- College of Pharmaceutical Sciences, Zhejiang University of Technology, Chaowang Road, Gongshu District, Hangzhou 310014, China
| | - Chengyun Zhang
- AI Department, Shanghai Highslab Therapeutics. Inc, Zhangheng Road, Pudong New Area, Shanghai 201203, China
| | - Tianfeng Shang
- AI Department, Shanghai Highslab Therapeutics. Inc, Zhangheng Road, Pudong New Area, Shanghai 201203, China
| | - Chenhao Zhang
- College of Pharmaceutical Sciences, Zhejiang University of Technology, Chaowang Road, Gongshu District, Hangzhou 310014, China
| | - Silong Zhai
- College of Pharmaceutical Sciences, Zhejiang University of Technology, Chaowang Road, Gongshu District, Hangzhou 310014, China
| | - Lujing Cao
- College of Pharmaceutical Sciences, Zhejiang University of Technology, Chaowang Road, Gongshu District, Hangzhou 310014, China
| | - Zhenyu Xu
- AI Department, Shanghai Highslab Therapeutics. Inc, Zhangheng Road, Pudong New Area, Shanghai 201203, China
| | - Zhihao Su
- College of Pharmaceutical Sciences, Zhejiang University of Technology, Chaowang Road, Gongshu District, Hangzhou 310014, China
| | - Ying Song
- College of Pharmaceutical Sciences, Zhejiang University of Technology, Chaowang Road, Gongshu District, Hangzhou 310014, China
| | - An Su
- College of Chemical Engineering, Zhejiang University of Technology, Chaowang Road, Gongshu District, Hangzhou 310014, China
| | - Chengxi Li
- College of Chemical and Biological Engineering, Zhejiang University, Yuhangtang Road, Xihu District, Hangzhou 310027, China
| | - Hongliang Duan
- Faculty of Applied Sciences, Macao Polytechnic University, R. de Luís Gonzaga Gomes, Macao 999078, China
| |
Collapse
|
9
|
Xia Q. DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification. Heliyon 2023; 9:e21515. [PMID: 38027921 PMCID: PMC10665673 DOI: 10.1016/j.heliyon.2023.e21515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2023] [Revised: 09/20/2023] [Accepted: 10/23/2023] [Indexed: 12/01/2023] Open
Abstract
Fast and effective algorithms for deep learning on 3D shapes are keys to innovate mechanical and electronic engineering design workflow. In this paper, an efficient 3D shape to 2D images projection algorithm and a shallow 2.5D convolutional neural network architecture is proposed. A smaller convolutional neural network (CNN) model is achieved by information enrichment at the preprocessing stage, i.e. 3D geometry is compressed into 2D "thickness view" and "depth view". Fusing the depth view and thickness view (DTV) from the same projection view into a dual-channel grayscale image, can improve information locality for geometry and topology feature extraction. This approach bridges the gap between mature image deep learning technologies to the applications of 3D shape. Enhanced by several essential scalar geometry properties and only 3 projection views, a mixed CNN and multiple linear parameter (MLP) neural network model achives a validation accuracy of 92 % for ModelNet10 mesh-based dataset, while the training time is one order of magnitude less than the original multi-view CNN approach. This study also creates new 3D shape datasets from 2 open source CAD projects. Higher validation accuracy is obtained for realistic CAD datasets, i.e. 97 % for FreeCAD's mechanical part library and 95 % for KiCAD electronic part library. The training cost reduces to tens of minutes on a laptop CPU, given the smaller input data size and shallow neural network design. It is expected that this approach can be adapted for other machine learning scenarios involved in CAD geometry.
Collapse
Affiliation(s)
- Qingfeng Xia
- Culham Centre for Fusion Energy, United Kingdom Atomic Energy Authority, OX14 3DB, United Kingdom
| |
Collapse
|
10
|
Hu X. The role of deep learning in the innovation of smart classroom teaching mode under the background of internet of things and fuzzy control. Heliyon 2023; 9:e18594. [PMID: 37576291 PMCID: PMC10415824 DOI: 10.1016/j.heliyon.2023.e18594] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 07/17/2023] [Accepted: 07/21/2023] [Indexed: 08/15/2023] Open
Abstract
Electronic components are rapidly updated in the context of expanding application requirements, and communication protocols used in combination with various electronic devices are also emerging. On this basis, IoT technology has developed a variety of sensor devices and gateways, which are widely used in cities. In the field of wisdom, applying IoT technology to classrooms can effectively improve the deficiencies of traditional teaching models. Fuzzy control theory is usually based on fuzzy sets in mathematics, and is combined with neural network, genetic and probability algorithms to form a calculation method. Fuzzy calculation has the ability to simplify the system input of a variety of complex variables, and its applications in the field of education are mainly: provide evaluation of teachers' teaching effectiveness. The advancement of science and technology has promoted the change and updating of the teaching mode. With the continuous advancement of basic education curriculum reform and the continuous deepening of classroom teaching reform, classroom teaching is also in urgent need of reform, from traditional classrooms to smart classrooms. Smart classrooms combine advanced technology with teachers' teaching. Through the dynamic data, the analysis instantly understands the student's learning situation, and then integrates it into education and teaching in a targeted manner. This paper conducts a questionnaire survey on the current situation of smart classroom teaching, and summarizes the current teaching problems. Then, combining the Internet of Things, fuzzy control and deep learning technology, from the two aspects of school teachers and students, it is proposed for smart classroom to promote students' learning effect. With its novel and new-style teaching advantages, smart classroom has gradually entered the public's vision and gained the attention and support of the majority of educators. Taking Grand Wisdom Classroom as an example, it uses the "Internet +" way of thinking and the new generation of information technology such as big data and cloud computing to create intelligent and efficient classrooms, realizing the whole process of application before, during and after class, and promoting the development of students' wisdom. Under the mobile Internet model, students and teachers can communicate anytime and anywhere. Combined with the analysis and application of our big data technology, data-based precision teaching becomes possible. In a real sense, learning before teaching can be realized and teaching can be determined by learning.
Collapse
Affiliation(s)
- Xiaoyan Hu
- Nantong Normal College, Nantong, 226000, China
| |
Collapse
|
11
|
He X, Shan W, Zhang R, Heidari AA, Chen H, Zhang Y. Improved Colony Predation Algorithm Optimized Convolutional Neural Networks for Electrocardiogram Signal Classification. Biomimetics (Basel) 2023; 8:268. [PMID: 37504156 PMCID: PMC10377160 DOI: 10.3390/biomimetics8030268] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Revised: 06/18/2023] [Accepted: 06/18/2023] [Indexed: 07/29/2023] Open
Abstract
Recently, swarm intelligence algorithms have received much attention because of their flexibility for solving complex problems in the real world. Recently, a new algorithm called the colony predation algorithm (CPA) has been proposed, taking inspiration from the predatory habits of groups in nature. However, CPA suffers from poor exploratory ability and cannot always escape solutions known as local optima. Therefore, to improve the global search capability of CPA, an improved variant (OLCPA) incorporating an orthogonal learning strategy is proposed in this paper. Then, considering the fact that the swarm intelligence algorithm can go beyond the local optimum and find the global optimum solution, a novel OLCPA-CNN model is proposed, which uses the OLCPA algorithm to tune the parameters of the convolutional neural network. To verify the performance of OLCPA, comparison experiments are designed to compare with other traditional metaheuristics and advanced algorithms on IEEE CEC 2017 benchmark functions. The experimental results show that OLCPA ranks first in performance compared to the other algorithms. Additionally, the OLCPA-CNN model achieves high accuracy rates of 97.7% and 97.8% in classifying the MIT-BIH Arrhythmia and European ST-T datasets.
Collapse
Affiliation(s)
- Xinxin He
- School of Emergency Management, Institute of Disaster Prevention, Sanhe 065201, China
| | - Weifeng Shan
- School of Emergency Management, Institute of Disaster Prevention, Sanhe 065201, China
| | - Ruilei Zhang
- School of Emergency Management, Institute of Disaster Prevention, Sanhe 065201, China
| | - Ali Asghar Heidari
- School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran, Tehran 1417935840, Iran
| | - Huiling Chen
- Institute of Big Data and Information Technology, Wenzhou University, Wenzhou 325000, China
| | - Yudong Zhang
- School of Computing and Mathematical Sciences, University of Leicester, Leicester LE1 7RH, UK
| |
Collapse
|
12
|
K V, Trojovský P, Hubálovský Š. VIOLA jones algorithm with capsule graph network for deepfake detection. PeerJ Comput Sci 2023; 9:e1313. [PMID: 37346538 PMCID: PMC10280569 DOI: 10.7717/peerj-cs.1313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 03/06/2023] [Indexed: 06/23/2023]
Abstract
DeepFake is a forged image or video created using deep learning techniques. The present fake content of the detection technique can detect trivial images such as barefaced fake faces. Moreover, the capability of current methods to detect fake faces is minimal. Many recent types of research have made the fake detection algorithm from rule-based to machine-learning models. However, the emergence of deep learning technology with intelligent improvement motivates this specified research to use deep learning techniques. Thus, it is proposed to have VIOLA Jones's (VJ) algorithm for selecting the best features with Capsule Graph Neural Network (CN). The graph neural network is improved by capsule-based node feature extraction to improve the results of the graph neural network. The experiment is evaluated with CelebDF-FaceForencics++ (c23) datasets, which combines FaceForencies++ (c23) and Celeb-DF. In the end, it is proved that the accuracy of the proposed model has achieved 94.
Collapse
Affiliation(s)
- Venkatachalam K
- Department of Applied Cybernetics, Faculty of Science, University of Hradec Králová, Hradec Králová, Czech Republic
| | - Pavel Trojovský
- Department of Mathematics, Faculty of Science, University of Hradec Kralove, Hradec Kralove, Czech Republic
| | - Štěpán Hubálovský
- Department of Applied Cybernetics, Faculty of Science, University of Hradec Králová, Hradec Králová, Czech Republic
| |
Collapse
|
13
|
Kumar Pandey R, Gandomkar A, Vaferi B, Kumar A, Torabi F. Supervised deep learning-based paradigm to screen the enhanced oil recovery scenarios. Sci Rep 2023; 13:4892. [PMID: 36966250 PMCID: PMC10039950 DOI: 10.1038/s41598-023-32187-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 03/23/2023] [Indexed: 03/27/2023] Open
Abstract
High oil prices and concern about limited oil reserves lead to increase interest in enhanced oil recovery (EOR). Selecting the most efficient development plan is of high interest to optimize economic cost. Hence, the main objective of this study is to construct a novel deep-learning classifier to select the best EOR method based on the reservoir's rock and fluid properties (depth, porosity, permeability, gravity, viscosity), and temperature. Our deep learning-based classifier consists of a one-dimensional (1D) convolutional neural network, long short-term memory (LSTM), and densely connected neural network layers. The genetic algorithm has been applied to tune the hyperparameters of this hybrid classifier. The proposed classifier is developed and tested using 735 EOR projects on sandstone, unconsolidated sandstone, carbonate, and conglomerate reservoirs in more than 17 countries. Both the numerical and graphical investigations approve that the structure-tuned deep learning classifier is a reliable tool to screen the EOR scenarios and select the best one. The designed model correctly classifies training, validation, and testing examples with an accuracy of 96.82%, 84.31%, and 82.61%, respectively. It means that only 30 out of 735 available EOR projects are incorrectly identified by the proposed deep learning classifier. The model also demonstrates a small categorical cross-entropy of 0.1548 for the classification of the involved enhanced oil recovery techniques. Such a powerful classifier is required to select the most suitable EOR candidate for a given oil reservoir with limited field information.
Collapse
Affiliation(s)
- Rakesh Kumar Pandey
- Department of Petroleum and Energy Studies, School of Engineering and Technology, DIT University, Dehradun, India
| | - Asghar Gandomkar
- Department of Chemical Engineering, Shiraz Branch, Islamic Azad University, Shiraz, Iran
| | - Behzad Vaferi
- Department of Chemical Engineering, Shiraz Branch, Islamic Azad University, Shiraz, Iran.
- Department of Advanced Calculations, Chemical, Petroleum, and Polymer Engineering Research Center, Shiraz Branch, Islamic Azad University, Shiraz, Iran.
| | - Anil Kumar
- Director, Tula's Institute, Dehradun, 248001, India
| | - Farshid Torabi
- Faculty of Engineering and Applied Science, University of Regina, Regina, SK, S4S 0A2, Canada
| |
Collapse
|
14
|
Jakkaladiki SP, Maly F. An efficient transfer learning based cross model classification (TLBCM) technique for the prediction of breast cancer. PeerJ Comput Sci 2023; 9:e1281. [PMID: 37346575 PMCID: PMC10280457 DOI: 10.7717/peerj-cs.1281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 02/16/2023] [Indexed: 06/23/2023]
Abstract
Breast cancer has been the most life-threatening disease in women in the last few decades. The high mortality rate among women is due to breast cancer because of less awareness and a minimum number of medical facilities to detect the disease in the early stages. In the recent era, the situation has changed with the help of many technological advancements and medical equipment to observe breast cancer development. The machine learning technique supports vector machines (SVM), logistic regression, and random forests have been used to analyze the images of cancer cells on different data sets. Although the particular technique has performed better on the smaller data set, accuracy still needs to catch up in most of the data, which needs to be fairer to apply in the real-time medical environment. In the proposed research, state-of-the-art deep learning techniques, such as transfer learning, based cross model classification (TLBCM), convolution neural network (CNN) and transfer learning, residual network (ResNet), and Densenet proposed for efficient prediction of breast cancer with the minimized error rating. The convolution neural network and transfer learning are the most prominent techniques for predicting the main features in the data set. The sensitive data is protected using a cyber-physical system (CPS) while using the images virtually over the network. CPS act as a virtual connection between human and networks. While the data is transferred in the network, it must monitor using CPS. The ResNet changes the data on many layers without compromising the minimum error rate. The DenseNet conciliates the problem of vanishing gradient issues. The experiment is carried out on the data sets Breast Cancer Wisconsin (Diagnostic) and Breast Cancer Histopathological Dataset (BreakHis). The convolution neural network and the transfer learning have achieved a validation accuracy of 98.3%. The results of these proposed methods show the highest classification rate between the benign and the malignant data. The proposed method improves the efficiency and speed of classification, which is more convenient for discovering breast cancer in earlier stages than the previously proposed methodologies.
Collapse
|
15
|
Xiang Q, Huang T, Zhang Q, Li Y, Tolba A, Bulugu I. A novel sentiment analysis method based on multi-scale deep learning. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:8766-8781. [PMID: 37161221 DOI: 10.3934/mbe.2023385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
As the college students have been a most active user group in various social media, it remains significant to make effective sentiment analysis for college public opinions. Capturing the direction of public opinion in the student community in a timely manner and guiding students to develop the right values can help in the ideological management of universities. Universally, the recurrent neural networks have been the mainstream technology in terms of sentiment analysis. Nevertheless, the existing research works more emphasized semantic characteristics in vertical direction, yet failing to capture sematic characteristics in horizonal direction. In other words, it is supposed to increase more balance into sentiment analysis models. To remedy such gap, this paper presents a novel sentiment analysis method based on multi-scale deep learning for college public opinions. To fit for bidirectional semantic characteristics, a typical sequential neural network with two propagation paths is selected as the backbone. It is then extended with more layers in horizonal direction. Such design is able to balance both model depth and model breadth. At last, some experiments on a real-world social media dataset are conducted for evaluation, well acknowledging efficiency of the proposed analysis model.
Collapse
Affiliation(s)
- Qiao Xiang
- School of Artificial Intelligence, Chongqing Technology and Business University, Chongqing 400067, China
| | - Tianhong Huang
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97331-5501, USA
| | - Qin Zhang
- School of Artificial Intelligence, Chongqing Technology and Business University, Chongqing 400067, China
| | - Yufeng Li
- School of Mechanical Engineering, Chongqing Technology and Business University, Chongqing 400067, China
| | - Amr Tolba
- Computer Science Department, Community College, King Saud University, Riyadh 11437, Saudi Arabia
| | - Isack Bulugu
- Department of Electronics and Telecommunications Engineering, College of ICT, University of Dar es Salaam, Dar es Salaam, Tanzania
| |
Collapse
|
16
|
Thakur GK, Garg SK, Singh T, Ali MS, Arora TK. Non-fragile synchronization of BAM neural networks with randomly occurring controller gain fluctuation. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:7302-7315. [PMID: 37161153 DOI: 10.3934/mbe.2023317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
In this research, a non-fragile synchronization of bidirectional association memory (BAM) delayed neural networks is taken into consideration. The controller gain fluctuation seems in a very random manner, that obeys sure Bernoulli distributed noise sequences. Delay dependent criteria are derived to confirm the asymptotic stability of the BAM delayed neural networks. The non-fragile controller are often obtained by determination a collection of linear matrix inequalities (LMIs). A simulation example is used to demonstrate the efficiency of the developed control.
Collapse
Affiliation(s)
- Ganesh Kumar Thakur
- Department of Applied sciences, ABES Engineering College, Ghaziabad, UP-201007, India
| | - Sudesh Kumar Garg
- Department of Applied Science, G L BAJAJ Institute of Technology and Management, Greater Noida, Uttar Pradesh, India
| | - Tej Singh
- Department of Applied sciences, ABES Engineering College, Ghaziabad, UP-201007, India
| | - M Syed Ali
- Department of Mathematics, Thiruvalluvar University, Vellore-632115, Tamilndau, Inida
| | - Tarun Kumar Arora
- Department of Applied sciences, ABES Engineering College, Ghaziabad, UP-201007, India
| |
Collapse
|
17
|
Chen ZY, Meng YH, Wang RY, Chen T. Neural Based Grey Nonlinear Control for Real-World Example of Mechanical Systems. Neural Process Lett 2023. [DOI: 10.1007/s11063-022-11109-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
|
18
|
Ye F, Zhou Z, Wu Y, Enkhtur B. Application of convolutional neural network in fusion and classification of multi-source remote sensing data. Front Neurorobot 2022; 16:1095717. [PMID: 36620484 PMCID: PMC9815026 DOI: 10.3389/fnbot.2022.1095717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Accepted: 12/05/2022] [Indexed: 12/24/2022] Open
Abstract
Introduction Through remote sensing images, we can understand and observe the terrain, and its application scope is relatively large, such as agriculture, military, etc. Methods In order to achieve more accurate and efficient multi-source remote sensing data fusion and classification, this study proposes DB-CNN algorithm, introduces SVM algorithm and ELM algorithm, and compares and verifies their performance through relevant experiments. Results From the results, we can find that for the dual branch CNN network structure, hyperspectral data and laser mines joint classification of data can achieve higher classification accuracy. On different data sets, the global classification accuracy of the joint classification method is 98.46%. DB-CNN model has the highest training accuracy and fastest speed in training and testing. In addition, the DB-CNN model has the lowest test error, about 0.026, 0.037 lower than the ELM model and 0.056 lower than the SVM model. The AUC value corresponding to the ROC curve of its model is about 0.922, higher than that of the other two models. Discussion It can be seen that the method used in this paper can significantly improve the effect of multi-source remote sensing data fusion and classification, and has certain practical value.
Collapse
Affiliation(s)
- Fanghong Ye
- Land Satellite Remote Sensing Application Center, Ministry of Natural Resources of People's Republic of China, Beijing, China,School of Resource and Environmental Sciences, Wuhan University, Wuhan, China,*Correspondence: Fanghong Ye ✉
| | - Zheng Zhou
- Ecology and Environment Monitoring and Scientific Research Center, Ministry of Ecology and Environment of the People's Republic of China, Wuhan, China,Zheng Zhou ✉
| | - Yue Wu
- Department of Natural Resources of Heilongjiang Province, Heilongjiang Provincial Institute of Land and Space Planning, Harbin, China
| | - Bayarmaa Enkhtur
- Geospatial Information and Technology Department, Agency for Land Administration and Management, Geodesy and Cartography, Ulaanbaatar, Mongolia
| |
Collapse
|
19
|
Zhao W, Wang Y, Qu Y, Ma H, Wang S. Binary Classification Quantum Neural Network Model Based on Optimized Grover Algorithm. ENTROPY (BASEL, SWITZERLAND) 2022; 24:1783. [PMID: 36554188 PMCID: PMC9777537 DOI: 10.3390/e24121783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 11/23/2022] [Accepted: 11/29/2022] [Indexed: 06/17/2023]
Abstract
We focus on the problem that the Grover algorithm is not suitable for the completely unknown proportion of target solutions. Considering whether the existing quantum classifier used by the current quantum neural network (QNN) to complete the classification task can solve the problem of the classical classifier, this paper proposes a binary quantum neural network classifical model based on an optimized Grover algorithm based on partial diffusion. Trial and error is adopted to extend the partial diffusion quantum search algorithm with the known proportion of target solutions to the unknown state, and to apply the characteristics of the supervised learning of the quantum neural network to binary classify the classified data. Experiments show that the proposed method can effectively retrieve quantum states with similar features. The test accuracy of BQM retrieval under the depolarization noise at the 20th period can reach 97% when the depolarization rate is 0.1. It improves the retrieval accuracy by about 4% and 10% compared with MSE and BCE in the same environment.
Collapse
Affiliation(s)
- Wenlin Zhao
- School of Information and Control Engineering, Qingdao University of Technology, Qingdao 266520, China
| | - Yinuo Wang
- School of Science, Qingdao University of Technology, Qingdao 266520, China
| | - Yingjie Qu
- School of Science, Qingdao University of Technology, Qingdao 266520, China
| | - Hongyang Ma
- School of Science, Qingdao University of Technology, Qingdao 266520, China
| | - Shumei Wang
- School of Science, Qingdao University of Technology, Qingdao 266520, China
| |
Collapse
|
20
|
Sesyuk A, Ioannou S, Raspopoulos M. A Survey of 3D Indoor Localization Systems and Technologies. SENSORS (BASEL, SWITZERLAND) 2022; 22:9380. [PMID: 36502083 PMCID: PMC9735771 DOI: 10.3390/s22239380] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 11/25/2022] [Accepted: 11/27/2022] [Indexed: 06/17/2023]
Abstract
Indoor localization has recently and significantly attracted the interest of the research community mainly due to the fact that Global Navigation Satellite Systems (GNSSs) typically fail in indoor environments. In the last couple of decades, there have been several works reported in the literature that attempt to tackle the indoor localization problem. However, most of this work is focused solely on two-dimensional (2D) localization, while very few papers consider three dimensions (3D). There is also a noticeable lack of survey papers focusing on 3D indoor localization; hence, in this paper, we aim to carry out a survey and provide a detailed critical review of the current state of the art concerning 3D indoor localization including geometric approaches such as angle of arrival (AoA), time of arrival (ToA), time difference of arrival (TDoA), fingerprinting approaches based on Received Signal Strength (RSS), Channel State Information (CSI), Magnetic Field (MF) and Fine Time Measurement (FTM), as well as fusion-based and hybrid-positioning techniques. We provide a variety of technologies, with a focus on wireless technologies that may be utilized for 3D indoor localization such as WiFi, Bluetooth, UWB, mmWave, visible light and sound-based technologies. We critically analyze the advantages and disadvantages of each approach/technology in 3D localization.
Collapse
Affiliation(s)
- Andrey Sesyuk
- School of Engineering, University of Central Lancashire, Preston PR12HE, UK
| | - Stelios Ioannou
- School of Sciences, University of Central Lancashire Cyprus, Larnaca 7080, Cyprus
| | - Marios Raspopoulos
- School of Sciences, University of Central Lancashire Cyprus, Larnaca 7080, Cyprus
| |
Collapse
|
21
|
Zhai R, Song J, Hou S, Gao F, Li X. Self-Supervised Learning for Point-Cloud Classification by a Multigrid Autoencoder. SENSORS (BASEL, SWITZERLAND) 2022; 22:8115. [PMID: 36365813 PMCID: PMC9658469 DOI: 10.3390/s22218115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 10/20/2022] [Accepted: 10/20/2022] [Indexed: 06/16/2023]
Abstract
It has become routine to directly process point clouds using a combination of shared multilayer perceptrons and aggregate functions. However, this practice has difficulty capturing the local information of point clouds, leading to information loss. Nevertheless, several recent works have proposed models that establish point-to-point relationships based on this procedure. However, to address the information loss, in this study we use self-supervised methods to enhance the network's understanding of point clouds. Our proposed multigrid autoencoder (MA) constrains the encoder part of the classification network so that it gains an understanding of the point cloud as it reconstructs it. With the help of self-supervised learning, we find the original network improves performance. We validate our model on PointNet++, and the experimental results show that our method improves overall classification accuracy by 2.0% and 4.7% with ModelNet40 and ScanObjectNN datasets, respectively.
Collapse
Affiliation(s)
- Ruifeng Zhai
- State Key Laboratory of Integrated Optoelectronics, College of Electronic Science and Engineering, Jilin University, Changchun 130012, China
| | - Junfeng Song
- State Key Laboratory of Integrated Optoelectronics, College of Electronic Science and Engineering, Jilin University, Changchun 130012, China
- Peng Cheng Laboratory, Shenzhen 518000, China
| | - Shuzhao Hou
- State Key Laboratory of Integrated Optoelectronics, College of Electronic Science and Engineering, Jilin University, Changchun 130012, China
| | - Fengli Gao
- State Key Laboratory of Integrated Optoelectronics, College of Electronic Science and Engineering, Jilin University, Changchun 130012, China
| | - Xueyan Li
- State Key Laboratory of Integrated Optoelectronics, College of Electronic Science and Engineering, Jilin University, Changchun 130012, China
| |
Collapse
|