1
|
Yang J, Yuan X, Lu X, Yan Tang Y. Adjustable Jacobi-Fourier Moment for Image Representation. IEEE TRANSACTIONS ON CYBERNETICS 2025; 55:207-220. [PMID: 39466856 DOI: 10.1109/tcyb.2024.3482352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/30/2024]
Abstract
The widely adopted Jacobi-Fourier moment (JFM) is limited by its inability to effectively capture spatial information. Although fractional-order JFM (FOJFM) introduces spatial information through a fractional-order parameter, the control of spatial information remains inadequate. This limitation stems from the insufficient control over zeros distribution associated with the used moment's radial kernel. To address this issue, we generalize both JFM and FOJFM into a transformed JFM. A transformed function with four parameters is designed, and adjustable JFM (AJFM) is proposed. Two parameters correlate to increasing velocities on the left and right parts of the transformed functions, enabling zeros quantities of radial kernel fall in the left and right parts of the interval. The other two parameters segment the transformed function, adjusting regions where different quantities of zeros fall in. This refined control over the radial kernel's zero distribution enhances the versatility of feature extraction by the AJFM, governed by the introduced parameters. Experimental results demonstrate that AJFM, with properly chosen parameters, can emphasize specific regions within an image more effectively.
Collapse
|
2
|
Chen J, Guo Z, Li H, Chen CLP. Regularizing Scale-Adaptive Central Moment Sharpness for Neural Networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:6452-6466. [PMID: 36215387 DOI: 10.1109/tnnls.2022.3210045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
In deep learning, finding flat minima of loss function is a hot research topic in improving generalization. The existing methods usually find flat minima by sharpness minimization algorithms. However, these methods suffer from insufficient flexibility for optimization and generalization due to their ignorance of loss value. This article theoretically and experimentally explores the sharpness minimization algorithms for neural networks. First, a novel scale-invariant sharpness which is called scale-adaptive central moment sharpness (SA-CMS) is proposed. This sharpness is not only scale-invariant but can characterize the nature of loss surface clearly. Based on the proposed sharpness, this article further derives a new regularization term by integrating the different orders of the sharpness. Particularly, a host of sharpness minimization functions such as local entropy can be covered by this regularization term. Then the central moment sharpness generating function is introduced as a new objective function. Moreover, theoretical analyses indicate that the new objective function has a smoother landscape and prefer converging to flat local minima. Furthermore, a computationally efficient two-stage algorithm is developed to minimize the objective function. Compared with other algorithms, the two-stage loss-sharpness minimization (TSLSM) algorithm offers a more flexible optimization target for different training stages. On a variety of learning tasks with both small and large batch sizes, this algorithm is more universal and effective, and meanwhile achieves or surpasses the generalization performance of the state-of-the-art sharpness minimization algorithms.
Collapse
|
3
|
Bera A, Wharton Z, Liu Y, Bessis N, Behera A. SR-GNN: Spatial Relation-aware Graph Neural Network for Fine-Grained Image Categorization. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2022; PP:6017-6031. [PMID: 36103441 DOI: 10.1109/tip.2022.3205215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Over the past few years, a significant progress has been made in deep convolutional neural networks (CNNs)-based image recognition. This is mainly due to the strong ability of such networks in mining discriminative object pose and parts information from texture and shape. This is often inappropriate for fine-grained visual classification (FGVC) since it exhibits high intra-class and low inter-class variances due to occlusions, deformation, illuminations, etc. Thus, an expressive feature representation describing global structural information is a key to characterize an object/ scene. To this end, we propose a method that effectively captures subtle changes by aggregating context-aware features from most relevant image-regions and their importance in discriminating fine-grained categories avoiding the bounding-box and/or distinguishable part annotations. Our approach is inspired by the recent advancement in self-attention and graph neural networks (GNNs) approaches to include a simple yet effective relation-aware feature transformation and its refinement using a context-aware attention mechanism to boost the discriminability of the transformed feature in an end-to-end learning process. Our model is evaluated on eight benchmark datasets consisting of fine-grained objects and human-object interactions. It outperforms the state-of-the-art approaches by a significant margin in recognition accuracy.
Collapse
|
4
|
Lai Q, Vong CM, Wong PK, Wang ST, Yan T, Choi IC, Yu HH. Multi-scale Multi-instance Multi-feature Joint Learning Broad Network (M3JLBN) for gastric intestinal metaplasia subtype classification. Knowl Based Syst 2022; 249:108960. [DOI: 10.1016/j.knosys.2022.108960] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
5
|
A Novel Hybrid Convolutional Neural Network Approach for the Stomach Intestinal Early Detection Cancer Subtype Classification. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:7325064. [PMID: 35785096 PMCID: PMC9249475 DOI: 10.1155/2022/7325064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/07/2022] [Revised: 06/05/2022] [Accepted: 06/10/2022] [Indexed: 11/18/2022]
Abstract
There may be different types of cancer that cause fatal effects in the human body. In general, cancer is nothing but the unnatural growth of blood cells in different parts of the body and is named accordingly. It may be skin cancer, breast cancer, uterus cancer, intestinal cancer, stomach cancer, etc. However, every type of cancer consists of unwanted blood cells which cause issues in the body starting from the minor to death. Cancer cells have the common features in them, and these common features we have used in our work for the processing. Cancer has a significant death rate; however, it is frequently curable with simple surgery if detected in its early stages. A quick and correct diagnosis may be extremely beneficial to both doctors and patients. In several medical domains, the latest deep-learning-based model’s performance is comparable to or even exceeds that of human specialists. We have proposed a novel methodology based on a convolutional neural network that may be used for almost all types of cancer detection. We have collected different datasets of different types of common cancer from different sources and used 90% of the sample data for the training purpose, then we reduced it by 10%, and an 80% image set was used for the validation of the model. After that for testing purposes, we fed a sample dataset and obtain the results. The final output clearly shows that the proposed model outperforms the previous model when we compared our methodology with the existing work.
Collapse
|
6
|
Wang Y, Tan YP, Tang YY, Chen H, Zou C, Li L. Generalized and Discriminative Collaborative Representation for Multiclass Classification. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:2675-2686. [PMID: 33001820 DOI: 10.1109/tcyb.2020.3021712] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
This article presents a generalized collaborative representation-based classification (GCRC) framework, which includes many existing representation-based classification (RC) methods, such as collaborative RC (CRC) and sparse RC (SRC) as special cases. This article also advances the GCRC theory by exploring theoretical conditions on the general regularization matrix. A key drawback of CRC and SRC is that they fail to use the label information of training data and are essentially unsupervised in computing the representation vector. This largely compromises the discriminative ability of the learned representation vector and impedes the classification performance. Guided by the GCRC theory, we propose a novel RC method referred to as discriminative RC (DRC). The proposed DRC method has the following three desirable properties: 1) discriminability: DRC can leverage the label information of training data and is supervised in both representation and classification, thus improving the discriminative ability of the representation vector; 2) efficiency: it has a closed-form solution and is efficient in computing the representation vector and performing classification; and 3) theory: it also has theoretical guarantees for classification. Experimental results on benchmark databases demonstrate both the efficacy and efficiency of DRC for multiclass classification.
Collapse
|
7
|
Zeng C, Li G, Chen Q, Xiao Q. Lightweight global-locally connected distillation network for single image super-resolution. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03454-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
8
|
Liu Z, Lu H, Pan X, Xu M, Lan R, Luo X. Diagnosis of Alzheimer’s disease via an attention-based multi-scale convolutional neural network. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2021.107942] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
|
9
|
Gou J, Xiong X, Wu H, Du L, Zeng S, Yuan Y, Ou W. Locality-constrained weighted collaborative-competitive representation for classification. INT J MACH LEARN CYB 2021. [DOI: 10.1007/s13042-021-01461-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
10
|
Lan R, Zhu Y, Lu H, Liu Z, Luo X. A Two-Phase Learning-Based Swarm Optimizer for Large-Scale Optimization. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:6284-6293. [PMID: 32149665 DOI: 10.1109/tcyb.2020.2968400] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
In this article, a simple yet effective method, called a two-phase learning-based swarm optimizer (TPLSO), is proposed for large-scale optimization. Inspired by the cooperative learning behavior in human society, mass learning and elite learning are involved in TPLSO. In the mass learning phase, TPLSO randomly selects three particles to form a study group and then adopts a competitive mechanism to update the members of the study group. Then, we sort all of the particles in the swarm and pick out the elite particles that have better fitness values. In the elite learning phase, the elite particles learn from each other to further search for more promising areas. The theoretical analysis of TPLSO exploration and exploitation abilities is performed and compared with several popular particle swarm optimizers. Comparative experiments on two widely used large-scale benchmark datasets demonstrate that the proposed TPLSO achieves better performance on diverse large-scale problems than several state-of-the-art algorithms.
Collapse
|
11
|
|
12
|
Discriminative Codebook Hashing for Supervised Video Retrieval. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2021; 2021:5845094. [PMID: 34512743 PMCID: PMC8433008 DOI: 10.1155/2021/5845094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Accepted: 08/12/2021] [Indexed: 11/27/2022]
Abstract
In recent years, hashing learning has received increasing attention in supervised video retrieval. However, most existing supervised video hashing approaches design hash functions based on pairwise similarity or triple relationships and focus on local information, which results in low retrieval accuracy. In this work, we propose a novel supervised framework called discriminative codebook hashing (DCH) for large-scale video retrieval. The proposed DCH encourages samples within the same category to converge to the same code word and maximizes the mutual distances among different categories. Specifically, we first propose the discriminative codebook via a predefined distance among intercode words and Bernoulli distributions to handle each hash bit. Then, we use the composite Kullback–Leibler (KL) divergence to align the neighborhood structures between the high-dimensional space and the Hamming space. The proposed DCH is optimized via the gradient descent algorithm. Experimental results on three widely used video datasets verify that our proposed DCH performs better than several state-of-the-art methods.
Collapse
|
13
|
Gou J, Song J, Du L, Zeng S, Zhan Y, Yi Z. Class mean‐weighted discriminative collaborative representation for classification. INT J INTELL SYST 2021. [DOI: 10.1002/int.22411] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Jianping Gou
- School of Computer Science and Communication Engineering, Jiangsu Key Laboratory of Security Tech. for Industrial Cyberspace Jiangsu University Zhenjiang China
| | - Jun Song
- School of Computer Science and Communication Engineering, Jiangsu Key Laboratory of Security Tech. for Industrial Cyberspace Jiangsu University Zhenjiang China
| | - Lan Du
- Faculty of Information Technology Monash University Melbourne Australia
| | - Shaoning Zeng
- School of Information Science and Technology Huizhou University Huizhou China
| | - Yongzhao Zhan
- School of Computer Science and Communication Engineering, Jiangsu Key Laboratory of Security Tech. for Industrial Cyberspace Jiangsu University Zhenjiang China
| | - Zhang Yi
- School of Computer Science Sichuan University Chengdu China
| |
Collapse
|
14
|
Lan R, Sun L, Liu Z, Lu H, Pang C, Luo X. MADNet: A Fast and Lightweight Network for Single-Image Super Resolution. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:1443-1453. [PMID: 32149667 DOI: 10.1109/tcyb.2020.2970104] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Recently, deep convolutional neural networks (CNNs) have been successfully applied to the single-image super-resolution (SISR) task with great improvement in terms of both peak signal-to-noise ratio (PSNR) and structural similarity (SSIM). However, most of the existing CNN-based SR models require high computing power, which considerably limits their real-world applications. In addition, most CNN-based methods rarely explore the intermediate features that are helpful for final image recovery. To address these issues, in this article, we propose a dense lightweight network, called MADNet, for stronger multiscale feature expression and feature correlation learning. Specifically, a residual multiscale module with an attention mechanism (RMAM) is developed to enhance the informative multiscale feature representation ability. Furthermore, we present a dual residual-path block (DRPB) that utilizes the hierarchical features from original low-resolution images. To take advantage of the multilevel features, dense connections are employed among blocks. The comparative results demonstrate the superior performance of our MADNet model while employing considerably fewer multiadds and parameters.
Collapse
|
15
|
Zhang L, Song L, Du B, Zhang Y. Nonlocal Low-Rank Tensor Completion for Visual Data. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:673-685. [PMID: 31021816 DOI: 10.1109/tcyb.2019.2910151] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
In this paper, we propose a novel nonlocal patch tensor-based visual data completion algorithm and analyze its potential problems. Our algorithm consists of two steps: the first step is initializing the image with triangulation-based linear interpolation and the second step is grouping similar nonlocal patches as a tensor then applying the proposed tensor completion technique. Specifically, with treating a group of patch matrices as a tensor, we impose the low-rank constraint on the tensor through the recently proposed tensor nuclear norm. Moreover, we observe that after the first interpolation step, the image gets blurred and, thus, the similar patches we have found may not exactly match the reference. We name the problem "Patch Mismatch," and then in order to avoid the error caused by it, we further decompose the patch tensor into a low-rank tensor and a sparse tensor, which means the accepted horizontal strips in mismatched patches. Furthermore, our theoretical analysis shows that the error caused by Patch Mismatch can be decomposed into two components, one of which can be bounded by a reasonable assumption named local patch similarity, and the other part is lower than that using matrix completion. Extensive experimental results on real-world datasets verify our method's superiority to the state-of-the-art tensor-based image inpainting methods.
Collapse
|
16
|
Sun Z, Wang Y, Cai Z, Liu T, Tong X, Jiang N. A two‐stage privacy protection mechanism based on blockchain in mobile crowdsourcing. INT J INTELL SYST 2021. [DOI: 10.1002/int.22371] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Zice Sun
- The School of Computer and Control Engineering, Yantai University Yantai Shandong China
| | - Yingjie Wang
- The School of Computer and Control Engineering, Yantai University Yantai Shandong China
| | - Zhipeng Cai
- The Department of Computer Science Georgia State University Atlanta Georgia USA
| | - Tianen Liu
- The School of Computer and Control Engineering, Yantai University Yantai Shandong China
| | - Xiangrong Tong
- The School of Computer and Control Engineering, Yantai University Yantai Shandong China
| | - Nan Jiang
- The Department of Internet of Things East China Jiaotong University Nanchang China
| |
Collapse
|
17
|
Li ZQ, Sun J, Wu XJ, Yin HF. Multiplication fusion of sparse and collaborative-competitive representation for image classification. INT J MACH LEARN CYB 2020. [DOI: 10.1007/s13042-020-01123-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
18
|
Lan R, Hu X, Pang C, Liu Z, Luo X. Multi-scale single image rain removal using a squeeze-and-excitation residual network. Appl Soft Comput 2020. [DOI: 10.1016/j.asoc.2020.106296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
19
|
Li B, Lai YK, Rosin PL. Sparse Graph Regularized Mesh Color Edit Propagation. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2020; 29:5408-5419. [PMID: 32203021 DOI: 10.1109/tip.2020.2980962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Mesh color edit propagation aims to propagate the color from a few color strokes to the whole mesh, which is useful for mesh colorization, color enhancement and color editing, etc. Compared with image edit propagation, luminance information is not available for 3D mesh data, so the color edit propagation is more difficult on 3D meshes than images, with far less research carried out. This paper proposes a novel solution based on sparse graph regularization. Firstly, a few color strokes are interactively drawn by the user, and then the color will be propagated to the whole mesh by minimizing a sparse graph regularized nonlinear energy function. The proposed method effectively measures geometric similarity over shapes by using a set of complementary multiscale feature descriptors, and effectively controls color bleeding via a sparse ℓ1 optimization rather than quadratic minimization used in existing work. The proposed framework can be applied for the task of interactive mesh colorization, mesh color enhancement and mesh color editing. Extensive qualitative and quantitative experiments show that the proposed method outperforms the state-of-the-art methods.
Collapse
|
20
|
Abstract
Small scale face detection is a very difficult problem. In order to achieve a higher detection accuracy, we propose a novel method, termed SE-IYOLOV3, for small scale face in this work. In SE-IYOLOV3, we improve the YOLOV3 first, in which the anchorage box with a higher average intersection ratio is obtained by combining niche technology on the basis of the k-means algorithm. An upsampling scale is added to form a face network structure that is suitable for detecting dense small scale faces. The number of prediction boxes is five times more than the YOLOV3 network. To further improve the detection performance, we adopt the SENet structure to enhance the global receptive field of the network. The experimental results on the WIDERFACEdataset show that the IYOLOV3 network embedded in the SENet structure can significantly improve the detection accuracy of dense small scale faces.
Collapse
|