1
|
Zhu Y, Wang Y, Chen H, Guo Z, Huang Q. Large-Scale Image Retrieval with Deep Attentive Global Features. Int J Neural Syst 2023; 33:2350013. [PMID: 36846979 DOI: 10.1142/s0129065723500132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
How to obtain discriminative features has proved to be a core problem for image retrieval. Many recent works use convolutional neural networks to extract features. However, clutter and occlusion will interfere with the distinguishability of features when using convolutional neural network (CNN) for feature extraction. To address this problem, we intend to obtain high-response activations in the feature map based on the attention mechanism. We propose two attention modules, a spatial attention module and a channel attention module. For the spatial attention module, we first capture the global information and model the relation between channels as a region evaluator, which evaluates and assigns new weights to local features. For the channel attention module, we use a vector with trainable parameters to weight the importance of each feature map. The two attention modules are cascaded to adjust the weight distribution for the feature map, which makes the extracted features more discriminative. Furthermore, we present a scale and mask scheme to scale the major components and filter out the meaningless local features. This scheme can reduce the disadvantages of the various scales of the major components in images by applying multiple scale filters, and filter out the redundant features with the MAX-Mask. Exhaustive experiments demonstrate that the two attention modules are complementary to improve performance, and our network with the three modules outperforms the state-of-the-art methods on four well-known image retrieval datasets.
Collapse
Affiliation(s)
- Yingying Zhu
- College of Computer Science and Software Engineering, Shenzhen University, Nanhai Ave 3688, Shenzhen, Guangdong 518060, P. R. China
| | - Yinghao Wang
- College of Computer Science and Software Engineering, Shenzhen University, Nanhai Ave 3688, Shenzhen, Guangdong 518060, P. R. China
| | - Haonan Chen
- College of Computer Science and Software Engineering, Shenzhen University, Nanhai Ave 3688, Shenzhen, Guangdong 518060, P. R. China
| | - Zemian Guo
- College of Computer Science and Software Engineering, Shenzhen University, Nanhai Ave 3688, Shenzhen, Guangdong 518060, P. R. China
| | - Qiang Huang
- College of Computer Science and Software Engineering, Shenzhen University, Nanhai Ave 3688, Shenzhen, Guangdong 518060, P. R. China
| |
Collapse
|
2
|
Maisano R, Foresti GL. A Sentiment Analysis Anomaly Detection System for Cyber Intelligence. Int J Neural Syst 2023; 33:2350003. [PMID: 36585854 DOI: 10.1142/s012906572350003x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Considering the 2030 United Nations intent of world connection, Cyber Intelligence becomes the main area of the human dimension able of inflicting changes in geopolitical dynamics. In cyberspace, the new battlefield is the mind of people including new weapons like abuse of social media with information manipulation, deception by activists and misinformation. In this paper, a Sentiment Analysis system with Anomaly Detection (SAAD) capability is proposed. The system, scalable and modular, uses an OSINT-Deep Learning approach to investigate on social media sentiment in order to predict suspicious anomaly trend in Twitter posts. Anomaly detection is investigated with a new semi-supervised process that is able to detect potentially dangerous situations in critical areas. The main contributions of the paper are the system suitability for working in different areas and domains, the anomaly detection procedure in sentiment context and a time-dependent confusion matrix to address model evaluation with unbalanced dataset. Real experiments and tests were performed on Sahel Region. The detected anomalies in negative sentiment have been checked by experts of Sahel area, proving true links between the models results and real situations observable from the tweets.
Collapse
Affiliation(s)
- Roberta Maisano
- Computer Science Centre, University of Messina, Piazza Antonello, 2, 98122 Messina, Italy
| | - Gian Luca Foresti
- Department of Mathematics, Computer Science and Physics, University of Udine, Viale delle Scienze, 206, 33100 Udine, Italy
| |
Collapse
|
3
|
Selcuk Nogay H, Adeli H. Diagnostic of autism spectrum disorder based on structural brain MRI images using, grid search optimization, and convolutional neural networks. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
|
4
|
|
5
|
Developing an Anomaly Detection System for Automatic Defective Products’ Inspection. Processes (Basel) 2022. [DOI: 10.3390/pr10081476] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Since unqualified products cause enterprise revenue losses, product inspection is essential for maintaining manufacturing quality. An automated optical inspection (AOI) system is an efficient tool for product inspection, providing a convenient interface for users to view their products of interest. Specifically, in the screw manufacturing industry, the conventional methods are the human visual inspection of the product and for the inspector to view the product image displayed on the dashboard of the AOI system. However, despite the inspector and the approach used, inspection results strongly depend on the inspector’s experience. Moreover, machine learning algorithms could improve the efficiency of human visual inspection, thus addressing the above problem. Based on these facts, we improved anomaly detection efficiency during product inspection, using product image data from the AOI system to obtain valuable information. This study notably used the visual geometry group network, Inception V3, and Xception algorithms to detect qualified and unqualified products during product image analytics. Therefore, we considered that the analyzed results could be integrated into a proposed cloud system for human–machine interaction. Thus, administrators can receive reminders concerning the anomaly-inspected notification through the proposed cloud system, comprising a message queuing telemetry transport protocol, an application programming interface, and a cloud dashboard. From the experimental results, the above-mentioned algorithms had more than 93% accuracy, especially Xception, which had a better performance during the defective type classification. From our study, the proposed system can successfully apply the obtained data in data communication, anomaly dashboards, and anomaly notifications.
Collapse
|
6
|
De Nardin A, Mishra P, Foresti GL, Piciarelli C. Masked Transformer for image Anomaly Localization. Int J Neural Syst 2022; 32:2250030. [DOI: 10.1142/s0129065722500307] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
7
|
Avola D, Cascio M, Cinque L, Fagioli A, Foresti GL. Human Silhouette and Skeleton Video Synthesis Through Wi-Fi signals. Int J Neural Syst 2022; 32:2250015. [PMID: 35209810 DOI: 10.1142/s0129065722500150] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The increasing availability of wireless access points (APs) is leading toward human sensing applications based on Wi-Fi signals as support or alternative tools to the widespread visual sensors, where the signals enable to address well-known vision-related problems such as illumination changes or occlusions. Indeed, using image synthesis techniques to translate radio frequencies to the visible spectrum can become essential to obtain otherwise unavailable visual data. This domain-to-domain translation is feasible because both objects and people affect electromagnetic waves, causing radio and optical frequencies variations. In the literature, models capable of inferring radio-to-visual features mappings have gained momentum in the last few years since frequency changes can be observed in the radio domain through the channel state information (CSI) of Wi-Fi APs, enabling signal-based feature extraction, e.g. amplitude. On this account, this paper presents a novel two-branch generative neural network that effectively maps radio data into visual features, following a teacher-student design that exploits a cross-modality supervision strategy. The latter conditions signal-based features in the visual domain to completely replace visual data. Once trained, the proposed method synthesizes human silhouette and skeleton videos using exclusively Wi-Fi signals. The approach is evaluated on publicly available data, where it obtains remarkable results for both silhouette and skeleton videos generation, demonstrating the effectiveness of the proposed cross-modality supervision strategy.
Collapse
Affiliation(s)
- Danilo Avola
- Department of Computer Science, Sapienza University of Rome Via Salaria, 113, Rome, 00198, Italy
| | - Marco Cascio
- Department of Computer Science, Sapienza University of Rome Via Salaria, 113, Rome, 00198, Italy
| | - Luigi Cinque
- Department of Computer Science, Sapienza University of Rome Via Salaria, 113, Rome, 00198, Italy
| | - Alessio Fagioli
- Department of Computer Science, Sapienza University of Rome Via Salaria, 113, Rome, 00198, Italy
| | - Gian Luca Foresti
- Department of Computer Science, Mathematics and Physics, University of Udine, Via delle Scienze 206, Udine, 33100, Italy
| |
Collapse
|
8
|
Machine learning techniques for diagnosis of alzheimer disease, mild cognitive disorder, and other types of dementia. Biomed Signal Process Control 2022. [DOI: 10.1016/j.bspc.2021.103293] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
9
|
Abstract
In this paper, we introduce VPNet, a novel model-driven neural network architecture based on variable projection (VP). Applying VP operators to neural networks results in learnable features, interpretable parameters, and compact network structures. This paper discusses the motivation and mathematical background of VPNet and presents experiments. The VPNet approach was evaluated in the context of signal processing, where we classified a synthetic dataset and real electrocardiogram (ECG) signals. Compared to fully connected and one-dimensional convolutional networks, VPNet offers fast learning ability and good accuracy at a low computational cost of both training and inference. Based on these advantages and the promising results obtained, we anticipate a profound impact on the broader field of signal processing, in particular on classification, regression and clustering problems.
Collapse
Affiliation(s)
- Péter Kovács
- Department of Numerical Analysis, Eötvös Loránd University, Pázmány Péter stny. 1/C, Budapest 1117, Hungary
| | - Gergő Bognár
- Department of Numerical Analysis, Eötvös Loránd University, Pázmány Péter stny. 1/C, Budapest 1117, Hungary.,Institute of Signal Processing, Johannes Kepler University Linz, Altenberger str. 69, Linz 4040, Austria.,JKU LIT SAL eSPML Lab, Silicon Austria Labs, Altenberger str. 69, Linz 4040, Austria
| | - Christian Huber
- Embedded AI Research Group, Silicon Austria Labs GmbH, Altenberger str. 69, Linz 4040, Austria
| | - Mario Huemer
- Institute of Signal Processing, Johannes Kepler University Linz, Altenberger str. 69, Linz 4040, Austria.,JKU LIT SAL eSPML Lab, Silicon Austria Labs, Altenberger str. 69, Linz 4040, Austria
| |
Collapse
|
10
|
Zhu J, Tan C, Yang J, Yang G, Lio' P. Arbitrary Scale Super-Resolution for Medical Images. Int J Neural Syst 2021; 31:2150037. [PMID: 34304719 DOI: 10.1142/s0129065721500374] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Single image super-resolution (SISR) aims to obtain a high-resolution output from one low-resolution image. Currently, deep learning-based SISR approaches have been widely discussed in medical image processing, because of their potential to achieve high-quality, high spatial resolution images without the cost of additional scans. However, most existing methods are designed for scale-specific SR tasks and are unable to generalize over magnification scales. In this paper, we propose an approach for medical image arbitrary-scale super-resolution (MIASSR), in which we couple meta-learning with generative adversarial networks (GANs) to super-resolve medical images at any scale of magnification in [Formula: see text]. Compared to state-of-the-art SISR algorithms on single-modal magnetic resonance (MR) brain images (OASIS-brains) and multi-modal MR brain images (BraTS), MIASSR achieves comparable fidelity performance and the best perceptual quality with the smallest model size. We also employ transfer learning to enable MIASSR to tackle SR tasks of new medical modalities, such as cardiac MR images (ACDC) and chest computed tomography images (COVID-CT). The source code of our work is also public. Thus, MIASSR has the potential to become a new foundational pre-/post-processing step in clinical image analysis tasks such as reconstruction, image quality enhancement, and segmentation.
Collapse
Affiliation(s)
- Jin Zhu
- Department of Computer Science and Technology, University of Cambridge, Cambridge, CB3 0FD, UK
| | - Chuan Tan
- Department of Computer Science and Technology, University of Cambridge, Cambridge, CB3 0FD, UK
| | - Junwei Yang
- Department of Computer Science and Technology, University of Cambridge, Cambridge, CB3 0FD, UK
| | - Guang Yang
- Cardiovascular Research Centre, Royal Brompton Hospital, London, SW3 6NP, UK.,National Heart and Lung Institute, Imperial College London, London, SW7 2AZ, UK
| | - Pietro Lio'
- Department of Computer Science and Technology, University of Cambridge, Cambridge, CB3 0FD, UK
| |
Collapse
|
11
|
Zhang J, Li D, Wang L, Zhang L. One-Shot Neural Architecture Search by Dynamically Pruning Supernet in Hierarchical Order. Int J Neural Syst 2021; 31:2150029. [PMID: 34128778 DOI: 10.1142/s0129065721500295] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Neural Architecture Search (NAS), which aims at automatically designing neural architectures, recently draw a growing research interest. Different from conventional NAS methods, in which a large number of neural architectures need to be trained for evaluation, the one-shot NAS methods only have to train one supernet which synthesizes all the possible candidate architectures. As a result, the search efficiency could be significantly improved by sharing the supernet's weights during the candidate architectures' evaluation. This strategy could greatly speed up the search process but suffer a challenge that the evaluation based on sharing weights is not predictive enough. Recently, pruning the supernet during the search has been proven to be an efficient way to alleviate this problem. However, the pruning direction in complex-structured search space remains unexplored. In this paper, we revisited the role of path dropout strategy, which drops the neural operations instead of the neurons, in supernet training, and several interesting characters of the supernet trained with dropout are found. Based on the observations, a Hierarchically-Ordered Pruning Neural Architecture Search (HOPNAS) algorithm is proposed by dynamically pruning the supernet with a proper pruning direction. Experimental results indicate that our method is competitive with state-of-the-art approaches on CIFAR10 and ImageNet.
Collapse
Affiliation(s)
- Jianwei Zhang
- College of Computer Science, Sichuan University, Section 4, Southern 1st Ring Rd, Chengdu, Sichuan 610065, P. R. China
| | - Dong Li
- College of Computer Science, Sichuan University, Section 4, Southern 1st Ring Rd, Chengdu, Sichuan 610065, P. R. China
| | - Lituan Wang
- College of Computer Science, Sichuan University, Section 4, Southern 1st Ring Rd, Chengdu, Sichuan 610065, P. R. China
| | - Lei Zhang
- College of Computer Science, Sichuan University, Section 4, Southern 1st Ring Rd, Chengdu, Sichuan 610065, P. R. China
| |
Collapse
|
12
|
Kalina J, Neoral A, Vidnerová P. Effective Automatic Method Selection for Nonlinear Regression Modeling. Int J Neural Syst 2021; 31:2150020. [PMID: 33787471 DOI: 10.1142/s0129065721500209] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Metalearning, an important part of artificial intelligence, represents a promising approach for the task of automatic selection of appropriate methods or algorithms. This paper is interested in recommending a suitable estimator for nonlinear regression modeling, particularly in recommending either the standard nonlinear least squares estimator or one of such available alternative estimators, which is highly robust with respect to the presence of outliers in the data. The authors hold the opinion that theoretical considerations will never be able to formulate such recommendations for the nonlinear regression context. Instead, metalearning is explored here as an original approach suitable for this task. In this paper, four different approaches for automatic method selection for nonlinear regression are proposed and computations over a training database of 643 real publicly available datasets are performed. Particularly, while the metalearning results may be harmed by the imbalanced number of groups, an effective approach yields much improved results, performing a novel combination of supervised feature selection by random forest and oversampling by synthetic minority oversampling technique (SMOTE). As a by-product, the computations bring arguments in favor of the very recent nonlinear least weighted squares estimator, which turns out to outperform other (and much more renowned) estimators in a quite large percentage of datasets.
Collapse
Affiliation(s)
- Jan Kalina
- The Czech Academy of Sciences, Institute of Computer Science, Pod Vodárenskou věží 2, 182 07 Prague 8, Czech Republic.,Charles University, Faculty of Mathematics and Physics, Sokolovská 83, 186 75 Prague 8, Czech Republic
| | - Aleš Neoral
- The Czech Academy of Sciences, Institute of Computer Science, Pod Vodárenskou věží 2, 182 07 Prague 8, Czech Republic
| | - Petra Vidnerová
- The Czech Academy of Sciences, Institute of Computer Science, Pod Vodárenskou věží 2, 182 07 Prague 8, Czech Republic
| |
Collapse
|
13
|
Piciarelli C, Mishra P, Foresti GL. Supervised Anomaly Detection with Highly Imbalanced Datasets Using Capsule Networks. INT J PATTERN RECOGN 2021. [DOI: 10.1142/s0218001421520108] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Detecting anomalous patterns in data is a relevant task in many practical applications, such as defective items detection in industrial inspection systems, cancer identification in medical images, or attacker detection in network intrusion detection systems. This paper focuses on detection of anomalous images, this is images that visually deviate from a reference set of regular data. While anomaly detection has been widely studied in the context of classical machine learning, the application of modern deep learning techniques in this field is still limited. We here propose a capsule-based network for anomaly detection in an extremely imbalanced fully supervised context: we assume that anomaly samples are available, but their amount is limited if compared to regular data. By using a variant of the standard CapsNet architecture, we achieved state-of-the-art results on the MNIST, F-MNIST and K-MNIST datasets.
Collapse
Affiliation(s)
| | - Pankaj Mishra
- University of Udine, Via delle Scienze 206, 33100 Udine, Italy
| | | |
Collapse
|