Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Torralba A, Fergus R, Freeman WT. 80 million tiny images: a large data set for nonparametric object and scene recognition. IEEE Trans Pattern Anal Mach Intell 2008;30:1958-1970. [PMID: 18787244 DOI: 10.1109/tpami.2008.128] [Citation(s) in RCA: 313] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

For:	Torralba A, Fergus R, Freeman WT. 80 million tiny images: a large data set for nonparametric object and scene recognition. IEEE Trans Pattern Anal Mach Intell 2008;30:1958-1970. [PMID: 18787244 DOI: 10.1109/tpami.2008.128] [Citation(s) in RCA: 313] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

Number

Cited by Other Article(s)

Yang J, Lai S, Wang X, Wang Y, Qian X. Diversity-Learning Block: Conquer Feature Homogenization of Multibranch. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024;35:7563-7576. [PMID: 36322499 DOI: 10.1109/tnnls.2022.3214993] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Wu S, Lv X, Liu Y, Jiang M, Li X, Jiang D, Yu J, Gong Y, Jiang R. Enhanced SSD framework for detecting defects in cigarette appearance using variational Bayesian inference under limited sample conditions. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2024;21:3281-3303. [PMID: 38454728 DOI: 10.3934/mbe.2024145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]

Wang S, Zhao J, Cai Y, Li Y, Qi X, Qiu X, Yao X, Tian Y, Zhu Y, Cao W, Zhang X. A method for small-sized wheat seedlings detection: from annotation mode to model construction. PLANT METHODS 2024;20:15. [PMID: 38287423 PMCID: PMC10826033 DOI: 10.1186/s13007-024-01147-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Accepted: 01/23/2024] [Indexed: 01/31/2024]

Affiliation(s)

Suwan Wang National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China
Jianqing Zhao College of Geography, Jiangsu Second Normal University, Nanjing, 211200, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China
Yucheng Cai National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China
Yan Li National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China
Xuerui Qi National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China
Xiaolei Qiu National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China Jiangsu Key Laboratory for Information Agriculture, Nanjing, 210095, China
Xia Yao National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China Jiangsu Key Laboratory for Information Agriculture, Nanjing, 210095, China
Yongchao Tian National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Jiangsu Collaborative Innovation Center for Modern Crop Production, Nanjing, 210095, China
Yan Zhu National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China
Weixing Cao National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China
Xiaohu Zhang National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China. Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China. Jiangsu Collaborative Innovation Center for Modern Crop Production, Nanjing, 210095, China.

Collapse

Gao R, Ma Y, Zhao Z, Li B, Zhang J. Real-Time Detection of an Undercarriage Based on Receptive Field Blocks and Coordinate Attention. SENSORS (BASEL, SWITZERLAND) 2023;23:9861. [PMID: 38139707 PMCID: PMC10747497 DOI: 10.3390/s23249861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 11/24/2023] [Accepted: 12/11/2023] [Indexed: 12/24/2023]

Abstract

Currently, aeroplane images captured by camera sensors are characterized by their small size and intricate backgrounds, posing a challenge for existing deep learning algorithms in effectively detecting small targets. This paper incorporates the RFBNet (a coordinate attention mechanism) and the SIOU loss function into the YOLOv5 algorithm to address this issue. The result is developing the model for aeroplane and undercarriage detection. The primary goal is to synergize camera sensors with deep learning algorithms, improving image capture precision. YOLOv5-RSC enhances three aspects: firstly, it introduces the receptive field block based on the backbone network, increasing the size of the receptive field of the feature map, enhancing the connection between shallow and deep feature maps, and further improving the model's utilization of feature information. Secondly, the coordinate attention mechanism is added to the feature fusion network to assist the model in more accurately locating the targets of interest, considering attention in the channel and spatial dimensions. This enhances the model's attention to key information and improves detection precision. Finally, the SIoU bounding box loss function is adopted to address the issue of IoU's insensitivity to scale and increase the speed of model bounding box convergence. Subsequently, the Basler camera experimental platform was constructed for experimental verification. The results demonstrate that the AP values of the YOLOv5-RSC detection model for aeroplane and undercarriage are 92.4% and 80.5%, respectively. The mAP value is 86.4%, which is 2.0%, 5.4%, and 3.7% higher than the original YOLOv5 algorithm, respectively, with a detection speed reaching 89.2 FPS. These findings indicate that the model exhibits high detection precision and speed, providing a valuable reference for aeroplane undercarriage detection.

Collapse

Nadler EO, Darragh-Ford E, Desikan BS, Conaway C, Chu M, Hull T, Guilbeault D. Divergences in color perception between deep neural networks and humans. Cognition 2023;241:105621. [PMID: 37716312 DOI: 10.1016/j.cognition.2023.105621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Revised: 06/23/2023] [Accepted: 09/09/2023] [Indexed: 09/18/2023]

Abstract

Deep neural networks (DNNs) are increasingly proposed as models of human vision, bolstered by their impressive performance on image classification and object recognition tasks. Yet, the extent to which DNNs capture fundamental aspects of human vision such as color perception remains unclear. Here, we develop novel experiments for evaluating the perceptual coherence of color embeddings in DNNs, and we assess how well these algorithms predict human color similarity judgments collected via an online survey. We find that state-of-the-art DNN architectures - including convolutional neural networks and vision transformers - provide color similarity judgments that strikingly diverge from human color judgments of (i) images with controlled color properties, (ii) images generated from online searches, and (iii) real-world images from the canonical CIFAR-10 dataset. We compare DNN performance against an interpretable and cognitively plausible model of color perception based on wavelet decomposition, inspired by foundational theories in computational neuroscience. While one deep learning model - a convolutional DNN trained on a style transfer task - captures some aspects of human color perception, our wavelet algorithm provides more coherent color embeddings that better predict human color judgments compared to all DNNs we examine. These results hold when altering the high-level visual task used to train similar DNN architectures (e.g., image classification versus image segmentation), as well as when examining the color embeddings of different layers in a given DNN architecture. These findings break new ground in the effort to analyze the perceptual representations of machine learning algorithms and to improve their ability to serve as cognitively plausible models of human vision. Implications for machine learning, human perception, and embodied cognition are discussed.

Collapse

Kim H, Lee W, Lee S, Lee J. Bridged adversarial training. Neural Netw 2023;167:266-282. [PMID: 37666185 DOI: 10.1016/j.neunet.2023.08.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 06/07/2023] [Accepted: 08/13/2023] [Indexed: 09/06/2023]

Sun C, Chen J, Li Y, Wang W, Ma T. Random pruning: channel sparsity by expectation scaling factor. PeerJ Comput Sci 2023;9:e1564. [PMID: 37705629 PMCID: PMC10495938 DOI: 10.7717/peerj-cs.1564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 08/13/2023] [Indexed: 09/15/2023]

Wei M, Zhou Y, Li Z, Xu X. Class-imbalanced complementary-label learning via weighted loss. Neural Netw 2023;166:555-565. [PMID: 37586256 DOI: 10.1016/j.neunet.2023.07.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 06/17/2023] [Accepted: 07/23/2023] [Indexed: 08/18/2023]

Xia K, Lv Z, Liu K, Lu Z, Zhou C, Zhu H, Chen X. Global contextual attention augmented YOLO with ConvMixer prediction heads for PCB surface defect detection. Sci Rep 2023;13:9805. [PMID: 37328545 DOI: 10.1038/s41598-023-36854-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Accepted: 06/11/2023] [Indexed: 06/18/2023] Open

Xue P, Lu Y, Chang J, Wei X, Wei Z. IR$$^2$$Net: information restriction and information recovery for accurate binary neural networks. Neural Comput Appl 2023. [DOI: 10.1007/s00521-023-08495-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]

Fan J, Zeng Y. Challenging deep learning models with image distortion based on the abutting grating illusion. PATTERNS (NEW YORK, N.Y.) 2023;4:100695. [PMID: 36960449 PMCID: PMC10028432 DOI: 10.1016/j.patter.2023.100695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 11/07/2022] [Accepted: 02/01/2023] [Indexed: 03/06/2023]

Picot M, Messina F, Boudiaf M, Labeau F, Ayed IB, Piantanida P. Adversarial Robustness Via Fisher-Rao Regularization. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2023;45:2698-2710. [PMID: 35552150 DOI: 10.1109/tpami.2022.3174724] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Image synthesis: a review of methods, datasets, evaluation metrics, and future outlook. Artif Intell Rev 2023. [DOI: 10.1007/s10462-023-10434-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]

Chen X, Li Y, Chen C. An Online Hashing Algorithm for Image Retrieval Based on Optical-Sensor Network. SENSORS (BASEL, SWITZERLAND) 2023;23:2576. [PMID: 36904780 PMCID: PMC10007520 DOI: 10.3390/s23052576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Revised: 02/20/2023] [Accepted: 02/24/2023] [Indexed: 06/18/2023]

Regularization-based pruning of irrelevant weights in deep neural architectures. APPL INTELL 2023. [DOI: 10.1007/s10489-022-04353-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]

Texture and material classification with multi-scale ternary and septenary patterns. JOURNAL OF KING SAUD UNIVERSITY - COMPUTER AND INFORMATION SCIENCES 2022. [DOI: 10.1016/j.jksuci.2022.12.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Tuggener L, Schmidhuber J, Stadelmann T. Is it enough to optimize CNN architectures on ImageNet? FRONTIERS IN COMPUTER SCIENCE 2022. [DOI: 10.3389/fcomp.2022.1041703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Mi JX, Wang XD, Zhou LF, Cheng K. Adversarial Examples based on Object Detection tasks: A Survey. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.10.046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

WSAGrad: a novel adaptive gradient based method. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04205-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Naeem A, Anees T, Ahmed KT, Naqvi RA, Ahmad S, Whangbo T. Deep learned vectors’ formation using auto-correlation, scaling, and derivations with CNN for complex and huge image retrieval. COMPLEX INTELL SYST 2022. [DOI: 10.1007/s40747-022-00866-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/10/2022]

Lu Y, Zhang Z, Lu G, Zhou Y, Li J, Zhang D. Addi-Reg: A Better Generalization-Optimization Tradeoff Regularization Method for Convolutional Neural Networks. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:10827-10842. [PMID: 33750731 DOI: 10.1109/tcyb.2021.3062881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Abstract

In convolutional neural networks (CNNs), generating noise for the intermediate feature is a hot research topic in improving generalization. The existing methods usually regularize the CNNs by producing multiplicative noise (regularization weights), called multiplicative regularization (Multi-Reg). However, Multi-Reg methods usually focus on improving generalization but fail to jointly consider optimization, leading to unstable learning with slow convergence. Moreover, Multi-Reg methods are not flexible enough since the regularization weights are generated from a definite manual-design distribution. Besides, most popular methods are not universal enough, because these methods are only designed for the residual networks. In this article, we, for the first time, experimentally and theoretically explore the nature of generating noise in the intermediate features for popular CNNs. We demonstrate that injecting noise in the feature space can be transformed to generating noise in the input space, and these methods regularize the networks in a Mini-batch in Mini-batch (MiM) sampling manner. Based on these observations, this article further discovers that generating multiplicative noise can easily degenerate the optimization due to its high dependence on the intermediate feature. Based on these studies, we propose a novel additional regularization (Addi-Reg) method, which can adaptively produce additional noise with low dependence on intermediate feature in CNNs by employing a series of mechanisms. Particularly, these well-designed mechanisms can stabilize the learning process in training, and our Addi-Reg method can pertinently learn the noise distributions for every layer in CNNs. Extensive experiments demonstrate that the proposed Addi-Reg method is more flexible and universal, and meanwhile achieves better generalization performance with faster convergence against the state-of-the-art Multi-Reg methods.

Collapse

Schwarz Schuler JP, Also SR, Puig D, Rashwan H, Abdel-Nasser M. An Enhanced Scheme for Reducing the Complexity of Pointwise Convolutions in CNNs for Image Classification Based on Interleaved Grouped Filters without Divisibility Constraints. ENTROPY (BASEL, SWITZERLAND) 2022;24:1264. [PMID: 36141151 PMCID: PMC9497893 DOI: 10.3390/e24091264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Revised: 09/01/2022] [Accepted: 09/05/2022] [Indexed: 06/16/2023]

Abstract

In image classification with Deep Convolutional Neural Networks (DCNNs), the number of parameters in pointwise convolutions rapidly grows due to the multiplication of the number of filters by the number of input channels that come from the previous layer. Existing studies demonstrated that a subnetwork can replace pointwise convolutional layers with significantly fewer parameters and fewer floating-point computations, while maintaining the learning capacity. In this paper, we propose an improved scheme for reducing the complexity of pointwise convolutions in DCNNs for image classification based on interleaved grouped filters without divisibility constraints. The proposed scheme utilizes grouped pointwise convolutions, in which each group processes a fraction of the input channels. It requires a number of channels per group as a hyperparameter Ch. The subnetwork of the proposed scheme contains two consecutive convolutional layers K and L, connected by an interleaving layer in the middle, and summed at the end. The number of groups of filters and filters per group for layers K and L is determined by exact divisions of the original number of input channels and filters by Ch. If the divisions were not exact, the original layer could not be substituted. In this paper, we refine the previous algorithm so that input channels are replicated and groups can have different numbers of filters to cope with non exact divisibility situations. Thus, the proposed scheme further reduces the number of floating-point computations (11%) and trainable parameters (10%) achieved by the previous method. We tested our optimization on an EfficientNet-B0 as a baseline architecture and made classification tests on the CIFAR-10, Colorectal Cancer Histology, and Malaria datasets. For each dataset, our optimization achieves a saving of 76%, 89%, and 91% of the number of trainable parameters of EfficientNet-B0, while keeping its test classification accuracy.

Collapse

Auditory Speech Based Alerting System for Detecting Dummy Number Plate via Video Processing Data sets. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:4423744. [PMID: 36093477 PMCID: PMC9462979 DOI: 10.1155/2022/4423744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 06/29/2022] [Accepted: 07/26/2022] [Indexed: 11/17/2022]

Elephant motorbikes and too many neckties: epistemic spatialization as a framework for investigating patterns of bias in convolutional neural networks. AI & SOCIETY 2022. [DOI: 10.1007/s00146-022-01542-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]

Mohamed E, Sirlantzis K, Howells G, Hoque S. Optimisation of Deep Learning Small-Object Detectors with Novel Explainable Verification. SENSORS 2022;22:s22155596. [PMID: 35898097 PMCID: PMC9330345 DOI: 10.3390/s22155596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/05/2022] [Revised: 07/13/2022] [Accepted: 07/18/2022] [Indexed: 11/16/2022]

Abstract

In this paper, we present a novel methodology based on machine learning for identifying the most appropriate from a set of available state-of-the-art object detectors for a given application. Our particular interest is to develop a road map for identifying verifiably optimal selections, especially for challenging applications such as detecting small objects in a mixed-size object dataset. State-of-the-art object detection systems often find the localisation of small-size objects challenging since most are usually trained on large-size objects. These contain abundant information as they occupy a large number of pixels relative to the total image size. This fact is normally exploited by the model during training and inference processes. To dissect and understand this process, our approach systematically examines detectors’ performances using two very distinct deep convolutional networks. The first is the single-stage YOLO V3 and the second is the double-stage Faster R-CNN. Specifically, our proposed method explores and visually illustrates the impact of feature extraction layers, number of anchor boxes, data augmentation, etc., utilising ideas from the field of explainable Artificial Intelligence (XAI). Our results, for example, show that multi-head YOLO V3 detectors trained using augmented data produce better performance even with a fewer number of anchor boxes. Moreover, robustness regarding the detector’s ability to explain how a specific decision was reached is investigated using different explanation techniques. Finally, two new visualisation techniques are proposed, WS-Grad and Concat-Grad, for identifying explanation cues of different detectors. These are applied to specific object detection tasks to illustrate their reliability and transparency with respect to the decision process. It is shown that the proposed techniques can result in high resolution and comprehensive heatmaps of the image areas, significantly affecting detector decisions as compared to the state-of-the-art techniques tested.

Collapse

Approximate Nearest Neighbor Search Using Enhanced Accumulative Quantization. ELECTRONICS 2022. [DOI: 10.3390/electronics11142236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Tripathy PK, Shrivastava A, Agarwal V, Shah DU, L. CSR, Akilandeeswari S. Federated learning algorithm based on matrix mapping for data privacy over edge computing. INTERNATIONAL JOURNAL OF PERVASIVE COMPUTING AND COMMUNICATIONS 2022. [DOI: 10.1108/ijpcc-03-2022-0113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

SFCC: Data Augmentation with Stratified Fourier Coefficients Combination for Time Series Classification. Neural Process Lett 2022. [DOI: 10.1007/s11063-022-10965-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]

Hofmann M, Mader P. Synaptic Scaling-An Artificial Neural Network Regularization Inspired by Nature. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;33:3094-3108. [PMID: 33502984 DOI: 10.1109/tnnls.2021.3050422] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Zhang L, Su G, Yin J, Li Y, Lin Q, Zhang X, Shao L. Bioinspired Scene Classification by Deep Active Learning With Remote Sensing Applications. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:5682-5694. [PMID: 33635802 DOI: 10.1109/tcyb.2020.2981480] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Abstract

Accurately classifying sceneries with different spatial configurations is an indispensable technique in computer vision and intelligent systems, for example, scene parsing, robot motion planning, and autonomous driving. Remarkable performance has been achieved by the deep recognition models in the past decade. As far as we know, however, these deep architectures are incapable of explicitly encoding the human visual perception, that is, the sequence of gaze movements and the subsequent cognitive processes. In this article, a biologically inspired deep model is proposed for scene classification, where the human gaze behaviors are robustly discovered and represented by a unified deep active learning (UDAL) framework. More specifically, to characterize objects' components with varied sizes, an objectness measure is employed to decompose each scenery into a set of semantically aware object patches. To represent each region at a low level, a local-global feature fusion scheme is developed which optimally integrates multimodal features by automatically calculating each feature's weight. To mimic the human visual perception of various sceneries, we develop the UDAL that hierarchically represents the human gaze behavior by recognizing semantically important regions within the scenery. Importantly, UDAL combines the semantically salient region detection and the deep gaze shifting path (GSP) representation learning into a principled framework, where only the partial semantic tags are required. Meanwhile, by incorporating the sparsity penalty, the contaminated/redundant low-level regional features can be intelligently avoided. Finally, the learned deep GSP features from the entire scene images are integrated to form an image kernel machine, which is subsequently fed into a kernel SVM to classify different sceneries. Experimental evaluations on six well-known scenery sets (including remote sensing images) have shown the competitiveness of our approach.

Collapse

Mo Y, Wu Y, Yang X, Liu F, Liao Y. Review the state-of-the-art technologies of semantic segmentation based on deep learning. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.01.005] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Algan G, Ulusoy I. MetaLabelNet: Learning to Generate Soft-Labels From Noisy-Labels. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2022;31:4352-4362. [PMID: 35731778 DOI: 10.1109/tip.2022.3183841] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Xu M, Zhang T, Li Z, Zhang D. InfoAT: Improving Adversarial Training Using the Information Bottleneck Principle. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;PP:1255-1264. [PMID: 35731762 DOI: 10.1109/tnnls.2022.3183095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Lu Y, Zhang L, Yang X, Zhou Y. Efficient Harmonic Neural Networks With Compound Discrete Cosine Transform Filters and Shared Reconstruction Filters. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;PP:693-707. [PMID: 35622805 DOI: 10.1109/tnnls.2022.3176611] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

A Novel Hierarchical Adaptive Feature Fusion Method for Meta-Learning. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12115458] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]

REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets. Int J Comput Vis 2022. [DOI: 10.1007/s11263-022-01625-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Sagong MC, Yeo YJ, Shin YG, Ko SJ. Conditional Convolution Projecting Latent Vectors on Condition-Specific Space. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;PP:1386-1393. [PMID: 35584073 DOI: 10.1109/tnnls.2022.3172512] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Yeo YJ, Shin YG, Park S, Ko SJ. Simple Yet Effective Way for Improving the Performance of GAN. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;33:1811-1818. [PMID: 33385312 DOI: 10.1109/tnnls.2020.3045000] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Convy I, Huggins W, Liao H, Whaley KB. Mutual Information Scaling for Tensor Network Machine Learning. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2022;3:015017. [PMID: 35211672 PMCID: PMC8862112 DOI: 10.1088/2632-2153/ac44a9] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Self-distribution binary neural networks. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03348-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Kataoka H, Okayasu K, Matsumoto A, Yamagata E, Yamada R, Inoue N, Nakamura A, Satoh Y. Pre-Training Without Natural Images. Int J Comput Vis 2022. [DOI: 10.1007/s11263-021-01555-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Abstract AbstractIs it possible to use convolutional neural networks pre-trained without any natural images to assist natural image understanding? The paper proposes a novel concept, Formula-driven Supervised Learning (FDSL). We automatically generate image patterns and their category labels by assigning fractals, which are based on a natural law. Theoretically, the use of automatically generated images instead of natural images in the pre-training phase allows us to generate an infinitely large dataset of labeled images. The proposed framework is similar yet different from Self-Supervised Learning because the FDSL framework enables the creation of image patterns based on any mathematical formulas in addition to self-generated labels. Further, unlike pre-training with a synthetic image dataset, a dataset under the framework of FDSL is not required to define object categories, surface texture, lighting conditions, and camera viewpoint. In the experimental section, we find a better dataset configuration through an exploratory study, e.g., increase of #category/#instance, patch rendering, image coloring, and training epoch. Although models pre-trained with the proposed Fractal DataBase (FractalDB), a database without natural images, do not necessarily outperform models pre-trained with human annotated datasets in all settings, we are able to partially surpass the accuracy of ImageNet/Places pre-trained models. The FractalDB pre-trained CNN also outperforms other pre-trained models on auto-generated datasets based on FDSL such as Bezier curves and Perlin noise. This is reasonable since natural objects and scenes existing around us are constructed according to fractal geometry. Image representation with the proposed FractalDB captures a unique feature in the visualization of convolutional layers and attentions. Collapse

Milde MB, Afshar S, Xu Y, Marcireau A, Joubert D, Ramesh B, Bethi Y, Ralph NO, El Arja S, Dennler N, van Schaik A, Cohen G. Neuromorphic Engineering Needs Closed-Loop Benchmarks. Front Neurosci 2022;16:813555. [PMID: 35237122 PMCID: PMC8884247 DOI: 10.3389/fnins.2022.813555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Accepted: 01/24/2022] [Indexed: 12/02/2022] Open

Large-Scale Data Clustering Using Manifold-Regularized Ensemble of Posterior in GAN. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2022. [DOI: 10.1007/s13369-021-05809-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

PConv: simple yet effective convolutional layer for generative adversarial network. Neural Comput Appl 2022. [DOI: 10.1007/s00521-021-06846-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Ashraf M, Robles WRQ, Kim M, Ko YS, Yi MY. A loss-based patch label denoising method for improving whole-slide image analysis using a convolutional neural network. Sci Rep 2022;12:1392. [PMID: 35082315 PMCID: PMC8791954 DOI: 10.1038/s41598-022-05001-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 01/05/2022] [Indexed: 12/24/2022] Open

Boundary-Aware Hashing for Hamming Space Retrieval. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12010508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Salari A, Djavadifar A, Liu XR, Najjaran H. Object recognition datasets and challenges: A review. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.01.022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Mantegazza D, Giusti A, Gambardella LM, Guzzi J. An Outlier Exposure Approach to Improve Visual Anomaly Detection Performance for Mobile Robots. IEEE Robot Autom Lett 2022. [DOI: 10.1109/lra.2022.3192794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Szadkowski R, Drchal J, Faigl J. Continually trained life-long classification. Neural Comput Appl 2022. [DOI: 10.1007/s00521-021-06154-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Singh R, Dubey AK, Kapoor R. Deep Neural Network Regularization (DNNR) on Denoised Image. INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES 2022. [DOI: 10.4018/ijiit.309584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]