Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Pang Y, Cao J, Li Y, Xie J, Sun H, Gong J. TJU-DHD: A Diverse High-Resolution Dataset for Object Detection. IEEE Trans Image Process 2020;30:207-219. [PMID: 33141669 DOI: 10.1109/tip.2020.3034487] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

For:	Pang Y, Cao J, Li Y, Xie J, Sun H, Gong J. TJU-DHD: A Diverse High-Resolution Dataset for Object Detection. IEEE Trans Image Process 2020;30:207-219. [PMID: 33141669 DOI: 10.1109/tip.2020.3034487] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

Number

Cited by Other Article(s)

Hu Y, Jiang X, Liu X, Luo X, Hu Y, Cao X, Zhang B, Zhang J. Hierarchical Self-Distilled Feature Learning for Fine-Grained Visual Categorization. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025;36:4005-4018. [PMID: 34780336 DOI: 10.1109/tnnls.2021.3124135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Tian G, Sun Y, Liu Y, Zeng X, Wang M, Liu Y, Zhang J, Chen J. Adding Before Pruning: Sparse Filter Fusion for Deep Convolutional Neural Networks via Auxiliary Attention. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025;36:3930-3942. [PMID: 34487502 DOI: 10.1109/tnnls.2021.3106917] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Abstract

Filter pruning is a significant feature selection technique to shrink the existing feature fusion schemes (especially on convolution calculation and model size), which helps to develop more efficient feature fusion models while maintaining state-of-the-art performance. In addition, it reduces the storage and computation requirements of deep neural networks (DNNs) and accelerates the inference process dramatically. Existing methods mainly rely on manual constraints such as normalization to select the filters. A typical pipeline comprises two stages: first pruning the original neural network and then fine-tuning the pruned model. However, choosing a manual criterion can be somehow tricky and stochastic. Moreover, directly regularizing and modifying filters in the pipeline suffer from being sensitive to the choice of hyperparameters, thus making the pruning procedure less robust. To address these challenges, we propose to handle the filter pruning issue through one stage: using an attention-based architecture that adaptively fuses the filter selection with filter learning in a unified network. Specifically, we present a pruning method named adding before pruning (ABP) to make the model focus on the filters of higher significance by training instead of man-made criteria such as norm, rank, etc. First, we add an auxiliary attention layer into the original model and set the significance scores in this layer to be binary. Furthermore, to propagate the gradients in the auxiliary attention layer, we design a specific gradient estimator and prove its effectiveness for convergence in the graph flow through mathematical derivation. In the end, to relieve the dependence on the complicated prior knowledge for designing the thresholding criterion, we simultaneously prune and train the filters to automatically eliminate network redundancy with recoverability. Extensive experimental results on the two typical image classification benchmarks, CIFAR-10 and ILSVRC-2012, illustrate that the proposed approach performs favorably against previous state-of-the-art filter pruning algorithms.

Collapse

Hao S, Zhou Y, Guo Y, Hong R, Cheng J, Wang M. Real-Time Semantic Segmentation via Spatial-Detail Guided Context Propagation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025;36:4042-4053. [PMID: 35259119 DOI: 10.1109/tnnls.2022.3154443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Zhang L, Liu Z, Zhu X, Song Z, Yang X, Lei Z, Qiao H. Weakly Aligned Feature Fusion for Multimodal Object Detection. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025;36:4145-4159. [PMID: 34437075 DOI: 10.1109/tnnls.2021.3105143] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Gao F, Leng J, Gan J, Gao X. RC-DETR: Improving DETRs in crowded pedestrian detection via rank-based contrastive learning. Neural Netw 2025;182:106911. [PMID: 39612687 DOI: 10.1016/j.neunet.2024.106911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Revised: 09/14/2024] [Accepted: 11/09/2024] [Indexed: 12/01/2024]

He L, Li M, Wang X, Wu X, Yue G, Wang T, Zhou Y, Lei B, Zhou G. Morphology-based deep learning enables accurate detection of senescence in mesenchymal stem cell cultures. BMC Biol 2024;22:1. [PMID: 38167069 PMCID: PMC10762950 DOI: 10.1186/s12915-023-01780-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 11/24/2023] [Indexed: 01/05/2024] Open

Affiliation(s)

Liangge He Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, School of Biomedical Engineering, Shenzhen University Medical School, 1066 Xueyuan Avenue, Shenzhen, 518060, China Department of Medical Cell Biology and Genetics, Shenzhen Key Laboratory of Anti-Aging and Regenerative Medicine, Shenzhen Engineering Laboratory of Regenerative Technologies for Orthopedic Diseases, Shenzhen University Medical School, Shenzhen, 518060, China
Mingzhu Li Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, School of Biomedical Engineering, Shenzhen University Medical School, 1066 Xueyuan Avenue, Shenzhen, 518060, China
Xinglie Wang Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, School of Biomedical Engineering, Shenzhen University Medical School, 1066 Xueyuan Avenue, Shenzhen, 518060, China
Xiaoyan Wu Department of Dermatology, Shenzhen Institute of Translational Medicine, Shenzhen Second People's Hospital, The First Affiliated Hospital of Shenzhen University, Shenzhen, 518035, China
Guanghui Yue Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, School of Biomedical Engineering, Shenzhen University Medical School, 1066 Xueyuan Avenue, Shenzhen, 518060, China
Tianfu Wang Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, School of Biomedical Engineering, Shenzhen University Medical School, 1066 Xueyuan Avenue, Shenzhen, 518060, China
Yan Zhou Department of Medical Cell Biology and Genetics, Shenzhen Key Laboratory of Anti-Aging and Regenerative Medicine, Shenzhen Engineering Laboratory of Regenerative Technologies for Orthopedic Diseases, Shenzhen University Medical School, Shenzhen, 518060, China Lungene Biotech Ltd., Shenzhen, 18000, China
Baiying Lei Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, School of Biomedical Engineering, Shenzhen University Medical School, 1066 Xueyuan Avenue, Shenzhen, 518060, China.
Guangqian Zhou Department of Medical Cell Biology and Genetics, Shenzhen Key Laboratory of Anti-Aging and Regenerative Medicine, Shenzhen Engineering Laboratory of Regenerative Technologies for Orthopedic Diseases, Shenzhen University Medical School, Shenzhen, 518060, China.

Collapse

Li T, Sun G, Yu L, Zhou K. HRBUST-LLPED: A Benchmark Dataset for Wearable Low-Light Pedestrian Detection. MICROMACHINES 2023;14:2164. [PMID: 38138333 PMCID: PMC10745713 DOI: 10.3390/mi14122164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/26/2023] [Accepted: 11/07/2023] [Indexed: 12/24/2023]

Li D, Tian Y, Li J. SODFormer: Streaming Object Detection With Transformer Using Events and Frames. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2023;45:14020-14037. [PMID: 37494161 DOI: 10.1109/tpami.2023.3298925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/28/2023]

Yan J, Zhao J, Cai Y, Wang S, Qiu X, Yao X, Tian Y, Zhu Y, Cao W, Zhang X. Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis. PLANT METHODS 2023;19:46. [PMID: 37179312 PMCID: PMC10183117 DOI: 10.1186/s13007-023-01020-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Accepted: 04/29/2023] [Indexed: 05/15/2023]

Abstract

BACKGROUND

Detecting and counting wheat spikes is essential for predicting and measuring wheat yield. However, current wheat spike detection researches often directly apply the new network structure. There are few studies that can combine the prior knowledge of wheat spike size characteristics to design a suitable wheat spike detection model. It remains unclear whether the complex detection layers of the network play their intended role.

RESULTS

This study proposes an interpretive analysis method for quantitatively evaluating the role of three-scale detection layers in a deep learning-based wheat spike detection model. The attention scores in each detection layer of the YOLOv5 network are calculated using the Gradient-weighted Class Activation Mapping (Grad-CAM) algorithm, which compares the prior labeled wheat spike bounding boxes with the attention areas of the network. By refining the multi-scale detection layers using the attention scores, a better wheat spike detection network is obtained. The experiments on the Global Wheat Head Detection (GWHD) dataset show that the large-scale detection layer performs poorly, while the medium-scale detection layer performs best among the three-scale detection layers. Consequently, the large-scale detection layer is removed, a micro-scale detection layer is added, and the feature extraction ability in the medium-scale detection layer is enhanced. The refined model increases the detection accuracy and reduces the network complexity by decreasing the network parameters.

CONCLUSION

The proposed interpretive analysis method to evaluate the contribution of different detection layers in the wheat spike detection network and provide a correct network improvement scheme. The findings of this study will offer a useful reference for future applications of deep network refinement in this field.

Collapse

Affiliation(s)

Jiawei Yan National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China
Jianqing Zhao National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China
Yucheng Cai National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China
Suwan Wang National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China
Xiaolei Qiu National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China
Xia Yao National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China Jiangsu Key Laboratory for Information Agriculture, Nanjing, 210095, China
Yongchao Tian National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Jiangsu Collaborative Innovation Center for Modern Crop Production, Nanjing, 210095, China
Yan Zhu National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China
Weixing Cao National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China
Xiaohu Zhang National Engineering and Technology Center for Information Agriculture, Nanjing Agricultural University, Nanjing, 210095, China. Key Laboratory for Crop System Analysis and Decision Making, Ministry of Agriculture and Rural Affairs, Nanjing, 210095, China. Jiangsu Collaborative Innovation Center for Modern Crop Production, Nanjing, 210095, China.

Collapse

Lin Z, Pei W, Chen F, Zhang D, Lu G. Pedestrian Detection by Exemplar-Guided Contrastive Learning. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2023;32:2003-2016. [PMID: 35839180 DOI: 10.1109/tip.2022.3189803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Er MJ, Chen J, Zhang Y, Gao W. Research Challenges, Recent Advances, and Popular Datasets in Deep Learning-Based Underwater Marine Object Detection: A Review. SENSORS (BASEL, SWITZERLAND) 2023;23:1990. [PMID: 36850584 PMCID: PMC9966468 DOI: 10.3390/s23041990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Revised: 01/18/2023] [Accepted: 01/20/2023] [Indexed: 06/18/2023]

Lim J, Baskaran VM, Lim JMY, Wong K, See J, Tistarelli M. ERNet: An Efficient and Reliable Human-Object Interaction Detection Network. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2023;32:964-979. [PMID: 37022006 DOI: 10.1109/tip.2022.3231528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

An objective method for pedestrian occlusion level classification. Pattern Recognit Lett 2022. [DOI: 10.1016/j.patrec.2022.10.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Occluded pedestrian detection through bi-center prediction in anchor-free network. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.08.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Cao J, Pang Y, Xie J, Khan FS, Shao L. From Handcrafted to Deep Features for Pedestrian Detection: A Survey. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022;44:4913-4934. [PMID: 33929956 DOI: 10.1109/tpami.2021.3076733] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Li W, Chen Z, Li B, Zhang D, Yuan Y. HTD: Heterogeneous Task Decoupling for Two-Stage Object Detection. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021;30:9456-9469. [PMID: 34780326 DOI: 10.1109/tip.2021.3126423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Zhao Z, Liu Y, Sun X, Liu J, Yang X, Zhou C. Composited FishNet: Fish Detection and Species Recognition From Low-Quality Underwater Videos. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021;30:4719-4734. [PMID: 33905330 DOI: 10.1109/tip.2021.3074738] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Abstract

The automatic detection and identification of fish from underwater videos is of great significance for fishery resource assessment and ecological environment monitoring. However, due to the poor quality of underwater images and unconstrained fish movement, traditional hand-designed feature extraction methods or convolutional neural network (CNN)-based object detection algorithms cannot meet the detection requirements in real underwater scenes. Therefore, to realize fish recognition and localization in a complex underwater environment, this paper proposes a novel composite fish detection framework based on a composite backbone and an enhanced path aggregation network called Composited FishNet. By improving the residual network (ResNet), a new composite backbone network (CBresnet) is designed to learn the scene change information (source domain style), which is caused by the differences in the image brightness, fish orientation, seabed structure, aquatic plant movement, fish species shape and texture differences. Thus, the interference of underwater environmental information on the object characteristics is reduced, and the output of the main network to the object information is strengthened. In addition, to better integrate the high and low feature information output from CBresnet, the enhanced path aggregation network (EPANet) is also designed to solve the insufficient utilization of semantic information caused by linear upsampling. The experimental results show that the average precision (AP)_0.5:0.95, AP₅₀ and average recall (AR)_max=10 of the proposed Composited FishNet are 75.2%, 92.8% and 81.1%, respectively. The composite backbone network enhances the characteristic information output of the detected object and improves the utilization of characteristic information. This method can be used for fish detection and identification in complex underwater environments such as oceans and aquaculture.

Collapse

Li Y, Pang Y, Cao J, Shen J, Shao L. Improving Single Shot Object Detection With Feature Scale Unmixing. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021;30:2708-2721. [PMID: 33417552 DOI: 10.1109/tip.2020.3048630] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]