1
|
Wang H, Gao J. SF-DETR: A Scale-Frequency Detection Transformer for Drone-View Object Detection. SENSORS (BASEL, SWITZERLAND) 2025; 25:2190. [PMID: 40218703 PMCID: PMC11991380 DOI: 10.3390/s25072190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2025] [Revised: 03/27/2025] [Accepted: 03/27/2025] [Indexed: 04/14/2025]
Abstract
Drone-based object detection faces critical challenges, including tiny objects, complex urban backgrounds, dramatic scale variations, and high-frequency detail loss during feature propagation. Current detection methods struggle to address these challenges while maintaining computational efficiency effectively. We propose Scale-Frequency Detection Transformer (SF-DETR), a novel end-to-end framework for drone-view scenarios. SF-DETR introduces a lightweight ScaleFormerNet backbone with Dual Scale Vision Transformer modules, a Bilateral Interactive Feature Enhancement Module, and a Multi-Scale Frequency-Fused Feature Enhancement Network. Extensive experiments on the VisDrone2019 dataset demonstrate SF-DETR's superior performance, achieving 51.0% mAP50 and 31.8% mAP50:95, surpassing state-of-the-art methods like YOLOv9m and RTDETR-r18 by 6.2% and 4.0%, respectively. Further validation of the HIT-UAV dataset confirms the model's generalization capability. Our work establishes a new benchmark for drone-view object detection and provides lightweight architecture suitable for embedded device deployment in real-world aerial surveillance applications.
Collapse
Affiliation(s)
| | - Junwei Gao
- The School of Automation Engineering, Qingdao University, Qingdao 266071, China;
| |
Collapse
|
2
|
Guo L, Li R, Jiang B. An Ensemble Broad Learning Scheme for Semisupervised Vehicle Type Classification. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:5287-5297. [PMID: 34086583 DOI: 10.1109/tnnls.2021.3083508] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Nowadays vehicle type classification is a fundamental part of intelligent transportation systems (ITSs) and is widely used in various applications like traffic flow monitoring, security enforcement, and autonomous driving, etc. However, vehicle classification is usually used in supervised learning, which greatly limits the applicability for real ITS. This article proposes a semisupervised vehicle type classification scheme via ensemble broad learning for ITS. This presented method contains two main parts. In the first part, a collection of base broad learning system (BLS) classifiers is trained by semisupervised learning to avoid time-consuming training process and alleviate the increasingly unlabeled samples burden. In the second part, a dynamic ensemble structure constructed by trained classifier groups with different characteristics obtains the highest type probability and determine which the vehicle belongs, so as to achieve superior generalization performance than a single base classifier. Several experiments conducted on the pubic BIT-Vehicle dataset and MIO-TCD dataset demonstrate that the proposed method outperforms single BLS classifier and some mainstream methods on effectiveness and efficiency.
Collapse
|
3
|
Sun F, Fang F, Wang R, Wan B, Guo Q, Li H, Wu X. An Impartial Semi-Supervised Learning Strategy for Imbalanced Classification on VHR Images. SENSORS 2020; 20:s20226699. [PMID: 33238513 PMCID: PMC7700671 DOI: 10.3390/s20226699] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 11/20/2020] [Accepted: 11/21/2020] [Indexed: 11/20/2022]
Abstract
Imbalanced learning is a common problem in remote sensing imagery-based land-use and land-cover classifications. Imbalanced learning can lead to a reduction in classification accuracy and even the omission of the minority class. In this paper, an impartial semi-supervised learning strategy based on extreme gradient boosting (ISS-XGB) is proposed to classify very high resolution (VHR) images with imbalanced data. ISS-XGB solves multi-class classification by using several semi-supervised classifiers. It first employs multi-group unlabeled data to eliminate the imbalance of training samples and then utilizes gradient boosting-based regression to simulate the target classes with positive and unlabeled samples. In this study, experiments were conducted on eight study areas with different imbalanced situations. The results showed that ISS-XGB provided a comparable but more stable performance than most commonly used classification approaches (i.e., random forest (RF), XGB, multilayer perceptron (MLP), and support vector machine (SVM)), positive and unlabeled learning (PU-Learning) methods (PU-BP and PU-SVM), and typical synthetic sample-based imbalanced learning methods. Especially under extremely imbalanced situations, ISS-XGB can provide high accuracy for the minority class without losing overall performance (the average overall accuracy achieves 85.92%). The proposed strategy has great potential in solving the imbalanced classification problems in remote sensing.
Collapse
Affiliation(s)
- Fei Sun
- School of Geography and Information Engineering, China University of Geosciences, Wuhan 430078, China; (F.S.); (F.F.); (B.W.); (H.L.); (X.W.)
- Academy of Computer, Huanggang Normal University, No. 146 Xinggang 2nd Road, Huanggang 438000, China
| | - Fang Fang
- School of Geography and Information Engineering, China University of Geosciences, Wuhan 430078, China; (F.S.); (F.F.); (B.W.); (H.L.); (X.W.)
- National Engineering Research Center for Geographic Information System, China University of Geosciences, Wuhan 430078, China
| | - Run Wang
- School of Geography and Information Engineering, China University of Geosciences, Wuhan 430078, China; (F.S.); (F.F.); (B.W.); (H.L.); (X.W.)
- Key Laboratory of Geological Survey and Evaluation of Ministry of Education, China University of Geosciences, Wuhan 430078, China
- Correspondence: ; Tel.: +86-027-6788-3728
| | - Bo Wan
- School of Geography and Information Engineering, China University of Geosciences, Wuhan 430078, China; (F.S.); (F.F.); (B.W.); (H.L.); (X.W.)
- National Engineering Research Center for Geographic Information System, China University of Geosciences, Wuhan 430078, China
| | - Qinghua Guo
- State Key Laboratory of Vegetation and Environmental Change, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China;
| | - Hong Li
- School of Geography and Information Engineering, China University of Geosciences, Wuhan 430078, China; (F.S.); (F.F.); (B.W.); (H.L.); (X.W.)
- National Engineering Research Center for Geographic Information System, China University of Geosciences, Wuhan 430078, China
| | - Xincai Wu
- School of Geography and Information Engineering, China University of Geosciences, Wuhan 430078, China; (F.S.); (F.F.); (B.W.); (H.L.); (X.W.)
- National Engineering Research Center for Geographic Information System, China University of Geosciences, Wuhan 430078, China
| |
Collapse
|
4
|
Shokravi H, Shokravi H, Bakhary N, Heidarrezaei M, Rahimian Koloor SS, Petrů M. A Review on Vehicle Classification and Potential Use of Smart Vehicle-Assisted Techniques. SENSORS (BASEL, SWITZERLAND) 2020; 20:E3274. [PMID: 32521806 PMCID: PMC7309154 DOI: 10.3390/s20113274] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2020] [Revised: 06/01/2020] [Accepted: 06/03/2020] [Indexed: 12/04/2022]
Abstract
Vehicle classification (VC) is an underlying approach in an intelligent transportation system and is widely used in various applications like the monitoring of traffic flow, automated parking systems, and security enforcement. The existing VC methods generally have a local nature and can classify the vehicles if the target vehicle passes through fixed sensors, passes through the short-range coverage monitoring area, or a hybrid of these methods. Using global positioning system (GPS) can provide reliable global information regarding kinematic characteristics; however, the methods lack information about the physical parameter of vehicles. Furthermore, in the available studies, smartphone or portable GPS apparatuses are used as the source of the extraction vehicle's kinematic characteristics, which are not dependable for the tracking and classification of vehicles in real time. To deal with the limitation of the available VC methods, potential global methods to identify physical and kinematic characteristics in real time states are investigated. Vehicular Ad Hoc Networks (VANETs) are networks of intelligent interconnected vehicles that can provide traffic parameters such as type, velocity, direction, and position of each vehicle in a real time manner. In this study, VANETs are introduced for VC and their capabilities, which can be used for the above purpose, are presented from the available literature. To the best of the authors' knowledge, this is the first study that introduces VANETs for VC purposes. Finally, a comparison is conducted that shows that VANETs outperform the conventional techniques.
Collapse
Affiliation(s)
- Hoofar Shokravi
- Faculty of Engineering, School of Civil Engineering, Universiti Teknologi Malaysia, Skudai, Johor 81310, Malaysia;
| | - Hooman Shokravi
- Department of Civil Engineering, Islamic Azad University, Tabriz 5157944533, Iran;
| | - Norhisham Bakhary
- Faculty of Engineering, School of Civil Engineering, Universiti Teknologi Malaysia, Skudai, Johor 81310, Malaysia;
- Institute of Noise and Vibration, Universiti Teknologi Malaysia, City Campus, Jalan Semarak, Kuala Lumpur 54100, Malaysia
| | - Mahshid Heidarrezaei
- Faculty of Engineering, Universiti Teknologi Malaysia, UTM Skudai, Johor Bahru, Johor 81310, Malaysia;
| | - Seyed Saeid Rahimian Koloor
- Institute for Nanomaterials, Advanced Technologies and Innovation (CXI), Technical University of Liberec (TUL), Studentska 2, 461 17 Liberec, Czech Republic; (S.S.R.K.); (M.P.)
| | - Michal Petrů
- Institute for Nanomaterials, Advanced Technologies and Innovation (CXI), Technical University of Liberec (TUL), Studentska 2, 461 17 Liberec, Czech Republic; (S.S.R.K.); (M.P.)
| |
Collapse
|