1
|
Ren J, Zhao Y, Zhang W, Sun C. Zero-shot incremental learning using spatial-frequency feature representations. Sci Rep 2025; 15:10932. [PMID: 40157998 PMCID: PMC11954858 DOI: 10.1038/s41598-024-83649-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2024] [Accepted: 12/16/2024] [Indexed: 04/01/2025] Open
Abstract
Zero-shot incremental learning aims to enable a model to generalize to new classes without forgetting previously learned classes. However, the semantic gap between old and new sample classes can lead to catastrophic forgetting. Additionally, existing algorithms lack the ability to capture significant information from each sample image domain. Therefore, this paper proposes a novel spatial-frequency feature representation network (SFFRNet) that contains a spatial feature extraction (SFE) module and a frequency feature extraction (FFE) module to improve the zero-shot translation for the class incremental learning algorithm. The proposed SFFRNet has the ability to effectively extract spatial-frequency feature representation from input images, improve the accuracy of image classification, and fundamentally alleviate catastrophic forgetting. Extensive experiments on the CUB 200-2011 and CIFAR-100 datasets demonstrate that our proposed algorithm outperforms state-of-the-art incremental learning algorithms.
Collapse
Affiliation(s)
- Jie Ren
- Xi'an Polytechnic University, Xi'an, Shaanxi, China
| | - Yang Zhao
- Xi'an Polytechnic University, Xi'an, Shaanxi, China
| | - Weichuan Zhang
- Shaanxi University of Science and Technology, Xi 'an, Shaanxi, China.
| | - Changming Sun
- CSIRO Data61, PO Box 76, Epping, NSW, 1710, Australia
| |
Collapse
|
2
|
Feng Z, Shi R, Jiang Y, Han Y, Ma Z, Ren Y. A Multiscale Gradient Fusion Method for Color Image Edge Detection Using CBM3D Filtering. SENSORS (BASEL, SWITZERLAND) 2025; 25:2031. [PMID: 40218544 PMCID: PMC11991500 DOI: 10.3390/s25072031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2025] [Revised: 03/13/2025] [Accepted: 03/15/2025] [Indexed: 04/14/2025]
Abstract
In this paper, we present a novel color edge detection method that integrates collaborative filtering with multiscale gradient fusion. The Block-Matching and 3D (BM3D) filter is utilized to enhance sparse representations in the transform domain, effectively reducing noise. The multiscale gradient fusion technique compensates for the loss of detail in single-scale edge detection, thereby improving both edge resolution and overall quality. RGB images from the dataset are converted into the XYZ color space through mathematical transformations. The Colored Block-Matching and 3D (CBM3D) filter is applied to the sparse images to reduce noise. Next, the vector gradients of the color image and anisotropic Gaussian directional derivatives for two scale parameters are computed. These are then averaged pixel-by-pixel to generate a refined edge strength map. To enhance the edge features, the image undergoes normalization and non-maximum suppression. This is followed by edge contour extraction using double-thresholding and a novel morphological refinement technique. Experimental results on the edge detection dataset demonstrate that the proposed method offers robust noise resistance and superior edge quality, outperforming traditional methods such as Color Sobel, Color Canny, SE, and Color AGDD, as evidenced by performance metrics including the PR curve, AUC, PSNR, MSE, and FOM.
Collapse
Affiliation(s)
- Zhunruo Feng
- School of Electronics and Information, Xi’an Polytechnic University, Xi’an 710048, China; (Z.F.); (Y.H.); (Z.M.)
| | - Ruomeng Shi
- School of International Business School Suzhou, Xi’an Jiaotong Liverpool University, Suzhou 215123, China;
| | - Yuhan Jiang
- School of the Arts, Universitiy Sains Malaysia, Penang 11700, Malaysia;
| | - Yiming Han
- School of Electronics and Information, Xi’an Polytechnic University, Xi’an 710048, China; (Z.F.); (Y.H.); (Z.M.)
| | - Zeyang Ma
- School of Electronics and Information, Xi’an Polytechnic University, Xi’an 710048, China; (Z.F.); (Y.H.); (Z.M.)
| | - Yuheng Ren
- School of Digital Industry, Jimei University, Xiamen 361021, China
| |
Collapse
|
3
|
Zhou H, Wang K, Nie C, Deng J, Chen Z, Zhang K, Zhao X, Liang J, Huang D, Zhao L, Jang HS, Kong J. Quantitative Analysis of Perovskite Morphologies Employing Deep Learning Framework Enables Accurate Solar Cell Performance Prediction. SMALL (WEINHEIM AN DER BERGSTRASSE, GERMANY) 2025:e2408528. [PMID: 40109130 DOI: 10.1002/smll.202408528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Revised: 03/03/2025] [Indexed: 03/22/2025]
Abstract
In perovskite solar cells, grain boundaries are considered one of the major structural defect sites, and consequently affect solar cell performance. Therefore, a precise edge detection of perovskite grains may enable to predict resulting solar cell performance. Herein, a deep learning model, Self-UNet, is developed to extract and quantify morphological information such as grain boundary length (GBL), the number of grains (NG), and average grain surface area (AGSA) from scanning elecron microscope (SEM) images. The Self-UNet excels conventional Canny and UNet models in edge extraction; the Dice coefficient and F1-score exhibit as high as 91.22% and 93.58%, respectively. The high edge detection accuracy of Self-UNet allows for not only identifying tiny grains stuck between relatively large grains, but also distinguishing actual grain boundaries from grooves on grain surface from low quality SEM images, avoiding under- or over-estimation of grain information. Moreover, the gradient boosted decision tree (GBDT) regression integrated to the Self-UNet exhibits high accuracy in predicting solar cell efficiency with relative errors of less than 10% compared to the experimentally measured efficiencies, which is corroborated by results from the literature and the experiments. Additionally, the GBL can be verified in multiple ways as a new morphological feature.
Collapse
Affiliation(s)
- Haixin Zhou
- College of Railway Transportation, Hunan University of Technology, Zhuzhou, 412008, China
| | - Kuo Wang
- Department of Physics, Gyeongsang National University, Jinju, 52828, Republic of Korea
- Materials Digitalization Center, Korea Institute of Ceramic Engineering & Technology (KICET), Jinju, 52851, Republic of Korea
| | - Cong Nie
- College of Railway Transportation, Hunan University of Technology, Zhuzhou, 412008, China
| | - Jiahao Deng
- College of Railway Transportation, Hunan University of Technology, Zhuzhou, 412008, China
| | - Ziye Chen
- College of Railway Transportation, Hunan University of Technology, Zhuzhou, 412008, China
| | - Kang Zhang
- College of Railway Transportation, Hunan University of Technology, Zhuzhou, 412008, China
| | - Xiaojie Zhao
- College of Railway Transportation, Hunan University of Technology, Zhuzhou, 412008, China
| | - Jiaojiao Liang
- College of Railway Transportation, Hunan University of Technology, Zhuzhou, 412008, China
| | - Di Huang
- College of Railway Transportation, Hunan University of Technology, Zhuzhou, 412008, China
| | - Ling Zhao
- Shandong Provinical Key Laboratory of Optical Communication Science and Technology, School of Physical Science and Information Technology, Liaocheng University, Liaocheng, 252059, China
| | - Hun Soo Jang
- Materials Digitalization Center, Korea Institute of Ceramic Engineering & Technology (KICET), Jinju, 52851, Republic of Korea
| | - Jeamin Kong
- Department of Physics, Gyeongsang National University, Jinju, 52828, Republic of Korea
| |
Collapse
|
4
|
Xie W, Chen P, Li Z, Wang X, Wang C, Zhang L, Wu W, Xiang J, Wang Y, Zhong D. A Two stage deep learning network for automated femoral segmentation in bilateral lower limb CT scans. Sci Rep 2025; 15:9198. [PMID: 40097821 PMCID: PMC11914536 DOI: 10.1038/s41598-025-94180-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2024] [Accepted: 03/12/2025] [Indexed: 03/19/2025] Open
Abstract
This study presents the development of a deep learning-based two-stage network designed for the efficient and precise segmentation of the femur in full lower limb CT images. The proposed network incorporates a dual-phase approach: rapid delineation of regions of interest followed by semantic segmentation of the femur. The experimental dataset comprises 100 samples obtained from a hospital, partitioned into 85 for training, 8 for validation, and 7 for testing. In the first stage, the model achieves an average Intersection over Union of 0.9671 and a mean Average Precision of 0.9656, effectively delineating the femoral region with high accuracy. During the second stage, the network attains an average Dice coefficient of 0.953, sensitivity of 0.965, specificity of 0.998, and pixel accuracy of 0.996, ensuring precise segmentation of the femur. When compared to the single-stage SegResNet architecture, the proposed two-stage model demonstrates faster convergence during training, reduced inference times, higher segmentation accuracy, and overall superior performance. Comparative evaluations against the TransUnet model further highlight the network's notable advantages in accuracy and robustness. In summary, the proposed two-stage network offers an efficient, accurate, and autonomous solution for femur segmentation in large-scale and complex medical imaging datasets. Requiring relatively modest training and computational resources, the model exhibits significant potential for scalability and clinical applicability, making it a valuable tool for advancing femoral image segmentation and supporting diagnostic workflows.
Collapse
Affiliation(s)
- Wenqing Xie
- Deparment of Orthopedics, Xiangya Hospital, Central South University, Changsha, 410008, Hunan, China
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changshan, 410008, Huna, China
| | - Peng Chen
- Deparment of Orthopedics, Xiangya Hospital, Central South University, Changsha, 410008, Hunan, China
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changshan, 410008, Huna, China
| | - Zhigang Li
- Deparment of Orthopedics, Xiangya Hospital, Central South University, Changsha, 410008, Hunan, China
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changshan, 410008, Huna, China
| | - Xiaopeng Wang
- Deparment of Orthopedics, Xiangya Hospital, Central South University, Changsha, 410008, Hunan, China
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changshan, 410008, Huna, China
| | - Chenggong Wang
- Deparment of Orthopedics, Xiangya Hospital, Central South University, Changsha, 410008, Hunan, China
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changshan, 410008, Huna, China
| | - Lin Zhang
- Changzhou Jinse Medical Information Technology Co., Ltd, Changzhou, 213000, Jiangsu, China
| | - Wenhao Wu
- Changzhou Jinse Medical Information Technology Co., Ltd, Changzhou, 213000, Jiangsu, China
| | - Junjie Xiang
- Changzhou Jinse Medical Information Technology Co., Ltd, Changzhou, 213000, Jiangsu, China
| | - Yiping Wang
- Changzhou Jinse Medical Information Technology Co., Ltd, Changzhou, 213000, Jiangsu, China.
| | - Da Zhong
- Deparment of Orthopedics, Xiangya Hospital, Central South University, Changsha, 410008, Hunan, China.
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changshan, 410008, Huna, China.
| |
Collapse
|
5
|
DiMattina C, Sterk EE, Arena MG, Monteferrante FE. Local cues enable classification of image patches as surfaces, object boundaries, or illumination changes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.26.640416. [PMID: 40060556 PMCID: PMC11888429 DOI: 10.1101/2025.02.26.640416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 03/15/2025]
Abstract
To correctly parse the visual scene, one must detect edges and determine their underlying cause. Previous work has demonstrated that image-computable neural networks trained to differentiate natural shadow and occlusion edges exhibited sensitivity to boundary sharpness and texture differences. Although these models showed a strong correlation with human performance on an edge classification task, this previous study did not directly investigate whether humans actually make use of boundary sharpness and texture cues when classifying edges as shadows or occlusions. Here we directly investigated this using synthetic image patch stimuli formed by quilting together two different natural textures, allowing us to parametrically manipulate boundary sharpness, texture modulation, and luminance modulation. In a series of initial "training" experiments, observers were trained to correctly identify the cause of natural image patches taken from one of three categories (occlusion, shadow, uniform texture). In a subsequent series of "test" experiments, these same observers then classified 5 sets of synthetic boundary images defined by varying boundary sharpness, luminance modulation, and texture modulation cues using a set of novel parametric stimuli. These three visual cues exhibited strong interactions to determine categorization probabilities. For sharp edges, increasing luminance modulation made it less likely the patch would be classified as a texture and more likely it would be classified as an occlusion, whereas for blurred edges, increasing luminance modulation made it more likely the patch would be classified as a shadow. Boundary sharpness had a profound effect, so that in the presence of luminance modulation increasing sharpness decreased the likelihood of classification as a shadow and increased the likelihood of classification as an occlusion. Texture modulation had little effect on categorization, except in the case of a sharp boundary with zero luminance modulation. Results were consistent across all 5 stimulus sets, showing these effects are not due to the idiosyncrasies of the particular texture pairs. Human performance was found to be well explained by a simple linear multinomial logistic regression model defined on luminance, texture and sharpness cues, with slightly improved performance for a more complicated nonlinear model taking multiplicative parameter combinations into account. Our results demonstrate that human observers make use of the same cues as our previous machine learning models when detecting edges and determining their cause, helping us to better understand the neural and perceptual mechanisms of scene parsing.
Collapse
Affiliation(s)
- Christopher DiMattina
- Computational Perception Laboratory, Department of Psychology, Florida Gulf Coast University, Fort Myers FL 33965
| | - Eden E Sterk
- Computational Perception Laboratory, Department of Psychology, Florida Gulf Coast University, Fort Myers FL 33965
- Florida Southwestern State College, Fort Myers FL, 33919
| | - Madelyn G Arena
- Computational Perception Laboratory, Department of Psychology, Florida Gulf Coast University, Fort Myers FL 33965
| | - Francesca E Monteferrante
- Computational Perception Laboratory, Department of Psychology, Florida Gulf Coast University, Fort Myers FL 33965
| |
Collapse
|
6
|
Li H, Xu K. Innovative adaptive edge detection for noisy images using wavelet and Gaussian method. Sci Rep 2025; 15:5838. [PMID: 39966553 PMCID: PMC11836388 DOI: 10.1038/s41598-025-86860-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2024] [Accepted: 01/14/2025] [Indexed: 02/20/2025] Open
Abstract
Edge detection is a crucial task in image processing and remote sensing, particularly for accurately identifying and separating shapes in noisy digital images. To enhance robustness and detail in edge detection, this study presents an innovative edge detection method, which integrates a denoising module and an adaptive thresholding technique to effectively address challenges associated with Gaussian noise in images. The proposed denoising module employs wavelet and Gaussian denoising functions to decompose, filter, and reconstruct the image, thereby reducing the impact of noise and enhancing image quality. For edge detection, an adaptive thresholding method based on a modified OTSU method is utilized. Comprehensive experiments validate the proposed framework by comparing detected edges against ground truth across various levels of Gaussian noise (0.1%, 10%, 20%, and 30%). The median thresholding function is chosen for its stability and convenience, while hard thresholding is avoided due to its tendency to introduce artifacts. Objective metrics, including Mean Squared Error (MSE), Accuracy, and Peak Signal-to-Noise Ratio (PSNR), are employed for evaluation. Comparative results indicate that the proposed method outperforms traditional methods, such as Canny and Roberts, showcasing its effectiveness in edge detection.
Collapse
Affiliation(s)
- Huanxu Li
- School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo, 454000, China
| | - Keke Xu
- School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo, 454000, China.
| |
Collapse
|
7
|
Yue P, Xia X, Hu Y, Wang X, He P, Qin X. Unmanned roller lateral positioning method for asphalt road construction. Sci Rep 2025; 15:418. [PMID: 39747545 PMCID: PMC11696228 DOI: 10.1038/s41598-024-84575-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2024] [Accepted: 12/24/2024] [Indexed: 01/04/2025] Open
Abstract
Unmanned rollers are typically equipped with satellite-based positioning systems for positional monitoring. However, satellite-based positioning systems may result in unmanned rollers driving out of the specified compaction areas during asphalt road construction, which affects the compaction quality and has potential safety hazards. Additionally, satellite-based positioning systems may encounter signal interference and cannot locate unmanned rollers. To solve this problem, a lateral positioning method for unmanned rollers is proposed to realize the positioning of unmanned rollers relative to asphalt road. First, we captured images from different perspectives and developed a dataset for asphalt road construction. Second, a method for boundary extraction of asphalt road is proposed to accurately locate pixels of asphalt road boundary. Subsequently, the lateral distances are measured by the designed lateral positioning methods. Finally, field validation experiments are conducted to evaluate the effectiveness of the proposed lateral positioning method. The results indicate that the method excels in extracting the asphalt road boundary. Furthermore, the proposed lateral positioning method shows excellent performance, with a mean relative error of 3.40% and a frequency of 6.25 Hz. The proposed lateral positioning method meets the performance requirements for lateral positioning in both accuracy and real-time in asphalt road construction for unmanned rollers.
Collapse
Affiliation(s)
- Pengju Yue
- Key Laboratory of Road Construction Technology and Equipment of MOE, Chang'an University, Xi'an, 710064, China
| | - Xiaohua Xia
- Key Laboratory of Road Construction Technology and Equipment of MOE, Chang'an University, Xi'an, 710064, China.
| | - Yongbiao Hu
- Key Laboratory of Road Construction Technology and Equipment of MOE, Chang'an University, Xi'an, 710064, China
| | - Xuebin Wang
- Key Laboratory of Road Construction Technology and Equipment of MOE, Chang'an University, Xi'an, 710064, China.
| | - Pengcheng He
- Key Laboratory of Road Construction Technology and Equipment of MOE, Chang'an University, Xi'an, 710064, China
| | - Xufang Qin
- Key Laboratory of Road Construction Technology and Equipment of MOE, Chang'an University, Xi'an, 710064, China
| |
Collapse
|
8
|
Zhao Y, Wang Z, Li H, Wang C, Zhang J, Zhu J, Liu X. Material Visual Perception and Discharging Robot Control for Baijiu Fermented Grains in Underground Tank. SENSORS (BASEL, SWITZERLAND) 2024; 24:8215. [PMID: 39771949 PMCID: PMC11679939 DOI: 10.3390/s24248215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/13/2024] [Revised: 12/15/2024] [Accepted: 12/20/2024] [Indexed: 01/11/2025]
Abstract
Addressing the issue of excessive manual intervention in discharging fermented grains from underground tanks in traditional brewing technology, this paper proposes an intelligent grains-out strategy based on a multi-degree-of-freedom hybrid robot. The robot's structure and control system are introduced, along with analyses of kinematics solutions for its parallel components and end-effector speeds. According to its structural characteristics and working conditions, a visual-perception-based motion control method of discharging fermented grains is determined. The enhanced perception of underground tanks' positions is achieved through improved Canny edge detection algorithms, and a YOLO-v7 neural network is employed to train an image segmentation model for fermented grains' surface, integrating depth information to synthesize point clouds. We then carry out the downsampling and three-dimensional reconstruction of these point clouds, then match the underground tank model with the fermented grain surface model to replicate the tank's interior space. Finally, a digging motion control method is proposed and experimentally validated for feasibility and operational efficiency.
Collapse
Affiliation(s)
- Yan Zhao
- School of Mechanical Engineering, University of Science and Technology Beijing, Beijing 100083, China; (Y.Z.); (J.Z.); (J.Z.)
| | - Zhongxun Wang
- School of Mechanical Engineering, Hebei University of Technology, Tianjin 300401, China; (Z.W.)
| | - Hui Li
- School of Mechanical Engineering, University of Science and Technology Beijing, Beijing 100083, China; (Y.Z.); (J.Z.); (J.Z.)
| | - Chang Wang
- School of Mechanical Engineering, University of Science and Technology Beijing, Beijing 100083, China; (Y.Z.); (J.Z.); (J.Z.)
| | - Jianhua Zhang
- School of Mechanical Engineering, University of Science and Technology Beijing, Beijing 100083, China; (Y.Z.); (J.Z.); (J.Z.)
| | - Jingyuan Zhu
- School of Mechanical Engineering, University of Science and Technology Beijing, Beijing 100083, China; (Y.Z.); (J.Z.); (J.Z.)
| | - Xuan Liu
- School of Mechanical Engineering, Hebei University of Technology, Tianjin 300401, China; (Z.W.)
| |
Collapse
|
9
|
Wang J, Lu J, Yang J, Wang M, Zhang W. An Unbiased Feature Estimation Network for Few-Shot Fine-Grained Image Classification. SENSORS (BASEL, SWITZERLAND) 2024; 24:7737. [PMID: 39686274 DOI: 10.3390/s24237737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2024] [Revised: 11/22/2024] [Accepted: 12/02/2024] [Indexed: 12/18/2024]
Abstract
Few-shot fine-grained image classification (FSFGIC) aims to classify subspecies with similar appearances under conditions of very limited data. In this paper, we observe an interesting phenomenon: different types of image data augmentation techniques have varying effects on the performance of FSFGIC methods. This indicates that there may be biases in the features extracted from the input images. The bias of the acquired feature may cause deviation in the calculation of similarity, which is particularly detrimental to FSFGIC tasks characterized by low inter-class variation and high intra-class variation, thus affecting the classification accuracy. To address the problems mentioned, we propose an unbiased feature estimation network. The designed network has the capability to significantly optimize the quality of the obtained feature representations and effectively reduce the feature bias from input images. Furthermore, our proposed architecture can be easily integrated into any contextual training mechanism. Extensive experiments on the FSFGIC tasks demonstrate the effectiveness of the proposed algorithm, showing a notable improvement in classification accuracy.
Collapse
Affiliation(s)
- Jiale Wang
- School of Electronic Information and Artificial Intelligence, Shaanxi University of Science and Technology, Xi'an 710000, China
| | - Jin Lu
- School of Electronic Information and Artificial Intelligence, Shaanxi University of Science and Technology, Xi'an 710000, China
| | - Junpo Yang
- School of Electronic Information and Artificial Intelligence, Shaanxi University of Science and Technology, Xi'an 710000, China
| | - Meijia Wang
- School of Electronic Information and Artificial Intelligence, Shaanxi University of Science and Technology, Xi'an 710000, China
| | - Weichuan Zhang
- School of Electronic Information and Artificial Intelligence, Shaanxi University of Science and Technology, Xi'an 710000, China
| |
Collapse
|
10
|
Wang W, Du W, Song X, Chen S, Zhou H, Zhang H, Zou Y, Zhu J, Cheng C. DRA-UNet for Coal Mining Ground Surface Crack Delineation with UAV High-Resolution Images. SENSORS (BASEL, SWITZERLAND) 2024; 24:5760. [PMID: 39275672 PMCID: PMC11397846 DOI: 10.3390/s24175760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/17/2024] [Revised: 09/02/2024] [Accepted: 09/04/2024] [Indexed: 09/16/2024]
Abstract
Coal mining in the Loess Plateau can very easily generate ground cracks, and these cracks can immediately result in ventilation trouble under the mine shaft, runoff disturbance, and vegetation destruction. Advanced UAV (Unmanned Aerial Vehicle) high-resolution mapping and DL (Deep Learning) are introduced as the key methods to quickly delineate coal mining ground surface cracks for disaster prevention. Firstly, the dataset named the Ground Cracks of Coal Mining Area Unmanned Aerial Vehicle (GCCMA-UAV) is built, with a ground resolution of 3 cm, which is suitable to make a 1:500 thematic map of the ground crack. This GCCMA-UAV dataset includes 6280 images of ground cracks, and the size of the imagery is 256 × 256 pixels. Secondly, the DRA-UNet model is built effectively for coal mining ground surface crack delineation. This DRA-UNet model is an improved UNet DL model, which mainly includes the DAM (Dual Dttention Dechanism) module, the RN (residual network) module, and the ASPP (Atrous Spatial Pyramid Pooling) module. The DRA-UNet model shows the highest recall rate of 77.29% when the DRA-UNet was compared with other similar DL models, such as DeepLabV3+, SegNet, PSPNet, and so on. DRA-UNet also has other relatively reliable indicators; the precision rate is 84.92% and the F1 score is 78.87%. Finally, DRA-UNet is applied to delineate cracks on a DOM (Digital Orthophoto Map) of 3 km2 in the mining workface area, with a ground resolution of 3 cm. There were 4903 cracks that were delineated from the DOM in the Huojitu Coal Mine Shaft. This DRA-UNet model effectively improves the efficiency of crack delineation.
Collapse
Affiliation(s)
- Wei Wang
- Shendong Coal Branch, China Shenhua Energy Co., Ltd., Yulin 719000, China
| | - Weibing Du
- School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454000, China
| | - Xiangyang Song
- School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454000, China
| | - Sushe Chen
- Shendong Coal Branch, China Shenhua Energy Co., Ltd., Yulin 719000, China
| | - Haifeng Zhou
- Shendong Coal Branch, China Shenhua Energy Co., Ltd., Yulin 719000, China
| | - Hebing Zhang
- School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454000, China
| | - Youfeng Zou
- School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454000, China
| | - Junlin Zhu
- School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454000, China
| | - Chaoying Cheng
- School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454000, China
| |
Collapse
|
11
|
Fu X, Tang L, Bai Y. Image reconstruction in graphic design based on Global residual Network optimized compressed sensing model. PeerJ Comput Sci 2024; 10:e2227. [PMID: 39678295 PMCID: PMC11639127 DOI: 10.7717/peerj-cs.2227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Accepted: 07/10/2024] [Indexed: 12/17/2024]
Abstract
The article aims to address the challenges of information degradation and distortion in graphic design, focusing on optimizing the traditional compressed sensing (CS) model. This optimization involves creating a co-reconstruction group derived from compressed observations of local image blocks. Following an initial reconstruction of compressed observations within similar groups, an initially reconstructed image block co-reconstruction group is obtained, featuring degraded reconstructed images. These images undergo channel stitching and are input into a global residual network. This network is composed of a non-local feature adaptive interaction module stacked with the aim of fusion to enhance local feature reconstruction. Results indicate that the solution space constraint for reconstructed images is achieved at a low sampling rate. Moreover, high-frequency information within the images is effectively reconstructed, improving image reconstruction accuracy.
Collapse
Affiliation(s)
- Xinxin Fu
- Department of Integrated Industrial Design, Hanseo University, Seosan, Republic of South Korea
| | - Lujing Tang
- Department of Integrated Industrial Design, Hanseo University, Seosan, Republic of South Korea
| | - Yingjie Bai
- School of Design, Guangxi Normal University, Guilin, China
| |
Collapse
|
12
|
Gao M, Shu F, Zhou W, Li H, Wu Y, Wang Y, Zhao S, Song Z. A Rapid Nanofocusing Method for a Deep-Sea Gene Sequencing Microscope Based on Critical Illumination. SENSORS (BASEL, SWITZERLAND) 2024; 24:5010. [PMID: 39124058 PMCID: PMC11314998 DOI: 10.3390/s24155010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Revised: 07/31/2024] [Accepted: 08/01/2024] [Indexed: 08/12/2024]
Abstract
In the deep-sea environment, the volume available for an in-situ gene sequencer is severely limited. In addition, optical imaging systems are subject to real-time, large-scale defocusing problems caused by ambient temperature fluctuations and vibrational perturbations. To address these challenges, we propose an edge detection algorithm for defocused images based on grayscale gradients and establish a defocus state detection model with nanometer resolution capabilities by relying on the inherent critical illumination light field. The model has been applied to a prototype deep-sea gene sequencing microscope with a 20× objective. It has demonstrated the ability to focus within a dynamic range of ±40 μm with an accuracy of 200 nm by a single iteration within 160 ms. By increasing the number of iterations and exposures, the focusing accuracy can be refined to 78 nm within a dynamic range of ±100 μm within 1.2 s. Notably, unlike conventional photoelectric hill-climbing, this method requires no additional hardware and meets the wide dynamic range, speed, and high-accuracy autofocusing requirements of deep-sea gene sequencing in a compact form factor.
Collapse
Affiliation(s)
- Ming Gao
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China
- University of Chinese Academy of Sciences, Beijing 100049, China
- State Key Laboratory of Applied Optics, Changchun 130033, China
- Key Laboratory of Optical System Advanced Manufacturing Technology, Chinese Academy of Sciences, Changchun 130033, China
| | - Fengfeng Shu
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China
- State Key Laboratory of Applied Optics, Changchun 130033, China
- Key Laboratory of Optical System Advanced Manufacturing Technology, Chinese Academy of Sciences, Changchun 130033, China
| | - Wenchao Zhou
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China
- State Key Laboratory of Applied Optics, Changchun 130033, China
- Key Laboratory of Optical System Advanced Manufacturing Technology, Chinese Academy of Sciences, Changchun 130033, China
| | - Huan Li
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China
- State Key Laboratory of Applied Optics, Changchun 130033, China
- Key Laboratory of Optical System Advanced Manufacturing Technology, Chinese Academy of Sciences, Changchun 130033, China
| | - Yihui Wu
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China
- State Key Laboratory of Applied Optics, Changchun 130033, China
- Key Laboratory of Optical System Advanced Manufacturing Technology, Chinese Academy of Sciences, Changchun 130033, China
| | - Yue Wang
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China
- State Key Laboratory of Applied Optics, Changchun 130033, China
- Key Laboratory of Optical System Advanced Manufacturing Technology, Chinese Academy of Sciences, Changchun 130033, China
| | - Shixun Zhao
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China
- University of Chinese Academy of Sciences, Beijing 100049, China
- State Key Laboratory of Applied Optics, Changchun 130033, China
- Key Laboratory of Optical System Advanced Manufacturing Technology, Chinese Academy of Sciences, Changchun 130033, China
| | - Zihan Song
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China
- University of Chinese Academy of Sciences, Beijing 100049, China
- State Key Laboratory of Applied Optics, Changchun 130033, China
- Key Laboratory of Optical System Advanced Manufacturing Technology, Chinese Academy of Sciences, Changchun 130033, China
| |
Collapse
|
13
|
Li C, Narayanan A, Ghobakhlou A. Overlapping Shoeprint Detection by Edge Detection and Deep Learning. J Imaging 2024; 10:186. [PMID: 39194975 DOI: 10.3390/jimaging10080186] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 07/04/2024] [Accepted: 07/30/2024] [Indexed: 08/29/2024] Open
Abstract
In the field of 2-D image processing and computer vision, accurately detecting and segmenting objects in scenarios where they overlap or are obscured remains a challenge. This difficulty is worse in the analysis of shoeprints used in forensic investigations because they are embedded in noisy environments such as the ground and can be indistinct. Traditional convolutional neural networks (CNNs), despite their success in various image analysis tasks, struggle with accurately delineating overlapping objects due to the complexity of segmenting intertwined textures and boundaries against a background of noise. This study introduces and employs the YOLO (You Only Look Once) model enhanced by edge detection and image segmentation techniques to improve the detection of overlapping shoeprints. By focusing on the critical boundary information between shoeprint textures and the ground, our method demonstrates improvements in sensitivity and precision, achieving confidence levels above 85% for minimally overlapped images and maintaining above 70% for extensively overlapped instances. Heatmaps of convolution layers were generated to show how the network converges towards successful detection using these enhancements. This research may provide a potential methodology for addressing the broader challenge of detecting multiple overlapping objects against noisy backgrounds.
Collapse
Affiliation(s)
- Chengran Li
- School of Engineering, Computer and Mathematical Sciences, Auckland University of Technology, Auckland 1010, New Zealand
| | - Ajit Narayanan
- School of Engineering, Computer and Mathematical Sciences, Auckland University of Technology, Auckland 1010, New Zealand
| | - Akbar Ghobakhlou
- School of Engineering, Computer and Mathematical Sciences, Auckland University of Technology, Auckland 1010, New Zealand
| |
Collapse
|
14
|
Nithiyanandham E, Srutha Keerthi B. Image edge detection enhancement using coefficients of Sakaguchi type functions mapped onto petal shaped domain. Heliyon 2024; 10:e31430. [PMID: 38826709 PMCID: PMC11141366 DOI: 10.1016/j.heliyon.2024.e31430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Accepted: 05/15/2024] [Indexed: 06/04/2024] Open
Abstract
This research introduces a new approach to elevate the precision of image edge detection through a new algorithm rooted in the coefficients derived from the subclass SC t , ρ (CSKP model). Our method employs convolution operations on input image pixels, utilizing the CSKP mask window in eight distinct directions, fostering a comprehensive and multi-directional analysis of edge features. To gauge the efficacy of our algorithm, image quality is assessed through perceptually significant metrics, including contrast, correlation, energy, homogeneity, and entropy. The study aims to contribute a valuable tool for diverse applications such as computer vision and medical imaging by presenting a robust and innovative solution to enhance image edge detection. The results demonstrate notable improvements, affirming the potential of the proposed algorithm to advance the current state-of-the-art in image processing.
Collapse
Affiliation(s)
- E.K. Nithiyanandham
- Department of Mathematics, School of Advanced Sciences, Vellore Institute of Technology Chennai Campus, Chennai - 600 127, India
| | - B. Srutha Keerthi
- Department of Mathematics, School of Advanced Sciences, Vellore Institute of Technology Chennai Campus, Chennai - 600 127, India
| |
Collapse
|
15
|
Wang J. Optimizing support vector machine (SVM) by social spider optimization (SSO) for edge detection in colored images. Sci Rep 2024; 14:9136. [PMID: 38644440 PMCID: PMC11033277 DOI: 10.1038/s41598-024-59811-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 04/15/2024] [Indexed: 04/23/2024] Open
Abstract
Edge detection in images is a vital application of image processing in fields such as object detection and identification of lesion regions in medical images. This problem is more complex in the domain of color images due to the combination of color layer information and the need to achieve a unified edge boundary across these layers, which increases the complexity of the problem. In this paper, a simple and effective method for edge detection in color images is proposed using a combination of support vector machine (SVM) and the social spider optimization (SSO) algorithm. In the proposed method, the input color image is first converted to a grayscale image, and an initial estimation of the image edges is performed based on it. To this end, the proposed method utilizes an SVM with a Radial Basis Function (RBF) kernel, in which the model's hyperparameters are tuned using the SSO algorithm. After the formation of initial image edges, the resulting edges are compared with pairwise combinations of color layers, and an attempt is made to improve the edge localization using the SSO algorithm. In this step, the optimization algorithm's task is to refine the image edges in a way that maximizes the compatibility with pairwise combinations of color layers. This process leads to the formation of prominent image edges and reduces the adverse effects of noise on the final result. The performance of the proposed method in edge detection of various color images has been evaluated and compared with similar previous strategies. According to the obtained results, the proposed method can successfully identify image edges more accurately, as the edges identified by the proposed method have an average accuracy of 93.11% for the BSDS500 database, which is an increase of at least 0.74% compared to other methods.
Collapse
Affiliation(s)
- Jianfei Wang
- Suzhou Chien-Shiung Institute of Technology, Taicang, 215411, China.
| |
Collapse
|
16
|
Lee Y, Yun J, Lee S, Lee C. Image Data-Centric Visual Feature Selection on Roll-to-Roll Slot-Die Coating Systems for Edge Wave Coating Defect Detection. Polymers (Basel) 2024; 16:1156. [PMID: 38675075 PMCID: PMC11054432 DOI: 10.3390/polym16081156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 04/16/2024] [Accepted: 04/18/2024] [Indexed: 04/28/2024] Open
Abstract
Roll-to-roll (R2R) manufacturing depends on a system's capability to deposit high-quality coatings with precise thickness, width, and uniformity. Therefore, consistent maintenance requires the immediate and accurate detection of coating defects. This study proposes a primary color selection (PCS) method to detect edge defects in R2R systems. This method addresses challenges associated with training data demands, complexity, and defect adaptability through a vision data-centric approach, ensuring precise edge coating defect detection. Using color information, high accuracy was achieved while minimizing data capacity requirements and processing time. Precise edge detection was facilitated by accurately distinguishing coated and noncoated regions by selecting the primary color channel based on color variability. The PCS method achieved superior accuracy (95.8%), outperforming the traditional weighted sum method (78.3%). This method is suitable for real-time detection in manufacturing systems and mitigates edge coating defects, thus facilitating quality control and production optimization.
Collapse
Affiliation(s)
- Yoonjae Lee
- Department of Mechanical Design and Production Engineering, Konkuk University, 120 Neungdong-ro, Gwangjin-gu, Seoul 05029, Republic of Korea; (Y.L.); (J.Y.); (S.L.)
| | - Junyoung Yun
- Department of Mechanical Design and Production Engineering, Konkuk University, 120 Neungdong-ro, Gwangjin-gu, Seoul 05029, Republic of Korea; (Y.L.); (J.Y.); (S.L.)
| | - Sangbin Lee
- Department of Mechanical Design and Production Engineering, Konkuk University, 120 Neungdong-ro, Gwangjin-gu, Seoul 05029, Republic of Korea; (Y.L.); (J.Y.); (S.L.)
| | - Changwoo Lee
- Department of Mechanical Engineering, Konkuk University, 120 Neungdong-ro, Gwangjin-gu, Seoul 05029, Republic of Korea
| |
Collapse
|
17
|
Li M, Liu Y, Chen D, Chen L, Liu S. Transformer with difference convolutional network for lightweight universal boundary detection. PLoS One 2024; 19:e0302275. [PMID: 38626177 PMCID: PMC11020957 DOI: 10.1371/journal.pone.0302275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 03/29/2024] [Indexed: 04/18/2024] Open
Abstract
Although deep-learning methods can achieve human-level performance in boundary detection, their improvements mostly rely on larger models and specific datasets, leading to significant computational power consumption. As a fundamental low-level vision task, a single model with fewer parameters to achieve cross-dataset boundary detection merits further investigation. In this study, a lightweight universal boundary detection method was developed based on convolution and a transformer. The network is called a "transformer with difference convolutional network" (TDCN), which implies the introduction of a difference convolutional network rather than a pure transformer. The TDCN structure consists of three parts: convolution, transformer, and head function. First, a convolution network fused with edge operators is used to extract multiscale difference features. These pixel difference features are then fed to the hierarchical transformer as tokens. Considering the intrinsic characteristics of the boundary detection task, a new boundary-aware self-attention structure was designed in the transformer to provide inductive bias. By incorporating the proposed attention loss function, it introduces the direction of the boundary as strongly supervised information to improve the detection ability of the model. Finally, several head functions with multiscale feature inputs were trained using a bidirectional additive strategy. In the experiments, the proposed method achieved competitive performance on multiple public datasets with fewer model parameters. A single model was obtained to realize universal prediction even for different datasets without retraining, demonstrating the effectiveness of the method. The code is available at https://github.com/neulmc/TDCN.
Collapse
Affiliation(s)
- Mingchun Li
- College of Information Engineering, Shenyang University, Shenyang, China
| | - Yang Liu
- College of Information Engineering, Shenyang University, Shenyang, China
| | - Dali Chen
- College of Information Science and Engineering, Northeastern University, Shenyang, China
| | - Liangsheng Chen
- College of Information Engineering, Shenyang University, Shenyang, China
| | - Shixin Liu
- College of Information Science and Engineering, Northeastern University, Shenyang, China
| |
Collapse
|
18
|
Shi T, Zhang H, Cui S, Liu J, Gu Z, Wang Z, Yan X, Liu Q. Stochastic neuro-fuzzy system implemented in memristor crossbar arrays. SCIENCE ADVANCES 2024; 10:eadl3135. [PMID: 38517972 PMCID: PMC10959402 DOI: 10.1126/sciadv.adl3135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 02/16/2024] [Indexed: 03/24/2024]
Abstract
Neuro-symbolic artificial intelligence has garnered considerable attention amid increasing industry demands for high-performance neural networks that are interpretable and adaptable to previously unknown problem domains with minimal reconfiguration. However, implementing neuro-symbolic hardware is challenging due to the complexity in symbolic knowledge representation and calculation. We experimentally demonstrated a memristor-based neuro-fuzzy hardware based on TiN/TaOx/HfOx/TiN chips that is superior to its silicon-based counterpart in terms of throughput and energy efficiency by using array topological structure for knowledge representation and physical laws for computing. Intrinsic memristor variability is fully exploited to increase robustness in knowledge representation. A hybrid in situ training strategy is proposed for error minimizing in training. The hardware adapts easier to a previously unknown environment, achieving ~6.6 times faster convergence and ~6 times lower error than deep learning. The hardware energy efficiency is over two orders of magnitude greater than field-programmable gate arrays. This research greatly extends the capability of memristor-based neuromorphic computing systems in artificial intelligence.
Collapse
Affiliation(s)
- Tuo Shi
- State Key Lab of Fabrication Technologies for Integrated Circuits, Institute of Microelectronics, Chinese Academy of Sciences, Beijing 100029, China
- Research Center for Intelligent Computing Hardware, Zhejiang Laboratory, Hangzhou 311122, China
| | - Hui Zhang
- Research Center for Intelligent Computing Hardware, Zhejiang Laboratory, Hangzhou 311122, China
| | - Shiyu Cui
- Research Center for Intelligent Computing Hardware, Zhejiang Laboratory, Hangzhou 311122, China
| | - Jinchang Liu
- Research Center for Intelligent Computing Hardware, Zhejiang Laboratory, Hangzhou 311122, China
| | - Zixi Gu
- Research Center for Intelligent Computing Hardware, Zhejiang Laboratory, Hangzhou 311122, China
| | - Zhanfeng Wang
- Key Laboratory of Brain-like Neuromorphic Devices and Systems of Hebei Province, Hebei University, Baoding 071002, P. R. China
| | - Xiaobing Yan
- Key Laboratory of Brain-like Neuromorphic Devices and Systems of Hebei Province, Hebei University, Baoding 071002, P. R. China
| | - Qi Liu
- Frontier Institute of Chip and System, Fudan University, Shanghai 200433, China
| |
Collapse
|
19
|
Lin EY, Chen JC, Lien JJJ. Intelligent Tapping Machine: Tap Geometry Inspection. SENSORS (BASEL, SWITZERLAND) 2023; 23:8005. [PMID: 37766059 PMCID: PMC10537247 DOI: 10.3390/s23188005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 09/17/2023] [Accepted: 09/17/2023] [Indexed: 09/29/2023]
Abstract
Currently, the majority of industrial metal processing involves the use of taps for cutting. However, existing tap machines require relocation to specialized inspection stations and only assess the condition of the cutting edges for defects. They do not evaluate the quality of the cutting angles and the amount of removed material. Machine vision, a key component of smart manufacturing, is commonly used for visual inspection. Taps are employed for processing various materials. Traditional tap replacement relies on the technician's accumulated empirical experience to determine the service life of the tap. Therefore, we propose the use of visual inspection of the tap's external features to determine whether replacement or regrinding is needed. We examined the bearing surface of the tap and utilized single images to identify the cutting angle, clearance angle, and cone angles. By inspecting the side of the tap, we calculated the wear of each cusp. This inspection process can facilitate the development of a tap life system, allowing for the estimation of the durability and wear of taps and nuts made of different materials. Statistical analysis can be employed to predict the lifespan of taps in production lines. Experimental error is 16 μm. Wear from tapping 60 times is equivalent to 8 s of electric grinding. We have introduced a parameter, thread removal quantity, which has not been proposed by anyone else.
Collapse
Affiliation(s)
- En-Yu Lin
- Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan
| | - Ju-Chin Chen
- Department of Computer Science and Information Engineering, National Kaohsiung University of Science and Technology, Kaohsiung 807, Taiwan
| | - Jenn-Jier James Lien
- Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 701, Taiwan
| |
Collapse
|
20
|
Xu B, Sun Y, Li J, Deng Z, Li H, Zhang B, Liu K. Accurate Detection for Zirconium Sheet Surface Scratches Based on Visible Light Images. SENSORS (BASEL, SWITZERLAND) 2023; 23:7291. [PMID: 37631827 PMCID: PMC10458122 DOI: 10.3390/s23167291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 08/06/2023] [Accepted: 08/17/2023] [Indexed: 08/27/2023]
Abstract
Zirconium sheet has been widely used in various fields, e.g., chemistry and aerospace. The surface scratches on the zirconium sheets caused by complex processing environment have a negative impact on the performance, e.g., working life and fatigue fracture resistance. Therefore, it is necessary to detect the defect of zirconium sheets. However, it is difficult to detect such scratch images due to lots of scattered additive noise and complex interlaced structural texture. Hence, we propose a framework for adaptively detecting scratches on the surface images of zirconium sheets, including noise removing and texture suppressing. First, the noise removal algorithm, i.e., an optimized threshold function based on dual-tree complex wavelet transform, uses selected parameters to remove scattered and numerous noise. Second, the texture suppression algorithm, i.e., an optimized relative total variation enhancement model, employs selected parameters to suppress interlaced texture. Finally, by connecting disconnection based on two types of connection algorithms and replacing the Gaussian filter in the standard Canny edge detection algorithm with our proposed framework, we can more robustly detect the scratches. The experimental results show that the proposed framework is of higher accuracy.
Collapse
Affiliation(s)
- Bin Xu
- School of Mechanical Engineering, Sichuan University, Chengdu 610065, China; (B.X.); (Y.S.); (J.L.)
| | - Yuanhaoji Sun
- School of Mechanical Engineering, Sichuan University, Chengdu 610065, China; (B.X.); (Y.S.); (J.L.)
| | - Jinhua Li
- School of Mechanical Engineering, Sichuan University, Chengdu 610065, China; (B.X.); (Y.S.); (J.L.)
| | - Zhiyong Deng
- Nuclear Fuel and Material Institute, Nuclear Power Institute of China, Chengdu 610213, China; (Z.D.); (H.L.)
| | - Hongyu Li
- Nuclear Fuel and Material Institute, Nuclear Power Institute of China, Chengdu 610213, China; (Z.D.); (H.L.)
| | - Bo Zhang
- School of Mechanical Engineering, Sichuan University, Chengdu 610065, China; (B.X.); (Y.S.); (J.L.)
| | - Kai Liu
- College of Electrical Engineering, Sichuan University, Chengdu 610065, China;
| |
Collapse
|
21
|
Li M, Chen D, Liu S. Beta network for boundary detection under nondeterministic labels. Knowl Based Syst 2023. [DOI: 10.1016/j.knosys.2023.110389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
|
22
|
Lu J, Zhang W, Zhao Y, Sun C. Image local structure information learning for fine-grained visual classification. Sci Rep 2022; 12:19205. [PMID: 36357665 PMCID: PMC9649701 DOI: 10.1038/s41598-022-23835-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 11/07/2022] [Indexed: 11/12/2022] Open
Abstract
Learning discriminative visual patterns from image local salient regions is widely used for fine-grained visual classification (FGVC) tasks such as plant or animal species classification. A large number of complex networks have been designed for learning discriminative feature representations. In this paper, we propose a novel local structure information (LSI) learning method for FGVC. Firstly, we indicate that the existing FGVC methods have not properly considered how to extract LSI from an input image for FGVC. Then an LSI extraction technique is introduced which has the ability to properly depict the properties of different local structure features in images. Secondly, a novel LSI learning module is proposed to be added into a given backbone network for enhancing the ability of the network to find salient regions. Thirdly, extensive experiments show that our proposed method achieves better performance on six image datasets. Particularly, the proposed method performs far better on datasets with a limited number of images.
Collapse
Affiliation(s)
- Jin Lu
- grid.454711.20000 0001 1942 5509School of Electronic Information and Artificial Intelligence, Shaanxi University of Science & Technology, Xi’an, 710021 China
| | - Weichuan Zhang
- grid.1022.10000 0004 0437 5432The Institute for Integrated and Intelligent Systems, Griffith University, Brisbane, QLD Australia
| | - Yali Zhao
- grid.464495.e0000 0000 9192 5439School of Electronics and Information, Xi’an Polytechnic University, Xi’an, 710000 China
| | | |
Collapse
|