1
|
Suh S. Optimal surface defect detector design based on deep learning for 3D geometry. Sci Rep 2025; 15:5527. [PMID: 39952973 PMCID: PMC11829014 DOI: 10.1038/s41598-025-88112-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Accepted: 01/24/2025] [Indexed: 02/17/2025] Open
Abstract
Steel-manufacturing sites are extremely harsh and dangerous environments. Visibility is reduced by dust, water vapor, oil, and low light conditions. Additionally, the steel products from hot furnaces have surface temperatures ranging from 1200 to 800 degrees celsius and weigh several to tens of tons. It is extremely dangerous for humans to visually inspect these steel products in such environments. Therefore, the use of automatic inspection equipment for steel surfaces is essential. Initially, image-processing methods were used, but with recent advances in deep learning, deep-learning-based methods are also being applied in this field. However, the method currently widely used involves utilizing existing models through transfer learning, which inevitably causes curvature of the input image data and limits performance. Furthermore, previous studies have focused on 2D sheet metal products, and there has been no research on three-dimensional geometric products. In this study, we propose dataset generation through geometric transformations that parameterize the structure of the steel surface defect detector hardware, along with a performance-based model optimization algorithm. In validation experiments, an average F1 score of 0.932 and an average area under curve of 0.99 were obtained, implying that the proposed algorithm has near-ideal performance.
Collapse
Affiliation(s)
- Sangmin Suh
- Department of Information and Telecommunication Engineering, Gangneung-Wonju National University, Wonju-si, Gangwon-do, 26403, Republic of Korea.
- Scalable-ai, Wonju-si, Gangwon-do, 26403, Republic of Korea.
| |
Collapse
|
2
|
Belila D, Khaldi B, Aiadi O. Wavelet Texture Descriptor for Steel Surface Defect Classification. MATERIALS (BASEL, SWITZERLAND) 2024; 17:5873. [PMID: 39685309 DOI: 10.3390/ma17235873] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2024] [Revised: 11/21/2024] [Accepted: 11/26/2024] [Indexed: 12/18/2024]
Abstract
The accurate and efficient classification of steel surface defects is critical for ensuring product quality and minimizing production costs. This paper proposes a novel method based on wavelet transform and texture descriptors for the robust and precise classification of steel surface defects. By leveraging the multiscale analysis capabilities of wavelet transforms, our method extracts both broad and fine-grained textural features. It involves decomposing images using multi-level wavelet transforms, extracting a series set of statistical and textural features from the resulting coefficients, and employing Recursive Feature Elimination (RFE) to select the most discriminative features. A comprehensive series of experiments was conducted on two datasets, NEU-CLS and X-SDD, to evaluate the proposed method. The results highlight the effectiveness of the method in accurately classifying steel surface defects, outperforming the state-of-the-art techniques. Our method achieved an accuracy of 99.67% for the NEU-CLS dataset and 98.24% for the X-SDD dataset. Furthermore, we demonstrate the robustness of our method in scenarios with limited data, maintaining high accuracy, making it well-suited for practical industrial applications where obtaining large datasets can be challenging.
Collapse
Affiliation(s)
- Djilani Belila
- Department of Computer Science and Information Technologies, University of Kasdi Merbah, Ouargla 30000, Algeria
- Laboratoire d'Intelligence Artificiel et des Technologies de l'Information (LINATI), Ouargla 30000, Algeria
| | - Belal Khaldi
- Department of Computer Science and Information Technologies, University of Kasdi Merbah, Ouargla 30000, Algeria
- Laboratoire d'Intelligence Artificiel et des Technologies de l'Information (LINATI), Ouargla 30000, Algeria
| | - Oussama Aiadi
- Department of Computer Science and Information Technologies, University of Kasdi Merbah, Ouargla 30000, Algeria
- Laboratoire d'Intelligence Artificiel et des Technologies de l'Information (LINATI), Ouargla 30000, Algeria
| |
Collapse
|
3
|
Wu J, Zhang X. Tunnel Crack Detection Method and Crack Image Processing Algorithm Based on Improved Retinex and Deep Learning. SENSORS (BASEL, SWITZERLAND) 2023; 23:9140. [PMID: 38005528 PMCID: PMC10674256 DOI: 10.3390/s23229140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 11/07/2023] [Accepted: 11/11/2023] [Indexed: 11/26/2023]
Abstract
Tunnel cracks are the main factors that cause damage and collapse of tunnel structures. How to detect tunnel cracks efficiently and avoid safety accidents caused by tunnel cracks effectively is a research hotspot at present. In order to meet the need for efficient detection of tunnel cracks, the tunnel crack detection method based on improved Retinex and deep learning is proposed in this paper. The tunnel crack images collected by optical imaging equipment are used to improve the contrast information of tunnel crack images using the image enhancement algorithm, and this image enhancement algorithm has the function of multi-scale Retinex decomposition with improved central filtering. An improved VGG19 network model is constructed to achieve efficient segmentation of tunnel crack images through deep learning methods and then form the segmented binary image. The Zhang-Suen fast parallel-thinning method is used to obtain the skeleton map of the single-layer pixel, and the length and width information of the tunnel cracks are obtained. The feasibility and effectiveness of the proposed method are verified by experiments. Compared with other methods in the literature, the maximum deviation in the length of the tunnel crack is about 5 mm, and the maximum deviation in the width of the tunnel crack is about 0.8 mm. The experimental results show that the proposed method has a shorter detection time and higher detection accuracy. The research results of this paper can provide a strong basis for the health evaluation of tunnels.
Collapse
Affiliation(s)
- Jie Wu
- School of Defense, Xi’an Technological University, Xi’an 710021, China
| | - Xiaoqian Zhang
- School of Electronic and Information Engineering, Xi’an Technological University, Xi’an 710021, China;
| |
Collapse
|
4
|
Yi C, Chen Q, Xu B, Huang T. Steel Strip Defect Sample Generation Method Based on Fusible Feature GAN Model under Few Samples. SENSORS (BASEL, SWITZERLAND) 2023; 23:3216. [PMID: 36991931 PMCID: PMC10054326 DOI: 10.3390/s23063216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 03/07/2023] [Accepted: 03/14/2023] [Indexed: 06/19/2023]
Abstract
Due to the shortage of defect samples and the high cost of labelling during the process of hot-rolled strip production in the metallurgical industry, it is difficult to obtain a large quantity of defect data with diversity, which seriously affects the identification accuracy of different types of defects on the steel surface. To address the problem of insufficient defect sample data in the task of strip steel defect identification and classification, this paper proposes the Strip Steel Surface Defect-ConSinGAN (SDE-ConSinGAN) model for strip steel defect identification which is based on a single-image model trained by the generative adversarial network (GAN) and which builds a framework of image-feature cutting and splicing. The model aims to reduce training time by dynamically adjusting the number of iterations for different training stages. The detailed defect features of training samples are highlighted by introducing a new size-adjustment function and increasing the channel attention mechanism. In addition, real image features will be cut and synthesized to obtain new images with multiple defect features for training. The emergence of new images is able to richen generated samples. Eventually, the generated simulated samples can be directly used in deep-learning-based automatic classification of surface defects in cold-rolled thin strips. The experimental results show that, when SDE-ConSinGAN is used to enrich the image dataset, the generated defect images have higher quality and more diversity than the current methods do.
Collapse
Affiliation(s)
- Cancan Yi
- Key Laboratory of Metallurgical Equipment and Control Technology (Wuhan University of Science and Technology), Ministry of Education, Wuhan 430081, China
- Hubei Key Laboratory of Mechanical Transmission and Manufacturing Engineering (Wuhan University of Science and Technology), Wuhan 430081, China
- Precision Manufacturing Institute (Wuhan University of Science and Technology), Wuhan 430081, China
| | - Qirui Chen
- Key Laboratory of Metallurgical Equipment and Control Technology (Wuhan University of Science and Technology), Ministry of Education, Wuhan 430081, China
- Hubei Key Laboratory of Mechanical Transmission and Manufacturing Engineering (Wuhan University of Science and Technology), Wuhan 430081, China
- Precision Manufacturing Institute (Wuhan University of Science and Technology), Wuhan 430081, China
| | - Biao Xu
- Key Laboratory of Metallurgical Equipment and Control Technology (Wuhan University of Science and Technology), Ministry of Education, Wuhan 430081, China
- Hubei Key Laboratory of Mechanical Transmission and Manufacturing Engineering (Wuhan University of Science and Technology), Wuhan 430081, China
- Precision Manufacturing Institute (Wuhan University of Science and Technology), Wuhan 430081, China
| | - Tao Huang
- Key Laboratory of Metallurgical Equipment and Control Technology (Wuhan University of Science and Technology), Ministry of Education, Wuhan 430081, China
- Hubei Key Laboratory of Mechanical Transmission and Manufacturing Engineering (Wuhan University of Science and Technology), Wuhan 430081, China
- Precision Manufacturing Institute (Wuhan University of Science and Technology), Wuhan 430081, China
| |
Collapse
|
5
|
Das HS, Das A, Neog A, Mallik S, Bora K, Zhao Z. Breast cancer detection: Shallow convolutional neural network against deep convolutional neural networks based approach. Front Genet 2023; 13:1097207. [PMID: 36685963 PMCID: PMC9846574 DOI: 10.3389/fgene.2022.1097207] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2022] [Accepted: 12/15/2022] [Indexed: 01/06/2023] Open
Abstract
Introduction: Of all the cancers that afflict women, breast cancer (BC) has the second-highest mortality rate, and it is also believed to be the primary cause of the high death rate. Breast cancer is the most common cancer that affects women globally. There are two types of breast tumors: benign (less harmful and unlikely to become breast cancer) and malignant (which are very dangerous and might result in aberrant cells that could result in cancer). Methods: To find breast abnormalities like masses and micro-calcifications, competent and educated radiologists often examine mammographic images. This study focuses on computer-aided diagnosis to help radiologists make more precise diagnoses of breast cancer. This study aims to compare and examine the performance of the proposed shallow convolutional neural network architecture having different specifications against pre-trained deep convolutional neural network architectures trained on mammography images. Mammogram images are pre-processed in this study's initial attempt to carry out the automatic identification of BC. Thereafter, three different types of shallow convolutional neural networks with representational differences are then fed with the resulting data. In the second method, transfer learning via fine-tuning is used to feed the same collection of images into pre-trained convolutional neural networks VGG19, ResNet50, MobileNet-v2, Inception-v3, Xception, and Inception-ResNet-v2. Results: In our experiment with two datasets, the accuracy for the CBIS-DDSM and INbreast datasets are 80.4%, 89.2%, and 87.8%, 95.1% respectively. Discussion: It can be concluded from the experimental findings that the deep network-based approach with precise tuning outperforms all other state-of-the-art techniques in experiments on both datasets.
Collapse
Affiliation(s)
- Himanish Shekhar Das
- Department of Computer Science and Information Technology, Cotton University, Guwahati, India
| | - Akalpita Das
- Department of Computer Science and Engineering, GIMT Guwahati, Guwahati, India
| | - Anupal Neog
- Department of AI and Machine Learning COE, IQVIA, Bengaluru, Karnataka, India
| | - Saurav Mallik
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, United States
- Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, United States
- Department of Pharmacology and Toxicology, University of Arizona, Tucson, AZ, United States
| | - Kangkana Bora
- Department of Computer Science and Information Technology, Cotton University, Guwahati, India
| | - Zhongming Zhao
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, United States
- Department of Pathology and Laboratory Medicine, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX, United States
| |
Collapse
|
6
|
Akhyar F, Liu Y, Hsu CY, Shih TK, Lin CY. FDD: a deep learning-based steel defect detectors. THE INTERNATIONAL JOURNAL, ADVANCED MANUFACTURING TECHNOLOGY 2023; 126:1093-1107. [PMID: 37073280 PMCID: PMC9988608 DOI: 10.1007/s00170-023-11087-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Accepted: 02/08/2023] [Indexed: 05/03/2023]
Abstract
Surface defects are a common issue that affects product quality in the industrial manufacturing process. Many companies put a lot of effort into developing automated inspection systems to handle this issue. In this work, we propose a novel deep learning-based surface defect inspection system called the forceful steel defect detector (FDD), especially for steel surface defect detection. Our model adopts the state-of-the-art cascade R-CNN as the baseline architecture and improves it with the deformable convolution and the deformable RoI pooling to adapt to the geometric shape of defects. Besides, our model adopts the guided anchoring region proposal to generate bounding boxes with higher accuracies. Moreover, to enrich the point of view of input images, we propose the random scaling and the ultimate scaling techniques in the training and inference process, respectively. The experimental studies on the Severstal steel dataset, NEU steel dataset, and DAGM dataset demonstrate that our proposed model effectively improved the detection accuracy in terms of the average recall (AR) and the mean average precision (mAP) compared to state-of-the-art defect detection methods. We expect our innovation to accelerate the automation of industrial manufacturing process by increasing the productivity and by sustaining high product qualities.
Collapse
Affiliation(s)
- Fityanul Akhyar
- School of Electrical Engineering, Telkom University, Bandung, West Java 40257 Indonesia
| | - Ying Liu
- Department of Computer Science & Engineering, Santa Clara University, Santa Clara, CA 95053 USA
| | - Chao-Yung Hsu
- Automation & Instrumentation System Development Sec, China Steel Corporation, Kaohsiung, 81233 Taiwan
| | - Timothy K. Shih
- Department of Computer Science & Information Engineering, National Central University, Taoyuan, 320317 Taiwan
| | - Chih-Yang Lin
- Department of Mechanical Engineering, National Central University, Taoyuan, 320317 Taiwan
| |
Collapse
|
7
|
Enhancing Precision with an Ensemble Generative Adversarial Network for Steel Surface Defect Detectors (EnsGAN-SDD). SENSORS 2022; 22:s22114257. [PMID: 35684877 PMCID: PMC9185267 DOI: 10.3390/s22114257] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Revised: 05/26/2022] [Accepted: 05/31/2022] [Indexed: 02/01/2023]
Abstract
Defects are the primary problem affecting steel product quality in the steel industry. The specific challenges in developing detect defectors involve the vagueness and tiny size of defects. To solve these problems, we propose incorporating super-resolution technique, sequential feature pyramid network, and boundary localization. Initially, the ensemble of enhanced super-resolution generative adversarial networks (ESRGAN) was proposed for the preprocessing stage to generate a more detailed contour of the original steel image. Next, in the detector section, the latest state-of-the-art feature pyramid network, known as De-tectoRS, utilized the recursive feature pyramid network technique to extract deeper multi-scale steel features by learning the feedback from the sequential feature pyramid network. Finally, Side-Aware Boundary Localization was used to precisely generate the output prediction of the defect detectors. We named our approach EnsGAN-SDD. Extensive experimental studies showed that the proposed methods improved the defect detector's performance, which also surpassed the accuracy of state-of-the-art methods. Moreover, the proposed EnsGAN achieved better performance and effectiveness in processing time compared with the original ESRGAN. We believe our innovation could significantly contribute to improved production quality in the steel industry.
Collapse
|
8
|
Strip Steel Surface Defects Classification Based on Generative Adversarial Network and Attention Mechanism. METALS 2022. [DOI: 10.3390/met12020311] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
In a complex industrial environment, it is difficult to obtain hot rolled strip steel surface defect images. Moreover, there is a lack of effective identification methods. In response to this, this paper implements accurate classification of strip steel surface defects based on generative adversarial network and attention mechanism. Firstly, a novel WGAN model is proposed to generate new surface defect images from random noises. By expanding the number of samples from 1360 to 3773, the generated images can be further used for training classification algorithm. Secondly, a Multi-SE-ResNet34 model integrating attention mechanism is proposed to identify defects. The accuracy rate on the test set is 99.20%, which is 6.71%, 4.56%, 1.88%, 0.54% and 1.34% higher than AlexNet, VGG16, ShuffleNet v2 1×, ResNet34, and ResNet50, respectively. Finally, a visual comparison of the features extracted by different models using Grad-CAM reveals that the proposed model is more calibrated for feature extraction. Therefore, it can be concluded that the proposed methods provide a significant reference for data augmentation and classification of strip steel surface defects.
Collapse
|
9
|
Karacı A. VGGCOV19-NET: automatic detection of COVID-19 cases from X-ray images using modified VGG19 CNN architecture and YOLO algorithm. Neural Comput Appl 2022; 34:8253-8274. [PMID: 35095212 PMCID: PMC8785935 DOI: 10.1007/s00521-022-06918-x] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2021] [Accepted: 01/04/2022] [Indexed: 01/09/2023]
Abstract
X-ray images are an easily accessible, fast, and inexpensive method of diagnosing COVID-19, widely used in health centers around the world. In places where there is a shortage of specialist doctors and radiologists, there is need for a system that can direct patients to advanced health centers by pre-diagnosing COVID-19 from X-ray images. Also, smart computer-aided systems that automatically detect COVID-19 positive cases will support daily clinical applications. The study aimed to classify COVID-19 via X-ray images in high precision ratios with pre-trained VGG19 deep CNN architecture and the YOLOv3 detection algorithm. For this purpose, VGG19, VGGCOV19-NET models, and the original Cascade models were created by feeding these models with the YOLOv3 algorithm. Cascade models are the original models fed with the lung zone X-ray images detected with the YOLOv3 algorithm. Model performances were evaluated using fivefold cross-validation according to recall, specificity, precision, f1-score, confusion matrix, and ROC analysis performance metrics. While the accuracy of the Cascade VGGCOV19-NET model was 99.84% for the binary class (COVID vs. no-findings) data set, it was 97.16% for the three-class (COVID vs. no-findings vs. pneumonia) data set. The Cascade VGGCOV19-NET model has a higher classification performance than VGG19, Cascade VGG19, VGGCOV19-NET and previous studies. Feeding the CNN models with the YOLOv3 detection algorithm decreases the training test time while increasing the classification performance. The results indicate that the proposed Cascade VGGCOV19-NET architecture was highly successful in detecting COVID-19. Therefore, this study contributes to the literature in terms of both YOLO-aided deep architecture and classification success.
Collapse
Affiliation(s)
- Abdulkadir Karacı
- Faculty of Engineering and Architecture, Computer Engineering, Kastamonu University, 37200 Kastamonu, Turkey
| |
Collapse
|
10
|
Flatness Prediction of Cold Rolled Strip Based on Deep Neural Network with Improved Activation Function. SENSORS 2022; 22:s22020656. [PMID: 35062616 PMCID: PMC8780483 DOI: 10.3390/s22020656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/21/2021] [Revised: 12/23/2021] [Accepted: 01/11/2022] [Indexed: 11/17/2022]
Abstract
With the improvement of industrial requirements for the quality of cold rolled strips, flatness has become one of the most important indicators for measuring the quality of cold rolled strips. In this paper, the strip production data of a 1250 mm tandem cold mill in a steel plant is modeled by an improved deep neural network (the improved DNN) to improve the accuracy of strip shape prediction. Firstly, the type of activation function is analyzed, and the monotonicity of the activation function is deemed independent of the convexity of the loss function in the deep network. Regardless of whether the activation function is monotonic, the loss function is not strictly convex. Secondly, the non-convex optimization of the loss functionextended from the deep linear network to the deep nonlinear network, is discussed, and the critical point of the deep nonlinear network is identified as the global minimum point. Finally, an improved Swish activation function based on batch normalization is proposed, and its performance is evaluated on the MNIST dataset. The experimental results show that the loss of an improved Swish function is lower than that of other activation functions. The prediction accuracy of a deep neural network (DNN) with an improved Swish function is 0.38% more than that of a deep neural network (DNN) with a regular Swish function. For the DNN with the improved Swish function, the mean square error of the prediction for the flatness of cold rolled strip is reduced to 65% of the regular DNN. The accuracy of the improved DNN is up to and higher than the industrial requirements. The shape prediction of the improved DNN will assist and guide the industrial production process, reducing the scrap yield and industrial cost.
Collapse
|
11
|
Impact of Image Compression on the Performance of Steel Surface Defect Classification with a CNN. JOURNAL OF SENSOR AND ACTUATOR NETWORKS 2021. [DOI: 10.3390/jsan10040073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Machine vision is increasingly replacing manual steel surface inspection. The automatic inspection of steel surface defects makes it possible to ensure the quality of products in the steel industry with high accuracy. However, the optimization of inspection time presents a great challenge for the integration of machine vision in high-speed production lines. In this context, compressing the collected images before transmission is essential to save bandwidth and energy, and improve the latency of vision applications. The aim of this paper was to study the impact of quality degradation resulting from image compression on the classification performance of steel surface defects with a CNN. Image compression was applied to the Northeastern University (NEU) surface-defect database with various compression ratios. Three different models were trained and tested with these images to classify surface defects using three different approaches. The obtained results showed that trained and tested models on the same compression qualities maintained approximately the same classification performance for all used compression grades. In addition, the findings clearly indicated that the classification efficiency was affected when the training and test datasets were compressed using different parameters. This impact was more obvious when there was a large difference between these compression parameters, and for models that achieved very high accuracy. Finally, it was found that compression-based data augmentation significantly increased the classification precision to perfect scores (98–100%), and thus improved the generalization of models when tested on different compression qualities. The importance of this work lies in exploiting the obtained results to successfully integrate image compression into machine vision systems, and as appropriately as possible.
Collapse
|