End-to-end deep learning framework for printed circuit board manufacturing defect classification

Bhattacharya, Abhiroop; Cloutier, Sylvain G.

doi:10.1038/s41598-022-16302-3

Download PDF

Article
Open access
Published: 22 July 2022

End-to-end deep learning framework for printed circuit board manufacturing defect classification

Abhiroop Bhattacharya¹ &
Sylvain G. Cloutier¹

Scientific Reports volume 12, Article number: 12559 (2022) Cite this article

9674 Accesses
28 Citations
14 Altmetric
Metrics details

Subjects

Abstract

We report a complete deep-learning framework using a single-step object detection model in order to quickly and accurately detect and classify the types of manufacturing defects present on Printed Circuit Board (PCBs). We describe the complete model architecture and compare with the current state-of-the-art using the same PCB defect dataset. These benchmark methods include the Faster Region Based Convolutional Neural Network (FRCNN) with ResNet50, RetinaNet, and You-Only-Look-Once (YOLO) for defect detection and identification. Results show that our method achieves a 98.1% mean average precision(mAP[IoU = 0.5]) on the test samples using low-resolution images. This is 3.2% better than the state-of-the-art using low-resolution images (YOLO V5m) and 1.4% better than the state-of-the-art using high-resolution images (FRCNN-ResNet FPN). While achieving better accuracies, our model also requires roughly 3× fewer model parameters (7.02M) compared with the state-of-the-art FRCNN-ResNet FPN (23.59M) and YOLO V5m (20.08M). In most cases, the major bottleneck of the PCB manufacturing chain is quality control, reliability testing and manual rework of defective PCBs. Based on the initial results, we firmly believe that implementing this model on a PCB manufacturing line could significantly increase the production yield and throughput, while dramatically reducing manufacturing costs.

PCB defect detection algorithm based on CDI-YOLO

Article Open access 28 March 2024

Gaoshang Xiao, Shuling Hou & Huiying Zhou

Quality inspection of specific electronic boards by deep neural networks

Article Open access 24 November 2023

Peter Klco, Dusan Koniar, … Marek Chnapko

Toward surface defect detection in electronics manufacturing by an accurate and lightweight YOLO-style object detector

Article Open access 01 May 2023

Jyunrong Wang, Huafeng Dai, … Rongsheng Lu

Introduction

The Printed Circuit Boards (PCBs) are the foundation supporting most electronic products. They are usually made of fibreglass and composite epoxies with laminated materials¹. Any fabrication defect at the PCB level can lead to fatal flaws at the product level. Thus, PCBs must be manufactured with the highest degree of precision to ensure optimal operation and product reliability. With the growing worldwide demand for electronic products, it is essential to detect fabrication defects both efficiently and accurately. As part of the Industry 4.0 revolution, new data- and machine learning-driven technologies can be implemented to improve product and process quality². The Zero Defect Manufacturing (ZDM) paradigm also aims to improve the manufacturing sustainability by leveraging data-driven methods to ensure no defective products pass through the production process³. The approach combines detection, repair, prediction and prevention⁴. While traditional quality improvement (QI) methods focus on the detection-repair, manufacturing industries now migrate towards a prediction-prevention paradigm using data-driven methods to predict manufacturing defects⁵. The PCB industry invests massively to train and maintain a large workforce dedicated to quality inspection using traditional inspection tools⁶. This process often leads to an unwanted latency in the manufacturing process. Moreover, physically inspecting parts is expensive and arduous. Thus, most manufacturing companies rely on batch inspection. However, batch inspection does not enable the manufacturers to comply with the ZDM principle of zero defects at the end of the manufacturing process. With the growing importance of product customization, there is an increase in defect rates due to smaller production batch sizes⁷. In Virtual Metrology(VM), a sub-field of ZDM, data-driven methods help estimate and predict the quality of a product⁸. These methods leverage low-cost quality metrics to derive more complex metrics to achieve a significant improvement in cost efficiency⁸. Emerging machine learning-based computer vision techniques have helped researchers apply Virtual Metrology to quality inspection⁹

PCB manufacturing defect types

Different types of defects in the copper pattern can afflict the PCBs. They can be fatal defects, immediately rendering the device non-functional. They can also be potential defects, hindering the performance of the device and reducing its operating lifetime¹⁰. During the etching and plating processes, anomalies may result in excess copper or missing copper. Also, an incomplete process can result in the unwanted deposition of conductive materials and form defects like shorts or spurs. On the other hand, excessive processing can lead to missing holes, open circuits and mouse bites. Faulty tooling can also produce missing holes. Incorrect timing can lead to mechanical mis-registrations, dirt contamination, or air bubbles from the electrolysis present in bare PCB boards. The literature gives an extensive summary of the most common PCB manufacturing defects and their origins¹¹.

Inspection techniques

The PCB defect detection techniques can be broadly divided into contact and non-contact methods¹¹. Contact methods usually rely on flying probes to detect defects causing electrical shortages and open circuits. They can be expensive to implement and maintain. They also have certain limitations which may allow defective products to pass¹⁰. Defects like spurious copper or burs, which can cause a slower degradation and failure of the board are often missed by electrical contact methods¹⁰.

The non-contact inspection methods uses several techniques such as X-ray imaging, Scanned Beam Lithography, Ultra-sonic imaging, thermal imaging and Automatic Optical Inspection (AOI)¹⁰. As the circuits become denser and more complex, the detection of manufacturing defects also becomes increasingly challenging and costly. This prevents the manufacturing process from complying with ZDM standards⁴.

In this work, we seek to improve AOI systems for defect detection in bare-board PCBs using emerging deep-learning techniques. AOI systems can detect defects on bare boards, missing components, soldering and padding defects. They can also detect potential defects such as burs, spurious copper and mouse-bites, which may be missed by contact methods. Moreover, AOI systems avoid mechanical damage and can easily scale with the increase in production capacity.

This paper presents a complete framework to improve PCB defect identification using AOI tools. The main contributions of this paper are as follows:

1.
Proposing an improved object detection model for PCB defect detection.
2.
Benchmarking its performances against several state-of-the-art object detection models.
3.
Increasing the overall accuracy (mAP[IoU = 0.5]) by 3.2%, reaching up-to a 5.6% improvement for the spur-defect class.
4.
All this while using only 7 million (7M) (35%) of the more-than 20M parameters used by the current state-of-the-art methods (YOLO V5m for low-resolution images and FRCNN-ResNet FPN for high-resolution images).

In time, we firmly believe such real-time machine learning-assisted monitoring will help rapidly identify and locate manufacturing defects, accurately pinpoint their origin, and provide timely adjustments to the manufacturing processes.

The paper is organized as follows: in “Related work”, we survey the literature with particular attention to the implementation of deep learning for defect detection. In “Overview of objection detection models” the relevant deep learning models are explained. This section also introduces the reader to our proposed model architecture and how it differs from the state-of-the-art. In “Experimentation”, we describe the dataset and the testing methodology. In “Results”, we present the experimental results and ablation studies. The “Discussion” section compares the results using our model and the state-of-the-art, while the “Conclusion” summarizes the key findings and introduces future research directions.

Related work

Here, we present a brief, yet representative, overview of the field. Zero Defect Manufacturing is a key part of Industry 4.0. The zero-defect concept was introduced in 1965 as a quality and reliability program implemented by the US Army¹². Researchers have explored different techniques to make manufacturing processes compliant with ZDM. In this work, we primarily focus on the application of deep learning for defect detection. Two years ago, researchers implemented an Extended Deep Belief Network (EDBN)-based fault classifier for chemical processes using a combination of raw data and hidden features¹³. However, such an architecture is complex and requires a longer time to process the data. Others proposed a Stacked Quality-Driven Autoencoder (SQAE), which captures quality-relevant features and neglects the irrelevant ones for soft-sensing applications¹⁴. Transfer convolutional neural networks (CNNs) combine online CNNs and smaller offline shallow CNN networks¹⁵. This approach shows that pre-training the shallow networks and transferring the knowledge to the online network can significantly improve the accuracy of the models¹⁵. However, such transfer learning methods tend to introduce unwanted biases in the models, which prevents generalizing across different samples¹⁶. A central assumption with deep learning-based methods is that the test data and the training data are taken from the same distribution¹⁷. Concretely, this assumes no change in environmental conditions. As such, deep Transfer Network can achieve better domain adaptation¹⁸. Indeed, such CNN-based networks were previously used for crack detection on surfaces¹⁹.

In 2018, Vafeidas et al. performed a comparative analysis of the performance of classical machine learning algorithms to detect faulty component placement on PCB boards²⁰. A combination of computer vision algorithms was used to extract the features. At the time, Support Vector Machines achieved the highest classification accuracy²⁰. In 2020, an architecture based on 3D convolutional neural networks (3DCNN) was used to simulate the changes in shape and volume of glue drops deposited on Liquid Cystal Polymer substrates before the attachment of integrated circuits²¹.

Virtual Metrology exploits available information from sensors or visual inputs to assess parameters which are difficult or expensive to measure^8,9. Based on the same paradigm, Autoencoders were trained on defect-free semiconductor chips and, then used for anomaly detection²². A similar approach was also used for wafer fault monitoring²³. The authors showed that the model is able to extract noise tolerant features. However, this process also detects any anomalous sample as a defected sample.

In recent years, researchers used multiple methods to streamline and automate the detection of defects in bare PCBs. Wavelet-based algorithms were first implemented for defect detection²⁴. The major limitations with these methods is their poor generalization²⁵. Classical machine learning algorithms such as support vector machines and decision trees have been implemented to classify defect types^26,27. These methods require extensive feature engineering to process the raw data and identify meaningful features for the model. In turn, this leads to unintentional expert-induced bias in the results¹⁶. This approach requires additional pre-processing steps, increasing the processing time and reducing the scalability. Our proposed deep-learning model seeks to extract the features directly from the images themselves. Indeed, several researchers have used a combination of computer vision methods and deep learning for detecting and classifying defects^28,29,30,31. This also creates pre-processing pipelines, leading to delays and scalability issues. Last year, DETR models removed the need for many hand-designed components for PCB defect detection³². Moreover, the use of transfer learning is a popular technique for applying complex deep learning models to small datasets. Recently, a few groups have implemented transfer learning for defect detection^33,34,35. However, these models tend to incorporate biases from the pre-training¹⁶. This year, Generative Adversarial Networks (GANs) were first implemented to improve the quality of data, which in-turn improves the accuracy of the models³⁶.

Object detection models can simultaneously perform multiple tasks on a single image, including: (i) Multiple object detection, (ii) Classifying the defects and (iii) Localization³⁷. This makes them the ideal deep learning models for application in fault diagnostics. Several research teams are currently exploring object detection models to detect small objects^38,39. Two-stage object detection models such as FRCNN combine two networks and, thus, have a large number of parameters and require higher image-processing delays. This makes it very difficult to apply them in a high-speed manufacturing line. Single-stage object detectors are faster and thus, more suitable candidates for near real-time deployment. Only last year, researchers started using one-stage object detection models for localization and classification of defects^40,41. Also, optical inspection requires high resolution images and equipments⁴². This makes it difficult for small scale productions to adopt such technologies. To the best of our knowledge, this is the first extensive study to examine and improve the performances of such models on low-resolution images.

Overview of objection detection models

This section gives a brief overview of the various models used in the paper. The literature gives a comprehensive overview of the various deep learning methods used for object detection³⁷. In this work, we will focus on state-of-art You-Only-Look-Once(YOLO), RetinaNet and Faster R-CNN models for comparison.

You-only-look-once(YOLO)

You-Only-Look-Once(YOLO)⁴³ is a single-stage object detection model. It contains three main components, namely the backbone, the neck and the head. The backbone is a convolutional neural network that takes images of different sizes as input and forms the overall features of the images. The neck represents a series of network layers that can fuse the features to enrich the information. The processed features are fed to the prediction layer, where the classifier obtains the class of the objects and generates the final coordinates of the bounding box.

The network divides the image into grid regions and predicts rectangular bounding boxes in each region. The base model for YOLO is similar to GoogLeNet⁴⁴ with the inception module replaced by 1 × 1 and 3 × 3 convolutional layers. The final prediction is produced by two fully connected layers over the whole convolutional feature map. The block diagram in Fig. 1 captures the network structure of YOLO.

The loss function for the network consists of two parts, the localization loss for the prediction of the bounding box offsets and the classification loss for conditional class probabilities. The losses are computed as the sum of the squared errors. Most of the bounding boxes have no instance of the object, thus it is important to down-weight the loss from the background boxes. Two weight parameters are used to balance between bounding box coordinates and confidence score prediction for boxes without objects.

YOLO v5⁴⁵ is built on a similar structure as YOLO. As such, it runs detection directly over dense location sites. The backbone first extracts the dominant features from the input images. In YOLO v5, the Cross Stage Partial(CSP) Network is used for the backbone. The neck is then used to generate feature pyramid filters. They help the model recognize the same object at different scales and sizes. YOLO v5 uses PANet as the neck⁴⁶. Finally, the head performs the detection part. It applies anchor boxes on the extracted features and produces output vectors with class probabilities, object scores and bounding boxes. YOLO v5 uses Leaky-ReLU for the hidden layers and Sigmoid activation for the output layer. For large models, the stochastic gradient(SGD) optimizer is preferred with a Binary Cross Entropy (BCE) loss⁴⁷. Earlier this year, researchers have applied YOLO-based object detection methods for fault diagnostic^48,49.

Faster region based convolutional neural network (FRCNN)

Faster R-CNNs(FRCNNs) combine two networks⁵⁰. First, a Region Proposal Network(RPN) generates region proposals. In turn, a detector network relies on these proposals for object detection. The Faster R-CNN is a significant improvement over it’s predecessor the Fast R-CNN model⁵¹, as it uses the RPN instead of a selective search method to generate the region proposals. The RPN ranks the region box anchors and proposes the most likely to contain the objects. These anchors play a vital role in the Faster R-CNN models. Typical FRCNNs use nine anchors at each position of an image. The RPN outputs a set of proposals to be further examined by a classifier and regressor to check the object occurrences. Thus, the RPN predicts the probability of an anchor being a meaningful object or being part of the background and then refines the anchor. It then labels as foreground the anchors with the greater overlap with the ground-truth boxes. In contrast, the anchors with low overlaps are labelled as background. The regressor calculates the L1 loss using the position of the bounding box and positive anchors. The default configuration uses the center position, height and width as the input. However, we observed that using the top left and bottom right coordinates gives a marginally better result. After the RPN, the model proposes regions with different sizes. A Region of Interest(ROI) pooling layer then splits the input feature map into a fixed number of equally sized regions and then applies max pooling on each region to ensure the same region sizes irrespective of the input. We have used ResNet 50 with Feature pyramid networks as the background. The SGD optimizer with a decaying learning rate gives the best results.

RetinaNet

RetinaNet⁵² is a composite network using a backbone, classification and regression subnet. The typical backbone uses a ResNet with Feature Pyramid Network (FPN)⁵³, using two laterally-interconnected pathways. The bottom-up pathway uses the output of the final feature map from a set of consecutive convolutional layers. The top-down pathway uses nearest neighbour up-sampling to expand the last feature map to the same size as the preceding penultimate layer. These layers are merged by element-wise addition. It then iterates until feature maps from the bottom-up pathway find a corresponding feature map through the lateral connections. This process renders the model scale-invariant. The classification subnet uses a convolutional network (CNN) attached to each FPN. It typically uses four 3 × 3 convolution layers with 256 filters, followed by a ReLU activation. Then, another 3 × 3 convolution layer is followed by sigmoid activation. The classification loss used is a variant of the focal loss⁵². The regression subnet is attached to the FPN’s feature maps in parallel to the classification subnet, akin to a classification network architecture. The RetinaNet typically picks the 1k anchor boxes with the highest confidence score from each FPN level. To prevent redundancy, non-maximum-suppression (NMS) can be applied independently to each class, then choosing the anchor box with the highest confidence score and removing overlapping anchor boxes using Intersection-over-Union (IoU) greater than 0.5⁵⁴. Finally, the regressor performs an offset prediction to refine the anchor selection and return a bounding box prediction.

Proposed network structure

The proposed model is a variation of YOLO v5. Our proposed network, is a combination of CNNs and Transformers. Figure 2 shows the model structure, which includes the three main blocks, namely—the backbone, the neck and the head. The transformer module is included at the junction of the neck and the backbone. It provides multi-level features with global information for detection. This enhances the field of reception of the convolutional network. The CNN network extracts the underlying geometrical features of the images. These feature maps usually constitute the key points, lines and some basic geometrical patterns⁵⁵. Both global dependence and locality modelling are important for a better representation of the image⁵⁶. Unlike standard transformer networks used for data sequences, our model directly processes feature maps generated by the convolutional network. As such, our model can benefit from the merits of both CNN and transformers to model long dependencies and, to learn scale combined with shift-invariant locality representations^48,57.

We present a comparison between the different models analyzed in this paper in the supplementary information section.

Data augmentation

Data augmentation is an umbrella of techniques that can be used to generate additional training samples by slightly modifying the existing training data or creating synthetic data. It prevents over-fitting and helps the model generalise⁵⁸. Shorten provide a comprehensive survey of different data augmentation techniques used for computer vision⁵⁹. YOLOv5 model uses a combination of Mosaic, Mixup, HSV and classical methods for data augmentation. Classical methods primarily involve rotation, re-scaling, vertical and horizontal flipping, translation, adding noise, cropping and zooming.

The mosaic data augmentation technique combines four training images into one single image. This technique was introduced in YOLOv4⁶⁰. This enables the model to learn to identify objects at different scales. The CutMix technique combines images by cutting parts and pasting them onto the augmented images. This improves the robustness of the model by changing a part of the input image⁶¹. On similar lines, Image Occlusion replaces regions of the images with random values. This behaves like a regularization technique. Another common method is to increase the Saturation (S) & Value (V) components of the HSV color space.

Experimentation

We have used a Tesla T4 graphics processing unit (GPU) to run our experiments. The framework was implemented using the PyTorch library version⁶² and the ultralytics YOLOv5⁴⁵ implementation.

The overall architecture involves an AOI camera, an image capture device for storing the images and a processing unit with our proposed model to accurately detect, classify and localise multiple defects in the bare board PCB in real time. Figure 3 presents a high level schematic of the overall architecture.

HRIPCB dataset

For this project, we use the public HRIPCB dataset to train, test and validate the object detection model²⁹. The dataset contains 1386 labeled images with six different families of manufacturing defects (missing hole, mouse bite, open circuit, short, spur, spurious copper). It is based on 10 different PCB board images augmented using six different defect types²⁹. The dataset provides “XML” files with the labels of defects and types. Due to the requirements of the experimental model, the files are converted to “TXT”. Figure 4 shows the different defect types in the dataset.

Evaluation metrics

In the object detection task, the model will output the possible defect positions in terms of the prediction boxes. To determine whether the prediction box and the ground truth are the same, the metric Intersection over Union(IoU) is used in literature. The definition of IoU is given in Eq. (1):

$$IoU = \frac{Area\,of\,overlap}{Area\,of\,union}$$

(1)

Most researchers set the threshold of IoU to 0.5. This means that when the overlap ratio reaches 0.5, the prediction box is correct. In addition, average Precision and Recall are used to evaluate the results of image classification. Precision is defined in equation 2 and Recall is defined in equation 3.

$$Precision = \frac{True\,Positives}{True\,Positives+False\,Positives}$$

(2)

$$Recall = \frac{True\,Positives}{True\, Positives+False\,Negatives}$$

(3)

The Precision value indicates the model’s positive predictive value (or the ability to avoid false positives). Meanwhile, the Recall value indicates the model’s true positive rate or sensitivity (or the ability to avoid false negatives). Balancing the Precision and Recall in the context of a specific application is one of the main challenges facing any machine-learning developer.

Methodology

We first divide the dataset into three training (70%), validation (20%) and test (10%) sets. We used bi-cubic downsampling⁶³ to generate the low-resolution images. The high resolution images were also generated using the same method. Bi-cubic sampling does not require any additional learnable parameters. Thus, it does not increase the complexity of the system. The stochastic gradient descent (SGD) optimizer was used for most of the experiments. Additional experiments were performed using the Adam and AdamW optimizers. However, we found that SGD consistently produced the best results. The same behaviour has been previously reported in the literature⁴⁷. The hyperparameters were fine-tuned for each model and the detailed list of hyperparameters associated with each model is provided in the supplementary information. For fine-tuning the hyperparameters of our proposed model, we have used a genetic algorithm-based approach⁶⁴.

The YOLO v5 algorithm uses a combination of different methods for data augmentation. Our proposed model uses a combination of image HSV augmentation, translation, rotation, mixup and mosaic. The detailed description of each augmentation method is provided in the section “Data augmentation”. The Faster R-CNN model is trained using a ResNet50 and a MobileNetv2⁶⁵ backbone for extracting the features. The RetinaNet model is trained using a ResNet50 feature extractor. We experimented with ResNet50 and GhostNet backbone models for YOLOv5s. However, we find that Cross Stage Partial Networks (CSP) Backbone gives the optimal results. For the Neck, we ran experiments with BIFPN, FPN and PANet models. The outcome of the experiments are presented in the section “Results”.

The key improvement in our model is the addition of the transformer model. The network structure is described in the section “Proposed network structure”. We also ran additional experiments using different activation functions. We found that the Swish activation function performs better than Mish, ReLU and Leaky-ReLU. Figure 5 shows the training and validation curves for our model. It demonstrates that our model is able to converge within 100 epochs for low-resolution images.

Results

Experimental results

Object detection performance is evaluated with the mean Average Precision (mAP) between ground truth and predicted bounding box (IoU). To validate the performance of our proposed network, multiple experiments are performed in this study. To benchmark the performance of different object detection models against our proposed model, we coded the models as per the respective references and then ran the experiments on the HRIPCB dataset.

Figure 5 shows that the performance of the model is the same across training and validation, which suggests that the model does not over-fit the data. We observe that the box loss and the classification loss decrease sharply for the first 50 epochs before saturation. Also, we observe that the model has a high precision and a high recall, which suggests that the model has low false negatives.

Experiment results reported in the Table 1 are the average of multiple experiments.We performed two sets of experiments to compare model performances: one using high-resolution images and the other using low-resolution images. We use bi-cubic sampling⁶³ to generate the low resolution images. The results in Table-1 show that our method can achieve an overall Mean Average Precision (mAP[IoU = 0.5]) of 98.1% on low-resolution images. We observe that the two-stage object detection FRCNN model with ResNet back bone is able to achieve a high performance. However, the model has a large number of parameters and is significantly slower than single-stage object detectors³⁸. Amongst the single-stage models, we observe that the YOLOv5 models outperform RetinaNet and YOLOv3. As expected, we observe that the medium YOLOv5m model performs better than the smaller YOLOv5s model. We also observe our model outperforms the state-of-art YOLOv5m medium model by 3.2% in the overall mAP[IoU = 0.5]. The mAP for IoU = 0.5:0.95 is also higher for our model compared to all other models. This means that the model is able to accurately detect defects for different IoU thresholds, from 0.5 to 0.95 with a step of 0.05.

Table 1 Mean average precision (mAP) for PCB defect classification across models.

Full size table

To be useful, a detection system should be able to generalize and accurately detect all types of defects. Indeed, we observe that other YOLO-based models are ill-suited for the spur, mousebite and open circuit defects detection. The smaller YOLOv5s model achieves only 88.5% mAP for spur and 97.4% for open circuit. The state-of-art YOLO v5m medium model achieves only 91.3% mAP for the spur defect detection and 91.6% mAP for open circuit. Our model outperforms the YOLO v5m model to achieve 96.9% mAP on spur defects and 99.5% on open circuit defects. While achieving significantly better performances, our model uses roughly 3× fewer parameters (7.02M) compared with the state-of-the-art FRCNN-ResNet FPN (23.59M) and YOLO V5m (20.08M).

Table 2 presents a comparative overview of the results achieved using the state-of-the-art YOLO 5-based models and our model.

Table 2 Mean average precision (mAP) achieved with IoU = 0.50 for PCB manufacturing defect classification across defect types.

Full size table

In the example shown in Fig. 6, we observe that the bare board PCB has 2 spur defects. YOLOv3 fails to identify the presence of both the spur defects while the YOLOv5s model predicts an additional false positive. Our proposed model is able to correctly identify both the spurs.

Now, the next section will present a complete ablation study for the different components of the model.

Ablation studies

Data augmentation

The YOLOv5 model uses a combination of augmentation methods for improving its generalization. These methods are discussed in the section “Data augmentation”. We observe that removing the mosaic method does not impact the overall precision. However, it slightly reduces the precision for the spurs and spurious copper defects. If we remove the HSV augmentation, we observe a significant decrease (7%) in the detection precision for spur defects. Furthermore, removing the scale augmentation only has a marginal effect on the overall precision. However, removing all data augmentation drastically reduces the overall mAP to 84.7% and the average precision for detecting spur defects to 73.6%. Thus, we see that data augmentation has a significant impact on the performance of the model.

Neck

We have removed the Path Aggregation module and used a BiFPN model for comparison. We observe that the BiFPN model yields similar results. The overall accuracy increases by 0.2% and we observe an increase of 1.4% for the spur defects. Thus, we observe that BiFPN marginally improves the performance of the system at the cost of a slight increase in compute parameters. The waterfall chart in Fig. 7 captures the effect of each model change on the average precision.

Activation functions

We have also compared the performances of the model using different activation functions. We have changed the activation function for the convolutional layer. We observe that using a ReLU activation function reduces the accuracy, as the ReLU function cannot recover after getting stuck in a negative region. The Leaky ReLU performs slightly better. However, we see that the Swish activation function outperforms the other activation functions. The waterfall chart presented in Fig. 8 captures the effect of using the different activation functions on the model’s mean average precision (mAP).

Regression loss functions

The regression loss function is a key factor in the training and optimization process of object detection. The most widely used regression loss function is the Smooth Ln-norm⁶⁶. There are some limitations associated with the Ln-norm loss. For example, they cannot combine the parameters of the bounding box⁶⁷. The IoU loss offers significant improvement over the Ln norm loss⁶⁸. It is based on the cross-union ratio between the bounding box and ground truth. Thus, it is scale invariant. However, for certain instances when the IoU score of the ground truth and the bounding box is zero, the loss suffers from a problem of vanishing gradient. Silvio et al. proposed Generalized IoU(GIoU) loss to address the limitations in IoU⁶⁹. GIoU uses a penalty term to prevent the IoU loss to keep expanding the size of the predicted box until it overlaps with the target box. However, GIoU can suffer from slow convergence. As such, Zheng et al. show that directly minimizing the normalized distance between the predicted box and the target box helps the algorithm to converge much faster⁶⁷. Moreover, it also considers the vertical and horizontal orientations. This new loss function is referred-to as Distance Intersection over Union(DIoU) loss. However, DIoU is unable to capture the consistency of aspect ratios for bounding boxes. Complete IoU (CIoU)⁶⁷ regression loss incorporates all geometric factors. CIoU works by adding a penalty factor $=\alpha \cdot V$ to DIoU, where V represents the aspect ratio consistency. We have compared the effect of those different regression loss functions to find that CIoU significantly outperforms all other loss functions. Figure 9 compares the model’s performance (mAP) using the different loss functions.

In addition to comparing the mainstream loss functions, we also propose a new loss function named AIoU which enables the model to optimise the height and width of the bounding box and the ground truth. We present a detailed analysis of this loss function in the supplementary information.

Transformer

Finally, we can run another ablation study by removing the transformer module from the architecture. We observe an overall reduction in accuracy of around 3.4%. However, the deterioration in performance is much more drastic for certain defect types. Compared to the YOLOv5s model, we observe a deterioration of around 9.8% in the mAP for the spur defects detection and a deterioration of 5.4% percent for spurious copper. The waterfall chart in Fig. 7 captures the effect of each model change on the average precision. It confirms that the transformer module is a key component of the framework.

Discussion

In this section we discuss the results from multiple perspectives. The results show that our model is able to accurately detect, localize and classify multiple occurrence of defects in images of bare board PCBs. The model accurately predicts the location of each defect, classifies the type of defect and provides a probability score.

To understand the potential impact of this work, one must understand how the testing and rework process is performed in most traditional industrial PCB manufacturing lines. A direct-contact (flying probe) test station provides a pass/fail diagnostic on each individual PCB. When it fails, a PCB is sent to the rework station, where a technician will first look for defects under a digital microscope and perform manual rework if possible. If the reworked PCB still fails, it will be sent for a more advanced diagnostic (using tools like X-ray tomography imaging). Some randomly-selected PCB samples with a pass diagnostic will also be sent to rework inspection and advanced diagnostic for quality control. This is done to detect those non-fatal potential defects, which can impact the long-term device operation and lifetime. With a high-performance and trustworthy model such as the one described in this work, the rework technician could receive a faulty PCB knowing exactly what are the defects and where they can be found on the PCB. Using more advanced analysis methods, one could also potentially pinpoint the origin of those defects in the manufacturing process. As such, a high probability score validated in the field could help manufacturing companies bolster their confidence in the machine learning-enhanced diagnostic tool.

Firstly, we would like to emphasize that the FRCNN with the ResNet backbone can achieve reasonable performances. However, it is a two stage object detector which takes a much longer time to process the images and uses around 23.5M parameters³⁸. Thus, we believe it might be difficult to implement this network in high-speed manufacturing lines. While achieving higher precision, the proposed model uses only 7.02M parameters. This is roughly 3× less than the 23.5M parameters used with a two-stage detector or the 20.8M parameters used by the state-of-the-art YOLO5m model.

Secondly, we would like to highlight that the proposed model maintains high detection and identification accuracy with low-resolution images. This allows the technology to be easily adopted across multiple industries as it does not require expensive imaging technology. Moreover, our model yields a 3.2% overall improvement in the mAP[IoU = 0.5] compared to the standard YOLOv5m model for the same duration of training.

Most importantly, the YOLO v5m model achieves only 91.3% mAP for the spur defect detection. Meanwhile, our model achieves 96.9% mAP on spur defects. We believe that this shows that our model is able to accurately predict defects which are difficult to classify. This might be attributed to the ability of our model to exploit both global dependence and shift/scale invariance in extracted features.

The ablation studies show that the combination of data augmentation techniques helps the model generalize and improves the mAP of the model. This prevents the model from over-fitting to the data. The ability to work in high-speed environments with a small datasets is crucial for large scale deployment of the technology. Thus, we prefer the model with PANet, even though it has marginally lower overall performance compared to BiFPN. Furthermore, the model with BiFPN has lower accuracy than PANet for spurious copper, which is a common defect during the etching process. The ablation studies also sheds light on the importance of using the correct activation and loss functions to maximize the performance of the model for accurate defect detection. Table 3 presents a comparative overview of the models.

Table 3 Comparative overview of key properties across models.

Full size table

Conclusion

In this paper, we propose an end-to-end framework to detect manufacturing defects in PCB boards. In an optimal manufacturing process, quality should be integrated into the process. While most fabrication defects stem from the manufacturing process itself, traditional pass/fail methods focus on reworking those defects on the failing PCBs instead of identifying their precise origins in order to rapidly correct the process. Our proposed framework will enable the fabrication units to make in-operando adjustments to the manufacturing process and move closer to a zero defect manufacturing paradigm. Our network uses a combination of current techniques in deep learning- transformers, multi-level feature fusion, data augmentation and object detection. The results show that our model is able to successfully detect, classify and localize multiple defects in low resolution bare board PCB images. Our model is lightweight, low- resolution compatible and provides a 3.2% overall improvement in the mAP[IoU = 0.5] compared to the standard YOLOv5m model. Based on the initial results, we believe integrating deep learning-based models into fully-automated optical inspection (AOI) tools is crucial for early detection of PCB manufacturing defects and could potentially lead to important productivity gains coupled with significant cost reductions.

The presented approach has a strong assumption that the test and the training data are sampled from the same distribution. An interesting area of future work could be to investigate the performance of the method for off distribution samples. In the current work, we have only considered single layer bare board PCBs. Another interesting area of development could be extending the analysis for detecting defects in multi-layered PCBs.

Data availability

The datasets generated and/or analysed during the current study are available in the PKU-Market-PCB repository, http://robotics.pkusz.edu.cn/resources/dataset/.

References

Magera, J. A. & Dunn, G. J. Printed circuit board. US Patent 7459202 (2008).
Angelopoulos, A. et al. Tackling faults in the industry 4.0 era—A survey of machine-learning solutions and key aspects. Sensors 20, 109 (2019).
Article ADS Google Scholar
Powell, D., Magnanini, M. C., Colledani, M. & Myklebust, O. Advancing zero defect manufacturing: A state-of-the-art perspective and future research directions. Comput. Ind. 136, 103596. https://doi.org/10.1016/j.compind.2021.103596 (2022).
Article Google Scholar
Psarommatis, F., May, G., Dreyfus, P.-A. & Kiritsis, D. Zero defect manufacturing: State-of-the-art review, shortcomings and future directions in research. Int. J. Prod. Res. 58, 1–17. https://doi.org/10.1080/00207543.2019.1605228 (2020).
Article Google Scholar
Psarommatis, F. & Kiritsis, D. A hybrid decision support system for automating decision making in the event of defects in the era of zero defect manufacturing. J. Ind. Inf. Integr. 26, 100263. https://doi.org/10.1016/j.jii.2021.100263 (2022).
Article Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 770–778 https://doi.org/10.1109/CVPR.2016.90 (2016).
Squire, B., Brown, S., Readman, J. & Bessant, J. The impact of mass customisation on manufacturing trade-offs. Prod. Oper. Manag. 15, 10–21. https://doi.org/10.1111/j.1937-5956.2006.tb00032.x (2006).
Article Google Scholar
Dreyfus, P.-A., Psarommatis, F., May, G. & Kiritsis, D. Virtual metrology as an approach for product quality estimation in industry 4.0: A systematic review and integrative conceptual framework. Int. J. Prod. Res 60, 742–765. https://doi.org/10.1080/00207543.2021.1976433 (2022).
Article Google Scholar
Maggipinto, M., Masiero, C., Beghi, A. & Susto, G. A. A convolutional autoencoder approach for feature extraction in virtual metrology. Procedia Manuf. 17, 126–133 (2018).
Article Google Scholar
Moganti, M., Ercal, F., Dagli, C. H. & Tsunekawa, S. Automatic PCB inspection algorithms: A survey. Comput. Vis. Image Underst. 63, 287–313 (1996).
Article Google Scholar
Thibadeau, R. H. Automated Visual Inspection as Skilled Perception (Society of Manufacturing Engineers, 1985).
Google Scholar
Eleftheriadis, R. & Myklebust, O. A guideline of quality steps towards zero defect manufacturing in industry. In Proceedings of the International Conference on Industrial Engineering and Operations Management 332–340 (2016).
Wang, Y., Pan, Z., Yuan, X., Yang, C. & Gui, W. A novel deep learning based fault diagnosis approach for chemical process with extended deep belief network. ISA Trans. 96, 457–467. https://doi.org/10.1016/j.isatra.2019.07.001 (2020).
Article PubMed Google Scholar
Yuan, X. et al. Hierarchical quality-relevant feature representation for soft sensor modeling: A novel deep learning strategy. IEEE Trans. Ind. Inform. 16, 3721–3730. https://doi.org/10.1109/TII.2019.2938890 (2019).
Article Google Scholar
Xu, G., Liu, M., Jiang, Z., Shen, W. & Huang, C. Online fault diagnosis method based on transfer convolutional neural networks. IEEE Trans. Instrum. Meas. 69, 509–520. https://doi.org/10.1109/TIM.2019.2902003 (2019).
Article Google Scholar
Roselli, D., Matthews, J. & Talagala, N. Managing bias in AI. In Companion Proceedings of The 2019 World Wide Web Conference 539–544, https://doi.org/10.1145/3308560.3317590 (2019).
Ben-David, S., Blitzer, J., Crammer, K. & Pereira, F. Analysis of representations for domain adaptation. In Advances in Neural Information Processing Systems, Vol. 19 (2006).
Han, T., Liu, C., Yang, W. & Jiang, D. Deep transfer network with joint distribution adaptation: A new intelligent fault diagnosis framework for industry application. ISA Trans. 97, 269–281. https://doi.org/10.1016/j.isatra.2019.08.012 (2020).
Article PubMed Google Scholar
He, Y., Song, K., Meng, Q. & Yan, Y. An end-to-end steel surface defect detection approach via fusing multiple hierarchical features. IEEE Trans. Instrum. Meas. 69, 1493–1504. https://doi.org/10.1109/TIM.2019.2915404 (2019).
Article Google Scholar
Vafeiadis, T. et al. A framework for inspection of dies attachment on PCB utilizing machine learning techniques. J. Manag. Anal. 5, 81–94. https://doi.org/10.1080/23270012.2018.1434425 (2018).
Article Google Scholar
Dimitriou, N. et al. A deep learning framework for simulation and defect prediction applied in microelectronics. Simul. Model. Pract. Theory 100, 102063. https://doi.org/10.1016/j.simpat.2019.102063 (2020).
Article Google Scholar
Lin, F. & Cheng, K.-T. An artificial neural network approach for screening test escapes. In 2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC) 414–419 https://doi.org/10.1109/ASPDAC.2017.7858358 (2017).
Lee, H., Kim, Y. & Kim, C. O. A deep learning model for robust wafer fault monitoring with sensor measurement noise. IEEE Trans. Semicond. Manuf. 30, 23–31 (2016).
Article Google Scholar
Ibrahim, Z. & Rahman Al-Attas, S. A. Wavelet-based printed circuit board inspection algorithm. Integr. Comput. Eng. 12, 201–213. https://doi.org/10.3233/ICA-2005-12206 (2005).
Article Google Scholar
Ibrahim, Z., Al-Attas, S., Aspar, Z. & Mokji, M. M. Performance evaluation of wavelet-based PCB defect detection and localization algorithm. In 2002 IEEE International Conference on Industrial Technology, 2002. IEEE ICIT’02., Vol. 1, 226–231 https://doi.org/10.1109/ICIT.2002.1189895 (2002).
Xie, L., Huang, R. & Cao, Z. Detection and classification of defect patterns in optical inspection using support vector machines. In International Conference on Intelligent Computing 376–384, https://doi.org/10.1007/978-3-642-39479-9_45 (2013).
Goyal, D., Choudhary, A., Pabla, B. & Dhami, S. Support vector machines based non-contact fault diagnosis system for bearings. J. Intell. Manuf. 31, 1275–1289. https://doi.org/10.1007/s10845-019-01511-x (2020).
Article Google Scholar
Malge, P. & Nadaf, R. PCB defect detection, classification and localization using mathematical morphology and image processing tools. Int. J. Comput. Appl. 87, 40–45. https://doi.org/10.5120/15240-3782 (2014).
Article Google Scholar
Huang, W., Wei, P., Zhang, M. & Liu, H. Hripcb: A challenging dataset for PCB defects detection and classification. J. Eng.https://doi.org/10.1049/joe.2019.1183 (2020).
Article Google Scholar
Takada, Y., Shiina, T., Usami, H., Iwahori, Y. & Bhuyan, M. Defect detection and classification of electronic circuit boards using keypoint extraction and CNN features. In The Ninth International Conferences on Pervasive Patterns and Applications Defect, Vol. 100, 113–116 (2017).
Liu, Z. & Qu, B. Machine vision based online detection of PCB defect. Microprocess. Microsyst. 82, 103807. https://doi.org/10.1016/j.micpro.2020.103807 (2021).
Article Google Scholar
Jin, J., Feng, W., Lei, Q., Gui, G. & Wang, W. PCB defect inspection via deformable DETR. In 2021 7th International Conference on Computer and Communications (ICCC) 646–651 https://doi.org/10.1109/ICCC54389.2021.9674579 (2021).
Silva, L. H. d. S. et al. Automatic optical inspection for defective pcb detection using transfer learning. In 2019 IEEE Latin American Conference on Computational Intelligence (LA-CCI) 1–6 https://doi.org/10.1109/LA-CCI47412.2019.9037036 (2019).
Ghosh, B., Bhuyan, M., Sasmal, P., Iwahori, Y. & Gadde, P. Defect classification of printed circuit boards based on transfer learning. In 2018 IEEE Applied Signal Processing Conference (ASPCON) 245–248 https://doi.org/10.1109/ASPCON.2018.8748670 (2018).
Volkau, I., Mujeeb, A., Wenting, D., Marius, E. & Alexei, S. Detection defect in printed circuit boards using unsupervised feature extraction upon transfer learning. In 2019 International Conference on Cyberworlds (CW) 101–108 https://doi.org/10.1109/CW.2019.00025 (2019).
You, S. PCB defect detection based on generative adversarial network. In 2022 2nd International Conference on Consumer Electronics and Computer Engineering (ICCECE) 557–560 https://doi.org/10.1109/ICCECE54139.2022.9712737 (2022).
Jiao, L. et al. A survey of deep learning-based object detection. IEEE Access 7, 128837–128868. https://doi.org/10.1109/ACCESS.2019.2939201 (2019).
Article Google Scholar
Huang, J. et al. Speed/accuracy trade-offs for modern convolutional object detectors. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 3296–3297 https://doi.org/10.1109/CVPR.2017.351 (2017).
Wu, X., Ge, Y., Zhang, Q. & Zhang, D. PCB defect detection using deep learning methods. In 2021 IEEE 24th International Conference on Computer Supported Cooperative Work in Design (CSCWD) 873–876 https://doi.org/10.1109/CSCWD49262.2021.9437846 (2021).
Adibhatla, V. A. et al. Applying deep learning to defect detection in printed circuit boards via a newest model of you-only-look-once. Math. Biosci. Eng.https://doi.org/10.3934/mbe.2021223 (2021).
Article PubMed Google Scholar
Lan, Z., Hong, Y. & Li, Y. An improved yolov3 method for PCB surface defect detection. In 2021 IEEE International Conference on Power Electronics, Computer Applications (ICPECA) 1009–1015 https://doi.org/10.1109/ICPECA51329.2021.9362675 (2021).
Watanabe, K. et al. Review of optical inspection methods and results. In Proceedings of SRF 123 (2009).
Redmon, J., Divvala, S., Girshick, R. & Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 779–788 https://doi.org/10.1109/CVPR.2016.91 (2016).
Szegedy, C. et al. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 1–9 https://doi.org/10.1109/CVPR.2015.7298594 (2015).
Jocher, G. et al. ultralytics/yolov5: v6.0—YOLOv5n ’Nano’ models, Roboflow integration, TensorFlow export, OpenCV DNN support https://doi.org/10.5281/zenodo.5563715 (2021).
Liu, S., Qi, L., Qin, H., Shi, J. & Jia, J. Path aggregation network for instance segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 8759–8768 https://doi.org/10.1109/CVPR.2018.00913 (2018).
Keskar, N. S. & Socher, R. Improving generalization performance by switching from adam to SGD. 1, 143–148 https://doi.org/10.1109/ICNN.1994.374153 arXiv preprint arXiv:1712.07628 (2017).
Guo, Z., Wang, C., Yang, G., Huang, Z. & Li, G. Msft-yolo: Improved yolov5 based on transformer for detecting defects of steel surface. Sensors 22, 3467. https://doi.org/10.3390/s22093467 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Huang, H., Tang, X., Wen, F. & Jin, X. Small object detection method with shallow feature fusion network for chip surface defect detection. Sci. Rep. 12, 1–9. https://doi.org/10.1038/s41598-022-07654-x (2022).
Article ADS CAS Google Scholar
Ren, S., He, K., Girshick, R. & Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. Adv. Neural. Inf. Process. Syst. 28, 91–99. https://doi.org/10.1109/TPAMI.2016.2577031 (2015).
Article Google Scholar
Girshick, R. Fast R-CNN. In Proceedings of the IEEE International Conference on Computer Vision 1440–1448 https://doi.org/10.1109/ICCV.2015.169 (2015).
Lin, T.-Y., Goyal, P., Girshick, R., He, K. & Dollár, P. Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell.https://doi.org/10.1109/TPAMI.2018.2858826 (2020).
Article PubMed Google Scholar
Lin, T.-Y. et al. Feature pyramid networks for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 936–944 https://doi.org/10.1109/CVPR.2017.106 (2017).
Neubeck, A. & Van Gool, L. Efficient non-maximum suppression. In 18th International Conference on Pattern Recognition (ICPR’06), Vol. 3, 850–855 https://doi.org/10.1109/ICPR.2006.479 (2006).
Ren, S., He, K., Girshick, R., Zhang, X. & Sun, J. Object detection networks on convolutional feature maps. IEEE Trans. Pattern Anal. Mach. Intell. 39, 1476–1481. https://doi.org/10.1109/TPAMI.2016.2601099 (2016).
Article Google Scholar
Cordonnier, J.-B., Loukas, A. & Jaggi, M. On the relationship between self-attention and convolutional layers. In Eighth International Conference on Learning Representations-ICLR 2020 CONF (2020).
Khan, S. et al. Transformers in vision: A survey. ACM Comput. Surv. (CSUR)https://doi.org/10.1145/3505244 (2021).
Article Google Scholar
Perez, L. & Wang, J. The effectiveness of data augmentation in image classification using deep learning. https://doi.org/10.48550/arXiv.1712.04621arXiv preprint arXiv:1712.04621 (2017).
Shorten, C. & Khoshgoftaar, T. M. A survey on image data augmentation for deep learning. J. Big Data 6, 1–48. https://doi.org/10.1186/s40537-019-0197-0 (2019).
Article Google Scholar
Bochkovskiy, A., Wang, C.-Y. & Liao, H.-Y. M. Yolov4: Optimal speed and accuracy of object detection. https://doi.org/10.48550/arXiv.2004.10934 arXiv preprint arXiv:2004.10934 (2020).
Yun, S. et al. Cutmix: Regularization strategy to train strong classifiers with localizable features. In Proceedings of the IEEE/CVF International Conference on Computer Vision 6023–6032 https://doi.org/10.1109/ICCV.2019.00612 (2019).
Paszke, A. et al. Pytorch: An imperative style, high-performance deep learning library. In Wallach, H. et al. (eds.) Advances in Neural Information Processing Systems 32 8024–8035 (Curran Associates, Inc., 2019).
Keys, R. Cubic convolution interpolation for digital image processing. IEEE Trans. Acoust. Speech Signal Process. 29, 1153–1160. https://doi.org/10.1109/TASSP.1981.1163711 (1981).
Article ADS MathSciNet MATH Google Scholar
Alibrahim, H. & Ludwig, S. A. Hyperparameter optimization: Comparing genetic algorithm against grid search and bayesian optimization. In 2021 IEEE Congress on Evolutionary Computation (CEC) 1551–1559 https://doi.org/10.1109/CEC45853.2021.9504761 (2021).
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. & Chen, L.-C. Mobilenetv2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 4510–4520 https://doi.org/10.1109/CVPR.2018.00474 (2018).
Wang, H., Nie, F. & Huang, H. Robust distance metric learning via simultaneous l1-norm minimization and maximization. In Interntional Conference on Machine Learning, Vol. 32, 1836–1844 (2014).
Zheng, Z. et al. Distance-iou loss: Faster and better learning for bounding box regression. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34, 12993–13000 https://doi.org/10.1609/aaai.v34i07.6999 (2020).
Yu, J., Jiang, Y., Wang, Z., Cao, Z. & Huang, T. Unitbox: An advanced object detection network. In Proceedings of the 24th ACM International Conference on Multimedia 516–520 https://doi.org/10.1145/2964284.2967274 (2016).
Rezatofighi, H. et al. Generalized intersection over union: A metric and a loss for bounding box regression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 658–666 https://doi.org/10.1109/CVPR.2019.00075 (2019).

Download references

Acknowledgements

The authors are thankful to the NSERC-Discovery and the Canada Research Chair programs for their financial support.

Author information

Authors and Affiliations

Department of Electrical Engineering, École de technologie supérieure, 1100 Notre-Dame Ouest, Montreal, QC, H3C 1K3, Canada
Abhiroop Bhattacharya & Sylvain G. Cloutier

Authors

Abhiroop Bhattacharya
View author publications
You can also search for this author in PubMed Google Scholar
Sylvain G. Cloutier
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.G.C. and A.B. designed the concept and the experimental approach. A.B. developed the model and performed the experiments. A.B. wrote the first draft of the manuscript. S.G.C. reviewed the manuscript and corrected the manuscript.

Corresponding author

Correspondence to Sylvain G. Cloutier.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bhattacharya, A., Cloutier, S.G. End-to-end deep learning framework for printed circuit board manufacturing defect classification. Sci Rep 12, 12559 (2022). https://doi.org/10.1038/s41598-022-16302-3

Download citation

Received: 13 April 2022
Accepted: 07 July 2022
Published: 22 July 2022
DOI: https://doi.org/10.1038/s41598-022-16302-3

This article is cited by

PCB surface defect fast detection method based on attention and multi-source fusion
- Qian Zhao
- Tangyu Ji
- Wentao Yu
Multimedia Tools and Applications (2024)
Automatic printed circuit board inspection: a comprehensible survey
- Luis Augusto Libório Oliveira Fonseca
- Yuzo Iano
- Rangel Arthur
Discover Artificial Intelligence (2024)
Research on PCB defect detection using artificial intelligence: a systematic mapping study
- Doǧan Irmak Ural
- Arda Sezen
Evolutionary Intelligence (2024)
Global contextual attention augmented YOLO with ConvMixer prediction heads for PCB surface defect detection
- Kewen Xia
- Zhongliang Lv
- Xuanlin Chen
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

PCB manufacturing defect types

Inspection techniques

Related work

Overview of objection detection models

You-only-look-once(YOLO)

Faster region based convolutional neural network (FRCNN)

RetinaNet

Proposed network structure

Data augmentation

Experimentation

HRIPCB dataset

Evaluation metrics

Methodology

Results

Experimental results

Ablation studies

Data augmentation

Neck

Activation functions

Regression loss functions

Transformer

Discussion

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links