Microseismic records classification using capsule network with limited training samples in underground mining

Peng, Pingan; He, Zhengxiang; Wang, Liguan; Jiang, Yuanjian

doi:10.1038/s41598-020-70916-z

Download PDF

Article
Open access
Published: 18 August 2020

Microseismic records classification using capsule network with limited training samples in underground mining

Pingan Peng^1,2,
Zhengxiang He^1,2,
Liguan Wang^1,2 &
…
Yuanjian Jiang^1,2

Scientific Reports volume 10, Article number: 13925 (2020) Cite this article

2211 Accesses
18 Citations
Metrics details

Subjects

Abstract

The identification of suspicious microseismic events is the first crucial step in microseismic data processing. Existing automatic classification methods are based on the training of a large data set, which is challenging to apply in mines without a long-term manual data processing. In this paper, we present a method to automatically classify microseismic records with limited samples in underground mines based on capsule networks (CapsNet). We divide each microseismic record into 33 frames, then extract 21 commonly used features in time and frequency from each frame. Consequently, a 21 × 33 feature matrix is utilized as the input of CapsNet. On this basis, we use different sizes of training sets to train the classification models separately. The trained model is tested using the same test set containing 3,200 microseismic records and compared to convolutional neural networks (CNN) and traditional machine learning methods. Results show that the accuracy of our proposed method is 99.2% with limited training samples. It is superior to CNN and traditional machine learning methods in terms of Accuracy, Precision, Recall, F1-Measure, and reliability.

nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation

Article 07 December 2020

Spike sorting with Kilosort4

Article Open access 08 April 2024

Deep learning for water quality

Article 12 March 2024

Introduction

Underground engineering causes disturbances in the stress state of the rock mass, leading to a large number of microseismic events¹. By post-processing these records (e.g., P-wave arrival picking², event location³, and source parameter calculation^4,5,6), the mechanical state of the corresponding rock mass can be adequately reflected, which is beneficial especially for disaster early warning in underground mining^7,8,9. However, in the underground mining process, the microseismic monitoring system often receives interference from blasting operations, ore extraction, mechanical operations, high voltage cables, and magnetic fields¹⁰. Therefore, quickly and accurately identifying microseismic records from a large number of suspicious records is a crucial task. Currently, the classification of suspicious microseismic records depends on the visual scanning of waveforms by experienced analysts¹¹. However, manual classification of microseismic records is a time-consuming, tedious task that is easy to bring into subjective opinions. For these reasons, automatic classification of microseismic records is urgently needed.

Throughout the years, many automatic classification methods have been proposed to address the abovementioned problems in seismic and microseismic fields. Scarpetta et al.¹² established a specialized neural discrimination method for low magnitude seismic events, quarry blasts, underwater explosions, and thunder sources at Mt. Vesuvius Volcano, Italy. Langer¹³, Esposito¹⁴ and Curilem¹⁵ used the machine learning to classify seismic records at the Soufriere Hills volcano (Montserrat), Stromboli island (southern Italy) and the Villarrica volcano (Chile), respectively. Malovichko¹⁶ utilized a set of seismic characteristics and the multivariate maximum-likelihood Gaussian classifier, to quantify a probability that a particular event belongs to a population of blasts. Vallejos and McKinnon¹⁷ presented an approach to the classification of seismic records from two mines in Ontario, Canada, by using logistic regression approach and neural network classification techniques. Hammer et al.¹⁸ attempted to automatically classify seismic signals from scratch by utilizing a hidden Markov model and 30 features extracted from waveforms. Ma et al.⁴ realized the discrimination of mine microseismic events by Bayes discriminant analysis. Dong et al.^19,20 proposed a discrimination method for seismic and blasting events based on a Fisher classifier, a naive Bayesian method and logistic regression; this method regards the logarithm of the seismic moment, the logarithm of the seismic energy, and the probability density function of the arrival time between adjacent sources as features.

Although these researches promote the research process in this field, it still cannot realize the automatic identification of complex microseismic records in the actual production process. In recent years, the deep learning approach has demonstrated superior performance in various research fields. Similarly, deep learning techniques are increasingly used in the field of seismology. Shang et al.²¹ established a classifier to distinguish microseismic records from quarry blasts by using Principal Component Analysis (PCA) and Artificial Neural Networks (ANN). ANN is considered the basis of the deep learning approach. Serdar Kuyuk and Ohno Susumu²² trained a deep learning Long Short-Term Memory (LSTM) network for the classification of near-source waveforms based on data from seismic events recorded by 305 three-component accelerometers recorded in Japan between 2000 and 2018. The LSTM network was tested with the earthquake in Northern Osaka (M 6.1) in 2018 as an example. Manuel Titos et al.²³ proposed a novel approach in the field of volcano seismology to classify volcano-seismic events based on fully connected DNNs. The DNNs model was trained by 9,332 volcanic earthquake events to classify the seven types of events, and good experimental results were obtained. Bi Lin et al.²⁴ proposes a method combining Convolutional Neural Networks (CNN) with Support Vector Machine (SVM) for identifying the multi-channel microseismic waveform automatically. They used 30,000 signal samples for CNN training, 3,960 event samples for SVM training, and finally achieved 98.18% classification accuracy. These new technologies and methods are encouraging because they effectively improve the accuracy and reliability of microseismic or seismic event classification.

However, the deep learning method requires a large amount of data to support the training process of its model. Hence, in actual applications, a large amount of manually labeled samples is required, which cannot be quickly applied in the newly built microseismic monitoring system of a mine, as the features of microseismic records in different mines vary greatly. Consequently, achieving a reliable real-time classification using limited samples is of great interest. Therefore, we concentrate on an approach with superior accuracy and stability to automatically classify multi-class microseismic records in underground mining using only limited samples. In this paper, we propose an approach to establish an automatic classifier for multi-class microseismic records with limited samples using the Capsule Network (CapsNet). This approach allows most of the current mines, both old and new, to use deep learning as early as possible to achieve the automatic classification of microseismic records and has a reliable result. The proposed method will be described in detail in the following sections. Subsequently, the proposed method will be applied to field datasets to demonstrate the efficiency and reliability of the classification of limited microseismic data.

Results

we analyze and discuss the proposed method based on the actual application process of the automatic classification method. The accuracy and reliability of CapsNet, CNN and other methods are compared. Figure 1 shows the actual application process of the automatic classification method in the mine.

Training process

Based on the microseismic records from the Huangtupo Copper and Zinc Mine, five training sets are divided according to different proportions, which contain 400, 800, 1,200, 1,600, and 2000 microseismic records. 20% of each training set will be used as the validation set. Moreover, one dataset had 3,200 microseismic records, 800 of each type, and no duplicate elements from the training were set as a universal test set. Training sets of different sizes constitute different training processes, and the situations of different training processes are shown in Table 1. The purpose of different training processes is to test the performance and reliability of CapsNet and CNN under limited samples.

Table 1 The situations of different training processes.

Full size table

With the parameters and architecture of the CapsNet and CNN showed in Fig. 2, we trained this two networks in different training processes (as shown in Table 1). The CapsNet consists of 2 convolution layers, a maxpooling layer, 2 ReLU layers, and a unique dynamic routing layer; the CNN consists of 2 convolution layers, a maxpooling layer, 5 ReLU layers, 5 batch normalization layers, 3 fully connected layers, 2 dropout layers, a softmax layer, and a classification layer. the minibatch size of all training process is 10 and ended in 30 epochs. The minibatch accuracy, validation accuracy, minibatch loss, and validation loss during the training process were recorded, and the training process were shown in Fig. 3.

From Fig. 3, the training process of CapsNet is stable and converges rapidly. Accuracy, loss, and validation curve closely match the training curve. However, for CNN, its training curve has been repeatedly beaten in 30 Epochs, eventually resulting in a low convergence state, even though it achieves higher accuracy. Through different training processes, we obtained five classification models of CapsNet and CNN, respectively.

Accuracy and comparison

Based on the training process of the classification model, this section uses the test set to test the effect of these models. Moreover, the classification result of deep learning method is compared with the result of commonly used machine learning method. The test set consisted of 3,200 actual microseismic records of the Huangtupo Copper and Zinc Mine, with 800 records for each category, and none of these records appeared during the training and verification process. As an evaluation, Accuracy, Precision, Recall, and F1-Measure will be adopted²⁵. Accuracy is the proportion of the microseismic record with the correct classification in test set:

$${\text{Accuracy}} = 1 - \frac{FP(tr) + FN(tr)}{{TP(tr) + TN(tr) + FP(tr) + FN(tr)}}$$

(1)

where TP denotes true positives (The records of the current type are correctly classified), TN denotes true negatives (The records of the other types are correctly classified), FP denotes false positives (The records of the other types are misclassified), and FN denotes false negatives (The records of the current type are misclassified). Precision is the proportion of predictions that are accurate, and Recall is the proportion of microseismic records that are correctly predicted:

$${\text{Precision } = \text{ }}\frac{TP(tr)}{{TP(tr) + FP(tr)}}$$

(2)

$${\text{Recall}} = \frac{TP(tr)}{{TP(tr) + FN(tr)}}$$

(3)

moreover, comprehensive considering Precision and Recall, the the weighted harmonic average evaluation index (F-Measure) has been proposed.

$${\text{F } - \text{ Measure}} = \frac{{(\alpha^{2} + 1) \times {\text{Precision}} \times {\text{Recall}}}}{{\alpha^{2} \times ({\text{Precision}} + {\text{Recall}})}}$$

(4)

when α = 1, is the most common F1-Measure:

$${\text{F1 } - \text{ measure}} = \frac{{2 \times {\text{Precision}} \times {\text{Recall}}}}{{{\text{Precision}} + {\text{Recall}}}}$$

(5)

Figure 4 shows the test results of the trained CapsNet and the trained CNN, and these results demonstrate the accuracy of CapsNet is always higher than that of CNN. Taking into account more detailed comparisons, the abovementioned Precision, Recall, and F1-Measure are calculated for each type of microseismic records.

Figure 5a,b show the Precision of each type of microseismic records in the different training process. From Fig. 5a,b, the Precision of the CapsNet is much larger than the CNN’s in both types of microseismic and blasting records, and for both ore extraction and noise, the two are almost identical. It reveals that CapsNet's Precision is superior to CNN’s in different experiments. Also, Fig. 5c,d show the Recall of each type of microseismic records in the different training process. From Fig. 5c,d, the Recall of the CapsNet curve is much larger than the CNN in both types of blasting and ore extraction records, and for both microseismic and noise records, the gap still exists, but it is weak. It reveals that CapsNet's Recall is superior to CNN’s in different experiments. Through F1-Measure, we take a comprehensive consideration of the above two indicators. Figure 5e,f show the F1-Measure of each type of microseismic records in the different training process. It can be found that the value of CNN’s test results is always lower than the value of the CapsNet’s test results. Multiple indicators reveal that CapsNet has certain advantages over CNN in the classification of microseismic records.

Moreover, a comparison of the classification performance between the deep learning approach and traditional machine learning methods is presented. Decision tree and k-nearest neighbor (kNN) are often used to classify microseismic records. Therefore, we tested these models by utilizing the same dataset of training process 5 (details in Table 1) and compared their results with the findings from the deep learning approach proposed herein. Table 2 shows the classification results from different classification models, including the CapsNet and CNN presented in this paper, while utilizing the same dataset and features. For the testing accuracy, the CapNet performed the best. The testing accuracy of the CapsNet reached 99.2%, while the accuracies of the machine learning methods were below 90%. Each index of the CapsNet proposed in this paper outperformed those of the other methods. These findings demonstrate that the CapsNet has excellent efficiency and reliability for the classification of microseismic data.

Table 2 Comparison of different classification models.

Full size table

Discussion and conclusion

Additionally, to show that CapsNet has clear advantages over CNN in microseismic records classification, we analyze the reliability of the two from the network classification probability. In deep learning, the final predicted output is composed of the decision probability of the corresponding labels, and the label corresponding to the maximum probability value is used as the predicted class of input. The probability used in this paper is the max probability of predicted output.

Figure 6 shows the distribution of classification probability in different training process and classification results (correct and incorrect). For example, Fig. 6a1,a2 show the probability distribution of test samples that predicted class in keeping with the label after training process 5 by CapsNet and CNN. On the contrary, (a3) and (a4) show that of incorrect classification.

For the correct classification, the results of CapsNet is concentrated on higher probability value, which is almost always above 0.70, and there is a larger percentage of results below 0.70 for CNN. Moreover, for the incorrect classification, an excellent classifier should attribute the failure to the hesitant state, that is, the output probability of all types is similar and low. However, the results of CNN are concentrated on higher probability which is above 0.90, many samples are strongly misclassified. But CapNet's results are the opposite of CNN's. Detailed probability distribution comparisons are shown in Fig. 7. In summary, CNN's strong predictive characteristics for both correct and incorrect classifications result in lower reliability than CapsNet. CapsNet's strong prediction of correct classification and weak prediction of incorrect classification can effectively help inspectors to screen the results in specific situations.

In order to more intuitively prove the advantages of CapsNet under limited data, we designed a set of repetitive experiments. We have prepared training sets with different amounts of data, which contain 400, 800, 1,200, 1,600, 2000, 4,000, 8,000, 12,000, and 16,000 microseismic records. We define the data volume less than 2000 as limited training samples. We perform four pieces of training and four tests on the model for each amount of data. As shown in Fig. 8, for each amount of data, we train four models for classification. From the experimental results, it can be seen that under limited training samples, CapsNet still has high accuracy and stability. However, for CNN, its accuracy is low, and the variation is large. As a consequence, CapsNet will outperform CNN in accuracy and stability for real applications with the everlasting scarcity issue of labeled seismic or microseismic data. Thus, CapsNet is a better option when we don’t have much labeled data at hand.

We propose a deep learning approach based on CapsNet to realize the automatic classification of microseismic records with limited samples in underground mining. CapsNet is a fully connected network of a series of interconnected capsules. In order to convert the microseismic record into the input for CapsNet, we extract the feature of the microseismic record by dividing a microseismic record waveform into 33 frames and extracting 21 feature parameters from each frame. Consequently, a 21 × 33 matrix is utilized to represent a microseismic record as the input of the CapsNet. On this basis, we use different sizes of training sets to train the classification models separately. The trained models are tested using the same test set containing 3,200 microseismic records and compared to CNN. Results show that CapsNet can achieve stable convergence faster than CNN with limited training samples. Then we use Accuracy, Precision, Recall, and F1-Measure as evaluation indexes. Results show that CapsNet is superior to CNN and traditional machine learning methods on various indicators. Finally, we analyze the reliability of the classification results of CapsNet and CNN. Results show that CapsNet performs better than CNN in terms of reliability. These results all indicate the reliability and practicability of CapsNet for automatic classification of microseismic records with limited samples in underground mining.

Methods

The principle of the CapsNet

At present, the deep learning architecture based on CNN architecture is widely used in various fields, such as image recognition, automatic driving, etc^26,27,28. However, due to the convolution operation of CNN, only the existence information of the feature is retained in the recognition process, and the orientation of the feature and the spatial relationship are ignored. Moreover, the downsampling of the max-pooling layer discards much crucial information. Therefore, the conventional deep learning method represented by CNN requires much data for training²⁹.

The Capsule Network (CapsNet) represents an entirely novel type of deep learning architectures that attempt to overcome the abovementioned disadvantage of conventional deep learning. Figure 9 shows a typical architecture of CapsNet. The architecture is shallow with only two convolutional layers (Conv1 and Conv2 in Fig. 1) and one fully connected (FC) layer³⁰. The outputs of each layer are Conv1d, Primary Capsule (PrimaryCaps), and Digit Capsule (DigitCaps). CapsNet was robust to the complex combination of features and required fewer training data. Also, CapsNet has resulted in some unique breakthroughs related to spatial hierarchies between features³². A capsule is a vector that can contain any number of values, each of which represents a feature of the object (such as a picture) that needs to be identified³³. In CNN, each value of the convolutional layer is the result of a convolution operation. The convolution operation is a linear weighted summation, so the value of each convolutional layer is a scalar. However, in CapsNet, each value of the capsule is a vector; that is, this vector can represent not only the characteristics but also the direction and the state of the input.

Moreover, the CapsNet uses the dynamic routing algorithm to achieve data transmission between the capsule layers (as shown in Fig. 10), which overcomes the shortcomings of the traditional pooling layer³⁴. In the dynamic routing algorithm, a non-linear "squashing" function (Eq. 6) is used to ensure that short vectors get shrunk to almost zero length and long vectors get shrunk to a length slightly below 1.

$${\text{v}}_{j} = \frac{{\left\| {s_{j} } \right\|^{2} }}{{1 + \left\| {s_{j} } \right\|^{2} }}\frac{{s_{j} }}{{\left\| {s_{j} } \right\|}}$$

(6)

where v_j is the vector output of the capsule j and s_j is its total input. And s_j is a weighted sum of all output ${\hat{\mathbf{u}}}_{j|i}$ of the previous layer. ${\hat{\mathbf{u}}}_{j|i}$ is produced by multiplying the output u_i with a weight matrix W_ij.

$$s_{j} = \sum\limits_{i} {c_{ij} {\hat{\mathbf{u}}}_{j|i} }$$

(7)

$${\hat{\mathbf{u}}}_{j|i} = {\mathbf{W}}_{ij} {\mathbf{u}}_{i}$$

(8)

The c_ij in Eq. 7 denotes a coupling coefficient that is determined by the iterative dynamic routing process:

$$c_{ij} = \frac{{\exp (b_{ij} )}}{{\sum\nolimits_{k} {\exp (b_{ik} )} }}$$

(9)

where b_ij and b_ik are the log prior probabilities between two coupled capsules. Also, b_ij is in an ongoing process of updating:

$$b_{ij} \leftarrow b_{ij} + {\hat{\mathbf{u}}}_{j|i} {\text{v}}_{j}$$

(10)

The initial value of b_ij is 0. Therefore, in the forward propagation process of solving s_j, we design weight matrix W_ij as random values, b_ij is initialized to 0 to get c_ij, and dynamic update of b_ij continuously optimizes the coupling coefficient c_ij. This series of calculations finally realized the dynamic routing propagation between the two layers of capsules³⁵.

Except that the coupling coefficient c_ij is updated by dynamic routing, other convolution parameters of the entire network and W_ij in the CapsNet need to be updated according to the loss function:

$$L_{k} = T_{k} \max (0,m^{ + } - \left\| {v_{k} } \right\|)^{2} + \lambda (1 - T_{k} )\max (0,\left\| {v_{k} } \right\| - m^{ - } )^{2}$$

(11)

where T_k = 1, m⁺ = 0.9, and m⁻ = 0.1 by default. λ enables down-weighting of the loss for absent digit classes stops the initial learning from shrinking the lengths of the activity vectors of all the digit capsules³⁰.

Dataset

The Huangtupo Copper and Zinc Mine is located in the southwest of Hami city, Xinjiang Uygur Autonomous Region, China. Two larger goaf areas (No.1 and No.2 goaf in Fig. 11) have been formed in this mine because of the use of non-pillar sublevel caving. Moreover, as the lower part and the upper part of the ore body are being mined at the same time, a larger and more unstable goaf area (No.3 goaf in Fig. 11) is formed at the mining junction. The volumes of these three goaves are 120,068.60 m³,42,633.25 m³, and 183,483.19 m³, respectively. Among them, the No.3 goaf area is much larger than the other two, and it is also the most dangerous. As shown in Fig. 11, No.3 goaf area has been interconnected with multiple mining routes, which is a severe crisis.

To understand the stability of the rock mass, a microseismic system was used to perform continuous monitoring of around goaves and stopes. Eight single-component accelerometers with a sensitivity of 10 V/g and a sampling frequency of 10 kHz were embedded in the Huangtupo Copper and Zinc Mine. Their coordinates are shown in Fig. 12.

Hundreds of events are triggered in the Huangtupo Copper and Zinc Mine every day. Considering our processing goal to monitor rock activity and to provide early-warning systems, these events are categorized into four types: microseismic events, blasts, ore extraction, and noise. All events triggered between September 2017 and January 2019 were manually labeled and were selected as our dataset. The example of each type of event is shown in Fig. 13.

Pretreatment

The original waveform is segmented every 380 sampling points to form a frame. A total of 80 points repeatedly appear between adjacent frames to avoid a large difference between adjacent frames. As a consequence, we can obtain 33 frames for each microseismic record under the condition that each record includes 10,000 sampling points. The purpose of waveform framing is to preserve the characteristics of the time sequence while transforming the waveform. Moreover, to maintain continuity between adjacent frames and attenuate the frequency leakage caused by signal truncation, each frame is multiplied by the Hamming window after the microseismic records are framed¹⁰. Assuming that the microseismic record is S(n), n = 1, 2, …, N − 1, multiplying the record by the Hamming window w(n) gives

$$S^{\prime}(n) = S(n) \times w(n)$$

(12)

where w(n) gives

$$w(n) = 0.54 - 0.46 \times \cos \left( {\frac{2\pi n}{{N - 1}}} \right), \, 0 \le n \le N - 1$$

(13)

where N is the number of frames within the framed microseismic record.

Then, we extract features of the time and frequency domains from each frame. Table 3 gives an overview of the 21 features employed most frequently in the literature for each frame used in this study. It is worth mentioning that these features are selected by the genetic algorithm (GA)-optimized correlation-based feature selection (CFS) method, for more detail implementation of feature selection, please see the reference^{35, 36}. Zero-crossing rates are used to determine whether the microseismic record is present in a frame³⁷. Energy and energy entropy can be used to indicate signal strength, and the strengths of different types of microseismic records show distinct differences³⁸. The spectral centroid, spectral spread, spectral entropy, spectral flux, and spectral rolloff form the low-level spectral features, which aim to describe the structure of the frame spectra using a single quantity^39,40; these features can be extracted within either linear or logarithmic frequency domains using spectral amplitudes, power values, logarithmic values, etc. Mel frequency cepstral coefficients (MFCCs) are an interesting variation on the linear cepstrum, which is widely used in signal analysis. MFCCs are the most widely used features in signal recognition, mainly due to their ability to concisely represent the signal spectrum^41,42. Additionally, the harmonic ratio can be used to indicate the proportion of the signal composed of the non-microseismic record part⁴³.

Table 3 Definitions and descriptions of features.

Full size table

As a consequence, a microseismic record is transformed into a 21 × 33 feature matrix by framing and feature extraction. Figure 14 shows the process and result of the transform. This 21 × 33 feature matrix is the initial input of the CapsNet.

References

Collins, D. S., Pinnock, I., Toya, Y., Shumila, V. & Trifu, C. I. Seismic event location and source mechanism accounting for complex block geology and voids. 48th US Rock Mech./Geomech. Symp. 2014(1), 63–69 (2014).
Google Scholar
Zhang, H., Thurber, C. & Rowe, C. Automatic P-wave arrival detection and picking with multiscale wavelet analysis for single-component recordings. Bull. Seismol. Soc. Am. 93, 1904–1912 (2003).
Article Google Scholar
Peng, P. & Wang, L. Targeted location of microseismic events based on a 3D heterogeneous velocity model in underground mining. PLoS ONE 14, e0212881 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ma, J., Zhao, G., Dong, L., Chen, G. & Zhang, C. A comparison of mine seismic discriminators based on features of source parameters to waveform characteristics. Shock Vib. 2015, 1–10 (2015).
Google Scholar
Vallejos, J. A. & Estay, R. A. Seismic parameters of mining-induced aftershock sequences for re-entry protocol development. Pure Appl. Geophys. 175, 793–811 (2018).
Article ADS Google Scholar
Peng, P. & Wang, L. 3DMRT: a computer package for 3D model-based seismic wave propagation. Seismol. Res. Lett. 90, 2039–2045 (2019).
Google Scholar
Ma, T. H., Tang, C. A., Tang, L. X., Zhang, W. D. & Wang, L. Rockburst characteristics and microseismic monitoring of deep-buried tunnels for Jinping II Hydropower Station. Tunn. Undergr. Sp. Technol. 49, 345–368 (2015).
Article Google Scholar
Dou, L., Cai, W., Cao, A. & Guo, W. Comprehensive early warning of rock burst utilizing microseismic multi-parameter indices. Int. J. Min. Sci. Technol. 28, 767–774 (2018).
Article Google Scholar
Vallejos, J. A., Delonca, A. & Perez, E. Three-dimensional effect of stresses in open stope mine design. Int. J. Min. Reclam. Environ. 32, 355–374 (2018).
Article Google Scholar
Peng, P., He, Z. & Wang, L. Automatic classification of microseismic signals based on MFCC and GMM-HMM in underground mines. Shock Vib. 2019, 1–9 (2019).
Google Scholar
Mousavi, S. M., Zhu, W., Ellsworth, W. & Beroza, G. Unsupervised clustering of seismic signals using deep convolutional autoencoders. IEEE Geosci. Remote Sens. Lett. https://doi.org/10.1109/lgrs.2019.2909218 (2019).
Article Google Scholar
Scarpetta, S. et al. Automatic classification of seismic signals at Mt. Vesuvius volcano, Italy, using neural networks. Bull. Seismol. Soc. Am. 95, 185–196 (2005).
Article Google Scholar
Langer, H., Falsaperla, S., Powell, T. & Thompson, G. Automatic classification and a-posteriori analysis of seismic event identification at Soufrière Hills volcano, Montserrat. J. Volcanol. Geotherm. Res. 153, 1–10 (2006).
Article ADS CAS Google Scholar
Esposito, A. M. et al. Automatic discrimination among landslide, explosion-quake, and microtremor seismic signals at Stromboli volcano using neural networks. Bull. Seismol. Soc. Am. 96, 1230–1240 (2006).
Article Google Scholar
Curilem, G., Vergara, J., Fuentealba, G., Acuña, G. & Chacón, M. Classification of seismic signals at Villarrica volcano (Chile) using neural networks and genetic algorithms. J. Volcanol. Geotherm. Res. 180, 1–8 (2008).
Article ADS Google Scholar
Malovichko, D. Discrimination of blasts in mine seismology. Deep Min. 161–171 (2012).
Vallejos, J. A. & McKinnon, S. D. Logistic regression and neural network classification of seismic records. Int. J. Rock Mech. Min. Sci. 62, 86–95 (2013).
Article Google Scholar
Hammer, C., Ohrnberger, M. & Fäh, D. Classifying seismic waveforms from scratch: a case study in the alpine environment. Geophys. J. Int. 192, 425–439 (2013).
Article ADS Google Scholar
Dong, L., Wesseloo, J., Potvin, Y. & Li, X. Discrimination of mine seismic events and blasts using the Fisher classifier, naive bayesian classifier and logistic regression. Rock Mech. Rock Eng. 49, 183–211 (2016).
Article ADS Google Scholar
Dong, L., Wesseloo, J., Potvin, Y. & Li, X. Discriminant models of blasts and seismic events in mine seismology. Int. J. Rock Mech. Min. Sci. 86, 282–291 (2016).
Article Google Scholar
Shang, X., Li, X., Morales-Esteban, A. & Chen, G. Improving microseismic event and quarry blast classification using Artificial Neural Networks based on Principal Component Analysis. Soil Dyn. Earthq. Eng. 99, 142–149 (2017).
Article Google Scholar
Kuyuk, H. S. & Susumu, O. Real-time classification of earthquake using deep learning. Proc. Comput. Sci. 140, 298–305 (2018).
Article Google Scholar
Titos, M., Bueno, A., Garcia, L. & Benitez, C. A deep neural networks approach to automatic recognition systems for volcano-seismic events. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 11, 1533–1544 (2018).
Article ADS Google Scholar
Lin, B., Wei, X. & Junjie, Z. Automatic recognition and classification of multi-channel microseismic waveform based on DCNN and SVM. Comput. Geosci. 123, 111–120 (2019).
Article ADS Google Scholar
Mousavi, S. M., Zhu, W., Sheng, Y. & Beroza, G. C. CRED: a deep residual network of convolutional and recurrent units for earthquake signal detection. Sci. Rep. 9, 1–4 (2019).
Article Google Scholar
Cireşan, D. C., Meier, U., Gambardella, L. M. & Schmidhuber, J. Deep, big, simple neural nets for handwritten digit recognition. Neural Comput. 22, 3207–3220 (2010).
Article PubMed Google Scholar
Yann, L. & Yoshua, B. Convolutional networks for images, speech, and time-series. Handb. Brain Theory Neural Netw. 4, 2571–2575 (1995).
Google Scholar
Mele, B. & Altarelli, G. Lepton spectra as a measure of b quark polarization at LEP. Phys. Lett. B 299, 345–350 (1993).
Article ADS CAS Google Scholar
Mousavi, S.M., Zhu, W. & Beroza, G.C. Automatic Seismic Denoising Method Based on Deep Neural Networks. AGU Fall Meeting Abstract (2018)
Sabour, S., Frosst, N. & Hinton, G. E. Dynamic routing between capsules. Adv. Neural Inf. Process. Syst. 2017, 3857–3867 (2017).
Google Scholar
He, Z., Peng, P., Wang, L. & Jiang, Y. PickCapsNet: capsule network for automatic p-wave arrival picking. IEEE Geosci. Remote Sens. Lett. https://doi.org/10.1109/lgrs.2020.2983196 (2020).
Article Google Scholar
Lukic, V. et al. Morphological classification of radio galaxies: capsule networks versus convolutional neural networks. Mon. Not. R. Astron. Soc. 487, 1729–1744 (2019).
Article ADS Google Scholar
Paoletti, M. E. et al. Capsule networks for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 57, 2145–2160 (2019).
Article ADS Google Scholar
Palvanov, A. & Cho, Y. I. Comparisons of deep learning algorithms for mnist in real-time environment. Int. J. Fuzzy Log. Intell. Syst. 18, 126–134 (2018).
Article Google Scholar
Deng, F. et al. Hyperspectral image classification with capsule network using limited training samples. Sensors (Switzerland) 18, 3153 (2018).
Article Google Scholar
Peng, P., He, Z., Wang, L. & Jiang, Y. Automatic classification of microseismic records in underground mining: a deep learning approach. IEEE Access 8, 17863–17876 (2020).
Article Google Scholar
Bachu, R. G., Kopparthi, S., Adapa, B. & Barkana, B. D. Separation of voiced and unvoiced using zero crossing rate and energy of the speech signal. Adv. Tech. Comput. Sci. Softw. Eng. https://doi.org/10.1007/978-90-481-3660-5_47 (2010).
Article Google Scholar
Yu, Y., Yu, D. & Junsheng, C. A roller bearing fault diagnosis method based on EMD energy entropy and ANN. J. Sound Vib. 294, 269–277 (2006).
Article ADS Google Scholar
Lartillot, O. & Toiviainen, P. A matlab toolbox for musical feature extraction from audio. Int. Conf. Digit. Audio 1–8 (2007).
Wu, B., Horner, A. & Lee, C. Musical timbre and emotion: The identification of salient timbral features in sustained musical instrument tones equalized in attack time and spectral centroid. In Proceeding—40th International Computer Music Conference ICMC 2014 11th Sound and Music Computing Conference SMC 2014—Music Technology Meets Philosophy: From Digital Echos to Virtual Ethos 928–934 (2014).
Mawadda Warohma, A., Kurniasari, P., Dwijayanti, S., Irmawan & Yudho Suprapto, B. Identification of Regional Dialects Using Mel Frequency Cepstral Coefficients (MFCCs) and Neural Network. In Proceedings—2018 International Seminar on Application for Technology of Information and Communication, iSemantic 2018 522–527 (2018). https://doi.org/10.1109/ISEMANTIC.2018.8549731
Jiang, W., Liu, P. & Wen, F. Speech magnitude spectrum reconstruction from mfccs using deep neural network. Chin. J. Electron. 27, 393–398 (2018).
Article CAS Google Scholar
Sun, X. Pitch determination and voice quality analysis using subharmonic-to-harmonic ratio. in ICASSP. In IEEE International Conference on Acoustics, Speech and Signal Processing—Proceedings 1, 333–336 (IEEE, 2002).

Download references

Acknowledgements

This work was supported by the National Key R&D Program of China (No. 2017YFC0602905).

Author information

Authors and Affiliations

School of Resources and Safety Engineering, Central South University, Changsha, 410083, China
Pingan Peng, Zhengxiang He, Liguan Wang & Yuanjian Jiang
Digital Mine Research Center, Central South University, Changsha, 410083, China
Pingan Peng, Zhengxiang He, Liguan Wang & Yuanjian Jiang

Authors

Pingan Peng
View author publications
You can also search for this author in PubMed Google Scholar
Zhengxiang He
View author publications
You can also search for this author in PubMed Google Scholar
Liguan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yuanjian Jiang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, P.P. and Z.H.; Methodology, P.P. and Z.H.; Validation, P.P.; Writing, Z.H.; Review and Editing, L.W. and Y.J.; Supervision, L.W.

Corresponding author

Correspondence to Zhengxiang He.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Peng, P., He, Z., Wang, L. et al. Microseismic records classification using capsule network with limited training samples in underground mining. Sci Rep 10, 13925 (2020). https://doi.org/10.1038/s41598-020-70916-z

Download citation

Received: 09 December 2019
Accepted: 27 July 2020
Published: 18 August 2020
DOI: https://doi.org/10.1038/s41598-020-70916-z

This article is cited by

Discrimination of different blasting and mine microseismic waveforms using FFT, SPWVD and multifractal method
- Baolin Li
- Enyuan Wang
- Xuelong Li
Environmental Earth Sciences (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.