1 Introduction
Using a real-time monitoring system to collect the operations data of mechanical equipment in nuclear power plants (NPPs) for early warning in the early stage of equipment failure allows troubleshooting, which avoids major safety accidents, reduces unplanned shutdown maintenance of units, and reduces costs [1–3]. The establishment of a data-driven prediction models for mechanical equipment fault prediction has become an important means of predictive maintenance, and research on nuclear power machinery has gradually increased because of the recent and rapid development of artificial intelligence algorithms and big data technology [4–6]. Xie et al. [7] designed an online early warning system to track and predict the critical reaction of a nuclear reactor through two independent online simulation systems. Qian et al. [8] presented a hierarchical multi-dimensional method for fault detection of an NPP main pipeline. Min et al. [9] used a pattern recognition early warning system developed with AAKR technology to demonstrate the effectiveness of a real-time monitoring and early warning system for NPPs. Peng et al. [10] utilized the feature selection ability of association analysis and the depth confidence network (DBN) method to detect faults in nuclear power machinery. Yao et al. [11] introduced a fault diagnosis method for NPP full range simulators based on state information imaging. By using machine learning and image processing technology, historical data and synthetic grey image data are analyzed, and the system learns to achieve image feature extraction and classification to perform fault diagnosis.
A variety of data-driven approaches are used in the fault prediction of large mechanical equipment. Qin et al. [12] employed an approach based on time series and Bayesian discriminant analyses to solve the problems of type identification and diagnosis of concurrent faults without characteristic parameters in rotating machinery. Mehrdad et al. [13] presented a non-parametric single spline regression approach to construct the power curve model of the generator set. Aye and Heyns [14] proposed an optimal Gaussian process regression through the combination of simple mean value and variance to predict a low error rate for the remaining service life of low-speed bearings. The judgment methods for regression model accuracy are continuously developing. Jiang and Yin [15] proposed and applied a recursive total principle component regression-based design and implementation approach for efficient data-driven fault detection for vehicular cyber-physical systems. Gao et al. [16] established a partial least squares-aided data-driven model predictive control approach to improve prediction accuracy. Herp et al. [17] developed a statistical method of online extraction and prediction of turbine state based on Bayesian inference. The residuals of bearing temperature measurement were inferred online, and the prediction probability is calculated by the sample model and the risk function describing the state transition probability to predict the fault state in advance. Li et al. [18] proposed a method for rolling bearing fault identification based on the multifractal and grey system theories, aiming at the non-equilibrium and non-linear characteristics of bearing vibration signals and the complexity of the distribution of state indication information in the signal. Liu et al. [19] presented the thermal component of a fault prediction method based on the convolutional neural network (CNN) to address the disastrous consequences caused by thermal component faults of gas turbines. This method shows that the CNN is a feasible method to resolve thermal component fault detection. Liu and Karimi [20] established two machine learning models based on an artificial neural network and a high-dimensional model representation to predict the operation characteristics of steam turbines and air compressors and provided a basis for continuous health monitoring and fault diagnosis.
Deep learning can directly reflect the characteristics through the training sample data to reduce the influence of assumptions and simplification on calculation results and has recently been widely used for mechanical fault identification [21–23]. The RNN model shows good performance in capturing temporal correlations in data and can store and transmit the sequence information multiple times. Liu et al. [24] developed an RNN-based fault identification approach that uses a denoising auto-encoder based on a gated recursive unit to predict multiple vibration values of rolling bearings in subsequent time series. Hadi and Shahnazari [25] proposed a fault detection and isolation (FDI) method. Based on the RNN, this method models and inverts the nonlinear system, establishes a factory prediction model, and makes use of the residual generated from the model for fault identification. Wang et al. [26] developed an RNN-based algorithm to effectively handle the multi-classification fault diagnosis for wind power systems. Palau et al. [27] employed Weibull time to an event-RNN algorithm for distributed collaborative prognostics. The industrial gas turbine unit data and c-mapps engine degradation data set are used in the experiment. Wang et al. [28] analyzed the motor vibration signal and multi-scale stator current signal and presented a multi-resolution & multi-sensor fusion network model for motor fault diagnosis based on RNN.
The dimension of the data must be reduced, and more refined information must be used to conduct a comprehensive analysis of the collected data to refine and simplify the research. Principal component analysis (PCA) can effectively reduce the dimensionality of compressed data by retaining the original data feature information and solve the multi-variable correlation problem to reduce the complexity of the problem analysis and is widely applied in big data processing, pattern recognition, and image processing fields. Li et al. [29] established an optimized PCA model to perform fault detection of sensors in NPPs and verified that the model can detect and reconstruct the fault sensors well by simulation. Prusty et al. [30] employed PCA to reduce the dimension of a large number of plant signals transmitted by the Prototype Fast Breeder Reactor (PFBR) in a NPP, improving the decision-making capability of the operator in catastrophic conditions. Wu et al. [31] constructed a fault detection model of a pressurized water reactor in a NPP based on the BN-FDD system framework. PCA, fuzzy theory, and data fusion were used to promote data accuracy, and multiple sensor data were combined into one node data. Sharifi and Langari [32] divided the measurement space into several local linear regions associated with a PPCA model and presented a sensor fault diagnosis method for a nonlinear system by considering the data uncertainty. Xiang et al. [33] utilized the PPCA denoising model for rolling bearing fault prediction. In this model, the subspace of the principal component retains the more useful original information and fault signal, and the noise and related linear information are projected into the remaining subspace.
The above literature introduces the analysis methods of fault prediction for different mechanical equipment. The following problems regarding the application of fault detection and early warning for nuclear power rotating mechanical equipment must be improved: 1) imperfections in the source data and multiple variable redundancies; 2) monitoring and early warning during the creep period of equipment failure; and 3) the quantitative evaluation of the reliability of the prediction model.
This study combines pattern recognition technology and deep learning to present a fault prediction approach for steam turbines, pumps, and other mechanical equipment in NPPs based on Bayesian PPCA RNN to improve the aforementioned issues. After wavelet packet threshold denoising, the signal data are dimensionally reduced by using a Bayesian PPCA method. A fully connected RNN prediction model is established and verified by using the goodness of fit and mean square error. The model reliability is quantified by calculating the Bayesian factor and confidence. Combined with the prior information in the historical data set, the proposed method calculates the residual between the prediction and hypothetical health values to find the unit failure during the creep period.
2 Data integration analysis
For the monitoring data of rotating machinery in NPPs, the discrete wavelet packet transform (DWPT) method is first used to denoise the data. The number of monitoring data prediction variables must be reduced to simplify the research and reduce the prediction time and calculation cost. Because the variables with a certain correlation cannot be directly ignored, a Bayesian PPCA method [34] is employed to extract the principal component signal with the highest contribution rate for analysis and prediction. Considering the data uncertainty, PPCA estimates the discarded information as Gaussian noise.
There are N samples with dimension d, where d is the number of variables, and X=(x1,x2…xN); thus, each variable contains N denoised time series points. PPCA assumes the existence of q (q≤d) dimension hidden variables β. The hidden variable model relates the relevant matrix X of the relevant sample data and the matrix β of uncorrelated hidden response variables with the formula (1):
where w is the q×d weight matrix that describes the relationship between
The specific parameters are solved by the maximum likelihood estimation. From formula (2), the prior distribution of the hidden variable
Under the condition of an implicit variable
By substituting formula (4) into the Bayesian formula, the posterior probability density distribution of the hidden variable
where
In formulas (6) and (7), the covariance matrix of sample data
3 Fully Connected RNN Prediction Model
3.1 RNN Model Construction
In this study, an RNN suitable for sequence data modeling is used to predict a time series. The neurons with a cyclic structure retain and apply the state information of the previous moment as memory to current output calculation; thus, the nodes between the same hidden layers are connected. The RNN can transmit information laterally among neurons and partially express correlations within the data. This information transmission mode matches well with the state process of operational nuclear power machinery. The running state at a given moment will have a certain impact on the running state at the next moment, and the collected data also correlates.
According to the embedding dimension m and time delay τ, an RNN structure prediction model is constructed (Fig. 1), where
-202008/1001-8042-31-08-002/alternativeImage/1001-8042-31-08-002-F001.jpg)
In the model, the same weight parameters are used at different times, and the activation function uses the rectified linear unit (ReLu) function uniformly. Units with a certain amount of m are connected, and the last unit provides the output value. The hidden layer state
In formula (8),
After constructing the model, a backpropagation through time (BPTT) algorithm [35] is employed to train the RNN model. Based on formula (8), the loss function is established, and the minimum value of the loss function is calculated. The negative log likelihood function is used to establish the loss function:
where
3.2 Model Reliability Verification
The reliability represents the ability of the model to accurately reflect the characteristics of the data set and to predict the data information of future time nodes. To verify the reliability of the model, three methods are introduced: 1) goodness of fit
The model prediction value is assumed as
The null hypothesis is defined as
The Bayesian factor is the primary evaluation index of the Bayesian hypothesis test, which is the ratio of prior and posterior probabilities. If a Bayesian factor is significantly greater than 1, the sample information supports the null hypothesis
The posterior probability of the mean value μ can be further obtained by formula (14):
where
3.3 Fault Prediction
In this study, part of the data is used as a training set to construct the RNN prediction model. The abnormal signal is identified by setting the threshold in advance. The threshold
when
4 Illustration
This study uses rotating speed signal data of a pressure cylinder of a nuclear power turbine in April 2019 to explain the algorithm flow and model building. The rated rotating speed of the sampled turbine is 1500 rpm. The data set consists of two rotor speeds, one bearing group speed, and 720 time points. The data before April 20 is used as the training set to build the model and train the weights, and the data after April 20 is used as the verification set to verify the model reliability.
Figure 2 is a flow chart of fault prediction by the RNN model. After the original three-dimensional rotational speed signal is denoised by DWPT and reduced by PPCA, a one-dimensional time series signal is obtained. The signal data with the delay time and embedding dimension optimized by the enumeration method are set as a training dataset. The RNN model after training characteristic parameters is used to predict the value of the next time point, and the residual between the prediction and monitoring values is calculated for early warning.
-202008/1001-8042-31-08-002/alternativeImage/1001-8042-31-08-002-F002.jpg)
4.1 Data Denoising and Dimensional Reduction
Three speed signals are denoised by the DWPT. The time series signal is decomposed into three levels by using the db8 wavelet packet, and the wavelet coefficients of each point are obtained. The wavelet coefficients are filtered according to the Bayesian threshold approach, and the signal is reconstructed. Figure 3 shows the noise and denoising data of the bearing group speed signal. The denoising signal is very similar to the original signal, and the trend is consistent. The feature information of the original signal is retained.
-202008/1001-8042-31-08-002/alternativeImage/1001-8042-31-08-002-F003.jpg)
The dimensions of each type of PPCA data are reduced to more than 70% of the cumulative variance contribution rate of the retained dimension. Thus, the proportion of information after dimensional reduction is more than 70%. Three speed signals are dimensionally reduced in this case. Table 1 shows the results of dimensional reduction, where
Weight | PC1 | PC2 | PC3 |
---|---|---|---|
w1 | 0.364 | -0.931 | -0.005 |
w2 | 0.658 | 0.261 | -0.706 |
w3 | 0.659 | 0.254 | 0.708 |
Contribution rate | 0.717 | 0.275 | 0.008 |
Cumulative contribution rate | 0.717 | 0.992 | 1.000 |
4.2 Determining the embedding dimension and time delay of the input layer
The time series data after dimensional reduction by PPCA are prone to chaos; therefore, the input layer must be determined by phase space reconstruction for the prediction model. By using the enumeration method with various time delay and embedding dimension combinations, the changes in the R2 and MSE parameters of the RNN model training are analyzed. Figures 4 and 5 show the changing trends of
-202008/1001-8042-31-08-002/alternativeImage/1001-8042-31-08-002-F004.jpg)
-202008/1001-8042-31-08-002/alternativeImage/1001-8042-31-08-002-F005.jpg)
4.3 RNN model prediction and verification
The fully connected RNN model is constructed according to the optimized time delay and embedding dimension. For the rotating speed principal component, the time delay is the interval time between two adjacent input data, and the embedding dimension is the number of RNN input units. After model training, the R2, MSE and Bayesian confidence
time delay | Embedding dimension | Training set | Verification set | |||
---|---|---|---|---|---|---|
R2 | MSE | R2 | MSE | λ | ||
τd=2 | m=3 | 0.965 | 0.001855 | 0.935 | 0.003154 | 93.55% |
m=4 | 0.975 | 0.000951 | 0.944 | 0.002177 | 93.64% | |
m=5 | 0.952 | 0.002946 | 0.942 | 0.003242 | 92.37% | |
m=6 | 0.962 | 0.002127 | 0.934 | 0.002755 | 93.27% | |
m=7 | 0.956 | 0.002099 | 0.917 | 0.002046 | 93.45% | |
m=8 | 0.949 | 0.003509 | 0.918 | 0.003326 | 91.93% |
Figure 6 shows the comparison between the monitoring and predictive values of the rotating speed signal under three different conditions. Figure 6a illustrates the results of the proposed model and shows that the two curves coincide well. To analyze the effect of noise reduction, the rotating speed signals without DWPT are trained and predicted (Fig. 6b). The difference between two curves in the figure indicates that noise reduction of the source data is necessary. Furthermore, the traditional artificial neural network (ANN) model is applied for comparative analysis, and the predictive results are described in Fig. 6c. The degree of agreement between the two curves is lower than that of the RNN model in Fig. 6a. The
-202008/1001-8042-31-08-002/alternativeImage/1001-8042-31-08-002-F006.jpg)
4.4 Fault prediction of a two-stage impeller
A data set with fault points is applied to further verify the model reliability and test the early warning function for faults with the model. Figure 7 shows cracks (found in late May 2017 during major maintenance) in a two-stage impeller of a turbine unit in an NPP. The flaw detection results show that five blades have cracks, of which the shortest and longest measure approximately 29 and 45 mm, respectively.
-202008/1001-8042-31-08-002/alternativeImage/1001-8042-31-08-002-F007.jpg)
The rotating speed and vibration signals from February 10 to March 16, 2017 are extracted from the monitoring system as the experimental data set. Both the speed and vibration datasets consist of 34 days of monitoring data with 24 time points per day and a total of 816 time points. The signal data are divided into training (Feb. 10 to Mar. 2), verification (Mar. 2–4), and testing (after Mar. 5) data sets for modeling and fault prediction. Similar to the previous case, the two data sets are employed for the RNN model prediction after noise reduction and reconstruction of the embedding dimension and time delay. Table 3 lists some model prediction parameters and fault early warning parameters. The results of
-202008/1001-8042-31-08-002/alternativeImage/1001-8042-31-08-002-F008.jpg)
-202008/1001-8042-31-08-002/alternativeImage/1001-8042-31-08-002-F009.jpg)
Fault data set (m=4, τd=2) | R2(training) | MSE (training) | Positive threshold | Negative threshold | Systemalarm time | RNN model alarm time |
---|---|---|---|---|---|---|
Rotating speed | 0.967 | 0.001248 | 0.043382 | -0.173924 | 3/7 17:00 | 3/5 05:00 |
Vibration | 0.944 | 0.001619 | 0.139959 | -0.205049 | 3/7 17:00 | 3/5 21:00 |
5 Conclusion
Because of multiple variable redundancies and turbine data imperfections, the Bayesian PPCA method is used to preprocess the DWPT denoising data and obtain a data set with a high signal-to-noise ratio and low dimension. The rotating speed signal is reduced from a three- to one-dimensional principal component, and the contribution rate is more than 70%.
A fully connected RNN prediction model is established. The goodness of fit of each signal data is calculated to be higher than 0.93, and MSE fluctuates on the order of 0.001, which verifies the model reliability. Furthermore, a Bayesian hypothesis testing method, which considers the data uncertainty and prior information of the training set, is employed to quantify the model confidence. The Bayesian confidence values of the verification set under different embedding dimensions are calculated at more than 90%. In the comparison case study of the two-stage impeller cracking and the monitoring system, the RNN prediction model produces alarms 60 and 44 h in advance for the rotating speed and vibration signals, respectively. The prediction results indicate that the RNN model can effectively identify faults during the creep period.
Applications of fault detection and diagnosis methods in nuclear power plants: A review
. Prog. Nucl. Energ. 53, 255-266 (2011). https://doi.org/10.1016/j.pnucene.2010.12.001Nuclear power plant thermocouple sensor-fault detection and classification using deep learning and generalized likelihood ratio test
. IEEE T. Nucl. Sci. 64, 1526-1534 (2017). https://doi.org/10.1109/TNS.2017.2697919Research on fault diagnosis methods for the reactor coolant system of nuclear power plant based on D-S evidence theory
. Ann. Nucl. Energy. 112, 395-399 (2018). https://doi.org/10.1016/j.anucene.2017.10.026Validation of the discharge force analysis method and key influence factors study
. Nuclear Techniques, 43(4), 040007 (2020). https://doi.org/10.11889/j.0253-3219.2020.hjs.43.040007 (in Chinese)Development on calculation code CPAP for radioactive activation product of pressurized water reactor nuclear power plant
. Nuclear Techniques, 43(4), 040005 (2020). https://doi.org/10.11889/j.0253-3219.2020.hjs.43.040005 (in Chinese)A multi-field coupling simulation method for rod dropping behavior based on overset mesh
. Nuclear Techniques. 43(5), 050605 (2020). https://doi.org/10.11889/j.0253-3219.2020.hjs.43.050605 (in Chinese)Research of critical reactivity control online early warning technology in nuclear power plant
.Research on intelligent early-warning system of main pipeline in nuclear power plants based on hierarchical and multidimensional fault identification method
.Demonstration of the validity of the early warning in online monitoring system for nuclear power plants
. Nucl. Eng. Des. 349, 56-62 (2019). https://doi.org/10.1016/j.nucengdes.2019.04.028Research on intelligent fault diagnosis method for nuclear power plant based on correlation analysis and deep belief network
. Prog. Nucl. Energ. 108, 419-427 (2018). https://doi.org/10.1016/j.pnucene.2018.06.003A new approach for fault diagnosis with full-scope simulator based on state information imaging in nuclear power plant
. Ann. Nucl. Energy. 141, 107274 (2020). https://doi.org/10.1016/j.anucene.2019.107274Concurrent fault diagnosis based on bayesian discriminating analysis and time series analysis with dimensionless parameters
. IEEE Sens. J. 19, 2254-2265 (2019). https://doi.org/10.1109/JSEN.2018.2885377Wind turbine power curve modeling for reliable power prediction using monotonic regression
. Renew. Energ. 147, 214-222 (2020). https://doi.org/10.1016/j.renene.2019.08.060An integrated gaussian process regression for prediction of remaining useful life of slow speed bearings based on acoustic emission
. Mech. Syst. Signal Pr. 84, 485-498 (2017). https://doi.org/10.1016/j.ymssp.2016.07.039Recursive total principle component regression based fault detection and its application to vehicular cyber-physical systems
. IEEE T. Ind. Inform. 14, 1415-1423 (2018). https://doi.org/10.1109/TII.2017.2752709A partial least squares aided intelligent model predictive control approach
. IEEE T. Syst. Man. Cy-S. 48, 2013-2021 (2018). https://doi.org/10.1109/TSMC.2017.2723017Bayesian state prediction of wind turbine bearing failure
. Renew. Energ. 116, 164-172 (2018). https://doi.org/10.1016/j.renene.2017.02.069A rolling element bearing fault diagnosis approach based on multifractal theory and gray relation theory
. Plos One 11, e0167587 (2016). https://doi.org/10.1371/journal.pone.0167587Fault detection for gas turbine hot components based on a convolutional neural network
. Energies 11, 2149 (2018). https://doi.org/10.3390/en11082149Gas turbine performance prediction via machine learning
. Energy 192, 116627 (2020). https://doi.org/10.1016/j.energy.2019.116627Deep Learning based approach for bearing fault diagnosis
. IEEE T. Ind. Appl. 53, 3057-3065 (2017). https://doi.org/10.1109/TIA.2017.2661250Anomaly detection and fault analysis of wind turbine components based on deep learning network
. Renew. Energ. 127, 825-834 (2018). https://doi.org/10.1109/TIA.2017.2661250A new deep transfer learning based on sparse auto-encoder for fault diagnosis
. IEEE T. Syst. Man. Cy-S. 49, 136-144 (2019). https://doi.org/10.1109/TSMC.2017.2754287Fault diagnosis of rolling bearings with recurrent neural network-based autoencoders
. ISA T. 77, 167-178 (2018). https://doi.org/10.1016/j.isatra.2018.04.005Fault diagnosis of nonlinear systems using recurrent neural networks
. Chem. Eng. Res. Des. 153, 233-245 (2020). https://doi.org/10.1016/j.cherd.2019.09.026Multi-Fault diagnosis method for wind power generation system based on recurrent neural network
. P. I. Mech. Eng. A-J Pow. 233, 604-615 (2019). https://doi.org/10.1177/0957650919844065An Industrial Multi Agent System for real-time distributed collaborative prognostics
. Eng. Appl. Artif. Intel. 85, 590-606 (2019). https://doi.org/10.1016/j.engappai.2019.07.013Multilevel information fusion for induction motor fault diagnosis
. IEEE-ASME T. Mech. 2019, 24, 2139-2150. https://doi.org/10.1109/TMECH.2019.2928967Fault detection, identification and reconstruction of sensors in nuclear power plant with optimized PCA method
. Ann. Nucl. Energy. 113, 105-117 (2018). https://doi.org/10.1016/j.anucene.2017.11.009Feasibility of ANFIS towards multiclass event classification in PFBR considering dimensionality reduction using PCA
. Ann. Nucl. Energy. 99, 311-320 (2017). https://doi.org/10.1016/j.anucene.2016.09.015Framework for fault diagnosis with multi-source sensor nodes in nuclear power plants based on a bayesian network
. Ann. Nucl. Energy. 122, 297-308 (2018). https://doi.org/10.1016/j.anucene.2018.08.050Nonlinear sensor fault diagnosis using mixture of probabilistic PCA models
. Mech. Syst. Signal Pr. 85, 638-650 (2017). https://doi.org/10.1016/j.ymssp.2016.08.028Rolling element bearing fault detection using PPCA and spectral kurtosis
. Measurement. 75, 180-191 (2015). https://doi.org/10.1016/j.measurement.2015.07.045Mixtures of probabilistic principal component analyzers
. Neural Comput. 11, 443-482 (1999). https://doi.org/10.1162/089976699300016728Gated recurrent units based neural network for time heterogeneous feedback recommendation
. Inform. Sciences 423, 50-65 (2018). https://doi.org/10.1016/j.ins.2017.09.048A quantitative validation method of kriging metamodel for injection mechanism based on bayesian statistical inference
. Metals 9, 493 (2019). https://doi.org/10.3390/met9050493Application of bayesian networks in reliability evaluation, IEEE T
. Ind. Inform. 15, 2146-2157 (2019). https://doi.org/10.1109/TII.2018.2858281Uncertainty analysis of Gaussian plume model based on Bayesian MCMC method
. Nuclear Techniques, 43(4), 040009 (2020). https://doi.org/10.11889/j.0253-3219.2020.hjs.43.040009 (in Chinese)