Optimizing near-carbon-free nuclear energy systems: Advances in reactor operation digital twin through hybrid machine learning algorithms for parameter identification and state estimation

NUCLEAR ENERGY SCIENCE AND ENGINEERING

Optimizing near-carbon-free nuclear energy systems: Advances in reactor operation digital twin through hybrid machine learning algorithms for parameter identification and state estimation

Li-Zhan Hong，

He-Lin Gong ，

Hong-Jun Ji，

Jia-Liang Lu，

Han Li，

Qing Li

Nuclear Science and Techniques

Vol.35, No.8

Article number 135

Published in print Aug 2024

Available online 24 Jul 2024

DOI：10.1007/s41365-024-01494-2

CSTR：32136.14.NST.202408135

971012

Accurate and efficient online parameter identification and state estimation are crucial for leveraging Digital Twin simulations to optimize the operation of near-carbon-free nuclear energy systems. In previous studies, we developed a reactor operation digital twin (RODT). However, non-differentiabilities and discontinuities arise when employing machine-learning-based surrogate forward models, challenging traditional gradient-based inverse methods and their variants. This study investigated deterministic and metaheuristic algorithms and developed hybrid algorithms to address these issues. An efficient modular RODT software framework that incorporates these methods into its post-evaluation module is presented for comprehensive comparison. The methods were rigorously assessed based on convergence profiles, stability with respect to noise, and computational performance. The numerical results show that the hybrid KNNLHS algorithm excels in real-time online applications, balancing accuracy and efficiency with a prediction error rate of only 1% and processing times of less than 0.1 s. Contrastingly, algorithms such as FSA, DE, and ADE, although slightly slower (approximately 1 s), demonstrated higher accuracy with a 0.3% relative L₂ error, which advances RODT methodologies to harness machine learning and system modeling for improved reactor monitoring, systematic diagnosis of off-normal events, and lifetime management strategies. The developed modular software and novel optimization methods presented offer pathways to realize the full potential of RODT for transforming energy engineering practices.

Parameter identificationState estimationReactor operation digital twinReduced order modelInverse problem

Introduction

1.1

Concept of Digital Twins (DTs)

The concept of digital twins (DTs) has gained significant traction in recent years, particularly in the engineering and industrial disciplines [1]. A DT refers to a virtual model that closely mirrors a real-world physical product, system, or process, serving as an effectively indistinguishable digital counterpart for practical purposes such as simulation, integration, testing, monitoring, and maintenance. DTs have been recognized as the fundamental premise of product lifecycle management, encompassing the entire lifecycle of the physical entity they represent, including creation, construction, operation/support, and disposal [2-6].

The value and fidelity of a DT representation depend on the specific use cases, considering that its granularity and complexity are tailored accordingly. Notably, a DT can be conceptualized and utilized even before its physical counterpart exists, allowing for comprehensive modeling and simulation of the intended entity’s lifecycle.

The origin of the DT concept can be traced back to NASA, which pioneered its practical definition in 2010 as part of its efforts to enhance the simulation of spacecraft using physical model representations [2, 6]. Since then, the development of DTs has progressed in tandem with advances in product design and engineering. Traditional manual drafting techniques have given way to computer-aided drafting and design and model-based system engineering approaches, which establish a direct link between the DT and signals from its physical counterparts.

In recent years, DTs have garnered significant attention in various industrial sectors. Notably, their application in the chemical processing industry has demonstrated versatility [7]. In addition, DTs are increasingly utilized in simulation-based vehicle certification and fleet management [8], as well as in addressing the challenges of unpredictable and undesirable behaviors in complex systems [9, 10]. A specific area of growth has been observed in the nuclear plant industry, where digital twin architectures have been developed for the management and monitoring of nuclear plants, reflecting the expanding scope and potential of technology [11, 12].

1.2

Previous Work on Reactor Operation Digital Twins (RODTs)

A significant amount of research has been conducted to explore the potential applications of reactor operation digital twins (RODT) in the nuclear energy domain. RODTs represent a specific instantiation of the DT concept, focusing on the numerical representation of nuclear reactors for real-time prediction, optimization, monitoring, control, and decision-making during their operational stage [13].

A previous study [13] introduced the first prototype of an RODT specifically tailored for the prediction of neutron flux and power fields in the operational stage of the HPR1000 reactor core [14]. The prototype demonstrated the feasibility of using the RODT for online monitoring and prediction of key reactor parameters, thereby enhancing operational understanding and performance assessment.

In [15], the forward model of the RODT was realized through the application of a nonintrusive reduced-order model constructed using an SVD autoencoder to learn the nonlinearity of the field distribution. Machine learning techniques, specifically leveraging k-nearest neighbour (KNN) and decision trees, are employed to establish forward mapping. Additionally, the inverse model adopts a generalized latent assimilation method for accurate estimation of the model parameters.

While demonstrating promise, opportunities remain for improving the efficacy of RODT for practical deployment. In particular, the inverse solver relies on a computationally expensive methodology that limits its potential integration within an online framework. Therefore, the aim of this work [16] is to enhance the key capabilities of the RODT. This includes proposing an advanced differential evolution algorithm to upgrade the inverse solver for improved efficiency and accuracy of parameter estimation. In addition, uncertainty quantification was conducted to validate the RODT’s performance considering noisy observational data. Numerical validation experiments were performed across representative domains for an HPR1000 pressurized water reactor core to demonstrate its potential for engineering applications.

The key steps for constructing an RODT include: (i) training a reduced-order model (ROM) using model order reduction (MOR) [17-20] techniques as well as an efficient forward model and (ii) adaptation and implementation of inverse models to infer the input parameters and the resulting field distribution.

1.3

Introduction to Reduced Order Models (ROMs)

Many mathematical models encountered in real-life processes present computational challenges when employed in numerical simulations because of their inherent complexity and large dimensionality. To address these challenges, MOR techniques have been developed to reduce the computational complexity of these problems, particularly in the context of simulations involving large-scale dynamic and control systems. The ROM was computed as an approximation of the original model by reducing the dimensionality or degrees of freedom associated with the model.

The preparation of an ROM model involves two distinct phases, the offline and online phases, which are integral to the creation of a simplified representation of a high-fidelity full-order numerical model, enabling efficient real-time simulations and analysis. The offline phase focuses on determining the inherent low-dimensional structure of the underlying full-order model and deriving the reduced basis functions from high-fidelity snapshots of the system. Intrusive MOR techniques [21] are employed when a comprehensive understanding of the governing equations and numerical strategies employed in the full-order model is available. This requires access to the detailed numerical framework of the full-order model, which may involve proprietary commercial codes. By projecting a full-order model onto a reduced space, intrusive ROMs provide a concise representation with reduced computational complexity.

However, in practical engineering scenarios [22-24], detailed information regarding a full-order model is often unattainable, and the solvers or codes implementing these models are treated as black-box entities. This limitation poses a challenge to the implementation of traditional intrusive MOR methods. To address this issue, nonintrusive MOR techniques have been developed. Non-intrusive MOR approaches aim to construct ROMs without relying on detailed knowledge of the numerical framework of a full-order model. Instead, these methods establish an input-output mapping between the input parameters and the reduced basis through data-driven techniques such as interpolation, regression, or machine learning, approximating the behavior of the full-order model based on a reduced set of parameters and basis functions.

ROMs prepared through the offline and online phases play a critical role in the development of RODTs. RODTs require the integration of high-fidelity models and solvers with real-time capabilities, and ROM-based emulators provide promising solutions [21-23, 25]. Leveraging ROM-based forward solvers, RODTs can perform both one-to-one forward simulations and one-to-many simulations for inverse problems, encompassing parameter identification [26-29], data assimilation [24, 30-35], sensitivity/uncertainty quantification [36-40], and optimization/control [37, 41-47].

1.4

Forward Model Construction

In line with the framework RODT, as investigated in a previous study [13, 15], we selected KNN [48] to construct the forward model. The choice of KNN is justified by its numerous advantages, which render it a suitable candidate for the forward model in this context.

First, it is a nonparametric algorithm, meaning that it does not assume any specific functional form for the relationship between the inputs and outputs. Second, KNN is relatively simple to implement and computes efficiently, particularly for low-dimensional input problems. The complexity of the algorithm depends primarily on the number of training samples, making it suitable for problems with moderate dataset sizes [49]. In a recent study [50], an investigation was conducted to explore the possibility of utilizing different values of K for each dataset, considering the information provided by the correlation matrix.

1.5

Inverse Model Construction

Generally, the inverse problem can be divided into two categories:

• Parameter Identification. Solving the parameter vector based on the information from observation vectors: In this scenario, we aim to estimate the unknown parameters of the system based on the available observation vectors. The observation vectors contain information about the system’s behavior or response under conditions that are characterized by the parameter vector.

• State Estimation. Solving the field distribution based on the information from observation vectors: In this scenario, the objective is to determine the unknown field distribution of the nuclear system based on the available observation vectors.

1.6

Contribution of this Work

A previous work [13, 15, 16] demonstrated the feasibility of employing the RODT framework in engineering applications. However, a need exists for a systematic and in-depth investigation of the core components of RODT, namely parameter identification and state estimation. In addition, the development of mature and modular software is crucial to facilitating the application of RODT in practical scenarios. This motivated us to develop an efficient and appropriate surrogate inverse model to accurately predict the parameters and states. This also motivated our research on hybrid optimization approaches that can effectively balance coarse- and fine-grid search (coarse–fine-grid search) methods, which are discussed in the following sections.

The contributions of this research are as follows:

• Investigation and implementation of novel gradient-free optimization algorithms specifically tailored to handle the challenges posed by the discontinuous surrogate forward mapping constructed with the KNN forward model of our RODT and comparing the convergence profile, stability, and accuracy-time performance.

• Exploration and evaluation of the proposed coarse–fine-grid search methods within various global optimization approaches, aiming to strike a balance between accuracy and computational efficiency in parameter identification and state estimation.

• Development and deployment of efficient modular RODT software integrated with the aforementioned algorithms into the inverse problem solver, which applies to other sustainable energy systems.

These contributions focus on advancing state-of-the-art RODT by addressing the unique challenges encountered in solving inverse problems in nuclear engineering and proposing novel methodologies for improved parameter identification and state estimation. Furthermore, our work addresses the critical challenge of achieving real-time, high-precision simulations in nuclear energy DT applications, significantly enhancing the capabilities of DTs in the nuclear energy sector and facilitating improved monitoring, control, and optimization of nuclear energy systems.

Methodology

The modeling process of an RODT is classified into offline and online stages.

• Offline phase: training a non-intrusive ROM, which includes (i) preparation of the full order snapshots; (ii) preparation of the reduced basis; (iii) preparation of the coefficient; and (iv) preparation of the surrogate forward model to predict the coefficient with the information of the parameter.

• Online phase: parameter identification and state estimation, which includes (i) building an inverse model to predict the parameter based on the information of the observation data (clean or with observation noise) and (ii) building an inverse model to predict the coefficient based on the information of the observation data (clean or with observation noise).

The methodology is illustrated in Fig. 1.

Fig. 1

Working flow of the RODT schema

2.1

Offline phase: Training a Non-Intrusive ROM

The variation in the power field Φ in a nuclear core is characterized by physical laws that are generally described implicitly by a governing equation [51], such as the neutron transport or diffusion equations, which is written as Equation 1. ${\begin{array}{l} F (Φ (r, μ), μ) = 0 \\ r \in Ω \subset ℝ^{d} \\ μ \in D \subset ℝ^{p} \end{array},$ (2.1) where $Ω \subset ℝ^{d}$ represents the spatial domain of dimension d, with $d \geq 1$ , and $ϕ (μ) = Φ (r, μ)$ is defined in a Hilbert space that is equipped with an inner product $〈 \cdot, \cdot 〉$ , and the induced norm $‖ \cdot ‖ = \sqrt{〈 \cdot, \cdot 〉}$ . $D \subset ℝ^{p}$ represents the p-dimensional feasible parameter domain that covers the operation of the reactor core. In this study, Eq. 1 represents two-group neutron diffusion equations [52].

2.1.1

Preparation of the Full-Order Snapshots

In this study, the numerical solution of Eq. 1 was obtained using the proprietary CORCA-3D code package developed at the Nuclear Power Institute of China (NPIC) [53]. The CORCA-3D code is treated as a black-box solver; it can be substituted with other proprietary codes as needed.

To construct a non-intrusive ROM, it is necessary to gather a set of full-order solution snapshots. We sample μi in the parameter domain 𝒟 to obtain the discrete set $D = {μ_{i} \in ℝ^{p} | μ_{i} \in D, i = 1, \dots, P}$ , representing the entire set 𝒟. Using CORCA-3D, we obtain a solution snapshot set $M = {ϕ (c μ) \in ℝ^{N} | c μ \in D}$ .

The knowledge of this manifold is implicitly dependent on the knowledge of the parameters μ, and for better construction of a non-intrusive model, we endeavor to find a low-rank representation for the solution snapshot set 𝑴. Reduced-basis methods (RB) [54-56] indicates that in state where Φ is sufficiently regular, it can be approximated by an n-dimensional reduced basis {qi}, as follows: $ϕ (μ) ≅ ϕ_{n} (μ) = \sum_{i = 1}^{n} α_{i} (μ) q_{i} .$ (2.2) For simplicity, we denote the approximation of the solution manifold as follows: $ϕ_{n} (μ) = Q α (μ),$ (2.3) where $Q \in ℝ^{N \times n}$ represents the assembled n basis {q₁, q₂, ...., qn}, and $α (μ) \in ℝ^{n}$ denotes the n-dimensional coefficient.

2.1.2

Preparation of the Reduced Basis

For simplicity and time efficiency, we chose the singular value decomposition (SVD) method for generating the reduced basis. The steps are as follows:

Correlation Matrix. The correlation matrix C is computed by taking the inner product of each pair of snapshots, as expressed by the following equation: $\begin{matrix} C_{i, j} = \frac{1}{P} 〈 ϕ (μ_{i}), ϕ (μ_{j}) 〉, & \forall 1 \leq i, j \leq P . \end{matrix}$ (2.4) Here, $〈 ϕ (μ_{i}), ϕ (μ_{j}) 〉$ represent the inner product of the ith and jth snapshots. Subsequently, the eigenvalues λi and corresponding eigenvectors vi of the correlation matrix C are computed.

Proper orthogonal decomposition (POD). The jth POD basis vector qj, which is independent of parameter μ, is obtained as a linear combination of snapshots. $q^{j} = \sum_{i = 1}^{N} v_{i}^{j} ϕ^{i} .$ (2.5) In this equation, $v_{i}^{j}$ represents the ith element of the j-th eigenvector, and ϕi corresponds to the i-th snapshot. The magnitude of the j-th eigenvalue λj provides information about the relative importance of the jth POD basis vector.

Collection of the best basis minimizing the Frobenious error. The solution snapshot set is denoted as $M = {ϕ (μ_{k})}_{k = 1}^{P}$ . When applied to computation, we define the snapshot matrix as $S \in ℝ^{N \times P}$ , which contains snapshots $ϕ (μ_{k})$ as columns. The first n POD bases are assembled into a matrix $Q_{n} = [q_{1}, \dots, q_{n}] \in ℝ^{N \times n}$ . Among all orthonormal bases of size n, the POD basis minimizes the Frobenious norm least squares error (defined in the form of ${‖ • ‖}_{F} = \sqrt{\sum_{i}^{m} \sum_{j}^{n} {| •_{i j} |}^{2}}$ ) of the reconstruction of snapshot matrix 𝑺, which further corresponds to the Echart–Young theorem [57]: $\min_{Q_{n}} {‖ S - Q_{n} Q_{n}^{T} S ‖}_{F}^{2} = \sum_{k = n + 1}^{N} λ_{k},$ (2.6) where λk represents the kth singular value of matrix S. In summary, the POD basis constitutes a set of orthonormal vectors that offers the most effective n-dimensional representation of the given snapshots.

2.1.3

Preparation of the Coefficient

Following the aforementioned procedure, the coefficient is constructed as the projection of the snapshot matrix S onto the vector space spanned by the POD basis $Span {q_{1}, q_{2}, \dots, q_{n}}$

For any $ϕ (μ)$ belonging to the set 𝒮, the n-dimensional approximation of $ϕ (μ)$ is given by Eq. 2. The entries αi of the coefficient $α (μ)$ can be computed using an orthogonal projection as follows: $α_{i} = 〈 q_{i}, ϕ (μ) 〉 .$ (2.7) where qi is the POD basis.

Moreover, in matrix form, the coefficient matrix $A_{n} = [α_{1}, α_{2},..., α_{P}] \in ℝ^{P}$ follows the following relationship: $S ≅ S_{n} = Q_{n} A_{n} = Q_{n} Q_{n}^{T} S .$ (2.8)

2.1.4

Training a Forward Model

The goal of the forward model is to construct an efficient surrogate mapping model between the parameter and coefficient, as follows: $α (μ) ≅ F^{ML} (μ),$ (2.9) where F^ML denotes the surrogate forward mapping constructed by the machine learning approach, such as model-dependent machine learning methods or model-free optimization searching methods.

The response of the ROM model, denoted as $F^{ML} (μ)$ , corresponds to the predicted coefficient data. The image of 𝑫 under the forward map $F^{ML} (μ)$ represents the set of responses for all possible states. The difference $Y_{o} - H Q F^{ML} (μ)$ is referred to as the observation data misfit or the residuals associated with parameter μ.

We design the cost function as Eq. 12 based on the residual. The forward model is used in the search of the optimal approximation for the operator ϕ in the functional set ${Q F^{ML}, F^{ML} \in E (D, ℝ^{n})}$ , where $E (D, ℝ^{n})$ denotes the set of functions mapping from 𝑫 to $ℝ^{n}$ , with n the reduced dimension in the MOR process.

Previous research [13] evaluated various forward model approaches, among which the KNN algorithm demonstrated exceptional accuracy and efficiency. In this study, we specifically selected the KNN algorithm [48] as the forward model to exploit its full potential. To enhance the performance of the KNN algorithm, we focused on two crucial parameters that significantly influence its efficacy.

The first parameter relates to the voting distance in KNN, wherein we investigate the impact of utilizing either the Euclidean distance or Manhattan distance as the metric for determining the nearest parameter choice in the training set. Notably, these two distances can be expressed as Minkowski norms with an exponent of p = 1, 2, corresponding to the Euclidean and Manhattan distances, respectively. The Minkowski distance, a measure of the discrepancy between the testing and training parameters, can be mathematically represented as follows: ${‖ μ^{test} - μ^{train} ‖}_{p} = {(\sum_{i = 1}^{d i m (μ)} {| μ_{i}^{test} - μ_{i}^{train} |}^{p})}^{\frac{1}{p}},$ (2.10) where $d i m (μ)$ denotes the dimension of the parameter μ. Finally, we choose the Euclidean metric.

Another vital parameter is K, which refers to the number of candidates voted for. The optimal choice of K can be determined by plotting the relative L₂ error of the predicted observation and true vectors, which is expressed as follows.

From Fig. 2, we can deduce that when K = 5, our forward model using KNN performs with the best prediction accuracy, and the variation in the error is relatively smooth, which corresponds to the continuous variation in the value of the field at the corresponding position of the field manifold snapshot.

Fig. 2

Finding the optimal choice of K for KNN model (Forward on the left, Inverse on the right).(a) KNN Forward model; (b) KNN Inverse model

In contrast, the stair-shaped observation data predictions in Fig. 3 demonstrate that the KNN forward surrogate model has a step-like progression as neighbors are adjusted between discrete datasets and thus are not continuous or differentiable. In this manner, well-developed state-of-the-art algorithms based on gradient information for solving optimization and inverse problems did not work well in our case. To meet this demand, we investigated other novel approaches, which are discussed in detail in the following sections.

Fig. 3

Non-differentiable stair-shaped observation data prediction via the KNN surrogate forward model (tested when the first parameter St varies from 0 to 615 steps, the other parameters take the midpoint values of the parameter intervals, respectively, 1250 for Bu; 50 for Pw; and 295 for Tin). (a) L₂ norm of predicted observation; (b) 1th vertical level, the 43th assembly; (c) 18th vertical level, the 11th assembly

2.2

Online phase: Parameter Identification and State Estimation

The inverse problem in nuclear engineering refers to the task of determining the model parameters that yield the observed data, as opposed to the forward problem, which involves predicting the data based on the given model parameters. The objective is to determine the model parameters μ^* that approximately satisfy Equation 12.

The nature of the inverse problem depends on whether operator ϕ is linear or nonlinear. In most cases, particularly when solving systems governed by the neutron flux diffusion function, the inverse problem is nonlinear, owing to the nonlinearity of the forward map. Thus, we attempt to develop nonlinear algorithms, gradient-free algorithms, or algorithms based on the initial guess given by fast linear algorithms, such as KNN.

2.2.1

Setup of Inverse Model and Optimization

In the context of optimal control theory, the governing equations that describe the behavior of a physical system are commonly referred to as state equations. However, in many practical scenarios, interest lies not only in the physical state itself but also in its impact on certain objects or quantities. Furthermore, only a limited amount of data can often be obtained from the physical state. To address these considerations, an additional operator denoted by $H$ , known as the observation operator, is introduced. This operator maps the state of the physical system denoted by Φ to the desired observations denoted by $Y_{o} \in ℝ^{m}$ , where m denotes the total number of observations. Given a field Φ, the observation vector Yo is given by the following equation and approximated by an error: $Y_{o} = H Φ = H Q F^{ML} (μ) + e,$ (2.11) where $e \in ℝ^{m}$ represents the observation noise vector that captures the presence of observational errors. The quality of a given set of model parameters μ is evaluated using the L₂ distance between the simulated and practical observations, which is quantified using the following cost function: $J (μ) : = {‖ H Q F^{ML} (μ) - Y_{o} ‖}_{L_{2}}^{2} .$ (2.12) The objective of the inverse problem is to minimize the observation error, which can be formulated as $μ^{*} = \underset{μ \in D}{argmin} J (μ) : = \underset{μ \in D}{argmin} ({‖ H Q F^{ML} (μ) - Y_{o} ‖}_{L_{2}}^{2}) .$ (2.13) It is important to note that the cost function described above is possibly nonlinear and non-differentiable with respect to the discontinuous function F^ML. Therefore, we developed the following algorithms to solve the optimization problem:

Comparison of Different Approaches for Solving Inverse Models

To solve inverse problems, machine learning and optimization methods can be employed to search for the optimal parameters or field distributions that best fit the observed data [58]. Additionally, as mentioned in Sect. 2.1.4, when we employ the KNN algorithm to construct the surrogate forward model, the surrogate forward mapping is non-differentiable and discontinuous. This poses a challenge because the traditional approaches used to solve continuous optimization and inverse problems [59] are not applicable in this context. This necessitates the exploration of novel inverse problem-solving methods suitable for our particular problem. Hence, this study investigates various deterministic optimization algorithms, metaheuristic algorithms, and their hybrids to address this challenge.

3.1

Exhaustive Direct Search with Latin Hypercube Sampling (LHS)

To evaluate and compare the different approaches to solving inverse models, it is essential to establish a benchmark for measuring their performance. In this section, we propose using exhaustive direct search (EDS) with latin hypercube sampling (LHS) as the benchmark method for inverse problems. It combines the accuracy of EDS methods, which have been widely used since the 1960s [60], with the sampling efficiency of LHS.

LHS [61] is a sampling technique that ensures a comprehensive exploration of the solution space while maintaining desirable properties. The range of each variable was divided into equally probable intervals, and the sample points were strategically placed to satisfy the requirements of a Latin hypercube, where each sample was the only one in its corresponding axis-aligned hyperplane. The advantage of LHS lies in its independence from the number of dimensions, which allows for efficient sampling without the need for an increasing number of samples. Furthermore, LHS facilitates sequential sampling, enabling the tracking of the already selected samples.

3.2

Exhaustive Direct Search with 2 Steps Latin Hypercube Sampling (LHS2STEPS)

To enhance the benchmark algorithm and mitigate the uncertainties introduced by the randomness of the sampling process, [13] introduced a two-step sampling strategy. This strategy involves further sampling the neighborhoods of the n₁ best candidates generated during the initial stages of the LHS algorithm. For the detailed pseudocode, please refer to 3.3, excluding the part related to ALHS.

3.3

Exhaustive Direct Search with Assembled 2 Steps Latin Hypercube Sampling (ALHS)

To improve our benchmark algorithm and further reduce the uncertainty caused by the randomness of the sampling process, we introduce a mean mechanism at the end of the classical LHS algorithm to calculate the mean of the n₁ top-rated candidates in the five clones, which are spread around the n₂ top-rated candidates in the first sampling process. The concept is illustrated in Algorithm 1.

Algorithm 1 Exhaustive Direct Search with Assembled 2 Steps Latin Hypercube Sampling
Inputs:
Observations: Y_o
Initial guess: μ_initial
Forward function: F^ML
Transformation operator: H
First stage sampling number: n^{s, 1}
Second stage sampling number: n^{s, 2}
$U^{1} = LHS (number = n^{s,1}, initial = μ_{initial})$
$μ_{1}^{*} = \underset{k = 1.. n^{s}}{argmin} ({‖ H V_{n} F^{ML} (μ^{k}) - Y_{o} ‖}_{2})$
for i from 2 to n₁ do
	$U^{1} = LHS (number = n^{s,1}, initial = μ_{initial})$	▷	n₁=5 in this article.
	$μ_{1}^{*} = \underset{k = 1.. n^{s}}{argmin} ({‖ H V_{n} F^{ML} (μ^{k}) - Y_{o} ‖}_{2})$
end for
for j from 1 to n₁ do
	$U_{j}^{2}$ =LHS (number =n^{s, 2} the initial = $μ_{i}^{*}$ )
end for
for k from 1 to n₂ do		▷	Start of the main part of ALHS, n₂=10 in this article.
	$μ_{k}^{*} = \underset{μ \in \cup U_{j}^{2}}{argmin} ({‖ H V_{n} F^{ML} (μ) - Y_{o} ‖}_{2})$
end for		▷	End of the main part of ALHS.
$\bar{μ} = (\sum_{k = 1}^{10} μ_{k}^{*}) / 10$
Outputs:
Parameter: $\bar{μ}$
Reconstructed field: $V_{n} F^{ML} (μ^{*})$

3.4

Efficient Deterministic Machine Learning Algorithm: KNN

In addition to the benchmark LHS method, another efficient deterministic algorithm that can be considered for solving inverse models is the KNN algorithm. KNN is a nonparametric classification and regression algorithm that can be adapted to solve inverse problems.

The KNN algorithm operates based on the principle of similarity. Given a new input data point, KNN finds the K-nearest neighbors in the training dataset based on the Euclidean distance and then generates the output value by averaging the output values of the K-nearest neighbors. The selection of the optimal value of K can be determined by graphing the relative L₂ error between the predicted observation vector and the true vector, as shown in Fig. 2. The analysis in Fig. 2 reveals that selecting K = 1 yields the optimal outcome in terms of minimizing the L₂ prediction error, thereby emphasizing its advantageous accuracy.

3.5

Metaheuristics Algorithms with Physical Constraints

Recently, the application of metaheuristic algorithms with physical constraints [62-64] has gained significant attention for inverse modeling. These algorithms offer powerful gradient-free optimization techniques that can effectively handle complex inverse problems while satisfying the physical constraints imposed by system dynamics.

In this manner, we implemented and adapted practical algorithms and constructed related solvers that were integrated into the developed RODT framework. These solvers are designed to address the inverse problem presented by Equation 12.

3.5.1

Differential Evolution Algorithm (DE)

The differential evolution (DE) algorithm [65] offers a promising approach for tackling inverse modeling problems. By leveraging its ability to handle nonlinear and nonconvex objective functions, DE can effectively explore the parameter space. Furthermore, DE provides mechanisms to incorporate physical constraints, ensuring that the generated solutions adhere to system dynamics. The concrete procedure of the DE algorithm can be found in a previous work [13], in which we used the best mutation strategy recommended in [66] for the mutation step in DE, and the mutation donor $V_{i, G}$ is given by Eq. 14. $V_{i, G} = μ_{r_{5}, G} + F \cdot (μ_{r_{1}, G} - μ_{r_{2}, G} + μ_{r_{3}, G} - μ_{r_{4}, G}),$ (3.14) where rk, G, $k \in {1,2,3,4,5}$ are randomly chosen from {1,2, ... NP}, NP is the number of populations, μ is a parameter, and F and G denote the scaling factor and the number of generations, respectively.

3.5.2

Particle Swarm Optimization Algorithm (PSO)

Particle swarm optimization (PSO) [67] is a population-based optimization technique inspired by the collective behavior of bird flocking or fish schooling. It involves particles representing potential solutions that update their positions based on the personal and global best positions. The algorithm iterates a set number of times by adjusting the particle velocities and positions. The best solution, represented by the global best position with the lowest objective function value, is returned.

3.5.3

Fast Simulated Annealing Algorithm (FSA)

The simulated annealing (SA) [68] algorithm, which was initially proposed for solving the well-known travelling salesman problem (TSP) [69], is a popular approach for solving optimization problems in various fields [70], including nuclear engineering [71].

SA was inspired by metallurgical annealing, in which a metal is heated and gradually cooled to achieve a more ordered configuration. Similarly, the SA starts from a high-energy state (initial solution) and progressively lowers the temperature until it converges to a state of minimal energy (optimal solution).

The fast simulated annealing (FSA) [72] is an adaptation of classical simulated annealing (CSA) [68], which improves the computational efficiency and convergence speed. CSA utilizes a local sampling approach and controls the fluctuation variance through an artificial cooling temperature, denoted as T_c(t). FSA modifies the cooling schedule to accelerate convergence using a cooling temperature T_f(t) that decreases reciprocally rather than logarithmically with time t.

3.5.4

Cuckoo Search Algorithm (CS)

In the cuckoo search (CS) algorithm [73], nests are used to represent potential solutions, and eggs within the nests symbolize candidate solutions. Through an iterative process, the algorithm generates new solutions by employing Lévy flights and subsequently evaluates their fitness. If a newly generated solution is superior to an existing one within a given nest, it replaces the incumbent solution. To maintain population diversity and improve the overall quality of solutions, the population is sorted based on fitness, and a predetermined number of solutions are replaced with new solutions generated via random walks. The algorithm terminates upon reaching a predefined maximum number of generations, and the best solution, characterized by the lowest objective function value, is returned as the final outcome.

CS utilizes a balanced combination of a local random walk and a global explorative random walk, controlled by a switching parameter pa. A local random walk is defined as follows: $μ_{i}^{t + 1} = μ_{i}^{t} + α \otimes H (p_{a} - ϵ) \otimes (μ_{j}^{t} - μ_{k}^{t}) .$ (3.15) In the above equation, $μ_{j}^{t}$ and $μ_{k}^{t}$ represent two solutions randomly selected through permutations. $H (•)$ denotes the Heaviside function, ⊗ represents point-to-point multiplication, ϵ is a random number drawn from a uniform distribution, and s represents the step size. The global random walk, on the other hand, is conducted using Lévy flights, as follows: $μ_{i}^{t + 1} = μ_{i}^{t} + α L (s, λ),$ (3.16) where L(s, λ) is defined as $\begin{matrix} L (s, λ) = \frac{λ Γ (λ) \sin (π λ / 2)}{π} \frac{1}{s^{1 + λ}}, & (s ≫ s_{0} > 0) . \end{matrix}$ (3.17)

3.5.5

Artificial Neural Network (NN)

The neural network (NN) algorithm [74], inspired by the behavior of the human brain, simulates interconnected artificial neurons to process information, learn from data, and estimate unknown parameters or states. They consist of interconnected artificial neurons organized into layers that receive input signals, process them through weighted connections, and produce output signals using activation functions. NNs can learn from input-output examples by adjusting the connection weights during training, enabling them to capture underlying patterns and correlations, making them suited for solving physical problems [75].

The combination of the Adam and L-BFGS optimizers in an alternating manner has demonstrated efficacy in training NNs for complex problem solutions [76]. This approach leverages the strengths of the two optimizers to enhance training and improve the overall network performance.

However, owing to NN’s continuous property, it cannot handle discrete problems well, particularly when the test data are noisy, which is further discussed in Sect. 5.

3.6

Hybrids of Optimization Methods for Inverse Problem: Balancing Coarse–Fine-Grid Search

To address the challenges posed by the discontinuity of KNN surrogate forward mapping and noisy observations, we propose hybrid algorithms that combine the strengths of deterministic machine-learning methods for coarse-grid search and metaheuristic algorithms for fine-grid search. This methodology can also be found in fields that encompass feature extraction of aerial data [77], automatic free-form optics design [78], and hyperparameter optimization [79]. The objective of a coarse–fine-grid search is to strike a balance between global exploration and local exploitation [80] to achieve accurate, efficient, and less sensitive parameter estimation. The hybrid optimization hub designed accordingly can be further illustrated in Fig. 4.

Fig. 4

Hybrid optimization schema of coarse–fine-grid search

3.6.1

Cuckoo Search with Differential Evolution (CSDE)

In the later stages of the standard CS process, notable issues arise in the form of information waste and slow convergence. These issues stem from the execution of independent evaluations and the lack of sufficient mechanisms for information sharing within the population. Consequently, valuable information fails to propagate effectively, resulting in suboptimal convergence rates and reduced search efficiency.

From an alternative perspective, the underlying biological nature of mutations within the standard DE algorithm was explored. In this context, a novel mathematical elucidation of these mutations is sought. Notably, researchers in the natural sciences have observed that mutations can be understood as sequences of transformations that transpire within a DNA molecule. Furthermore, the transformations pertaining to modifications exhibited a statistical distribution that adhered to the Levy distribution. This finding suggests an analogy between mutations and Levy flight [81]. Drawing inspiration from this observation, it is conceivable to integrate the Levy flight process into the generation of the DE population.

In this regard, we developed the cuckoo search differential evolution (CSDE) algorithm, which lies within the framework of the CS algorithm, integrating the DE mutation operation into the production of the population for each generation. The first step of CSDE is the initialization: Define the objective function $f (•)$ . We set the population size N, problem dimension D, discovery probability Pa, step size α, scaling factor F, crossover probability CR, maximum number of iterations T, and search domain range [μlb, μub]. The steps are presented in Algorithm 2.

Algorithm 2 Cuckoo Search Algorithm with Differential Evolution
Step 1: Population initialization
Initialize the population P and the initial position using a random distribution within the search domain range [μlb, μub]
while $G \leq G_{max}$ do
	for each individual μi in population PG do
		Step 2: Population updating
		Generate random indices r1, r2, r3 (different from i) do
		for each dimension j in problem dimension dim(μ) do
			if random(0,1) < CR then
				$μ_{i, j}^{G} = μ_{i, j}^{G} + α \otimes (μ_{r 2, j}^{G} - μ_{r 3, j}^{G}) \otimes L (s, λ)$
			else
				$μ_{i, j}^{G} = μ_{i, j}^{G}$
			end if
		end for
		Step 3 Mutation Step
		Generate random indices r3, r4, r5 (different from i)
		Generate a donor vector $V_{i}^{G} \in ℝ^{dim (μ)}$ :
		$V_{i}^{G} = μ_{r_{3}}^{G} + F \cdot (μ_{r_{4}}^{G} - μ_{r_{5}}^{G})$
		Step 4 Crossover Step
		Generate a trial vector $U_{i}^{G} \in ℝ^{d i m (μ)}$ :
		for j=1 to dim(μ) do
			if $r a n d o m (0,1) \leq C r$ then
				$U_{i, j}^{G} = V_{i, j}^{G}$
			else
				$U_{i, j}^{G} = μ_{i, j}^{G}$
			end if
		end for
		Step 5 Selection Step
		Evaluate the trial vector $U_{i}^{G}$
		if $f (U_{i}^{G}) < f (μ_{i}^{G})$ then
			$μ_{i}^{G + 1} = U_{i}^{G}$
		else
			$μ_{i}^{G + 1} = μ_{i}^{G}$
		end if
		Step 6 Abandon nests
		Sort the population PG based on the fitness values in ascending order.
		Determine the number of solutions to be replaced, $n_{replace} = rand (p_{a} \cdot N P)$ .
		for i = 1 to $n_{replace}$ do
			Generate a new solution X in the search space with random walk given in Equation 3.15.
			Evaluate the objective function f for X.
			if f(X) < f(worst solution in PG) then
				Replace the worst solution in PG with X.
			end if
		end for
	end for
	G = G +1
end while

3.6.2

Advanced CSDE (ACSDE) and Advanced DE (ADE)

We utilize the SVC algorithm to classify the test parameters and denote the center of the classified hypercube as the initial machine learning estimate μ_SVC; then, we use the CSDE and DE algorithms to further explore the parameter space with the initial population spread around the center μ_SVC. In the SVC process, according to a previous research [15] on the importance rate of the input parameters, the last parameter, temperature Tin, contributes slightly, and we also consider that Tin varies at a relatively small boundary; thus, we only administer the SVC process to the first three parameters. We chose two representative metrics to evaluate the performance of the SVC model.

Confusion Matrix. The confusion matrix [82] provides a comprehensive overview of the binary classification results by displaying the number of true positive (TP), true negative (TN), false positive (FP), and false negative (FN) predictions. This allowed us to assess the accuracy of the model’s by differentiating classes and identifying potential sources of misclassification.

We can infer from Fig. 5 that our SVC classification model predicts the first two parameters with relatively high accuracy while exhibiting confusion concerning the third parameter.

Fig. 5

(Color online) Confusion Matrix Plot that evaluates the efficacy of SVC prediction in the first 3 parameters (St, Bu, Pw, respectively)

F1-score. The F1-score [83], defined in Eq. 3.18, is a popular performance metric for binary classifier models. In Eq. 3.18, TP, FN, FP are the numbers of true positives, false negatives, and false positives classified by the model, respectively. $F 1 = \frac{T P}{T P + \frac{1}{2} (F P + F N)}$ (3.18) The predictive model employed in this study for each parameter is a multi-class SVC utilizing the “one-vs-others" strategy. This approach involves training separate binary classifiers to distinguish between the candidate intervals for a given parameter. Consequently, performance evaluation necessitates the adoption of the Macro F1-score as a metric. The Macro F1-score (Eq. 19) is computed as the average of the F1-scores obtained for each interval, reflecting the overall effectiveness of the classifiers in capturing intra-parameter variations: $F 1_{macro} : = \frac{1}{N} \sum_{i = 0}^{N} F 1_{i},$ (3.19) where i is the interval index, and N is the number of intervals.

Based on the Confusion Matrix shown in Fig. 5 and $F 1_{macro}$ , we designed the grid size of the fine-grid search process as follows:

3.6.3

K-Nearest Neighbour plus Exhaustive Direct Search with Latin Hypercube Sampling (KNNLHS)

Integrating KNN with LHS provides a robust methodology to address inverse problems. To enhance the comprehensiveness of our research, we standardized the grid size for a coarse–fine grid search. This standardization is consistent with the specifications listed in Table 1.

Grid size design for the Coarse–Fine-Grid search based on the SVC prediction evaluation

Parameters	St	Bu	Pw
$F 1_{macro}$	9.98E-01	9.93E-01	5.18E-01
Size coarse-grid (SVC)	41 steps	250 MWd/tU	8% FP
Size fine-grid (DE/CSDE)	41 steps	250 MWd/tU	20% FP
Relative size coarse-grid (SVC)	6.67%	10.00%	10.00%
Relative size fine-grid (DE/CSDE)	6.67%	10.00%	25.00%
Parameter boundary	[0, 615]	[0, 2500]	[20%, 100%]

In conclusion, the k-nearest neighbour plus exhaustive direct search with latin hypercube sampling (KNNLHS) method is a robust and efficient approach compared to similar strategies. This technique combines the exploratory capabilities of KNN in conducting coarse-grid searches with the efficiency and diversity of LHS in sampling local solution spaces. Consequently, the KNNLHS facilitates precise and efficient parameter estimation, accurate navigation of complex objective functions, and the management of noisy datasets. The effectiveness of this approach and its comparative advantages are discussed in Sect. 5.

Application to Nuclear Reactor

4.1

Setup of Reactor Physics Operation Problems

Setup of background information for HPR1000. The objective of constructing the RODT was to accurately predict the power distribution within the HPR1000 reactor core during operation. The core consists of 177 vertical nuclear fuel assemblies, 44 of which are equipped with self-powered neutron detectors (SPNDs) to measure neutronic activity and power fields. Figure 6 provides a visualization of a horizontal slice of the HPR1000 core and an axial slice of an SPND-equipped assembly, with only one-quarter of the core displayed owing to the symmetry along the x and y axes. The gray fuel assemblies represent those containing control rods, whereas the assemblies marked with D indicate the presence of SPNDs. For more detailed information regarding the HPR1000 reactor and generic neutronic physical model, please refer to [14]. Further descriptions of the model for data assimilation and the initial work on RODT can be found in [35] and [13], respectively.

Fig. 6

Quarter of the core in the radial direction (white square: fuel assembly with SPNDs, grey square: fuel assembly with control rods, D: fuel assembly with neutron detectors) [13]

During the normal operation of the HPR1000 reactor, two types of control rods were utilized for regulation. The first type, known as compensation rods, is responsible for coarse control and a substantial reactivity reduction. These compensation rods comprise four subtypes: G1, G2, N1, and N2. The second type, called regulating rods (R rods), are employed for fine adjustments to maintain the desired power or temperature [84]. Power evolution within the reactor is influenced by various factors, including the movement of the control rods, the burnup of nuclear fuel, variations in the power level of the reactor core, and fluctuations in the inlet coolant temperature. This evolution was mathematically modeled using two-group diffusion equations [13] and numerically solved using the CORCA-3D code package [85]. Developed at NPIC, the CORCA-3D code is capable of solving 3D few-group diffusion equations, considering thermal-hydraulic feedback, and performing pin-by-pin power reconstruction.

Setup of parameter domain. In this study, we consider CORCA-3D to be an opaque solver in which we input the parameter vector μ and obtain the output Φ. In particular, the power field Φ is constrained to be dependent on a set of "general" parameters that indicate the stage of the reactor’s life cycle: $μ : = (S_{t}, B_{u}, P_{w}, T_{i n}) .$ (4.20) • St: The control rod insertion steps, ranging from 0 to 615, represent the movement of the compensation rods from all rod clusters being out (ARO) to fully inserted. This range takes into account the overlap steps.

• Bu: The average burnup of the fuel in the entire core indicates the amount of energy extracted from the fuel and increases over time. Its value ranges between 0 (at the beginning of the fuel’s life cycle) and B_{u, max}, which is set to 2500 MWd/tU (the end of the fuel’s life cycle). The specific evolution of Bu depends on the reactor’s operational history.

• Pw: The power level of the reactor core ranges from 0.3 to 1 FP (full power).

• Tin: The temperature of the core coolant at the inlet falls within the range of 290 to 300°C.

In this manner, Φ can be implicitly represented by $ϕ (μ) = ϕ (S_{t}, B_{u}, P_{w}, T_{i n})$ , thanks to CORCA-3D. There are 177 fuel assemblies in HPR1000, and each assembly is numerically represented using 28 vertical levels. Thus, Φ is a vector with dimensions N=4956(=177 × 28). The discrete solution set 𝑴= ${ϕ (μ) \in ℝ^{N} | μ \in D}$ consists of P=18480 solution snapshots with the parameter configuration $D : = B_{u}^{s} \otimes S_{t}^{s} \otimes P_{w}^{s} \otimes T_{i n}^{s}$ , where $S_{t}^{s} = {0, 1, \dots, 615}$ , $B_{u}^{s} = {0, 50, 100, 150, 200, 500, 1000, 1500, 2000, 2500}$ , $P_{w}^{s} = R U 3 (30, 100)$ and $T_{i n}^{s} = R U 3 (290, 300)$ . The operator RU3(a, b) represents three independent and identically uniformly distributed samplings in the closed set [a, b] where a<b. In this study, we selected 90% of 18480 snapshots for training the forward and inverse models, and the remaining 10% of snapshots were used for test purposes.

Setup of observation data and modeling the observation noise. In the context of RODT, the observations denoted by Y_o are used to infer the input parameter vector μ and subsequently determine the resulting power field Φ. The arrangement of these observations, organized node by node, is shown in Fig. 6. Each observation value represents the average value over the corresponding node in which the SPNDs are located. It is important to clarify that the observations used in the analysis were not obtained from direct engineering measurements but were derived from numerical simulations of Φ using CORCA-3D.

The noise associated with the observations can be modeled as either a Gaussian or a uniform distribution, depending on the specific requirements. Specifically, the noise term can be mathematically represented as follows: $e = Y_{o} \cdot N (0, σ, Y_{o}) or Y_{o} \cdot U (- δ, δ, Y_{o})$ In this context, the vector N(0, σ, Y_o) denotes a collection of random variables following a normal distribution. These random variables have the same dimensions as Yo and are characterized by a mean of zero and standard deviation of σ. Similarly, the vector $U (- δ, δ, Y_{o})$ represents a set of random variables uniformly distributed within the interval [-δ, δ]. These variables have the same dimensions as Y_o.

Considering the prevalence of Gaussian noise in real-world nuclear reactors, our subsequent analysis primarily focused on evaluating the performance of the algorithm under Gaussian noise conditions.

Software implementation We integrated our software into the RODT platform using the following modules:

• rodtPro: This module implements data cleaning and preprocessing techniques based on domain expertise.

• rodtROM: This module constructs ROM through techniques like POD, SVD, and RB.

• rodtML: This module focuses on surrogate modeling and inverse model optimization. It also includes hyperparameter optimization techniques.

• rodtPost: This is a post-evaluation module that defines convergence metrics and conducts comparative analyses using evaluation methodologies derived from peer-reviewed literature.

• rodtUI: This module is responsible for visualizing the results, providing an intrinsic review of the methods, showcasing the optimized hyperparameters, and designing user-specific visualizations.

4.2

Practice Metrics for Inverse Model Evaluation

To evaluate the performance of the algorithms, we designed a series of numerical experiments using nuclear reactor data produced by an intrusive program with added noise. We carefully selected metrics related to accuracy and finally defined the normalized reconstruction fields L₂ error (E₂) and L_∞ error (E_∞) to evaluate the performance of the comparative algorithms.

The reconstructed field L₂ error is defined in Eq. 4.21 to evaluate the average estimation of the field, where ${‖ • ‖}_{2} : = {(\sum_{i} {(•_{i})}^{2})}^{1 / 2}$ denotes the L₂norm. $E_{2} : = {‖ Q F^{ML} (μ^{*}) - Φ_{true} ‖}_{2} / {‖ Φ_{true} ‖}_{2}$ (4.21) The reconstructed field L_∞ error is defined in Equation 22 to evaluate the worst field estimation, where $∥ • ∥_{\infty} : = \max_{i} | •_{i} |$ denotes the L_∞norm. $E_{\infty} : = {‖ Q F^{ML} (μ^{*}) - Φ_{true} ‖}_{\infty} / {‖ Φ_{true} ‖}_{\infty}$ (4.22) It should also be noted that the reconstructed observation L₂ error and L_∞ can be generated similarly.

Numerical Results

In this section, we describe the experiments conducted to evaluate the performance of parameter identification and state estimation for the proposed algorithms. The evaluation was performed on a test dataset consisting of 40 specimens from HPR1000, as described in Sect. 4. All the proposed inverse modeling approaches are utilized in the experiments, and it is important to mention that the cost function employed remains consistent with Eq. 2.12. Notably, we focused solely on testing the CSDE method and did not include the CS method. Furthermore, we introduce the proposed accuracy metric, illustrated in Sect. 4.2, to assess its performance.

5.1

Accuracy and Stability Analysis of the Parameter Identification Phase

In this section, we describe the tests conducted to evaluate the accuracy of parameter identification. Initially, we assessed the effectiveness of the proposed hybrid methodology, the coarse–fine–grid search, as discussed in Sect. 3.6, using the KNNLHS method as an illustrative example. To visualize this process, we generated scatter plots depicting the predicted values of the parameters St and Pw, which were identified as the two dominant parameters in a previous study [15] (see Fig. 7 for six cases with different input parameter settings.

Fig. 7

Distribution of generated candidate parameters

μ^{1} = S_{t}

and

μ^{3} = P_{w}

with hybrid algorithms. The solid green points are the true values, while the hollow yellow points stand for the candidate parameter generated by the mere KNN algorithm. The hollow blue triangles represent the five optimal samplings by LHS around the initial guess given by the KNN evaluated in the observation space. The solid red points represent the output of the algorithms, which is the best among the five optimal samplings

Figure 7 shows a visualization of the local search space in the inverse algorithms. The figures in the first row depict scenarios with varying Pw values ranging from relatively low to moderate to relatively high. Correspondingly, the figures in the second row illustrate cases with varying St values, again encompassing relatively low, moderate, and high values. These figures effectively show the considerable improvements achieved using the KNNLHS method, particularly in the simultaneous optimization of both Pw and St. The visualizations underscore the effectiveness of the method in yielding enhanced optimization outcomes.

The parameter prediction deviation is demonstrated in Fig. 8 over the test set. Owing to space limitations, we chose only KNN, KNNLHS, and FSA for illustration and tested clean and noisy (σ= 5%) observation data. Using the LHS algorithm, the KNN prediction for parameter identification became more accurate, and the prediction deviation for the FSA algorithm was concentrated at relatively low intervals, thus proving its excellent accuracy. In general, the accuracy of burnup, the power level, and the control-rod step are acceptable from an engineering perspective.

Fig. 8

Prediction deviation of μ with inverse model based on the clean or noised (σ= 5%) observation data

5.2

Comparison of the Convergence Rate among Metaheuristic Algorithms

The convergence rate is a crucial criterion for evaluating algorithms that are integrated with the generation design; thus, in this section, we compare different metaheuristic algorithms and their hybrids. Their convergence performance in 50 iterations (generation) with clean and noisy (σ=1% and σ=5%) observation data is shown in Fig. 9.

Fig. 9

Convergence rate of the cost function (defined by the L₂ norm) with different metaheuristic optimizers. The x-axis represents the iteration (generation), and we set the maximum iteration (generation) to 50

When the observation data were clean, the FSA exhibited the fastest convergence rate, followed by the advanced differential evolution (ADE). Moreover, when integrated within an SVC coarse-grid search, both the DE and CSDE algorithms demonstrated the ability to escape the local optima in the early stages while incurring a relatively low-cost function, thereby surpassing their standalone performance.

Among the two population-based algorithms, DE outperformed CSDE. This can be attributed to two factors: (i) the adoption of the best mutation method for DE, where we use the optimal mutation strategy for DE while using the classical strategy; and (ii) the relatively straightforward nature of our problem, wherein the inverse problem lacks multiple optima. Consequently, the CS step in each generation of CSDE may lead to unnecessary exploration of the parameter space that is distant from the optimal solution.

In the presence of noise-contaminated observation data, FSA continues to exhibit superior convergence capabilities. However, its advantage diminishes as the level of Gaussian noise reaches 5%, eventually leading to a decline in advantage. Notably, the ADE and advanced cuckoo search differential evolution (ACSDE) failed to deliver satisfactory results under noisy conditions. This is primarily due to the erroneous predictions of SVC, which further leads to inadequate initial estimates provided by population-based algorithms.

5.3

Evaluation of State Estimation Phase by Accuracy

In this section, we evaluate various methods in the state-estimation phase based on accuracy. First, we compared the accuracy and stability under two metrics: the relative error in the L₂ norm defined by Eq. 21 and the L∞ norm defined by Eq. 22 for field Φ, which is presented in Table 2. The ‘Mean’ values in the tables represent the average accuracy across 40 tested specimens, while the ‘STD’ values indicate the standard deviation of errors over the same set of specimens. The optimal values are indicated in bold.

Comparison of reconstructed power field errors using different algorithms

noise level	0	0.01	0.02	0.03	0.04	0.05
Mean L2
KNN	1.0662E-02	1.1128E-02	1.1461E-02	1.1461E-02	1.3296E-02	1.3541E-02
LHS	4.0457E-02	4.0210E-02	4.0823E-02	4.0754E-02	4.1138E-02	4.1790E-02
ALHS	3.4294E-02	3.4510E-02	3.4837E-02	3.4865E-02	3.5218E-02	3.7221E-02
DE	5.3219E-03	6.6446E-03	7.8485E-03	9.2407E-03	1.1051E-02	1.3777E-02
ADE	3.8119E-03	6.8077E-03	7.8185E-03	9.1585E-03	1.1134E-02	1.3526E-02
CSDE	1.7029E-02	1.6581E-02	1.7011E-02	1.7390E-02	1.8874E-02	2.0011E-02
ACSDE	7.6471E-03	4.4970E-02	4.5157E-02	4.6044E-02	4.7066E-02	4.8563E-02
PSO	1.0093E-02	1.1502E-02	1.3355E-02	1.4640E-02	1.6516E-02	1.7939E-02
FSA	3.8796E-03	5.1561E-03	6.5906E-03	8.1864E-03	1.0106E-02	1.2991E-02
NN	2.6737E-02	9.8060E-02	1.3113E-01	1.6029E-01	1.8785E-01	2.1606E-01
LHS2STEPS	3.1108E-02	3.1410E-02	3.1664E-02	3.2919E-02	3.3712E-02	3.4975E-02
KNNLHS	7.0615E-03	9.0129E-03	9.6693E-03	1.0514E-02	1.2406E-02	1.3408E-02
STD L2
KNN	5.3359E-03	5.5301E-03	5.2201E-03	5.2201E-03	5.5587E-03	5.7721E-03
LHS	2.5494E-02	2.5404E-02	2.5938E-02	2.5916E-02	2.5778E-02	2.5591E-02
ALHS	2.3719E-02	2.3621E-02	2.3377E-02	2.3131E-02	2.2688E-02	2.2074E-02
DE	5.3195E-03	4.6991E-03	4.6245E-03	4.6625E-03	5.7638E-03	8.1052E-03
ADE	3.1455E-03	4.6463E-03	4.5439E-03	4.6656E-03	6.0923E-03	7.7264E-03
CSDE	9.7295E-03	7.3634E-03	7.3791E-03	7.2028E-03	7.1571E-03	7.0336E-03
ACSDE	5.2449E-03	2.2811E-02	2.4206E-02	2.4100E-02	2.3438E-02	2.2736E-02
PSO	1.2374E-02	5.3048E-03	6.9606E-03	6.2216E-03	6.6237E-03	7.2507E-03
FSA	6.2602E-03	3.5832E-03	3.5640E-03	3.7438E-03	4.6444E-03	6.0076E-03
NN	3.4667E-02	7.3152E-02	8.3294E-02	9.8081E-02	1.1251E-01	1.2140E-01
LHS2STEPS	2.1742E-02	2.1617E-02	2.1475E-02	2.1528E-02	2.1408E-02	2.0572E-02
KNNLHS	4.9637E-03	5.3025E-03	4.9676E-03	5.8222E-03	5.9288E-03	6.1111E-03
Mean L∞
KNN	4.1213E-02	4.3991E-02	4.4840E-02	4.4840E-02	4.9788E-02	4,8775E-02
LHS	1.3126E-01	1.2567E-01	1.2326E-01	1.1998E-01	1.1904E-01	1,2479E-01
ALHS	1.0932E-01	1.1062E-01	1.1454E-01	1.1529E-01	1.1662E-01	1,2263E-01
DE	2.4302E-02	2.7539E-02	3.2492E-02	3.8239E-02	4.4672E-02	5,2324E-02
ADE	1.5357E-02	2.8079E-02	3.2408E-02	3.7967E-02	4.4269E-02	5,1787E-02
CSDE	5.8757E-02	6.0849E-02	6.3653E-02	6.5595E-02	6.9252E-02	7,2312E-02
ACSDE	3.1167E-02	1.4168E-01	1.4022E-01	1.4177E-01	1.4747E-01	1,4928E-01
PSO	4.6008E-02	4.6052E-02	5.2500E-02	5.6560E-02	6.2328E-02	6,7784E-02
FSA	1.6543E-02	2.2407E-02	2.7837E-02	3.3938E-02	4.0081E-02	4,9525E-02
NN	8.1834E-02	2.3886E-01	3.0443E-01	3.6030E-01	3.8488E-01	4,1158E-01
LHS2STEPS	1.0144E-01	1.0014E-01	1.0167E-01	1.0810E-01	1.1415E-01	1,1607E-01
KNNLHS	2.7072E-02	3.7594E-02	4.0961E-02	4.3185E-02	4.7107E-02	5,0309E-02
STD L∞
KNN	2.7190E-02	3.0633E-02	2.8849E-02	2.8849E-02	2.8941E-02	2.9448E-02
LHS	8.7321E-02	8.2894E-02	7.8177E-02	7.6610E-02	7.7353E-02	8.1824E-02
ALHS	8.8403E-02	8.9334E-02	8.8679E-02	8.8367E-02	8.9894E-02	8.7995E-02
DE	2.2486E-02	1.8169E-02	1.9161E-02	1.9406E-02	2.4870E-02	3.2087E-02
ADE	1.5143E-02	1.8220E-02	1.9221E-02	2.0268E-02	2.5709E-02	3.0895E-02
CSDE	4.0726E-02	2.6558E-02	2.8031E-02	2.8521E-02	2.6937E-02	2.9098E-02
ACSDE	2.1241E-02	6.5362E-02	6.7719E-02	7.0399E-02	6.9278E-02	6.8662E-02
PSO	5.6853E-02	2.4197E-02	2.5349E-02	2.5845E-02	2.8182E-02	3.1462E-02
FSA	2.3181E-02	1.5810E-02	1.5634E-02	1.6278E-02	2.0246E-02	2.6014E-02
NN	9.1032E-02	1.6232E-01	1.8105E-01	1.8648E-01	2.0080E-01	1.9789E-01
LHS2STEPS	7.1699E-02	6.9753E-02	7.1235E-02	7.3554E-02	7.4717E-02	7.2363E-02
KNNLHS	2.2045E-02	2.6180E-02	2.4270E-02	2.8580E-02	2.9683E-02	3.2208E-02

In general, integration of the LHS algorithm enhanced the accuracy of the KNN model. This improvement can be attributed to the LHS sampling approach, which mitigates the discontinuity issue associated with KNN predictions by sampling around the initial estimate.

Furthermore, the NN method exhibited a decline in accuracy when confronted with noise-contaminated observational backgrounds. This is attributed to its reliance on gradients, rendering it less effective in discrete cases.

To visualize the performance of the inverse algorithms directly, we randomly selected one specimen from a test dataset. Figures (11 and 12) depict the relative L₂ error of the field prediction at the 11th vertical level. Readers are referred to Fig. 10 for the true field values.

Fig. 11

(Color online) Reconstructed relative errors of radial power distribution on the 11th axial plane with clean observations

Fig. 12

(Color online) Reconstructed relative errors of radial power distribution on the 11th axial plane with observations of noise level σ= 5%

Fig. 10

(Color online) True value of the power distribution on one specimen in the test dataset over the core in a realistic HPR1000 reactor

To fully demonstrate the reconstruction accuracy of our approach, we examined cases in which the reconstruction was relatively poor, particularly around the control rod interfaces. Reconstruction of the physical fields in locations near the control rod interfaces is challenging because the behavior in these regions is not easily captured with high fidelity. Readers can refer to Fig. 10 for the true field value on this axis, and Figs. 13 and 14 depict the prediction errors of various methods on this axis.

Fig. 13

Predicted relative error for the D01 fuel assembly with clean observations

Fig. 14

Predicted relative error for the D01 fuel assembly with observations of noise level σ= 5%

However, our results also show that even in such difficult cases, our method can achieve good reconstruction in most cases. Although some techniques performed worse around the interfaces, the majority of our proposed methods still facilitated a reconstruction quality comparable to other areas across the domain. Overall, this validation indicates that the proposed approaches hold promise for representing complex reactor physics behaviors with suitable accuracy, even when challenges arise locally because of design complexities, such as control rod insertion points.

5.4

Evaluation of State Estimation Phase by Time Cost

In this section, we present an overview of the time costs (we take the mean of the 40 specimens) associated with each proposed algorithm in the state-estimation phase, which can be found in Table 3. The optimal values are highlighted in bold, and the second optimal value is underlined.

Comparison of time cost for different algorithms

noise level	0	0.01	0.02	0.03	0.04	0.05
KNN	6.3696E-03	3.1396E-03	3.1599E-03	3.1594E-03	3.1915E-03	3.1527E-03
LHS	7.3946E-01	7.3326E-01	7.2510E-01	7.3279E-01	7.2927E-01	7.2180E-01
ALHS	7.1437E-01	7.3674E-01	7.1443E-01	7.1613E-01	7.1730E-01	7.1050E-01
DE	3.0745E+00	2.9459E+00	2.8510E+00	2.9223E+00	2.8299E+00	2.7874E+00
ADE	2.9249E+00	2.9339E+00	2.8915E+00	2.9284E+00	2.9743E+00	3.0290E+00
CSDE	4.7069E+00	4.6679E+00	4.6691E+00	4.6531E+00	4.6615E+00	4.6227E+00
ACSDE	4.8054E+00	4.6563E+00	4.6374E+00	4.6886E+00	4.6303E+00	4.7069E+00
PSO	2.4404E+00	2.5169E+00	2.4969E+00	2.3198E+00	2.3551E+00	2.3151E+00
FSA	2.7848E+00	2.8079E+00	2.7324E+00	2.7212E+00	2.7090E+00	2.6977E+00
NN	4.1338E-04	4.2277E-04	4.0678E-04	4.0900E-04	4.0602E-04	4.0516E-04
LHS2STEPS	7.0137E-01	7.0546E-01	7.0171E-01	7.0508E-01	7.0827E-01	7.0630E-01
KNNLHS	7.0515E-02	6.4439E-02	6.3790E-02	6.4109E-02	6.3136E-02	6.3097E-02

It is worth mentioning that the NN method demonstrated notable speed. However, metaheuristic algorithms incur relatively high time costs. This is primarily attributed to the repeated computation of the cost function in Eq. 12, which occurs 50 times for each population member over 50 iterations/generation cycles, resulting in a total of 2500 cost function evaluations. Moreover, these algorithms involve additional computations and sorting operations for information exchange and identification of the best solutions, contributing to the discrepancy in time cost.

Furthermore, it is important to note that the methods LHS, ALHS, and LHS2STEPS have approximately the same time costs. In the algorithm LHS, we set the number of samples to 1000. In algorithms ALHS and LHS2STEPS, we set the first-stage sampling number to 750, and the second-stage sampling is repeated five times, with each repetition sampling number uniformly set to 50, adding up to 1000 in total. Model-based algorithms, such as KNN and its hybrid KNNLHS, have relatively low time costs, that is, less than one second, which is acceptable in the industry.

5.5

Comprehensive Evaluation of Various Algorithms

In the state-estimation phase, a comparison of the proposed algorithms in terms of time cost and accuracy is presented in Fig. 15. The KNNLHS algorithm demonstrates a commendable level of accuracy, achieving solutions within 0.1 s for both clean and noise-contaminated observation backgrounds. Consequently, KNNLHS can be considered suitable for real-time online inverse modeling in the context of RODT. Moreover, in scenarios in which time constraints are not limiting factors, the FSA, DE, and ADE algorithms provide reasonably accurate solutions.

Fig. 15

Comparison of the synthetic performance for different inverse algorithms

Conclusion and Future Works

In this study, we developed and implemented a series of powerful algorithms for the RODT, aiming to address the challenges in parameter identification and state estimation for nuclear reactor monitoring and optimization applications. Novel hybrid optimization algorithms, including ADE, CSDE, ACSDE, and KNNLHS, were designed and implemented to effectively solve the inverse problems posed by the discontinuous surrogate forward modeling approach. Both deterministic and metaheuristic optimization methods have been investigated for their applicability to nuclear engineering systems characterized by high-dimensional, nonlinear relationships. The integration of coarse and fine grid searching strikes an optimal balance between computational efficiency and solution accuracy.

Through extensive experimentation and evaluation, we conducted a thorough comparative analysis of parameter identification accuracy, field reconstruction accuracy, and convergence profile. FSA exhibited the fastest convergence rate, and the proposed ADE method follows closely behind. Moreover, integrating standard DE or CSDE algorithms within the SVC coarse grid search framework enables these techniques to effectively escape local optima in early iterations while maintaining relatively low-cost function values, surpassing their standalone implementations. Additionally, it is observed that pairing the LHS approach within the KNN surrogate model enhances the accuracy by mitigating discontinuity issues through local sampling around the initial guess. Thus, our approach effectively addresses the challenges of discontinuity and non-differentiability in surrogate forward mapping generated by the KNN model. Conversely, the NN method experiences declining accuracy when faced with noise-contaminated measurements, likely owing to overreliance on gradient information, which proves less effective under such uncertainty.

Our algorithms demonstrated high computational efficiency, with solutions obtained within 1 s and as fast as $\leq 0.08$ s for KNN, KNNLHS, and NN. They also exhibited robustness in handling noise-contaminated backgrounds. Our work contributes to the realization of RODT by providing effective, efficient, and reliable support for real-world nuclear power plant operation and sensor data. These algorithms establish a promising foundation for advanced system modeling and engineering decision support.

By specifically tailoring solutions to nuclear inverse and surrogate modeling challenges, this research advances the state-of-the-art RODT methodology. The improved parameter identification and state estimation facilitate the optimization of reactor monitoring, control, and performance through data-driven DT applications. In the future, novel methods can contribute to our RODT system, and with the same methodology, we can support the development and implementation of these transformative technologies in other industries requiring real-time DTs.

Continued work validating diverse systems and progressively integrating multi-physics modeling holds promise for fully realizing the potential of RODTs in transforming nuclear engineering practice with powerful real-time and decision-support capabilities. Realizing this potential requires ongoing progress, collaboration, and open innovation across expert areas.

References

F. Tao, H. Zhang, A. Liu, et al.,

Digital twin in industry: State-of-the-art

. IEEE Transactions on industrial informatics 15, 2405-2415 (2018).