Many-objective evolutionary algorithms based on reference point-selection strategy for application in reactor radiation-shielding design

NUCLEAR ENERGY SCIENCE AND ENGINEERING

Many-objective evolutionary algorithms based on reference point-selection strategy for application in reactor radiation-shielding design

Cheng-Wei Liu，

Ai-Kou Sun，

Ji-Chong Lei，

Hong-Yu Qu，

Chao Yang，

Tao Yu，

Zhen-Ping Chen

Nuclear Science and Techniques

Vol.36, No.6

Article number 105

Published in print Jun 2025

Available online 22 Apr 2025

DOI：10.1007/s41365-025-01683-7

CSTR：32136.14.NST.2025.06105

107807

In recent years, the development of new types of nuclear reactors, such as transportable, marine, and space reactors, has presented new challenges for the optimization of reactor radiation-shielding design. Shielding structures typically need to be lightweight, miniaturized, and radiation-protected, which is a multi-parameter and multi-objective optimization problem. The conventional multi-objective (two or three objectives) optimization method for radiation-shielding design exhibits limitations for a number of optimization objectives and variable parameters, as well as a deficiency in achieving a global optimal solution, thereby failing to meet the requirements of shielding optimization for newly developed reactors. In this study, genetic and artificial bee-colony algorithms are combined with a reference-point-selection strategy and applied to the many-objective (having four or more objectives) optimal design of reactor radiation shielding. To validate the reliability of the methods, an optimization simulation is conducted on three-dimensional shielding structures and another complicated shielding-optimization problem. The numerical results demonstrate that the proposed algorithms outperform conventional shielding-design methods in terms of optimization performance, and they exhibit their reliability in practical engineering problems. The many-objective optimization algorithms developed in this study are proven to efficiently and consistently search for Pareto-front shielding schemes. Therefore, the algorithms proposed in this study offer novel insights into improving the shielding-design performance and shielding quality of new reactor types.

Many-objective optimization problemEvolutionary algorithmRadiation-shielding designReference point-selection strategy

Introduction

Radiation-shielding designs for reactors aim to minimize the external radiation dose (ALARA) by selecting appropriate shielding materials and structures to meet the radiation-safety requirements of the personnel [1, 2]. Additionally, with the development of new types of nuclear reactors in various fields, the shielding design needs to balance safety standards with the consideration of miniaturization and lightweight design, such as for marine, transportable, and space reactors [2-5]. Consequently, the radiation-shielding design for newly developed reactors presents a typical multi-objective optimization problem, as it involves multiple design objectives and parameters, including the radiation dose, volume, and weight (Fig. 1.

Fig. 1

(Color online) Application of evolutionary algorithms in reactor-shielding design

Conventional shielding-design methods rely heavily on expert knowledge, and require manual iterations to achieve acceptable results. With the development of evolutionary optimization algorithms, some researchers have begun to employ single-objective optimization algorithms, such as the genetic algorithm (GA), particle swarm optimization, and differential evolution, to optimize the radiation dose as a single objective [5-7]. In recent years, studies have also utilized multi-objective (two or three objectives) optimization algorithms, such as the non-dominated sorting genetic algorithm II (NSGA-II) and multi-objective particle swarm optimization, to optimize the volume-dose trade-off as multiple objectives [8-12]. This means that evolutionary algorithms have received considerable attention and can provide new opportunities for complex shielding-optimization problems.

However, in practical shielding-optimization designs, the number of objectives to be optimized often exceeds four, resulting in a typical many-objective problem (four or more objectives to be optimized) [13, 14]. Conventional multi-objective optimization algorithms exhibit suboptimal performance when dealing with many-objective optimization problems [15, 16], failing to meet the requirements of achieving a global optimal solution in the shielding design. Therefore, in this study, we propose a reference point-based many-objective artificial bee-colony algorithm (RP-MOABC) and reference point-based non-dominated sorting algorithm (RP-NSGA) to solve many-objective shielding-optimization problems.

The remainder of this paper is organized as follows. Section 2 presents the mathematical model of the many-objective optimization problem in reactor radiation shielding and discusses the encoding methods for shielding design. In Sect. 3, we provide an overview of the fundamental principles of the proposed genetic-and bee-colony-based optimization algorithms. We also outline a detailed framework for their application in radiation-shielding designs. Section 4 presents the numerical models of reactor-shielding design applications, along with the corresponding numerical results. Finally, the concluding section summarizes the key findings and conclusions of this study.

Many-objective optimization problem in reactor radiation-shielding design

This section provides a concise overview of reactor radiation-shielding design and the associated mathematical model for the many-objective shielding-optimization problem. It begins with a brief explanation of the principles underlying the shielding design for neutrons and gamma rays in reactors. Next, a mathematical model is proposed to address the many-parameter, -objective, and -constraint reactor-shielding design. Finally, a special encoding method is proposed to represent the shielding parameters (such as the number of shielding layers, types of shielding materials, and their compositions), facilitating the optimization process.

2.1

Principles of radiation-shielding design

The primary objective of reactor radiation shielding is to design a shielding system composed of multi-layer and multi-material structures that surround the core facilities within the reactor, ensuring that the personnel are in an environment where the radiation dose meets the specified limits. This is achieved by utilizing suitable shielding materials and geometric configurations combined with multi-layered and multi-component structures to shield the radiation sources. The selection and arrangement of materials in each layer are based on their specific shielding capabilities and nuclear interactions with radiation particles, with the aim of minimizing radiation levels to the maximum possible extent.

The principles governing the design of radiation shielding are based on the interactions between the radiation particles and materials. Radiation predominantly comprises neutrons and gamma rays. Neutron interactions include absorption and scattering, which is further categorized into elastic and inelastic scattering. For neutron shielding, the secondary gamma rays generated during inelastic scattering reactions must be considered. The interactions of gamma rays include Compton scattering, photoelectric absorption, and pair production. Typically, neutron shielding requires materials containing heavy and light isotopes, such as light water, boron carbide, and polyethylene [17, 18]. Conversely, gamma-ray shielding requires materials with high-Z isotopes, including lead, concrete, and steel [19].

2.2

Many-objective mathematical model for radiation-shielding design

Many-objective optimization in practical applications aims to achieve multiple optimized objectives within specific constraints. However, owing to the inherent conflicts between these objectives, optimizing one objective is typically achieved at the expense of deteriorating the others. As a result, a unique optimal solution is elusive; instead, a set of Pareto-front solutions consisting of non-dominated individuals is obtained [20]. Radiation-shielding design presents a typical many-objective optimization problem in nuclear engineering. The goal is to minimize the mass and volume of the total shielding system while ensuring that the external-radiation dose satisfies the ALARA principle. However, weight, volume, and radiation dose are conflicting objectives that cannot be optimized simultaneously. In this study, we focus on minimizing the total weight, total volume, and axial and radial radiation doses outside the shielding layers. These objectives are influenced by various parameters, including the thickness of the shielding layer, materials used, and the composition of the materials used. Additionally, constraints are imposed on the fast-neutron, thermal-neutron, and gamma-ray flux rates outside the shielding layers during the optimization process. Reactor radiation-shielding design involves many objectives, parameters, and constraints. To address these challenges, the mathematical model in Eq. (1) is proposed: ${\begin{array}{l} \min F (X) = {(F_{R} (X), F_{W} (X), F_{V} (X))}^{T} \\ s . t . {\begin{array}{l} F_{R} (X) = R_{N} (X) + R_{P} (X) \leq R_{0} \\ F_{W} (X) = \sum_{m = 1}^{M} V_{m} \cdot ρ_{m} \leq W_{0} \\ F_{V} (X) = \sum_{m = 1}^{M} V_{m} \leq V_{0} \\ Φ_{N} (X) \leq Φ_{N_{0}} \\ Φ_{P} (X) \leq Φ_{P_{0}} \\ X = (x_{1}, x_{2}, x_{3}, \dots, x_{n - 1}, x_{n}), X \in ℜ \\ L_{i} \leq x_{i} \leq U_{i} (i = 1, 2, \dots, n) \\ . \end{array} \end{array}$ (1) Based on the optimization model (1), where X is the decision variable in the shielding design (geometry thickness, material type, material composition); the values $L_{i}$ and $U_{i}$ represent the lower and upper bounds of the variables, respectively; the vector space containing all decision variables is represented by X; $Φ_{N} (X)$ and $Φ_{P} (X)$ represent the neutron and photon flux rates outside the shielding layer, respectively, while $Φ_{N_{0}}$ and $Φ_{P_{0}}$ are their corresponding constraint values; $V_{m}$ represents the volume of the m-th shielding layer, and $ρ_{m}$ represents the corresponding density of the shielding material denoted by $m$ . $W_{m}$ is the mass of the shielding layers; $R_{N}$ and $R_{P}$ represent the neutron and photon dose external to the shielding layer, respectively; $F_{R} (X)$ , $F_{W} (X)$ , and $F_{V} (X)$ represent the radiation dose of the shielding layer and the total weight and total volume of the shielding system, respectively; $R_{0}$ , $W_{0}$ , and $V_{0}$ represent the constraints for the radiation dose, weight, and volume, respectively; $F (X)$ is the radiation-shielding design objective vector, and the minimized value is expected to be optimized for each dimension of the objective vector.

2.3

Encoding methods of radiation-shielding model

In recent years, the development of radiation-shielding designs has progressed significantly, and a series of studies combining evolutionary algorithms with various types of shielding designs have emerged. However, the decision vector in this type of research only covers the shielding structure and shielding-material-type changes, and no material-composition-optimization research has been previously carried out. In this study, the reactor-shielding model is simulated for the many-parameter problem in shielding optimization using both binary encoding and real-number encoding, which can be encoded to characterize the shielding-model geometry, material type and material composition, and coupled with a variety of evolutionary algorithms for calculation.

Fig. 2 illustrates the encoding process for the simple shielding scheme. First, the thicknesses of the shielding layers and the corresponding index numbers for the shielding materials are obtained. The composition is determined for materials with variable compositions, such as borated and lead-borated polyethylene. Second, the above parameters are encoded in real or binary numbers. The composition content, which is subject to fixed-sum constraints, is transformed using spherical-coordinate conversion [21], as in Eq. (2) prior to encoding. Finally, the encoding design is completed for all shielding layers. The purpose of encoding is to combine the shielding model with the evolutionary algorithm such that the model parameters can be optimized by the algorithm, and the final shielding scheme can be obtained for the reference of the designer. $\begin{array}{l} x_{i} = {(\sin θ_{1} \cdot \sin θ_{2} ... \sin θ_{n - 1})}^{2}, i = 1 \\ x_{i} = {(\cos θ_{i - 1} \cdot \sin θ_{i} ... \sin θ_{n - 1})}^{2}, i = 2, ..., n - 1 \\ x_{i} = {(\cos θ_{n - 1})}^{2}, i = n \end{array}$ (2)

Fig. 2

(Color online) Example of the encoding method for a radiation-shielding model

Many-objective evolutionary algorithms for radiation-shielding design

3.1

Scheme of radiation-shielding optimization with many-objective evolutionary algorithm

In this study, shielding-optimization methods are proposed by combining many-objective evolutionary algorithms with particle-transport calculation software that can optimize the reactor primary-shielding structures, material types, and material compositions to obtain optimal shielding-design schemes through an automated process. These methods are embedded into the multi-functional radiation-transport simulation platform (MOSRT) developed by the NEAL team [22, 23].

A schematic of the many-objective optimization of the radiation-shielding design is shown in Fig. 3, and the steps are described as follows.

Fig. 3

(Color online) Schematic of radiation-shielding design with many-objective evolutionary algorithm

(1) Preprocessing of the initial shielding model is performed using the MOSRT software. First, the CAD model of the reactor geometry is modeled with MOSRT, and the model is automatically converted into a particle-transport calculation model. The shielding regions to be optimized are labeled. Finally, the initial running parameters are set, including the maximum number of iterations g, population size N, and optimization-objective number M.

(2) The evolutionary algorithm is selected. In this study, the genetic and ABC algorithms are invoked as evolutionary algorithms to optimize the weight, volume, and region-specific radiation-dose rate of the reactor radiation-shielding model.

(3) The decision-vector encoding method (real-number encoding or binary encoding) is chosen based on the requirements, and the initial parent population $P_{g = 1}$ is generated.

(4) The initial offspring population $Q_{g = 1}$ is generated based on the parent-population characteristics. The GA uses selection, crossover, and mutation to generate the offspring population, and the bee-colony algorithm uses employed, onlooker, and scout bees to update the offspring population. The parent population $P_{g}$ is merged with the offspring population $Q_{g}$ to obtain the g-th generation combined population $C_{g}$ . The population is decoded and converted into a particle-transport input card to solve the objective values. The MOSRT has a built-in shielding-calculation module integrated with the Monte Carlo method (MC) [24].

(5) A fast non-dominated sorting strategy is applied to the merged population, and each algorithm uses the reference-point selection strategy [25] to generate the new parent population $P_{g + 1}$ . A performance-comparison analysis is conducted simultaneously with conventional methods that use crowding-distance strategy selection [26]. Specific strategies are explained in Sect. 3.5.

(6) Steps (4) and (5) are repeated if the maximum number of iterations is not reached, and the parent population $P_{g + 1}$ is continuously updated; otherwise, step (7) is implemented.

(7) The optimization process is completed, and the optimization results of each algorithm are thoroughly compared and analyzed.

3.2

Evolutionary strategy based on genetic algorithm

The GA [27] is a classical evolutionary strategy for simulating inheritance and evolution in nature. It searches for the solution space of optimization problems through operations such as selection, crossover, and mutation. GAs based on different evaluation strategies exhibit significant differences in performance when solving various problems [28]. The following section explains the basic operations of GAs.

Selection: In this study, a binary tournament-selection operation is used, where two individuals (the individual in the paper means a specific shielding scheme) are taken from the current population at a time, the dominance relationship between the two individuals is judged, and the individual in the dominant position is selected for subsequent crossover and mutation. If two individuals do not dominate each other, one individual is randomly selected for the subsequent genetic steps. The determination of the dominance relationship will be explained in Sect. 3.4.1.

Crossover: The crossover operation mimics the process of hybridization in biological evolution. It selects the chromosomal segments (genes) of two or more parent individuals for exchange to generate new offspring. In this study, we use a single-point crossover operation for GAs with binary encoding and a simulated binary crossover operation for GAs with real-number encoding.

Mutation: The mutation operation introduces a degree of randomness to the algorithm by changing some gene values on individual chromosomes to generate new individuals. The mutation operation can increase the diversity of the population and prevent it from falling into a locally optimum solution. The mutation operation can randomly select some genes of an individual to be changed, or mutate each gene on the chromosome with a particular probability. In this study, we use the bit-flip mutation operation for GAs with binary encoding and polynomial mutation operations for GAs with real number encoding.

3.3

Evolutionary strategy based on artificial bee-colony algorithm

The ABC [29, 30] algorithm is a global optimization technique based on swarm intelligence. It draws inspiration from the foraging behavior of bee colonies, where individual bees perform specific tasks and share information to collectively find the optimal solution to a problem. This study builds on the fundamental ABC algorithm and introduces improvements by employing two methods: a differential evolution search [31] and golden sine search [32]. The following sections explain the search strategies of the algorithm.

Employed bee search: In this phase, a number of hired bees consistent with the population size will be dispatched to search for historical food sources (solution space). In this study, the bee population searches using a differential-search method based on an elite strategy. Individuals in the Pareto optimal set (the Pareto optimal set will be explained in Sect. 3.4.2.) are randomly used as bootstrap terms, and the differential evolution operator is used to generate new solutions, as shown in Eq. (3): $X^{t} = X^{t 1} + R (X^{t 1} - X^{t 2}) + R (X^{t 1} - X^{t 3}),$ (3) where $X^{t 1}$ , $X^{t 2}$ , and $X^{t 3}$ are three mutually dissimilar solutions selected from the Pareto optimal set and R are random numbers between [0, 1].

The elite-guided search mechanism is an exploitation operation that unavoidably reduces the diversity of the population, and may cause the algorithm to converge locally. To better balance the global-search and local-exploitation capabilities, this study proposes a solution-space search method based on the golden sine function: $\begin{array}{l} X^{t} = X_{i} | \sin (r_{1}) | + r_{2} \cdot \sin (r_{1}) \cdot | c_{1} X^{t 1} - c_{2} X_{i} |, \\ c_{1} = a g_{s} + b (1 - g_{s}), \\ c_{2} = a (1 - g_{s}) + b g_{s} . \end{array}$ (4) In Eq. (4), X_i represents the i-th individual, $r_{1}$ and $r_{2}$ are random numbers belonging to $[0,2 π]$ and $[0, π]$ respectively, $c_{1}$ and $c_{2}$ are introduced as golden section coefficients in the position-update formula, respectively, and $a$ and $b$ are initial values set to $- π$ and $π$ , respectively. The golden section number $g_{s}$ is $(\sqrt{5} - 1) / 2$ .

The employed bees use Eq. (3) and Eq. (4) for searching with equal probabilities to balance the diversity and local-search capability of the population.

Onlooker bee search: The onlooker bees perform repeated optimization on the excellent food sources based on the roulette-wheel selection. The specific optimization operations are the same as those of the employed bees, and the onlooker-bee evaluation function is as follows: $f i t_{i} = \frac{d o m}{F o o d N u m b e r},$ (5) where $d o m$ represents the number of solutions in which the i-th feasible solution dominates among all feasible solutions, and $F o o d N u m b e r$ represents the total number of food sources.

The roulette-wheel selection probability is defined as $p_{i} = \frac{f i t_{i}}{\sum_{n = 1}^{F o o d N u m b e r} f i t_{n}} .$ (6) Food sources with higher probabilities have a greater probability of being explored in the subsequent bee stage, thereby facilitating the algorithm's efficient exploration of the space for excellent solutions.

Scout bee search: A food source is randomly selected from the last Pareto layer, its decision vector is randomized, and its corresponding objective values are reset.

3.4

Evaluation method of shielding scheme based on Pareto domination

3.4.1

Pareto domination of the shielding schemes

When evaluating the performance of different shielding-design schemes, in the case of a single objective (such as shield weight), a smaller shield weight indicates better performance of the design scheme. However, for multiple objectives, each shielding-design scheme has multidimensional attributes (such as shield weight, volume, and dose rate). The performance of shielding schemes cannot be evaluated simply based on numerical magnitudes.

First, the decision vectors must be decoded and converted into particle-transport program-input files, and the population objective values are calculated using the particle-transport program, as shown in Eq. (7): $\begin{array}{l} D_{pop} = [\begin{array}{l} X_{1} = [x_{1,1} x_{1,2} ......... x_{1, n}] \\ ... \\ ... \\ ... \\ X_{N} = [x_{N,1} x_{N,2} ......... x_{N, n}] \end{array}] \\ ⇓ \\ F_{pop} = [\begin{array}{l} F (X_{1}) = [F_{1,1} F_{1,2} ......... F_{1, M}] \\ ... \\ ... \\ ... \\ F (X_{N}) = [F_{N,1} F_{N,2} ......... F_{N, M}] \end{array}] \end{array} .$ (7) In this study, the performance-evaluation method for the shielding-design schemes is based on the Pareto-domination method. Specifically, when a Pareto-domination relationship exists between the two schemes, the scheme in the dominant position performs better than the other schemes in the dominant position. For any two schemes $X_{u}$ and $X_{v}$ in a set of schemes, $X_{u}$ dominates $X_{v}$ if Eq. (8) is satisfied: $D o m (X_{u}, X_{v}) = if {\begin{array}{l} F (X_{u}) \neq F (X_{v}) \\ W (X_{u}) \leq W (X_{v}) \\ V (X_{u}) \leq V (X_{v}) \\ R (X_{u}) \leq R (X_{v}) \end{array} \Rightarrow X_{u} ≺ X_{v} .$ (8) For constrained optimization problems, we employ the feasibility rule [33] to assess the superiority of the schemes. First, the feasibility of the scheme with respect to the constraints is evaluated. If the constraints are satisfied, then the constraint variable $R e s (X)$ is set to zero; otherwise, it is set to one. When comparing two schemes $X_{u}$ and $X_{v}$ with the same value of $R e s (X)$ , the domination judgment is based on Eq. (8). However, when the two schemes have unequal values of $R e s (X)$ , the scheme with a value of 0 dominates that with a value of 1, as shown in Eq. (9) and (10): $R e s (X) = {\begin{array}{l} 0 if r (X) \leq r_{0} (X) \\ 1 else \end{array}$ (9) $D o m_{res} (X_{u}, X_{v}) = {\begin{array}{l} D o m (X_{u}, X_{v}) if R e s (X_{u}) = R e s (X_{v}) \\ X_{u} ≺ X_{v} else if R e s (X_{u}) < R e s (X_{v}) \\ X_{v} ≺ X_{u} else, \end{array}$ (10) where r₀(X) is the constraint vector and $r (X)$ is the constraint value of the corresponding individual.

3.4.2

Fast non-dominated sorting strategy

Based on the Pareto-domination judgment method, fast non-dominated sorting (Fig. 3(e)) of the populations can be performed to divide the different levels of the non-dominated layers. First, all schemes in the population are judged for domination, and all schemes that are not dominated by others are removed from the population and constitute the non-dominated layer $F_{1}$ . Then, the population that eliminates the schemes in $F_{1}$ is again judged for domination, and all non-dominated schemes are removed from the population to constitute $F_{2}$ . This is repeated until all the schemes are deposited in the non-dominated layer of the corresponding rank.

The schemes at each level of the non-dominated layer are not dominated by each other, and the smaller the level of the non-dominated layer, the better the integrated performance of the schemes in the layer. The $F_{1}$ non-dominated layer is called the Pareto optimal set.

3.5

Pareto-set-selection strategy

When offspring are generated, the parent set $P_{g}$ and resulting offspring set $Q_{g}$ are combined into a set $C_{g}$ . The schemes at the smaller non-dominant level of the set are selected to enter the next generation until all schemes at level $F_{l}$ are selected to $P_{g + 1}$ , such that $P_{g + 1}$ is equal in size to $P_{g}$ . If all the individuals at level $F_{l}$ are selected to $P_{g + 1}$ , the next-generation population $P_{g + 1}$ is larger than $P_{g}$ . Therefore, a certain strategy is required to select individuals at level $F_{l}$ . In this study, crowding distance and reference-point selection strategies are used. The details are as follows.

3.5.1

Selection strategy based on crowding distance

Conventional multi-objective optimization algorithms use the crowding distance to compare schemes in non-dominated layers. Further judgment is made by calculating the CD value for each scheme. The calculation method is given in Eq. (11) and Fig. 3(f): $C D_{i} = {\begin{array}{l} \infty i = (1 or n) \\ \sum_{j = 1}^{M} \frac{F_{i + 1, j} - F_{i - 1, j}}{F_{j}^{\max} - F_{j}^{\min}} i =(2, ..., n - 1) \end{array} j =(1, ..., M) .$ (11) By summing the normalized distance of each scheme in each dimension of the objective space, we can determine whether the scheme has diversity in the current non-dominated layer. A scheme with a larger value of CD represents fewer similar schemes and is more likely to be selected for the next generation.

3.5.2

Selection strategy based on reference point

In the reference-point selection strategy [34, 35], the reference points are first predefined in the normalized hyperplane and the number of reference points should be close to the population size, which is defined by Eq. (12): $R e f C o u n t = (\begin{matrix} H + M - 1 \\ H \end{matrix}) = C_{H + M - 1}^{H},$ (12) where M denotes the number of optimization objectives and H denotes the number of divisions. A schematic of the reference points in the case of three objectives and six divisions is shown in Fig. 4. The purpose of generating reference points is to generate a set of reference vectors in the objective space, through which schemes in the population can be associated to ensure that the diversity of the population is maintained during subsequent evolution.

Fig. 4

(Color online) Reference points are shown on a normalized hyperplane for a three-objective problem with p =6

We define $S_{g}$ as the set consisting of all schemes in the $F_{1} - F_{l}$ layers and adaptively normalize all schemes in $S_{g}$ to the hyperplane. First, we define the ideal point $z = (z_{1}^{\min}, z_{2}^{\min}, ..., z_{M}^{\min})$ , where $z_{i}^{\min}$ is the minimum value of the i-th objective. Next, the objective values of each scheme in $S_{g}$ are transformed with respect to the ideal point as $F_{i}^{'} (X) = F_{i} (X) - z_{i}^{min}$ . Subsequently, the extreme point ( $z_{i}^{\max}$ ) on each (i-th) objective axis is determined, and the hyperplane is formed based on the M extreme points. Finally, the intercept $a_{i}$ of each objective axis is obtained and the normalized objective value is obtained from Eq. (13). $\begin{array}{l} F_{i}^{n} (X) = \frac{F_{i}^{'} (X)}{a_{i}} \end{array}$ (13) The normalized scheme in $S_{g}$ in the hyperplane is associated with a reference point. First, the reference points are connected to the ideal points to form a reference vector. Second, the Euclidean distance between each individual and the reference vector is computed. Finally, each individual is associated with the nearest reference vector.

After associating the reference points with the scheme in $S_{g}$ , the number of times the first l - 1 layers of the schemes are associated with each reference point $ρ_{j}$ is counted. Let $J_{\min} = {j : \arg \min_{j} ρ_{j}}$ be the set of reference points with the smallest $ρ_{j}$ . Select the reference point $\bar{j} \in J_{\min}$ .

If the selected $ρ_{\bar{j}} = 0$ , no scheme is associated with $\bar{j}$ in the first l - 1 layers. At this time, two choices are available. 1) A scheme in $F_{l}$ that is associated to this reference point exists. At this time, the individual with the shortest distance from the reference vector Euclidean is selected to join the population, and at the same time, $ρ_{\bar{j}}$ is increased by 1. 2) If the reference point $\bar{j}$ does not have a scheme associated with it in $F_{l}$ , then this reference point is skipped.

If $ρ_{\bar{j}} > 0$ , then at least one scheme exists in the first l - 1 layers associated with $\bar{j}$ . In this case, the scheme in $\bar{j}$ is chosen randomly to join the population, whereas $ρ_{\bar{j}}$ increases by one. The reference-point strategy ensures that the population is well-distributed and can guide the evolution of the population to a uniform Pareto front.

Numerical evaluation for radiation-shielding optimization problems

Two sets of numerical simulations are conducted, as follows:

(1) The GA and ABC algorithms are combined with Pareto-frontier-selection strategies based on the reference-point strategy or crowding-distance strategy, respectively, to test the optimization of a simple three-dimensional (3D) shielding problem, and the optimization performance of each strategy is compared.

(2) Constrained many-objective optimization of a complex shielding problem is conducted using the proposed optimization method and is compared with the initial shielding scheme.

In the following sections, we refer to the algorithms based on the reference-point-selection strategy as RP-NSGA and RP-MOABC, and those based on the crowding-distance strategy as CD-NSGA and CD-MOABC. The neutron database used in the simulation is ENDF/B-VIII.0 [36], and the photon database is MCPLIB84. The neutron flux-dose rate and photon flux-dose rate conversion factors used are NCRP-38 and ANSI/ANS 1977 [37], respectively. Particle-transport simulations are conducted using the Monte Carlo code MagicMC [38], and the overall optimization process, including evolutionary algorithms and the Monte Carlo code, are integrated into the self-developed software MOSRT (Fig. 5.

Fig. 5

(Color online) Radiation-shielding design optimization based on MOSRT software

4.1

Many-objective optimization for a simple 3D shielding problem

The first problem focuses on the multiple objectives of a simple 3D shielding structure, as shown in Fig. 6(a). The optimization objectives are to determine the total volume and weight of the R1-R10, U1-U5, and L1-L5 shielding layers as well as the radial lateral, axial upward, and axial downward dose rates of the shielding layers. The thickness of each shielding layer in the optimized model ranges from 0.3 cm to 13 cm. The reactor consists of a homogenized core with a height of 167.6 cm and radius of 78.8 cm. The source term is set as a fixed source with a probability distribution based on the Watt-fission spectrum. A comparative analysis of the optimization performance of each algorithm is performed using the average objective value and hypervolume [39] value.

Fig. 6

(Color online) Many-objective optimization results for the simple 3D shielding problem. (a) Schematic of the simple 3D shielding structure. (b) The average values of the volumes for shielding schemes in the Pareto front. (c) The average values of the weights for shielding schemes in the Pareto front. (d) The average values of the axial upward dose rates for shielding schemes in the Pareto front. (e) The average values of the axial downward dose rates for shielding schemes in the Pareto front. (f) The average values of the radial dose rates for shielding schemes in the Pareto front. (g) Hypervolume indicator results

In this problem, GA-based algorithms use binary encoding and ABC-based algorithms use real number encoding. The population size of all algorithms is set to 210, and the number of reference points of the algorithms using the RP strategy is 210 (the number of divisions H=6). The number of MC transported particles is 1.0×10⁷.

For the shielding-optimization process, the time complexity is mainly related to the number of evolutionary generations g and the population size N, where the total time complexity of the GA is $O (g N)$ and that of the ABC algorithm is $O (2 g N)$ . To ensure that the tests are conducted under identical conditions, the number of evolutionary generations of the GA-based algorithms is set to 100, whereas that of ABC-based algorithm is 50.

As observed in the average objective values shown in Fig. 6(b-f), the algorithms based on the reference-point strategy show excellent optimization results in terms of the weight and volume objectives. Compared with the conventional crowding-distance strategy, the average volume and weight values for the shielding schemes in the final generation of RP-NSGA are only 24.5% and 14.5% of those of CD-NSGA, and the average volume and weight values of RP-MOABC are only 17.3% and 9.77% of those of CD-MOABC. Regarding the dose rates in each region, the results in the final generation of each strategy are not significantly better or worse; the distribution of the results is relatively close, and the algorithm with relatively better performance is RP-NSGA. Overall, based on the average objective values, the algorithms using the reference-point strategy exhibit better convergence results.

To further evaluate the comprehensive performance of each algorithm scientifically, the hypervolume (HV) is chosen as the quantified factor to measure the convergence and distribution of the shielding schemes. The HV metric is a widely used performance evaluation method for engineering optimization problems, and does not require a real Pareto front to judge the strengths and weaknesses of a set of population schemes. The HV metric is calculated as shown in Eq. (14): $\begin{array}{l} H V = δ (\cup_{i = 1}^{S_{i}} v_{i}) . \end{array}$ (14) The combined performance of the algorithms can be determined by calculating the sum of the values of the hypervolume $v_{i}$ in space for all the schemes in the population. The larger the combined value of the hypervolume, the better the performance. Fig. 6(g) shows that the optimization performance from high to low follows the order of RP-MOABC, RP-NSGA, CD-MOABC, and CD-NSGA. The algorithms based on the reference-point strategy are comprehensively better than the conventional crowding-distance strategy for many-objective optimization problems.

4.2

Constrained many-objective optimization for a complex 3D shielding problem

The second problem focuses on conducting a constrained many-objective optimization for the complex shielding structures shown in Fig. 7(a). For the constraint objectives, the thermal neutron-flux rate on the outermost shielding layer of the primary shielding system must be less than 1.0×10⁵ n/(cm²·s), the fast neutron-flux rate must be less than 1.0×10³ n/(cm²·s), and the gamma-ray energy-flux rate must be less than 6.0×10⁶ MeV/(cm²·s). The optimization objectives include the total volume and weight of shielding layers R1–R8 and U1–U3, as well as the dose rates in the radial and axial upward directions of the shielding layers. The variable parameters are the shielding-layer thickness, shielding-layer materials, and partial material compositions (boron polyethylene, lead boron polyethylene, tungsten alloy). The thickness of each shielding layer in the optimized model ranges from 15 cm to 80 cm. The reactor consists of a homogenized core with a height of 167.6 cm and radius of 78.8 cm. The source term is set as a fixed source with a probability distribution based on the Watt-fission spectrum.

Fig. 7

(Color online) Constrained many-objective optimization results for complex shielding structure. (a) Schematic of the complex shielding structure. (b) Improvements of the optimized objectives with RP-NSGA. (c) Improvements of the optimized objectives with RP-MOABC. (d) Improvements of the optimized objectives with CD-NSGA. (e) Improvements of the optimized objectives with CD-MOABC

In this problem, GA- and ABC-based algorithms use real-number encoding. The population size of all the algorithms is set to 100, and the number of reference points of the algorithms using the RP strategy is 84 (the number of divisions H=6). The number of MC transported particles is 2.0×10⁸. For computational efficiency, eight AMD EPYC 7H12 CPUs (Dual 128 core, 2.6 GHz) are used for massive parallel computation.

A heat map of the optimization ratio is plotted for the last generation of optimized shielding schemes compared to the initial shielding scheme, where the optimization ratio is defined in Eq. (15): $\begin{array}{l} improvements = \frac{(F_{initial, j} - F_{i, j})}{F_{initial, j}} \times 100 % \\ i = 1, ..., N j = 1, ..., M \end{array} .$ (15) In Eq. (15), i represents the individual index, N represents the population size, j represents the dimension of the objective values, M represents the total number of objective values, and F represents the corresponding objective values. A higher value of the optimization ratio indicates a better optimization of the corresponding objective. Fig. 7(b-e) shows that the optimization effect of each method is obvious in the three objective dimensions of volume, weight, and radial dose rate. However, for the top-side dose rate, some of the schemes show a degree of degradation, but they still achieve a better scheme set compared to the initial shielding scheme. The optimization performances of RP-NSGA and RP-MOABC in each objective dimension are better than those of CD-NSGA and CD-MOABC, which proves that the algorithms proposed in this study are feasible for many-objective optimization problems.

In practical engineering shielding-design problems, the final design scheme can be selected from a Pareto set of optimization schemes based on specific requirements. In this study, the miniaturization schemes obtained by the RP-NSGA algorithm (Shield #1) and RP-MOABC algorithm (Shield #2) are selected for demonstration. The shielding structure is shown in Fig. 8(b-c), the dose field distribution is shown in Fig. 8(d-e), and the specific parameters and optimization effects are listed in Tables 1, 2, 3 and 4. The dose rates are computed with the MagicMC Monte Carlo code using the global variance-reduction method [40-43] with a statistical error of less than 5%.

Fig. 8

(Color online) Schematic of typical shielding schemes after optimization. (a) Initial shielding structure. (b) Miniaturization scheme of RP-NSGA. (c) Miniaturization scheme of RP-MOABC. (d) Dose field distribution of the RP-NSGA miniaturization scheme. (e) Dose field distribution of the RP-MOABC miniaturization scheme

Component ratios of various materials for each optimized shielding scheme

Shielding Scheme No.	Borated Polyethylene		Lead-Borated Polyethylene			Tungsten Alloy
	B₄C (%)	PE (%)	Pb (%)	B₄C (%)	PE (%)	W (%)	Ni (%)	Fe (%)
Initial shield	80.30	19.70	53.17	2.34	44.49	98.66	0.94	0.40
Shield #1	76.85	23.15	80.17	0.99	18.83	90.53	6.63	2.84
Shield #2	30.34	69.66	90.00	0.50	9.50	90.62	6.57	2.81

Objective values and optimization ratios of the optimized shielding schemes

Shielding Scheme No.	Volume		Weight		Radial dose rate		Axial upper dose rate
	Numeric (cm³)	OR (%)	Numeric (g)	OR (%)	Numeric (rem/hr)	OR (%)	Numeric (rem/hr)	OR (%)
Initial shield	1.01×10⁸	-	9.11×10⁸	-	9.90×10⁰	-	9.95E-1	-
Shield #1	8.17×10⁷	18.95	4.44×10⁸	51.19	9.36×10⁰	5.44	7.75E-1	22.09
Shield #2	8.15×10⁷	19.12	6.88×10⁸	24.50	9.77×10⁰	1.32	5.31E-1	46.62

Numerical values of constrained objectives for the optimized shielding schemes

Shielding Scheme No.	Radial outermost shield			Axial topmost shield
	Thermal neutron (n/(cm²·s))	Fast neutron (n/(cm²·s))	Photon (MeV/(cm²·s))	Thermal neutron (n/(cm²·s))	Fast neutron (n/(cm²·s))	Photon (MeV/(cm²·s))
Initial shield	1.06×10²	6.15×10²	3.56×10⁵	1.94×10²	7.39×10¹	5.35×10⁵
Shield #1	6.01E-2	3.57×10⁰	5.28×10⁵	2.53×10⁰	2.53×10⁰	2.88×10⁵
Shield #2	2.34×10¹	3.02×10²	3.95×10⁵	5.96×10⁰	1.81×10²	1.01×10⁵

Design parameters of the optimization schemes

Shielding layer No.	Initial shield		Shield #1		Shield #2
	Thickness (cm)	Material type	Thickness (cm)	Material type	Thickness (cm)	Material type
R1	28.7	B-Steel	28.2	Pb-B-PE	40.0	Pb-B-PE
R2	25.8	Pb-B-PE	33.3	Pb-B-PE	40.0	B₄C
R3	35.4	W-Ni-Fe	34.7	W-Ni-Fe	29.8	H₂O
R4	32.2	Pb-B-PE	31.0	B₄C	15.0	W-Ni-Fe
R5	36.9	W-Ni-Fe	33.2	B-PE	40.0	W-Ni-Fe
R6	21.8	W-Ni-Fe	22.0	Pb-B-PE	15.0	Pb-B-PE
R7	50.0	Pb-B-PE	20.0	W-Ni-Fe	20.0	B-Steel
R8	66.8	H₂O	20.3	H₂O	20.0	B-PE
U1	34.1	W-Ni-Fe	64.3	W-Ni-Fe	74.4	W-Ni-Fe
U2	58.0	H₂O	78.5	H₂O	80.0	W-Ni-Fe
U3	46.4	Air	20.0	Air	20.0	Air

Conclusion

With the development of new nuclear-reactor types, this study introduces two many-objective evolutionary algorithms based on a reference-point-selection strategy, and applies them to many-objective radiation-shielding optimization problems. The basic principles of the proposed evolutionary algorithms for shielding optimization are described in detail. The algorithms efficiently and accurately deliver an optimal set of shielding schemes that are more compact and lightweight and have lower dose rates. This study conducts a performance assessment based on a simple 3D shielding problem, with the results indicating superiority over evolutionary algorithms that rely on crowding-distance strategies. However, for the optimization of a complex shielding problem with multiple constraints for practical applications, the algorithms proposed in this study are able to obtain optimized design schemes with superior objective values compared to the initial scheme. Furthermore, by incorporating the composition of the shielding materials into the optimization process, the algorithms enhance the diversity and creativity of the shielding design. In summary, optimization algorithms can effectively identify excellent shielding schemes in the early stages of radiation-shielding design for nuclear reactors. This study provides significant guidance for radiation-shielding design and provides supplementary data during the conceptual design phase of novel nuclear facilities with limited engineering experience.

References

Y. Chen, B. Yan,

The technology of shielding design for nuclear reactor: a review

. Prog. Nucl. Energ. 161, 104741 (2023). https://doi.org/10.1016/j.pnucene.2023.104741