Bayesian neural network evaluation method on the neutron-induced fission product yields of 232Th

NUCLEAR PHYSICS AND INTERDISCIPLINARY RESEARCH

Bayesian neural network evaluation method on the neutron-induced fission product yields of ²³²Th

Chun-Yuan Qiao，

Ya-Xuan Wang，

Chun-Wang Ma ，

Jun-Chen Pei，

Yong-Jing Chen

Nuclear Science and Techniques

Vol.37, No.3

Article number 41

Published in print Mar 2026

Available online 09 Jan 2026

DOI：10.1007/s41365-025-01858-2

CSTR：32136.14.NST.2026.0341

1200

Research on neutron-induced fission product yields of ²³²Th is crucial for understanding the competition between symmetric and asymmetric fission in actinide nuclei. However, obtaining complete isotopic yield distributions over a wide range of neutron energies remains a challenge. In this study, a Bayesian neural network (BNN) model was developed to predict the independent (IND) and cumulative (CUM) fission yields of ²³²Th under neutron irradiation at various incident energies. To address the limited availability of experimental data for the analysis of IND mass distributions, we substituted mass-number-based yields with the yields of specific isotopes. Furthermore, physical phenomena or quantities, such as the odd-even effect and isospin, were introduced as constraints to enhance the physical consistency of the predictions. The impact of these constraints was evaluated using mass-chain yield distributions and their dependence on energy. Incorporating physical constraints significantly improves the prediction accuracy, yielding more reliable and physically meaningful fission yield data for nuclear physics and reactor design applications.

Bayesian neural network232ThIndependent fission yieldCumulative fission yieldOdd-even effectIsospin

Introduction

Nuclear fission, a fundamental phenomenon in nuclear physics, plays a crucial role in a wide range of applications, including nuclear energy, national defense, and medical isotope production. It is also essential for studies on the synthesis of superheavy nuclei [1, 2], the analysis of reactor antineutrino energy spectra [3-5], and the investigation of nucleosynthesis via the r-process in neutron star mergers [6-8]. Accurate and comprehensive fission yield data are critical for understanding the dynamic evolution of fission process. In 2011, the Chinese Academy of Sciences launched the Strategic Priority Science and Technology Project entitled “Future Advanced Nuclear Fission Energy—Thorium Molten Salt Reactor (TMSR) Nuclear Energy System”, employing ²³²Th as the primary fuel [9]. The experimental TMSR reactor achieved its first criticality in October 2023. In this system, ²³²Th serves as a key component, participating mainly in neutron-induced reactions via the Th-U fuel cycle [10]. In particular, high-energy neutrons may induce the direct fission of ²³²Th. Consequently, for an accurate burn-up credit analysis, special attention must be paid to the fission product yields to reliably assess the reactivity feedback and characteristics of the spent fuel.

Currently, major international evaluated nuclear data libraries (such as JENDL [11], ENDF [12], CENDL [13], and JEFF [14]) provide comprehensive data for neutron-induced fission yields only at specific energies, that is, 0.0253 eV (thermal neutron), 0.5 MeV, and 14 MeV, respectively. However, the measured neutron-induced fission data are often sparse, with incomplete isotopic coverage, significant uncertainties, and notable discrepancies [15]. Obtaining complete isotopic yield distributions across a continuous range of incident neutron energies remains a challenge. Phenomenological models of nuclear fission (such as the Brosa [16] and GEF [17] approaches) rely heavily on experimental data for parameter calibration and validation. When such data are limited or absent, the underlying assumptions and approximations of these models become weakly constrained, resulting in increased parameter uncertainty and diminished predictive reliability. In the case of ²³²Th, the lack of systematic fission yield measurements at various neutron energies further hampers the direct application of such semi-empirical models, highlighting the urgent need for alternative approaches to obtain complete yield data.

With the development of artificial intelligence, machine learning (ML) techniques have found extensive and diverse applications in nuclear physics research [18-20]. These applications encompass a wide range of studies, including the prediction of nuclear masses for advancing nuclear theory [21-24], astrophysical modeling [25-27], and methodological comparisons [28, 29]; the determination of nuclear charge radii [30-34]; the estimation of α-decay and β-decay half-lives [35-37]; the inference of ground-state spin parities [38]; and the prediction of fission product yields [39-43]. Furthermore, in the realm of nuclear reactions, ML has been effectively employed to model cross sections across various mechanisms, such as proton-induced spallation [44-47], projectile fragmentation [48-51] and photonuclear reactions [52, 53]. ML algorithms can uncover hidden patterns in complex datasets and extract the underlying physical correlations [54]. Among these approaches, the Bayesian neural network (BNN) has shown particularly strong potential for fission yield evaluation [39, 55]. The BNN rigorously quantifies uncertainties through the probabilistic treatment of model parameters [56], incorporates prior physical knowledge to enhance learning from limited datasets, and captures nonlinear features in yield distributions, such as the transition between asymmetric and symmetric fission modes [57-59]. Furthermore, its probabilistic framework ensures direct compatibility with modern nuclear data evaluation practices that require covariance data. The capability of BNN to integrate experimental uncertainties and missing data further enhances its robustness and predictive accuracy while mitigating overfitting [57-59]. This makes it a promising tool for reliable ²³²Th fission yield predictions, where experimental data are sparse.

In this study, a Bayesian neural network (BNN) framework was employed to model and predict the fission product yields of ²³²Th [39]. Guided by the findings of Ref. [60], the hyperbolic tangent (tanh) activation function was selected for its superior capability in capturing yield trends and reducing prediction errors. To balance the model complexity and accuracy, a two-hidden-layer architecture was adopted, which offers improved performance on complex fission datasets with a manageable number of trainable parameters. Therefore, a two-hidden-layer BNN architecture with tanh activation functions was adopted in this study. The network structure was further optimized by analyzing the error distributions to determine the appropriate number of neurons in each layer. In scenarios where experimental data are sparse, the inclusion of relevant physical constraints, such as the odd-even effect and isospin, has been shown to significantly enhance the predictive accuracy [61-64]. The BNN predictions incorporating the odd-even effect and isospin showed significant improvement compared to the uncorrected case. Based on the established BNN framework, this study investigated the mass distributions of IND for ²³²Th at incident neutron energies of 1 MeV and 13 MeV, and further explored the energy dependence of CUM for the selected nuclides.

The remainder of this paper is organized as follows. Section 2 introduces the theoretical framework of the BNN method. Section 3 provides detailed descriptions of the network construction and analysis of results, with Sect. 3.1 focusing on IND and Sect. 3.2 providing a comprehensive analysis of the CUM. Finally, Sect. 4 summarizes the key findings and methodological innovations of this study.

BNN model

A Bayesian neural network (BNN) is adopted in this study to predict fission product yields by combining Bayesian inference with the structure of traditional neural networks [65]. Unlike conventional neural networks, in which model parameters are fixed after training and produce a single deterministic output, BNN represents model parameters as probability distributions. This probabilistic formulation enables the BNN to provide both predictions and reliable estimates of the associated uncertainties.

The core of BNN lies in the posterior probability distribution of the model parameters,(1)where is the likelihood function that describes the probability of the observed data given the model parameters θ, and p(θ) is the prior distribution that represents prior beliefs about these parameters. The integral in the denominator serves as a normalization constant to ensure that the posterior distribution is valid. The posterior and predictive distributions are obtained via Markov chain Monte Carlo (MCMC) sampling methods, including Metropolis, Hybrid Monte Carlo, and Slice sampling, which enables rigorous uncertainty quantification of the network parameters. As the training data accumulate, the influence of the prior diminishes in favor of the likelihood.

The likelihood function is given by(2)The discrepancy between the target and predicted values is quantified by the loss function,(3)where t_i is the target value, is the network prediction for input x_i, and is the associated experimental uncertainty or noise term [66]. Minimizing during training corresponds to maximizing the likelihood. This choice of loss function is motivated by its statistical consistency with Gaussian measurement errors and its emphasis on accurately reproducing high-yield regions, such as the bimodal peaks in fission product distributions, which are of primary physical interest in this study.

In Bayesian neural networks, a feed-forward neural network is used,(4)where is the input vector consisting of l characteristics, typically including the proton number (Z), neutron number (N), and mass number (A) of the fissioning nuclei and fission fragments, and the excitation energy (E) of the compound nucleus. represents the values of the model parameters, where dji is the connection weight between the input layer and the hidden layer, indicating the influence of the input characteristics on the hidden neurons. cj is the bias of the hidden layer, which determines the baseline level when there is no contribution from the input layer to the activation function. bj is the connection weight between the hidden layer and the output layer, representing the contribution weight of each hidden neuron to the final output. a is the bias of the output layer. The nonlinear tanh activation function enables an accurate approximation of complex continuous functions. Figure 1 shows the schematic structure of the single-hidden-layer Bayesian neural network used for the fission yield evaluation.

Fig. 1

(Color online) Schematic of the single-hidden-layer Bayesian neural network for fission yield evaluation. The input layer includes the proton and neutron numbers, mass numbers of fissioning nuclei and fission fragments, and excitation energy of the compound nucleus. The hidden layer contains H neurons with tanh activation function. The output layer provides the predicted fission yields. All connection weights (bj, dji) and biases (a, cj) are treated as probability distributions

Given a new input x_n, the network prediction is the expected value over the posterior,(5)Each prediction corresponding to a specific set of parameters is weighted by its posterior probability, and the integration yields the final predictive output of the BNN, which incorporates both the central estimate and uncertainty.

Results and discussion

3.1

Independent fission yield

Based on the fission yield dataset selected in this study, BNN models with both single and double hidden layers were constructed and evaluated. Comparative analysis revealed that the two-hidden-layer architecture provides better generalization performance and results in lower prediction errors than the single-hidden-layer. To further optimize the architecture and prevent overfitting from excessive complexity or underfitting from insufficient model capacity, the number of neurons per layer was systematically examined. For both simplicity and consistency, the same number of neurons was assigned to each of the two hidden layers of the network. This symmetric configuration not only ensures a balanced parameter distribution across the network but also reduces the number of hyperparameters to be tuned during model optimization. The neuron count varied from 16 to 40 in increments of four. Each configuration was trained under identical conditions using a fixed dataset and a hyperparameter set. During training, the total error in the training dataset was monitored across iterations to evaluate the convergence behavior and prediction stability. The resulting error trends are shown in Fig. 2 provides a basis for selecting the optimal network configuration used in subsequent fission yield predictions.

Fig. 2

(Color online) The variation in total training dataset error across different network architectures, as a function of training iterations. The selected networks are all two-hidden-layer networks, with the number of neurons in each layer starting from 16, increasing by 4 each time, up to a maximum of 40

Figure 2 illustrates the variation in the mean square error (MSE) during training for networks with different neuron counts per hidden layer. Under a fixed network architecture, the error gradually decreased with an increase in training iterations and stabilized as the number of iterations approached 10⁵, indicating that convergence had been reached. Within the Bayesian neural network framework, we mitigate the risk of converging to local minima by employing a substantially large number of iterations, allowing the sampling process to thoroughly explore the parameter space. At a given number of training iterations, increasing the number of neurons per hidden layer leads to a noticeable reduction in the prediction error. However, this trend began to plateau when the neuron count reached 28, beyond which the additional reduction in error became marginal. This observation suggests that increasing the model complexity beyond this point does not lead to significant performance gains and may potentially introduce the risk of overfitting. To balance the predictive accuracy and model generalization, a configuration with two hidden layers and 28 neurons per layer was selected as the optimal network architecture.

In IND modeling, the mass distribution of fission product yields is of central importance. However, experimental data directly reporting the yield as a function of the mass number are limited. To address this issue and maximize the use of the available data, the training input was designed to include the neutron number of the fissioning nuclei, proton number and mass number of the fission fragments, excitation energy of the compound nucleus, and corresponding IND of individual nuclides. The dataset used in IND includes 4,128 evaluated data points extracted from the JENDL database [11] and 112 experimental data points obtained from the EXFOR database [15]. The mass-yield data from EXFOR were employed for validation. To further enhance the predictive performance of the BNN, additional physical constraints were added. Specifically, the odd-even effect is incorporated by assigning a value of +0.2 for even-proton-number fragments and -0.2 for odd-proton-number fragments, following the methodology described in Ref. [67]. In addition, isospin symmetry is accounted for by including the third component of isospin, defined as (Z - N)/2, as an input feature of the model. Once the BNN predicts the yields of individual nuclides, the mass yield distribution is reconstructed by summing the predicted yields of nuclides with the same mass numbers. This enables the generation of independent fission product mass distributions induced by incident neutrons on thorium isotopes, even when direct experimental data on mass yields are not available.

In Fig. 3, BNN₀ denotes the baseline model without additional physical constraints; BNN_1-1 incorporates the odd-even effect; BNN_1-2 includes isospin; and BNN₂ incorporates both constraints simultaneously by introducing the two corresponding input columns (odd-even and isospin) into the network. In Fig. 3(a), corresponding to thermal neutron-induced fission of ²²⁷Th, BNN_1-1 reduces the unphysical negative values present in BNN₀ but introduces an abnormal upturn near mass number 170. Similarly, BNN_1-2 suppresses the negative values and upturn but fails to reproduce the fine structure in the light fragment region. In contrast, BNN₂ not only eliminates these unphysical features but also retains the peak structure originally predicted by BNN₀, indicating that the simultaneous inclusion of both constraints results in a more precise representation of the fission mechanism and improves the model predictive accuracy. In Fig. 3(b), for thermal neutron-induced fission of ²²⁹Th, BNN_1-1 and BNN_1-2 exhibit similar behaviors as in Fig. 3(a), whereas BNN₂ successfully reproduces the peak structure of the heavy fragment for the first time, which aligns well with the evaluated data. This suggests that the formation of heavy fission fragments is significantly influenced by the combined effects of the odd-even pattern and isospin. These results underscore the necessity of jointly incorporating both constraints to advance the theoretical modeling of the production of heavy fission fragments. In Fig. 3(c), which shows neutron-induced fission of ²³²Th at 0.5 MeV, BNN₀ exhibits an abnormal upturn around mass number 70 and significantly underestimates the yields in the 138–144 mass range relative to the evaluated data. BNN_1-1 corrects the heavy fragment underestimation, whereas BNN_1-2 mitigates the abnormal upturn and reduces negative yields. However, only BNN₂ resolves all these deficiencies simultaneously, demonstrating the best overall performance in reproducing the fission fragment mass yield distribution. In Fig. 3(d), corresponding to 14 MeV neutron-induced fission of ²³²Th, BNN₂ provides the most accurate description of the symmetric fission region and matches the evaluated data well in the light fragment region (mass 80-100), outperforming the other models in both respects. Across all four reaction systems, compared to BNN₀, BNN_1-1 results in a slightly narrower confidence band, whereas BNN_1-2 shows a modest increase in uncertainty. In contrast, BNN₂ yields a substantial reduction in the confidence band width, indicating significantly improved predictive reliability. This reduction in uncertainty further suggests that the combination of both physical constraints enhances model generalizability. In summary, while the odd-even effect and isospin each contribute to improving the BNN model performance, they have individual limitations. Their simultaneous incorporation into BNN₂ lead to the most substantial improvement in both yield prediction accuracy and uncertainty reduction, closely aligning with the evaluated data and demonstrating the robustness of the combined-constraint model. Despite the incorporation of additional physical constraints, Y_BNN still exhibited unphysical negative values. Reference [60] demonstrated that applying a penalty on negative values improves both the representation of yield peak structures and the overall predictive accuracy. This penalty mechanism will be considered in future network design.

Fig. 3

(Color online) Comparison of independent fission product yield distributions as a function of mass number for thorium isotopes induced by neutrons of different energies, based on various data sources and BNN model predictions. Panels (a-1) to (a-4) show the mass yield distributions for the thermal neutron-induced fission of ²²⁷Th. Panels (b-1) to (b-4) correspond to the thermal neutron-induced fission of ²²⁹Th. Panels (c-1) to (c-4) correspond to the fission of ²³²Th induced by 0.5 MeV neutrons, and panels (d-1) to (d-4) correspond to the fission of ²³²Th induced by 14 MeV neutrons. BNN₀ denotes the predictions that do not incorporate additional physical constraints. BNN_1-1 includes an odd-even effect in the training data. BNN_1-2 incorporates isospin dependence, while BNN₂ includes both the odd-even effect and isospin dependence

Figure 4 shows the predicted fission fragment mass distributions for ²³²Th induced by 1 MeV and 13 MeV neutrons, with both energy points excluded from the training data. are compared with and the Y_GEF [17]. In Fig. 4(a), BNN₂ eliminates the unphysical negative yields observed in BNN₀ around mass numbers A = 150–160. Moreover, in the mass range A = 90–100, shows an improved agreement with Y_GEF. In Fig. 4(b), also suppresses the negative values in the mass range A = 100–120 and produces significantly closer to Y_GEF in the A = 120–140 region. In both panels, the confidence intervals (confidence intervals) of are noticeably narrower than those of , indicating reduced predictive uncertainty and improved generalization performance. These results suggest that the simultaneous inclusion of the odd-even effect and isospin enhances the physical realism of the model and underscores the importance of incorporating both effects in fission yield modeling.

Fig. 4

(Color online) Mass distributions of independent fission yields from ²³²Th induced by 1 MeV (a) and 13 MeV (b) neutrons. The results from BNN₀ without additional physical constraints, and BNN₂ including the odd-even effect and isospin, were compared with the independent yield data from GEF. Shaded bands represent 95% confidence intervals for the BNN predictions

3.2

Cumulative fission yield

The training dataset includes both the proton and neutron numbers of the fissioning nuclei and fission fragments, excitation energy of the compound nucleus, and cumulative fission yields. It comprises 4128 evaluated data points from the JENDL database [11] and 1,258 experimental data points from the EXFOR database [15]. Based on the collected cumulative fission yield data, various BNN architectures, including single-hidden-layer and two-hidden-layer networks with varying numbers of neurons per layer, were systematically evaluated. Optimal performance was achieved using a two-hidden-layer architecture with 20 neurons per layer, which was adopted as the baseline model. Four configurations were tested: (1) without additional physical constraints (BNN₀), (2) with only the odd-even effect (BNN_1-1), (3) with only the isospin effect (BNN_1-2), and (4) with both physical constraints simultaneously (BNN₂). The implementation of these physical constraints follows the same approach as that in IND [67]. Comparative analysis against the evaluated data shows that the BNN₂ model, which incorporates both effects, achieves the best predictive accuracy and generalization capability among all the models. Consequently, only the BNN₂ model was retained for the subsequent analysis.

Figure 5 compares the evaluated cumulative fission yields (Y_JENDL) with the predictions of the BNN without additional physical constraints () and the physically constrained model () for four representative reactions. In the thermal neutron-induced fission of thorium isotopes (Fig. 5(a) and (b)), the exhibits a systematic overestimation compared to Y_JENDL, particularly in low-yield (blue) regions, and fails to reproduce the characteristic yield peaks. In contrast, BNN₂ provides a much improved prediction, especially for light fragments, where closely matches Y_JENDL. In particular, two pronounced peaks (red spots) corresponding to ¹³⁷Ba and ¹⁴⁰Ce appear in both and Y_JENDL, demonstrating the model’s ability to capture the key heavy fragment yields. In Fig. 5(c), corresponding to a different actinide target, BNN₀ again overpredicts the yields in the low-value regions. This trend of overestimation was significantly mitigated in BNN₂, which was in better agreement with the evaluated distribution. Figure 5(d), corresponding to fission at 14 MeV incident neutron energy, shows that BNN₂ substantially improves the yield predictions in the symmetric fission region, aligning more closely with Y_JENDL than BNN₀. In general, across the four reactions, demonstrated significantly better agreement with Y_JENDL than with , particularly in reproducing the heavy fragment yield peaks. These results demonstrate that incorporating the odd-even effect and isospin enhances the physical reliability and the predictive precision of the model. The mean squared error was reduced from 0.565 to 0.364, further confirming the effectiveness of the added physical constraints.

Fig. 5

(Color online) Comparisons of CUM distributions for thorium isotopes induced by neutrons at different incident energies. Shown are results from JENDL evaluated data (Y_JENDL), BNN predictions without additional physical constraints (), and BNN predictions including odd-even effect and isospin (). Panels (a) correspond to thermal neutron-induced fission of ²²⁷Th, (b) to thermal neutron-induced fission of ²²⁹Th, (c) to 0.5 MeV neutron-induced fission of ²³²Th, and (d) to 14 MeV neutron-induced fission of ²³²Th

Figure 6 presents the cumulative fission yields for heavy fragments with mass numbers A =134-144 from four neutron-induced Th fission reactions at different neutron energies. As fission products reach stability primarily through β-decay, these chain yields reflect the population of the final stable or long-lived nuclides along the decay chains. The evaluated data (Y_JENDL) show a clear odd-even staggering, with yields at odd mass numbers being notably higher than those of neighboring even-A isotopes. In addition, a pronounced enhancement was observed at A=140, corresponding to the production of ¹⁴⁰Ce, which is a β-stable nuclide with a magic neutron number (N=82) and one of the yield peaks identified in Fig. 5. BNN predictions without physical constraints () show a smoother trend, failing to capture the odd-even structure and underestimating the peak at A=140. In contrast, the constrained model (BNN₂), which incorporates both the odd-even effect and isospin, reproduces the evaluated pattern well (including the odd-even staggering and sharp rise at A=140). These results demonstrate that integrating physical features into the BNN framework significantly improves its ability to capture decay chain effects and local nuclear structure signatures. Thus, the BNN₂ model offers a more accurate and physically interpretable prediction of the cumulative yields of fission products.

Fig. 6

(Color online) Comparisons of chain yields for fission products with mass numbers from 134 to 144. The evaluated results from JENDL (Y_JENDL), BNN predictions without additional physical constraints (), and predictions including the odd-even effect and isospin () are shown. Panels (a) and (b) correspond to thermal neutron-induced fission of ²²⁷Th and ²²⁹Th, respectively, while (c) and (d) correspond to 0.5 MeV and 14 MeV neutron-induced fission of ²³²Th

As shown in Fig. 6, the BNN₂ model more accurately captures the mass-dependent features of the fission product yields, particularly the odd-even staggering and the notable deviation around A = 140. To quantitatively assess the improvement from the incorporation of physical constraints, Figure 7 compares the ratios of the model predictions to the JENDL evaluations of the four neutron-induced fission reactions in thorium isotopes. The results show that the yield ratios predicted by BNN₂ () are consistently closer to unity than those predicted by the original BNN₀ model (), indicating a significant enhancement in the prediction accuracy. This improvement was particularly evident for several mass numbers. For instance, at A = 137, the consistently aligns more closely with the Y_JENDL in all four cases, corresponding to the enhanced production of ¹³⁷Ba observed in Fig. 5. Around A = 140, the are nearly identical to the Y_JENDL in Fig. 7(a) and (d), and also exhibit notable improvements over the in Fig. 7(b) and (c), respectively. However, at A = 136, both and show notable deviations from Y_JENDL, suggesting that additional physical effects may influence the yields in this region and warrant further investigation. In summary, the overall trend of being consistently closer to unity than clearly demonstrates the benefit of incorporating the odd-even effect and isospin into the BNN framework. These physical enhancements significantly improved the model’s ability to reproduce the evaluated cumulative fission yields for thorium-induced reactions, emphasizing their indispensable role in refining fission yield predictions.

Fig. 7

(Color online) The yield ratios of fission products with mass numbers from 134 to 144, induced by neutrons of different energies, for thorium isotopes. The black squares represent the ratios of the initial BNN predictions without additional physical constraints () to the evaluated data (Y_JENDL). The blue circles represent the ratios of BNN predictions incorporating the odd-even effect and isospin () to the evaluated data (Y_JENDL)

Figure 8 presents the cumulative fission yields of four representative fission products—⁹⁵Zr, ⁹⁹Mo, ¹³²Te, and ¹³¹I as a function of the incident neutron energy from 0 to 14 MeV. These nuclides were selected because of their significance in nuclear applications, environmental safety, and human health. Specifically, ⁹⁵Zr is a precursor of ⁹⁵Nb, which is a widely used redox indicator in reactor monitoring. ⁹⁹Mo, with a high decay branching ratio (88%) to ^99mTc, plays a critical role in nuclear medicine. ¹³²Te promotes corrosion, shortening the service life of structural materials, whereas ¹³¹I is a radiotoxic nuclide with substantial biological impact. For ⁹⁵Zr (Fig. 8(a)), the yield initially increases, then gradually decreases at higher neutron energies (4–14 MeV). The BNN₀ model consistently underestimates the experimental data, whereas BNN₂ offers more accurate predictions across the energy range, with only a slight overestimation at 14 MeV. In the case of ⁹⁹Mo (Fig. 8(b)), the yield remains nearly constant from 2 to 12 MeV, with a small drop at 14 MeV. The BNN₂ predictions accurately captured this behavior and corrected the underestimation observed in BNN₀ at thermal energy (0.0253 eV). For ¹³²Te (Fig. 8(c)), the yield shows a pronounced increase at low energies, followed by a gradual decline beyond 8 MeV. The BNN₂ model matches the experimental trends more closely than BNN₀, particularly in the 0–4 MeV region. As shown in Fig. 8(d), the yield of ¹³¹I rises with increasing neutron energy. The predictions of BNN₂ align well with the experimental and evaluated data above 5 MeV and at thermal energy; however, both BNN predictions slightly overestimate the experimental results between 2 and 4 MeV. Overall, all experimental points fell within the 95% confidence intervals of the BNN predictions. Notably, the confidence bands associated with BNN₂ are narrower than those of BNN₀, demonstrating that incorporating odd-even effects and isospin corrections not only improves the predictive accuracy but also reduces the model uncertainty.

Fig. 8

(Color online) Cumulative fission yields of ⁹⁵Zr, ⁹⁹Mo, ¹³²Te, and ¹³¹I as a function of incident neutron energy from 0 to 14 MeV. The experimental data are shown as black dots with error bars, while the blue squares denote the evaluated yields from the JENDL. The dashed black lines represent BNN predictions without additional physical constraints (), and the dashed red lines show the results incorporating the odd-even effect and isospin dependence (). Shaded areas correspond to the 95% confidence intervals for each prediction

Summary

In this study, a Bayesian neural network (BNN) was employed to evaluate both independent (IND) and cumulative fission product yields (CUM), with the goal of improving the predictive accuracy beyond the limitations posed by sparse and inconsistent experimental data available only at a few neutron energies (thermal, 0.5 MeV, and 14 MeV). In the evaluation of IND, the input features were optimized by incorporating the proton and mass numbers of the fission fragments, along with their corresponding yields. This enabled the use of a broader set of experimental data for model training. A two-hidden-layer architecture with 28 neurons per layer was adopted based on the error minimization analysis. The incorporation of the odd-even effect and isospin significantly enhanced the predictive performance of the model. When both physical constraints were applied simultaneously, the model effectively suppressed unphysical negative yields, mitigated abnormal upturns, and reproduced the fine structures in the heavy fragment region. The uncertainty of the prediction, as reflected in the 95% confidence intervals, was also substantially reduced. For CUM evaluation, a two-hidden-layer architecture with 20 neurons per layer was selected after comprehensive testing of various configurations, including single-hidden-layer and two-hidden-layer networks with different neuron counts. Comparative studies were performed, including single-nuclide yield comparisons, analysis of odd-even staggering in mass yields (particularly in the A=134–144 region), and yield ratio analysis. These results demonstrate that the simultaneous incorporation of odd-even effects and isospin constraints significantly improved the reliability of the yield predictions. The model more accurately captured the features of stable nuclide production and successfully reproduced the anomalous structure near A=140. Overall, our study demonstrates that combining physical constraints, particularly the odd-even effect and isospin, significantly enhances the performance and robustness of BNN-based fission yield evaluations. These findings suggest that such physical features should be systematically incorporated into future data-driven models and theoretical approaches to improve the accuracy of fission yield predictions.

References

J.H. Hamilton, S. Hofmann, Y. Oganessian,

Search for superheavy nuclei

. Annu. Rev. Nucl. Part. Sci. 63, 383-405 (2013). https://doi.org/10.1146/annurev-nucl-102912-144535