Enhancing reliability in photonuclear cross-section fitting with Bayesian neural networks

NUCLEAR PHYSICS AND INTERDISCIPLINARY RESEARCH

Enhancing reliability in photonuclear cross-section fitting with Bayesian neural networks

Qian-Kun Sun ，

Yue Zhang，

Zi-Rui Hao，

Hong-Wei Wang，

Gong-Tao Fan，

Hang-Hua Xu，

Long-Xiang Liu，

Sheng Jin，

Yu-Xuan Yang，

Kai-Jie Chen，

Zhen-Wei Wang

Nuclear Science and Techniques

Vol.36, No.3

Article number 52

Published in print Mar 2025

Available online 13 Feb 2025

DOI：10.1007/s41365-024-01611-1

CSTR：32136.14.NST.2025.0352

166407

This study investigates photonuclear reaction (γ,n) cross-sections using Bayesian neural network (BNN) analysis. After determining the optimal network architecture, which features two hidden layers, each with 50 hidden nodes, training was conducted for 30,000 iterations to ensure comprehensive data capture. By analyzing the distribution of absolute errors positively correlated with the cross-section for the isotope ¹⁵⁹Tb, as well as the relative errors unrelated to the cross-section, we confirmed that the network effectively captured the data features without overfitting. Comparison with the TENDL-2021 Database demonstrated the BNN’s reliability in fitting photonuclear cross-sections with lower average errors. The predictions for nuclei with single and double giant dipole resonance peak cross-sections, the accurate determination of the photoneutron reaction threshold in the low-energy region, and the precise description of trends in the high-energy cross-sections further demonstrate the network’s generalization ability on the validation set. This can be attributed to the consistency of the training data. By using consistent training sets from different laboratories, Bayesian neural networks can predict nearby unknown cross-sections based on existing laboratory data, thereby estimating the potential differences between other laboratories’ existing data and their own measurement results. Experimental measurements of photonuclear reactions on the newly constructed SLEGS beamline will contribute to clarifying the differences in cross-sections within the existing data.

Photoneutron reactionBayesian neural networkMachine learningGamma sourceSLEGS

Introduction

Nuclear information is extracted by a photonuclear reaction using 1–30 MeV γ-rays. The photoneutron reaction is an important photonuclear reaction in which the excited atomic nucleus emits one or more neutrons. The measurement of the photoneutron reaction (γ, Xn) cross-section is closely related to the Giant Dipole Resonance(GDR) structure (i.e., resonance energy, width, and contribution of the GDR to the energy-weighted sum rule (EWSR)). According to the Brink-Axel hypothesis [1, 2], the γ-ray strength function obtained from photoneutron cross section is a crucial parameter for calculating the neutron capture cross section, which is relevant to the nucleosynthesis of elements heavier than iron in nuclear astrophysics.

Experimental photonuclear reaction data have been obtained from various types of measurements, including bremsstrahlung and quasi-monoenergetic photons from positron annihilation in flight and, more recently, from Laser Compton Scattering (LCS). Initially, bremsstrahlung beams were developed, and these photon beams were constructed in Russia, Canada, Australia, and Germany [3]. In the 1960’s, Lawrence Livermore National Laboratory (LLNL, USA) [4] and Centre d′Études Nucléaires de Saclay (France) [5] base on quasi-monochromatic beams using the positron annihilation in flight technique measure abundant photonuclear reaction data [6]. After the development of new-generation synchrotron radiation facilities and high-power lasers, quasi-monoenergetic γ-ray beams were produced in collisions of laser photons with relativistic electrons, which is referred to as Laser Compton Scattering (LCS) [7, 8]. Now SPring-8 LEPS/LEPS-2(Japan) [9], NewSUBARU BL01(Japan) [10-12], UVSOR-III BL1U(Japan), and HIγS(USA)[13] are running, and the Shanghai Laser Electron Gamma Source(SLEGS) has been constructed in Shanghai Synchrotron Radiation Facility(SSRF) in December 2021 [14-20]. The Variable Energy Gamma (VEGA) System at Extreme Light Infrastructure Nuclear Physics (ELI-NP, Romania) [21] and the very compact inverse-Compton scattering gamma ray source (VIGAS) at Tsinghua University [22] will contribute to the future. All these facilities already or will soon measure vast amounts of photoneutron reaction experimental data or photonuclear applications, anticipated in the foreseeable future. The key and challenge of extracting physical information from these data lies in the fact that results measured by different laboratories may vary significantly. For example, ¹⁹⁷Au is generally considered to accurately measure (γ,n) reaction cross-section, but there are still differences in the data provided by different laboratories. For ⁵⁹Co, different laboratories cannot agree on the single-peak and double-peak structures [3]. Therefore, when we seek the physical laws of GDR, the systematic errors arising from different γ-ray sources, detectors, and analysis methods in various laboratories cannot be ignored.

The experimental GDR data were fitted with several empirical Lorentzian functions [23], e.g. the Standard Lorentzian model(SLO) [1, 2], the Enhanced Generalized Lorentzian model(EGLO) [24], the Generalized Fermi-Liquid model(GFL) [25], the General Hybrid model(GH) [26], the modified Lorentzian approach(MLO1/MLO2/MLO3) [27], and its simplified version SMLO [28]. For this phenomenological approach, it is necessary to classify the number of GDR peaks in the nucleus using two sets of parameters to describe double GDR peak nuclei. On the other hand, the current micro-theoretical models for nuclear excitation mainly consist of two types, Random Phase Approximation model within the framework of density functional theory(QRPA) [29] and Configuration interaction shell model [30]. Calculations using both approaches require a significant amount of numerical computational resources. Both the phenomenological and microscopic methods mentioned above were used to describe the overall properties of the photoneutron reactions. A nuclear reaction model is required for studying a specific reaction channel. For instance, numerous nuclear reaction codes have been developed based on the Hauser–Festival statistical model such as the well-known NON-SMOKER [31] and TALYS [32], which are widely utilized in nuclear data and astrophysics research. In this study, data from the TALYS-based Evaluated Nuclear Data Library (TENDL-2021) [33] were used for comparison.

Recently, machine learning has been successfully applied to numerous nuclear physics issues. In particular, following the victory of AlphaGo over humans in 2016, a plethora of studies have emerged at the intersection of machine learning and nuclear physics [34-38]. Machine learning and neural networks have been successfully used to study nuclear structure, such as nuclear masses [39-46], charge radii [47], α-decay half-lives [48], and α-clustering structures[49]. In the field of nuclear reactions, machine learning also has a wide range of applications [50-54]. In Ref. [55], the key parameters of GDR were studied based on a traditional classification neural network and two multitask learning (MTL) neural networks. The training and validation sets were divided into single and double GDR peak nuclei, and different neural networks were used for each set. A similar approach was employed to study the total (γ, Xn) photoneutron yield cross-section based on the Lorentzian function-based BNN (LBNN) [52]. These studies describe the photoneutron production cross-section; however, there are many reaction channels in photoneutron reactions, and the ability to accurately describe each reaction channel is significant for photoneutron experiments. Among these reaction channels, the (γ, n) photoneutron reaction channel constitutes the main part of the (γ, Xn) photoneutron reactions and is one of the more accurately measured channels. Therefore, an accurate description of the (γ, n) photoneutron reaction is essential.

In this study, we employed a suitable Bayesian neural network structure with two hidden layers, each containing 50 hidden nodes, to describe the (γ, n) photoneutron cross-section. The training set consisted of consistent experimental data on photonuclear cross-sections from the EXFOR Database [56]. First, by analyzing the absolute and relative errors compared with the TENDL-2021 database and experimental values, our Bayesian neural network demonstrated reliability. Subsequently, predictions were made for three single GDR peak nuclei (¹²⁷I, ¹⁹⁷Au, ²⁰⁷Pb) and two double GDR peak nuclei (⁵⁹Co and ¹⁶⁵Ho). The results showed good agreement with the experimental values, with both double GDR peak nuclei exhibiting double GDR peak structures. Finally, we compared the prediction results for ¹²⁷I, ¹⁶⁵Ho, and ¹⁹⁷Au using Saclay and LLNL as training sets. We believe that selecting a consistent training set is crucial when employing machine learning approaches to study photonuclear reaction cross-sections.

This study relied primarily on standard Bayesian neural networks trained on a consistent experimental dataset to enhance the predictive capabilities of the model. The remainder of this paper is organized as follows: Sect. 2 describes the mathematical principles of Bayesian neural networks, Sect. 3 presents the details of model training and prediction, and Sect. 4 summarizes the findings.

BNN model

We employed a Bayesian neural network(BNN) for machine learning to predict the (γ, n) cross-section. The BNN sets the parameters of the neural network θ as probability distributions, treats the network parameters as prior distributions $P (θ)$ , and utilizes Bayesian statistical relations to learn and obtain posterior distributions $P (θ | D)$ [57], $P (θ | D) = \frac{P (D | θ) P (θ)}{P (D)},$ (1) where the input data xn and output data σn contribute to the data $D = (x_{n}, σ_{n})$ . The standard practice for the likelihood function is to assume the form of a Gaussian distribution based on an objective function obtained from a least-squares ﬁt of the empirical data. Likelihood function is, $P (D | θ) = exp (- χ^{2} (θ) / 2),$ (2) here, the objective function is $χ^{2} = \sum_{n}^{N} \frac{{[y_{n} - σ (x_{n}, θ)]}^{2}}{Δ t_{i}^{2} + Δ σ_{n}^{2}},$ (3) where N is the amount of empirical data, $Δ t_{i}^{2}$ is hyperparameters of noise; $Δ σ_{n}$ is the experimental uncertainty. The BNN output functions $σ (θ)$ of photoneutron (γ, n) are written according to the network parameters θ: $σ (x_{n}, θ) = a + \sum_{j = 1}^{H} b_{j} tanh (c_{j} + \sum_{i = 1}^{I} d_{j i} x_{i}),$ (4) where θ = a, bj, cj, dji, hidden node number of the hidden layer H, input variable number I, and the activation function is set as the tanh type.

Owing to the high dimensionality of the parameters, computing the posterior distribution P(θ| D) is difficult. Variational inference is employed to approximate P(θ| D). Variational inference aims to determine κ such that the distribution q(θ | κ) minimizes the Kullback-Leibler (KL) divergence, $\begin{matrix} θ = arg min KL [q (θ | κ) | | P (θ | D)] \\ = arg min E_{q (θ | κ)} [ln \frac{q (θ | κ)}{P (θ | D)}] \\ = arg min E_{q (θ | κ)} [ln \frac{q (θ | κ) P (D)}{P (D | θ) P (θ)}] \\ = arg min \sum_{k} [ln q (θ | κ) - ln P (θ^{(k)}) - ln P (D | θ^{(k)})] . \end{matrix}$ (5) When employing machine-learning techniques to study experimental data in the past, it was common to use evaluation databases or aggregate data from different laboratories. This practice may have obscured systematic biases in the experimental data, making it difficult to reconstruct the true experimental results. To ensure data consistency, we selected experimental data from LLNL and the Centre d′Études Nucléaires de Saclay (France), which are available in the EXFOR Database[56]. In Table 1, we have included all experimental data from LLNL as the training and validation sets, removing light nuclei with proton or neutron numbers less than or equal to 20 due to the presence of more complex structures in the light nuclei region. After obtaining the data, we excluded data points where the cross-sectional central value was less than 0.1 mb. During training, we processed the data using min-max normalization, specifically defined as $x_{norm} = (x - x_{min}) / (x_{max} - x_{min})$ . Our training set consisted of 40 nuclides, while the validation set included 5 nuclides, resulting in a low ratio of 8:1 for the training and validation sets. The input data types included the number of charges Z, the number of neutrons N, the mass A, and the energy ϵ of the injected γ ray.

The training and validation set, all experimental data were taken from the LLNL or Saclay, available in EXFOR Database[56]

Lab	Author	Training set	Validation set
LLNL	Alvarez	⁵⁵Mn	⁵⁹Co
	Berman	⁷⁵As, ⁸⁹Y, ^90,91,92,94Zr, ¹⁰⁷Ag, ¹³³Cs, ¹³⁸Ba, ¹⁴¹Pr,	¹²⁷I, ¹⁶⁵Ho,¹⁹⁷Au
		¹⁵³Eu, ¹⁶⁰Gd, ¹⁸⁶W, ^{186,188,189,190,192}Os, ²³⁹Pu,
	Bramblett	¹⁵⁹Tb, ¹⁸¹Ta,	¹²⁷I
	Caldwell	²³²Th, ^235,238U
	Harvey	^206,208Pb, ²⁰⁹Bi	²⁰⁷Pb
	Fultz	⁵¹V, ^58,60Ni, ^63,65Cu, ¹¹⁵In, ^{116,117,118,119,120,124}Sn	⁵⁹Co, ¹⁹⁷Au
Saclay	Veyssiere	⁵¹V, ²⁰⁸Pb, ²³²Th, ²³⁸U	¹⁹⁷Au
	Carlos	⁶⁴Zn, ^70,72,74,76Ge, ⁷⁵As, ^76,78,80,82Se,
		^{142,143,144,145,146,148,150}Nd, ^{144,148,150,152,154}Sm
	Lepretre	⁸⁹Y, ⁹⁰Zr, ⁹³Nb, ¹⁰³Rh, ¹¹⁵In, ¹³³Cs, ^140,142Ce,
		^{116,117,118,120,124}Sn, ^{124,126,128,130}Te
	Beil	^{92,94,96,98,100}Mo
	Bergere	¹³⁹La, ¹⁵⁹Tb, ¹⁸¹Ta,	¹²⁷I, ¹⁶⁵Ho

Results and discussions

To balance data complexity and computational efficiency, we selected a neural network structure based on the loss function. In Fig. 1, two hidden layers were defined and experiments were conducted with each hidden layer having 10, 30, 50, 100, and 300 hidden nodes. All the models were trained on an NVIDIA RTX 5000 Ada GPU with training times of 85, 186, 332, 503, and 1251 s. When the number of hidden nodes was relatively small, the loss function tended to plateau with an increasing number of training iterations before continuing to decrease. When the number of training iterations exceeded 20,000, the loss functions of the neural networks with different numbers of hidden nodes tended to converge. Thus, we used 30,000 iterations with 50 nodes per hidden layer to ensure thorough learning.

Fig. 1

(Color online) Comparison of the loss function (Mean Squared Error, MSE) deviation for both layers with 10, 30, 50, 100 and 300 hidden nodes. Shown in first + second notes number, respectively

To ensure that the neural network is not overfitted, absolute and relative errors are introduced: $\begin{array}{l} δ_{a} = \frac{1}{n} \sum_{i = 1}^{n} δ (x_{n}) \\ = \frac{1}{n} \sum_{i = 1}^{n} | σ_{BNN} (x_{n}) - σ_{exp} (x_{n}) | - δ_{exp} (x_{n}), \end{array}$ (6) where $σ_{BNN} (x_{n})$ are the BNN predicted values, $σ_{exp} (x_{n})$ and δ_exp(xn) are the experimental values and errors from LLNL, respectively. In Eq.6, when the absolute difference between the predicted and experimental values $| σ_{BNN} (x_{i}) - σ_{exp} (x_{i}) |$ is less than the experimental error δ_exp(xi). Here, δ(xi) was set to zero. Similarly, the relative error can be expressed as $\begin{matrix} δ_{r} = \frac{1}{n} \sum_{i = 1}^{n} \frac{δ (x_{n})}{σ_{exp} (x_{n})} \\ = \frac{1}{n} \sum_{i = 1}^{n} \frac{| σ_{BNN} (x_{n}) - σ_{exp} (x_{n}) | - δ_{exp} (x_{n})}{σ_{exp} (x_{n})} . \end{matrix}$ (7)

A comparison between the BNN predicted values (red curve) and the experimental values (black squares) for the isotope ¹⁵⁹Tb in the training set is shown in Fig. 2. The neural network accurately predicted the (γ,n) photoneutron cross-section of ¹⁵⁹Tb and the energies corresponding to the two GDR peaks. In Fig. 2(a), the red dots represent the absolute errors between the predicted values of ¹⁵⁹Tb from the BNN and the experimental values. In Fig. 2(b), the blue dots represent the corresponding relative errors. It can be observed that the absolute error shows a clear positive correlation with the experimental values, while the relative error does not exhibit a significant positive correlation with the experimental values. This suggests that, although the absolute error increases with the experimental values–indicating the neural network’s ability to capture the magnitude of deviations from the experimental results–the relative error does not show a similar correlation. This lack of correlation implies that the performance of the neural network was consistent across different ranges of experimental values, supporting the conclusion that the neural network training did not lead to overfitting.

Fig. 2

(Color online) Comparison of the BNN predictions (red curve) and experimental values (black squares) of ¹⁵⁹Tb for the training set, along with the absolute(a) and relative errors(b) of each point

In past studies of photonuclear reaction cross sections, experimental values from different laboratories or evaluation databases were typically aggregated. The data were then divided into two training sets: one for single GDR peak nuclei and the other for double GDR peak nuclei, facilitating improved learning[55] or similar processing, e.g., employing empirical formulas to control the values of hidden nodes in the output layer[52], to achieve better prediction results for the validation set Without separating these two types of nuclei, the predicted atomic nuclei would exhibit only a single GDR peak. In the present study, the model training did not distinguish between single and double GDR peak nuclei. Various devices used for photonuclear physics experiments employ different γ-ray sources, detector models, and data analysis methods, which can lead to significant discrepancies when measuring the photoneutron yield cross-section of the same nucleus. Notable data discrepancies were observed for 19 nuclei investigated between the LLNL and Centre d’Etudes Nucleaires of Saclay. If the training set consists of experimental values selected from multiple laboratories, systematic errors from these facilities may obscure structural information within the data. Consequently, this study utilized a training set derived exclusively from LLNL experimental data to ensure consistency.

Figure 3 illustrates a macroscopic comparison of the single nuclear mean absolute and mean relative errors on the training set of the BNN, along with the computed results from the TALYS-based Evaluated Nuclear Data Library (TENDL-2021) [33]. Figure 3 depicts the mean absolute and mean relative errors between the BNN’s predictions and experimental values from LLNL, as well as those between TENDL-2021 data and the same experimental values. Comparing Fig. 3 (a), 3 (c), the mean absolute errors for single nuclei between the BNN’s predictions and experimental values show a uniform distribution. The BNN’s minimum single nuclear absolute error is 1.7 mb, which is higher than the 1 mb reported in TENDL-2021. The overall mean absolute error for single nuclei between the BNN’s predictions and experimental values is 7.14 mb, smaller than the 13.56 mb reported in TENDL-2021. The distribution and minimum values of the single-nuclear mean relative errors are shown in Fig. 3 (b) and (d), which lead to conclusions similar to those drawn from the distribution of single nuclear mean absolute errors. Meanwhile, the BNN’s minimum single nuclear relative error was 0.046, higher than the 0.034 in TENDL-2021. In contrast, the overall mean relative error of a single nucleus between TENDL and the experimental values is 0.209, which is smaller than the 0.252 for the BNN. Based on the above comparison, it is evident that the BNN demonstrates good predictive ability for the training set.

Fig. 3

(Color online) Comparison between the Bayesian neural network’s single nuclear mean absolute and mean relative errors on the training set and the computed results from the TALYS-based Evaluated Nuclear Data Library (TENDL-2021)[33]. The values in (a) and (c) have been normalized according to their respective maximum values; the same processing has also been applied to (b) and (d)

In Fig. 4, the (γ, n) cross-section data for ¹²⁷I, ¹⁹⁷Au, and ²⁰⁷Pb are predicted and compared with experimental values and TENDL-2021 data. Notably, in the experimental measurements, all three nuclei exhibit only one GDR peak. The red curve represents the mean obtained by sampling the posterior distribution of the trained neural network using the Markov Chain Monte Carlo method, with the shaded region indicating a 99% confidence interval. The narrowness of these shaded confidence intervals demonstrates the high accuracy of the BNN in predicting these data. The TENDL-2021 database used for comparison employed linear interpolation to obtain its data. In Fig. 4(a), the comparison for isotope ¹²⁷I is presented. Here, the GDR peak position and width of the (γ, n) photonuclear cross-section predicted by the BNN align well with the experimental values, though the maximum cross-section value is slightly higher than the experimental result. The Giant Dipole Resonance (GDR) is associated with the collective excitation of the nucleus. Additionally, the onset of the photoneutron reaction, marked by the one-neutron separation energy S_1n, is crucial in calculating r-process observables. The quality of the high-energy tail description depends on distinguishing between one-neutron (γ, n), two-neutron (γ, 2n), and more-neutron reaction channels experimentally. Therefore, accurately describing both low- and high-energy cross-sections is essential. The BNN’s ability to accurately describe cross-sections at low energies is promising. In the high-energy tail, BNN predictions were closer to experimental values than the TENDL-2021 database. A comparison for a similar nuclear isotope, ¹⁹⁷Au, is shown in Fig. 4(b). The BNN predictions align very well with experimental values in terms of GDR peak position, width, and maximum cross-section, whereas the TENDL-2021 database shows some deviation, particularly in width. Both low-energy and high-energy cross-sections are well described by the BNN. In Fig. 4(c), the BNN’s predictions for the ²⁰⁷Pb isotope are also shown to be highly accurate. The BNN provided an excellent description of the high-energy tail, closely matching experimental data, including the subtle bump near ϵ=17 MeV and the rise when ϵ>24 MeV. These features were not captured by the TENDL-2021 database or in prior BNN-based studies [52]. It is worth emphasizing the significance of the selected nuclei. For ²⁰⁷Pb, the training set included neighboring nuclei ²⁰⁶Pb and ²⁰⁸Pb, which likely explains why the BNN predictions for ²⁰⁷Pb closely match the experimental values. This result is encouraging for predicting cross-section data of unstable nuclei adjacent to stable ones: by measuring the cross-section data of stable nuclei, the neural network can accurately predict the cross-section data of these unstable nuclei. Additionally, ¹²⁷I and ¹⁹⁷Au, whose neighboring mass numbers were not included in the training set, were still accurately described by the BNN. This highlights the strong capability of the BNN to predict the cross-sections of single-GDR-peak nuclei, capturing the GDR shape and both the low- and high-energy tails effectively.

Fig. 4

(Color online) The single GDR peak (γ, n) cross-section data for ¹²⁷I, ¹⁹⁷Au, and ²⁰⁷Pb predicted by the Bayesian neural network (red curve) are compared with experimental data (color dots) and TENDL-2021 data (green curve))

In Fig. 5, we also present the BNN predictions for (γ, n) cross-sections for nuclei with double GDR peaks, specifically ⁵⁹Co and ¹⁶⁵Ho. Similar to the comparisons shown in Fig. 4, the TENDL-2021 database results are also included for reference. In Fig. 5(a), the BNN prediction for the photonuclear cross-section of ⁵⁹Co accurately reproduces the double GDR peak structure, with peak positions, widths, and maximum cross-section values closely matching experimental data. Additionally, the confidence interval (red shading) for the BNN prediction for ⁵⁹Co is wider, due to the proximity to the boundaries of the training set, where BNN predictions carry less certainty. The BNN also accurately predicts the onset of (γ, n) cross-sections, aligning well with experimental data. Focusing on the high-energy tail, we note that two sets of experimental data are available from Lawrence Livermore National Laboratory. The BNN prediction aligns well with the dataset measured by Alvarez [58], while the TENDL-2021 database aligns better with the dataset measured by Fultz [59]. For another double GDR peak isotope, ¹⁶⁵Ho, shown in Fig. 5(b), the BNN predictions demonstrate good agreement with experimental data in terms of peak positions and widths of the cross-section structure. However, the cross-section values for the two GDR peaks are slightly lower than the experimental values, indicating partial agreement. This may be due to limited data near ¹⁶⁵Ho in the training set, with neighboring isotopes being ¹⁶⁰Gd and ¹⁸¹Ta–the latter having a mass number difference of 16 relative to ¹⁶⁵Ho. Despite this limitation, the BNN provides a cross-sectional prediction that is still close to the experimental values for both low-energy and high-energy tails. Both ⁵⁹Co and ¹⁶⁵Ho are double GDR peak nuclei, and the BNN’s red curves effectively capture the GDR peak positions and lower cross sections in the low- and high-energy tails for these nuclei. Thus, it appears that the BNN does not need to distinguish between single and double GDR peak nuclei when predicting nuclear cross-sections, though additional consistent experimental data may be needed.

Fig. 5

(Color online) the double GDR peaks (γ, n) cross-section data for ⁵⁹Co and ¹⁶⁵Ho were predicted by the BNN(red curve), which are compared with experimental data (color dots) and the TENDL-2021 data(green curve), respectively

Data from Centred’É tudes Nucléaires de Saclay (France) using the same γ-ray source were also included in the BNN training set, as these data are abundant and similar to those from the Lawrence Livermore National Laboratory (LLNL, USA). The training sets are presented in Table 1. Three nuclides ¹²⁷I, ¹⁶⁵Ho, and ¹⁹⁷Au—measured by both Saclay and LLNL were selected as the validation set. The BNN’s reliability in predicting single- and double-GDR peak nuclei has already been validated. In this study, we focused on comparing the BNN’s sensitivity to discrepancies when learning from data generated by different laboratories. In Fig. 6, it is notable that the confidence intervals (shaded areas) provided by the BNN, trained on both datasets for these three nuclides, were very narrow. In particular, when using the Saclay training set, the shaded area was nearly imperceptible, even though the central value curve was minimized in thickness in the figure. This may be due to Saclay’s more precise measurements. In Fig. 6(a), it is evident that for ¹²⁷I, the BNN cross-section predictions based on both Saclay and LLNL data show almost no difference. This similarity arises because the cross-section measurements from the two laboratories for this mass region were consistent. The BNN results for both training sets were similar to Saclay’s data, raising questions about the maximum GDR peak value obtained from LLNL for ¹²⁷I. In Fig. 6(b), the BNN, trained with the Saclay data, also accurately predicted the double GDR peaks of ¹⁶⁵Ho. When the Saclay data were used as the training set, the energy corresponding to the first GDR peak matched Saclay’s experimental values. Similarly, when LLNL data were used as the training set, the first GDR peak energy aligned with LLNL’s experimental values. This highlights the importance of consistency in the training set to enable the Bayesian neural network to accurately predict peak positions. Specifically, if the training set contains data from different laboratories, the predicted peak positions, especially for nuclides with double GDR peaks, may deviate. Figure 6(c) shows a comparison for ¹⁹⁷Au, where the second half of the GDR peak appears considerably higher than the experimental values. This discrepancy arises because nearby nuclides in the training set, ¹⁸¹Ta and ²⁰⁸Pb, have significantly higher values than those obtained from LLNL. However, Saclay’s ¹⁹⁷Au values were more consistent with LLNL’s results.

Fig. 6

(Color online) The (γ, n) cross-section data for ¹²⁷I, ¹⁶⁵Ho, and ¹⁹⁷Au, predicted by the Bayesian neural network using two training sets: one from Lawrence Livermore National Laboratory (solid red line) and the training set from Centred’É tudes Nucléaires de Saclay (blue dashed line). Predictions are compared with experimental data (colored dots) and TENDL-2021 data (green curve)

The ratio $σ_{X}^{int} / σ_{L}^{int}$ (X=S; S,BNN) is shown in Fig. 7, where $σ_{S,BNN}^{int}$ represents the BNN predictions based on the Saclay training set, and $σ_{S}^{int}$ and $σ_{L}^{int}$ are the experimentally integrated cross-sections from Saclay and LLNL, respectively. The integrated cross-sections were obtained following the approach from [3] $σ_{X,L}^{int} = \int_{E_{min}}^{E_{max}} σ_{X, L} (ϵ) d ϵ,$ (8) where E_min is the minimum intersection point of the energy ranges from the two laboratories’ γ-ray sources, and E_max is the maximum intersection point. The average values $〈 σ_{S}^{int} / σ_{L}^{int} 〉$ and $〈 σ_{S,BNN}^{int} / σ_{L}^{int} 〉$ were 1.192 and 1.176, respectively. The (γ,n) cross-sections from Saclay are generally higher than those from LLNL, aligning with the conclusions of Ref. [60]. For the Sn isotope chain, the cross-section data from both laboratories were very similar, as shown in the figures. This consistency led the BNN, trained on either set, to predict cross-section magnitudes comparable to those for ¹²⁷I, which more closely aligns with the Saclay data. In cross-section measurements for the Sn isotope chain, both laboratories reported peak values between 270 mb and 300 mb. However, for ¹²⁷I, LLNL recorded a significant decrease to around 220 mb, while Saclay’s measurements remained close to 300 mb. Therefore, we consider Saclay’s data to be more reliable for peak values. However, regarding peak width measurements, the BNN tended to favor data from LLNL. Given the high consistency between both laboratories’ measurements for nearby nuclei, we believe the BNN-predicted values likely approximate the true values. For ¹⁹⁷Au, both laboratories produced similar measurements, with a small ratio difference; however, the BNN predicted a slightly larger ratio. Despite this, the larger ratio still reflected the trend of actual differences between the two laboratories. Thus, when trained on a consistent dataset, the BNN effectively captures both individual nuclide trends and the broader systematic features of the training set, making it a valuable tool for comparing laboratory results. Furthermore, the BNN’s inherent generalization capability allows it to predict unknown data based on existing laboratory data, estimating potential differences between its predictions and measurements from other laboratories. Due to its robust capability to capture data structures and generalize across datasets, the BNN can be effectively applied to cross-sections for neutron, proton, and α-particle reaction channels. In cases with sparse experimental data, the BNN can provide reliable predictions for unmeasured values. Conversely, in data-rich scenarios, it can assess data reliability and help identify the most accurate measurements.

Fig. 7

(Color online) The mean difference between the BNN predictions using Saclay as the training set and LLNL is marked with asterisks; the mean difference within the training set is marked in blue, and in the validation set in red. The mean difference between Saclay and LLNL data is represented by a green square

Summary

We explored photonuclear reaction cross-sections using BNN analysis. Initially, we optimized the network architecture by examining the behavior of the loss function across various hidden node counts. The optimal configuration comprises two hidden layers with 50 nodes each. Since the loss functions for networks with different hidden-node counts stabilized after 20,000 iterations, we conducted 30,000 training iterations to ensure thorough capture of information from the training set.

To evaluate training performance and detect potential overfitting, we analyzed both absolute and relative errors. In studying the isotope ¹⁵⁹Tb, the BNN predictions yielded a mean absolute error of 3.09 mb and a mean relative error of 0.169 compared to experimental data. A positive correlation was observed between the absolute error and the experimental values, demonstrating the network’s ability to accurately capture data characteristics. However, there was no significant correlation between the relative error and experimental values, indicating that overfitting was not an issue. The predicted results further support this assertion, showing that the BNN performed comparably to the experimental values in the TENDL-2021 database. The BNN had an average absolute error of 7.14 mb and an average relative error of 0.252, whereas the TENDL-2021 database yielded an average absolute error of 13.56 mb and an average relative error of 0.209. This confirms that the BNN provides a reliable fit for (γ,n) photonuclear cross-sections.

After training, the BNN made predictions for three single-peak nuclei (¹²⁷I, ¹⁹⁷Au, and ²⁰⁷Pb) and two double GDR peak nuclei (⁵⁹Co and ¹⁶⁵Ho). Comparisons with experimental values and the TENDL-2021 database demonstrated the BNN’s ability to accurately predict GDR parameters for nuclei in the validation set. Furthermore, precise predictions in low- and high-energy cross-sectional regions–where data is sparse–highlighted the BNN’s strong generalization capabilities, likely due to the training set’s consistency, which primarily consisted of experimental data from Lawrence Livermore National Laboratory.

FinallyIn a secondary analysis, we employed another training set (Saclay) to evaluate the BNN’s versatility. We found that this training set could also reliably predict the (γ,n) photonuclear cross-sections for both single- and double GDR peak nuclei. Interestingly, when using data from both laboratories in the training sets, the BNN predictions for ¹²⁷I more closely matched Saclay’s data, whereas the TENDL database aligned more closely with LLNL’s data. The training set source also influenced predictions for the first peak in double GDR peak nuclei. For example, for ¹⁶⁵Ho in the validation set, the predicted first peak tended to align with the laboratory data used for training. This analysis illustrates that BNNs, when trained consistently on datasets from different laboratories, can estimate unknown data by extrapolating from available laboratory data and can thus reveal potential discrepancies between measurements from different laboratories. In future work, we aim to extend BNN-based studies to include cross-sections for all photonuclear reaction channels using physics-informed machine learning. This approach can provide reliable guidance for precise photonuclear reaction cross-section measurements at the SLEGS beamline. Additionally, we plan to predict cross-sectional data for photonuclear reactions that cannot be measured experimentally, thereby contributing to resolving significant questions regarding element formation in nuclear astrophysics.

References

D.M. Brink, Ph.D. thesis, Oxford University, Oxford, UK (1955).

P. Axel,

Electric dipole ground-state transition width strength function. Seven MeV photon interactions

. Phys. Rev. 126, 671–683 (1962). https://doi.org/10.1103/PhysRev.126.671