1 Introduction
The accompanying work (“part I”) has presented the newly developed mathematical framework, the “nth-order feature adjoint sensitivity analysis methodology for response-coupled forward/adjoint linear systems” (abbreviated as “nth-FASAM-L”), conceived by Cacuci (2024c). This work illustrates the application of the nth-FASAM-L to a representative energy-dependent neutron-slowing down model of fundamental importance to reactor physics. The physical considerations underlying this model are presented in Section 2, which briefly reviews the concept of “contributon-flux density response” and particularizes this concept within the modeling of neutron slowing down in a mixture of materials. This physical model is of fundamental importance in nuclear reactor physics and enables the derivation of exact closed-form results for the application of the nth-FASAM-L. Section 2 also defines the “features” inherent to this model, which enable the advantageous application of the nth-FASAM-L. By definition, there are considerably fewer “feature functions” of the primary model parameters than there are primary model parameters.
Section 3 presents the first-order adjoint sensitivity analysis of the contributon flux with respect to the features and primary model parameters of the slowing-down model, comparing the application of the 1st-FASAM-L versus the first-order comprehensive adjoint sensitivity analysis methodology for response-coupled forward/adjoint linear systems (1st-CASAM-L). Using either the 1st-FASAM-L or 1st-CASAM-L involves solving the same operator equations and boundary conditions within the respective 1st-LASS but with differing source terms. For the computation of the first-order sensitivities, the 1st-FASAM-L enjoys only a slight computational advantage since it requires only one quadrature per component of the feature function, whereas the 1st-CASAM-L requires one quadrature per primary model parameter.
Section 4 presents the second-order adjoint sensitivity analysis of the contributon flux with respect to the features and primary model parameters of the slowing-down model, comparing the application of the 2nd-FASAM-L versus the 2nd-CASAM-L. It is shown that the 2nd-FASAM-L requires as many large-scale “adjoint” computations as there are non-vanishing first-order response sensitivities with respect to the components of the feature functions, whereas the 2nd-CASAM-L requires as many large-scale computations as there are non-vanishing first-order response sensitivities with respect to the primary model parameters. Hence, the 2nd-FASAM-L is inherently more efficient than the 2nd-CASAM-L. In particular, one of the three distinct second-order sensitivities with respect to the model’s features vanishes identically within the 2nd-FASAM-L but none of the ca. 100 second-order sensitivities with respect to the primary model parameters vanish within the 2nd-CASAM-L.
Section 5 presents the third-order adjoint sensitivity analysis of the contributon flux with respect to the features and primary model parameters of the slowing-down model, comparing the application of the 3rd-FASAM-L versus the 3rd-CASAM-L. For computing the exact expressions of the third-order contributon-response sensitivities, the 3rd-FASAM-L requires only two large-scale computations, whereas the 3rd-CASAM-L would require hundreds of large-scale computations.
The concluding discussion presented in Section 6 emphasizes the fact that the unparalleled efficiency of the nth-FASAM-N increases as the order of computed sensitivities increases, and the probability of encountering vanishing sensitivities is much higher when using the nth-FASAM-L rather than any other methodology. Both the nth-FASAM-L and nth-CASAM-L overcome the limitation of dimensionality in the sensitivity analysis of linear systems, being incomparably more efficient and more accurate than any other method (statistical, finite differences, etc.) for computing exact expressions of response sensitivities (of any order) with respect to the uncertain parameters, boundaries, and internal interfaces of the model.
2 Modeling the contributon flux in a paradigm neutron slowing-down model
Fundamentally important responses of linear models depend simultaneously on both the forward and adjoint state functions governing the respective linear model, which makes it necessary to treat linear models/systems in their own right since such responses cannot be treated as particular cases of responses of nonlinear models. Typical examples of such responses arise in the modeling of self-diffusion processes in which the interaction mean free path is independent of the phase-space density. Such processes are modeled by linear equations of the Lorentz–Boltzmann type, and they occur in neutron, electron, and photon transport through media, as well as in certain types of transport processes in gas or plasma dynamics. Numerically solving such time-dependent integro-differential equations, albeit linear, is representative of “large-scale” computations and will be used in the sequel for illustrating the application of the nth-FASAM-L. In particular, the distribution of neutrons in a medium is modeled by the following standard form of the linear Boltzmann equation:
where the linear integro-differential operator is defined below:
The quantities that appear in the standard notation used in Equation 2 are defined as follows:
(i) denotes the three-dimensional position vector in space; denotes the energy-independent variable; the directional vector denotes the scattering solid angle; denotes the time-independent variable; and denotes the neutron particle speed.
(ii) denotes the flux of particles (i.e., particle number density multiplied by the particle speed) in the energy range about and volume element about , with directions of motion in the solid angle element about .
(iii) denotes the rate at which particles are produced in the same element of phase space from sources that are independent of the flux.
(iv) denotes the macroscopic total cross section.
(v) denotes the macroscopic scattering transfer cross section from energy to energy and from a scattering angle through angle .
(vi) denotes the number of particles emitted isotropically per fission.
(vii) denotes the macroscopic fission cross section.
(viii) denotes the fraction of fission particles appearing in energy about from fissions in about .
The adjoint Boltzmann transport equation is formulated in the Hilbert space denoted as and is endowed with the following inner product, denoted as , between two elements and :
In the Hilbert space , the generic adjoint Boltzmann transport equation is as follows:
where the (adjoint) linear integro-differential operator is defined below:
By construction, the forward and adjoint transport equations satisfy the following relation:
where denotes the bilinear concomitant evaluated on the boundary of the phase-space domain under consideration. The “generalized reciprocity relation” expressed by Equation 6 relates the bilinear concomitant, which is a functional of the forward and adjoint fluxes at the initial and final times along the incoming and outgoing directions at the surface of the medium, to the fluxes in the interior of the medium comprising fixed sources. This reciprocity relation provides a physical interpretation of the adjoint flux as an “importance function,” which quantifies the contribution of a source to a detector and enables transport problems to be posed either in the forward or adjoint descriptions. These reciprocity relations also restrict the combination of forward and adjoint boundary conditions to those that ensure both the forward and adjoint formulations are mathematically “well posed.” The reciprocity relation expressed by Equation 6 is extensively used in the so-called “source-detector” problems in steady-state subcritical systems, where models the detector properties (cross section) in the sub-region occupied by the respective detector.
When the boundary conditions for Equation 1 are homogeneous and there is no external source, i.e., when , the stationary neutron transport problem becomes an eigenvalue problem. The largest (i.e., fundamental) eigenvalue in such a case is called the “effective multiplication factor” and, depending on its value, corresponds to a critical, subcritical, or supercritical physical system (e.g., nuclear reactor). This eigenvalue (multiplication factor) is an important system (model) response, and its mathematical expression is a functional (“Raleigh quotient”) of the forward and the adjoint fluxes. Additional important model responses that are functionals of both the forward and adjoint fluxes include the reactivity, generation time, and lifetime of the system, along with several other Lagrangian functionals used in variational principles for developing efficient Raleigh–Ritz type numerical methods (see, e.g., Lewins, 1965; Stacey, 1974; Stacey, 2001). Perhaps the simplest quantity that depends on both the forward and adjoint fluxes—and has important applications in particle transport (particularly in particle shielding)—is the so-called “contributon flux” (Williams and Engle, 1977), which arises as follows:
(i) Multiplying the stationary form of Equation 1 by , multiplying the stationary form of Equation 4 by , subtracting the resulting equations from each other, and integrating the resulting equation over only the energy- and solid angle-independent variables yield the following relation:
where
(ii) The form of Equation 7 is the same as the mass continuity balance/equation for compressible flow, indicating that the “contributon response density” is conserved as it flows from the “contributon response source” toward the “contributon response sink” , with a “contributon response mean velocity” corresponding to the neutron speed .
The application of the nth-FASAM-L is illustrated in this section by considering the simplified model of the distribution in the asymptotic energy range of neutrons produced by a source of neutrons placed in an isotropic medium comprising a homogeneous mixture of “M” non-fissionable materials having constant (i.e., energy-independent) properties. For simplicity, but without diminishing the applicability of the nth-FASAM-L, this medium is considered to be infinitely large. The simplified form of the Boltzmann neutron transport equation, as shown in Equation 1, that models the energy distributions of neutrons within a mixture of materials is called the “neutron slowing-down equation.” This equation is written using neutron lethargy (rather than the neutron energy) as the independent variable. Neutron lethargy is customarily denoted using the variable/letter “u” and is defined as , where denotes the energy variable and denotes the highest energy in the system. Thus, the neutron slowing-down model (see, e.g., Meghreblian and Holmes, 1960; Lamarsh, 1966) for the energy distribution of the neutron flux in a homogeneous mixture of non-fissionable materials of infinite extent takes the following simplified form of Equation 1:
The quantities that appear in Equation 12 are defined as follows.
(i) The lethargy-dependent neutron flux is denoted as ; denotes a cut-off lethargy, usually corresponding to the thermal neutron energy (ca. 0.0024 electron volts).
(ii) The macroscopic elastic scattering cross section for the homogeneous mixture of “M” materials is denoted as and is defined as follows:
where denotes the elastic scattering cross section of material “i,” and the atomic or molecular number density of material “i” is denoted as and is defined as , where is Avogadro’s number , while and denote the mass number and density of the material, respectively.
(iii) The average gain in lethargy of a neutron per collision is denoted as and is defined as follows for the homogeneous mixture:
(iv) The macroscopic absorption cross section is denoted as and is defined as follows for the homogeneous mixture:
where denotes the microscopic radiative-capture cross section of material “i.”
(v) The macroscopic total cross section is denoted as and is defined as follows for the homogeneous mixture:
(vi) The source is considered to be a simplified “spontaneous fission” source stemming from fissionable actinides, such as 239Pu and 240Pu, emitting monoenergetic neutrons at the highest energy (i.e., zero lethargy). Such a source is comprised within the OECD/NEA polyethylene-reflected plutonium (PERP) OECD/NEA reactor physics benchmark (Valentine, 2006; Cacuci and Fang, 2023), which can be modeled using the following simplified expression:
where the superscript “S” indicates the “source; ” the subscript index k = 1 indicates material properties pertaining to the isotope 239Pu; the subscript index k = 2 indicates material properties pertaining to the isotope 240Pu; denotes the decay constant; denotes the atomic density of the respective actinide; denotes the spontaneous fission branching ratio; denotes the average number of neutrons per spontaneous fission; and denotes a function of parameters used in Watt’s fission spectrum to approximate the spontaneous fission neutron spectrum of the respective actinide. The detailed forms of the parameters are unimportant for illustrating the application of the nth-FASAM-L. The nominal values for these imprecisely known parameters are available from a library file contained in SOURCES 4C (Wilson et al., 2002).
Mirroring the considerations for the Boltzmann transport equation presented in Equations 1‒6, the “adjoint slowing-down model” is constructed in the Hilbert space of square-integrable functions and endowed with the following inner product, denoted as :
Using the inner product defined in Equation 19, the adjoint slowing-down model is constructed by the usual procedure, i.e., by (i) constructing the inner product of Equation 12 with a function ; (ii) integrating by parts the resulting relation so as to transfer the differential operation from the forward function onto the adjoint function ; (iii) using the initial condition provided in Equation 13 and eliminating the unknown function by choosing the final-value condition ; and (iv) choosing the source for the resulting adjoint slowing-down model so as to satisfy the generalized reciprocity relation shown in Equation 6. The result of these operations is the following adjoint slowing-down model for the adjoint slowing-down function :
The “contributon-flux response density” , as generally defined in Equation 8, specialized for the neutron slowing-down model, coincides with the inner product used in this context, i.e.,
It is important to note that does not depend explicitly on either the feature function or any primary model parameter. Therefore, the G-differential of will not comprise a direct-effect term but will consist entirely of the indirect-effect term.
For this “contributon-flux response density” model, the following primary model parameters are subject to experimental uncertainties.
(i) For each material “i,” , included in the homogeneous mixture, the following are primary model parameters: the atomic number densities ; the microscopic radiative-capture cross section ; and the scattering cross section ;
(ii) The source parameters , , , , and , for k = 1,2.
The above primary parameters are considered to constitute the components of a “vector of primary model parameters” defined as follows:
The first-level forward/adjoint system (1st-LFAS) for the “first-level forward/adjoint function” comprises Equations 12, 13, 20, and 21. The structure of the 1st-LFAS suggests that the components of the feature function can be defined as follows:
Solving Equations 12, 13 while using the definitions introduced in Equation 24 yields the following expression for the flux in terms of the components of the feature function :
Solving the above adjoint slowing-down model yields the following closed-form expression for the adjoint slowing-down function :
In terms of the components of the feature function , the closed-form expression of the “contributon response density” is obtained by substituting the expressions provided in Equations 25, 26 into Equation 22 and performing the integration over lethargy, which yields
In terms of the primary model parameters, the closed-form expression of the “contributon response density” is
As Equation 28 indicates, the model response can be considered to depend directly on primary model parameters. In view of Equation 27, however, the model response can alternatively be considered to depend directly on two feature functions and only indirectly (through the two feature functions) on the primary model parameters. In the former consideration/interpretation, the response sensitivities to the primary model parameters will be obtained by applying the nth-CASAM-L. In the later consideration/interpretation, the response sensitivities to the primary model parameters will be obtained by applying the nth-FASAM-L, which will involve two stages: (a) the response sensitivities with respect to the feature functions will be obtained in the first stage; (b) the subsequent computation of the response sensitivities to the primary model parameters will be performed in the second stage by using the response sensitivities with respect to the feature functions obtained in the first stage. The computational distinctions that stem from these differing considerations/interpretations underlying the nth-CASAM-L versus the nth-FASAM-L will become evident in the next section by using a paradigm neutron slowing-down model, which is representative of the general situation for any linear system.
3 First-order adjoint sensitivity analysis of the contributon flux to the slowing-down model’s features and parameters
The first-order sensitivities of the response , where , are obtained by determining the first-order Gateaux (G-)- differential, denoted as , of this response for variations around the phase-space point . By definition, the first-order G-differential is obtained as follows:
The sensitivities of with respect to the feature functions (and subsequently to the primary model parameters) will be determined in Section 3.1 by applying the 1st-FASAM-L. Alternatively, the sensitivities of directly with respect to the primary model parameters will be determined in Section 3.2 by applying the 1st-CASAM-L.
3.1 Application of the 1st-FASAM-L
The first-level variational sensitivity function is the solution of the first-level variational sensitivity system (1st-LVSS) obtained by differentiating the 1st-LFAS. The function is obtained by taking the first-order G-differentials of Equations 12, 13 to obtain
Carrying out the differentiations with respect to in the above equations and setting in the resulting expressions yields the following relations:
The equations satisfied by the variational function are obtained by G-differentiating Equations 20, 21 to obtain the equations below:
Concatenating Equations 32‒35 yields the following 1st-LVSS for the first-level variational sensitivity function :
where
Rather than repeatedly solving the 1st-LVSS for every possible variations , , the appearance of the first-level variational sensitivity function will be eliminated from the expression of the G-differential of the response , defined in Equation 29), by applying the principles of the 1st-FASAM-L outlined in the accompanying “Part I” by Cacuci (2024c). The specific steps are as follows:
1. A Hilbert space, denoted as , is introduced endowed with the following inner product denoted as , between two elements, and :
2. In the Hilbert , the inner product of Equation 36 is formed with a yet undefined vector-valued function to obtain the following relation:
3. The left-side of Equation 41 is integrated by parts to obtain the following relation, where the specification is omitted to simplify the notation:
4. The first two terms on the right side of Equation 42 are required to represent the G-differentiated response defined in Equation 29, and the unknown boundary values of the function are eliminated from the bilinear concomitant on the right side of Equation 42 to obtain the following 1st-LASS for the first-level adjoint sensitivity function :
where
5. It follows from Equations 29, 41‒44 that G-differentiated response defined in Equation 29 takes the following expression in terms of the first-level adjoint sensitivity function :
The expressions of the sensitivities of the response with respect to the components of the feature function are given by the expressions that multiply the respective components of in Equation 47, i.e.,
The above expressions are to be evaluated at the nominal parameter values , but the indication has been omitted for simplicity.
The first-order sensitivities of the response with respect to the primary model parameters are obtained by using the results obtained in Equations 48, 49, respectively, in conjunction with the “chain rule” of differentiating the components of the feature function with respect to the primary model parameters defined in Equation 29 to obtain the following expressions:
Solving the 1st-LASS defined by Equations 43, 44 yields the following closed-form expressions for the components of the first-level adjoint sensitivity function :
Using the above expressions in Equations 48, 49 yields the following closed-form expressions for the respective sensitivities:
The correctness of the expressions obtained in Equations 53, 54 can be verified by differentiating accordingly the closed-form expression given in Equation 27.
3.2 Application of the 1st-CASAM-L
The 1st-CASAM-L delivers the first-order sensitivities of the response directly with respect to the primary model parameters. The expression of the G-differentiated response is as shown in Equation 29, but the source term on the right side of the 1st-LVSS takes the following form:
If one were to actually solve the 1st-LVSS to obtain the first-level variational function and subsequently use the respective variational function to compute each sensitivity, one would need to solve the 1st-LVSS TP-times, using each time a source that would correspond to the ith-primary parameter, of the form , for each primary parameter
Since the left side of the 1st-LVSS remains the same as in Equation 36 and the boundary conditions also remain the same as obtained in Equation 37, it follows that the 1st-LASS and its solution remain unchanged. It therefore follows that the counterpart of the expression of the G-differential obtained in Equation 47 takes the following form:
The first-order sensitivities of the response with respect to the primary model parameters are obtained by identifying the expressions that multiply the respective variations in Equation 47, which yields the following result:
As expected, the result obtained from Equation 57 is identical to the result produced from Equation 50 by using the 1st-FASAM-L. Both the 1st-FASAM-L and 1st-CASAM-L require “one large-scale computation” for solving the 1st-LASS represented by Equations 43, 44.
4 Second-order adjoint sensitivity analysis of the contributon flux to the slowing-down model’s features and parameters
In practice, closed-form expressions such as those shown in Equations 53, 54 are unavailable. The 1st-FASAM-L yields the expressions provided in Equations 48, 49, while the 1st-CASAM-L yields the expressions provided in Equation 57. Hence, these expressions will provide the starting points for obtaining the second-order sensitivities that stem from the respective first-order sensitivities. As outlined within the general frameworks of both the nth-FASAM-L and nth-CASAM-L methodologies, the second-order sensitivities are obtained by conceptually considering them to arise as the “first-order sensitivities of the first-order sensitivities.”
4.1 Application of the 2nd-FASAM-L
The 2nd-FASAM-L uses the first-order sensitivities obtained from the 1st-CASAM-L, as provided in Equations 48, 49, to obtain the respective second-order sensitivities, as presented in Sections 4.1.1 and 4.1.2.
4.1.1 Second-order sensitivities stemming from the first-order sensitivity
The second-order sensitivities that stem from the first-order sensitivity are obtained by determining the G-differential of . For subsequent “bookkeeping” purposes, this first-order sensitivity will be denoted as , where the superscript “(1)” denotes “first-order” (sensitivity) and the argument “1” indicates that this sensitivity is with respect to the first component, i.e., , of the feature function . This sensitivity also depends on the function , which is the solution of the “second-level forward/adjoint system (2nd-LFAS)” obtained by concatenating the 1st-LFAS with the 1st-LASS, comprising Equations 12, 13, 20, 21, 43, and 44.
Applying the definition of the G-differential to Equation 48 yields the following expression for the G-differential :
The components , , , and of the second-level variational sensitivity function are the solutions of the 2nd-LVSS, which is obtained by G-differentiating the 2nd-LFAS. Thus, performing the G-differentiation of Equations 12, 13, 20, 21, 43, and 44 yields the following 2nd-LVSS for the second-level variational sensitivity function :
where
The second-level variational sensitivity function will be eliminated from the expression of by constructing the 2nd-LASS corresponding to the above 2nd-LVSS. The solution of the 2nd-LASS will be used in Equation 58 to construct , an alternative expression that will not depend on . This 2nd-LASS will be constructed in a Hilbert space denoted as , comprising four-component vector-valued functions of the form as elements, and is endowed with the following inner product between two vectors and :
The inner product defined in Equation 63 will be used to construct the inner product of Equation 59 with a function denoted as , where the argument “1” of the function indicates that this (adjoint) function corresponds to the first-order sensitivity of the response with respect to the “first” component, , of the feature function . Constructing this inner product yields the following relation, where the specification has been omitted to simplify the notation:
Integrating by parts the left side of Equation 64 yields the following relation:
The right side of Equation 65 is now tailored to represent the G-differential expressed by Equation 58 by requiring the second-level adjoint sensitivity function to be the solution of the following 2nd-LASS:
where
Implementing the equations underlying the 2nd-LVSS and the 2nd-LASS and substituting Equation 58 into Equation 64 provide the following alternative expression for the G-differential :
The expressions that multiply the respective components of in Equation 70 are the expressions of the second-order sensitivities (stemming from the first-order sensitivity ) of the response , with respect to the components of the feature function . Thus, identifying in Equation 70 the expressions that multiply the respective variations in the components of the feature function yields the following relations:
Solving the 2nd-LASS represented by Equations 66, 67 yields the following closed-form expressions for the components of the second-level adjoint sensitivity function :
Using the explicit closed-form expressions obtained in Equations 73‒76 and substituting them in Equations 71, 72 yield the following closed-form explicit expressions for the respective second-order sensitivities:
The correctness of the expressions obtained in Equations 77, 78 can be verified by differentiating accordingly the closed-form expression given in Equation 53.
4.1.2 Second-order sensitivities stemming from the first-order sensitivity
The second-order sensitivities that stem from the first-order sensitivity are obtained by determining the G-differential of . For subsequent “bookkeeping” purposes, this first-order sensitivity will be denoted as , where the superscript “(1)” denotes “first-order” (sensitivity) and the argument “2” indicates that this sensitivity is with respect to the second component, i.e., , of the feature function . This sensitivity also depends on the function . Applying the definition of the G-differential to the expression provided in Equation 49 yields the result below for the G-differential :
The function , as shown in Equation 79, is the component of the second-level variational sensitivity function , which is the solution of the 2nd-LVSS comprising Equations 59, 60. The component will be eliminated from the expression of by following the same procedure as described in Section 4.1.1 to construct a 2nd-LASS, the solution of which will be denoted as and will be used in Equation 79 to eliminate . The argument “2” in indicates that this second-level adjoint sensitivity function corresponds to the first-order sensitivity of the response with respect to the “second” component, , of the feature function . The 2nd-LASS for the function will have the same left side and boundary conditions as obtained in Equations 66, 67, but the right-side of this 2nd-LASS will correspond to the G-differential obtained in Equation 79, which leads to the following 2nd-LASS:
where
The alternative expression for the G-differential in terms of the components of has the same formal expression as shown in Equation 70 but with the components of the function being replaced by the components of , i.e.,:
Solving the 2nd-LASS represented by Equations 80, 81 yields the following expressions:
Identifying in Equation 83 the expressions that multiply the respective variations , in the components of the feature function and using the closed-form expressions obtained in Equations 84‒87, 26, 51 yield the following closed-form explicit expressions for the respective second-order sensitivities:
The correctness of the expressions obtained in Equations 88, 89 can be verified by differentiating accordingly the closed-form expression given in Equation 54.
Notably, due to the symmetry of the mixed second-order sensitivities, the expressions obtained in Equations 88, 72 provide an intrinsic mutual verification mechanism of the accuracy of the computations of the second-level adjoint sensitivity functions and .
4.2 Application of the 2nd-CASAM-L
The starting point for the application of the 2nd-CASAM-L is to determine the G-differential of the TP first-order sensitivities represented by Equation 57. For “bookkeeping” purposes, it is convenient to designate these TP first-order sensitivities as follows:
where
The G-differential of the expression in Equation 90 is obtained, by definition, as follows:
where the direct-effect and indirect-effect terms are defined, respectively, as follows:
The direct-effect term can be evaluated/computed already at this stage. On the other hand, the indirect-effect depends on the second-level variational function , which is the solution of the counterpart of 2nd-LVSS defined by Equations 59, 60, with the same boundary conditions and right-side but with distinct source terms, each source term involving the quantities and for . If this path were chosen to compute the second-order sensitivities, the 2nd-LVSS would need to be solved TP2 times, with TP2 different sources on the respective right sides, albeit with the same left side and boundary conditions.
The components are eliminated from the expression of the indirect-effect term defined in Equation 94 by constructing a corresponding 2nd-LASS in the Hilbert space by following the same sequence of steps as described in Section 4.1. The formal expression of the 2nd-LASS thus obtained will have the same left side and boundary conditions as those described in Section 4.1, but the right side of this formal 2nd-LASS will have a source term that will correspond to the indirect-effect term defined in Equation 94 and, hence, will be different for each , i.e.,
where
In terms of the solution of the 2nd-LASS represented by Equations 95, 96, the indirect-effect term defined in Equation 94 will have a representation that will formally resemble the expressions provided in Section 4.1, e.g., Equation 83, but with the second-level adjoint function(s) from Section 4.1 being replaced by the second-level adjoint sensitivity function . Finally, the total G-differential will be obtained, as shown in Equation 92, by adding the expression of the indirect-effect term obtained in terms of the second-level adjoint sensitivity function and the expression of the direct-effect term provided in Equation 93. The expression of the individual second-order sensitivities will subsequently be obtained by identifying in the final expression of the total G-differential those terms that multiply the parameter variations , .
4.2.1 Comparing the 2nd-FASAM-L versus the 2nd-CASAM-L
The computational savings provided by using, whenever possible, the 2nd-FASAM-L rather than the 2nd-CASAM-L are evident by comparing the results obtained in Section 4.1 versus the results obtained in Section 4.2. The feature function comprises two components , ; consequently, the 2nd-FASAM-L requires two large-scale computations (to solve the corresponding 2nd-LASS) to obtain the second-order response sensitivities with respect to the components of the feature function. Subsequently, the second-order response sensitivities with respect to the primary model parameters are obtained analytically using the chain-rule of differentiation.
In contradistinction, there is , where the number (M) of materials in the medium can easily exceed two dozen primary model parameters. Consequently, the 2nd-CASAM-L requires TP large-scale computations (to solve the corresponding 2nd-LASS) to obtain the second-order response sensitivities with respect to the primary model parameters. The boundary conditions and the operators on the left sides for all of the 2nd-LASS, for both the 2nd-FASAM-L and 2nd-CASAM-L, are the same; only the source terms on the left sides of these 2nd-LASS differ from each other. It is therefore computationally advantageous if the inverse operators of the left sides of these 2nd-LASS could be computed just once and stored for subsequent use, in which case the computational advantage of using the 2nd-FASAM-L would not be massive. Such a procedure could be feasible for relatively small models but would be impractical for large-scale problems, for which the advantage of using the 2nd-FASAM-L rather than the 2nd-CASAM-L increases as the number of model parameters increases.
5 Third-order adjoint sensitivity analysis of the contributon flux to the slowing-down model’s features and parameters
The 3rd-FASAM-L determines the third-order sensitivities by applying the principles of the 1st-FASAM to the second-order sensitivities, i.e., considering that the third-order sensitivities are “the first-order sensitivities of the second-order sensitivities.” The unmixed second-order sensitivity is identically zero. The two non-zero second-order sensitivities of the model response with respect to the components of the feature function are as follows: (i) the unmixed second-order sensitivity , expressed in Equation 71, and (ii) the mixed second-order sensitivity , expressed in either Equation 72 or Equation 88, which are equivalent, in view of the symmetry property of the mixed second-order sensitivities. Therefore, either the expression obtained in Equation 88 or Equation 72 can be used as the starting point for obtaining the third-order sensitivities stemming from this mixed second-order sensitivity. It appears that the expression provided in Equation 72 is the simpler of the two, so it will be used as the starting point for obtaining the corresponding third-order sensitivities.
The second-order sensitivity expressed in Equation 71 depends on the components of the third-level forward/adjoint function, denoted as , which is the solution of the third-level forward/adjoint system (3rd-LFAS) obtained by concatenating the 2nd-LFAS with the 2nd-LASS, thus comprising Equations 12, 13, 20, 21, 43, 44, 66 and 67. The argument “1;1” of indicates that this third-level function corresponds to the (unmixed) second-order sensitivity of the response with respect to the “first” feature function, . Therefore, the second-order sensitivity is denoted as follows: , where the argument “1;1” indicates that this third-level function corresponds to the (unmixed) second-order sensitivity and the arguments of the function were omitted, for simplicity. Similarly, the mixed second-order sensitivity depends on the components of the same function and will, therefore, be denoted as , where the argument “2;1” indicates that this second-order sensitivity is with respect to the components of .
5.1 Application of the 3rd-FASAM-L to compute the third-order sensitivities stemming from
The third-order sensitivities stemming from are obtained from the G-differential of Equation 71, which will be denoted as , and they are, by definition, determined as follows:
Performing the differentiation with respect to in Equation 98 and setting in the resulting expression yield
The third-level variational function , where , is the solution of the 3rd-LVSS obtained by concatenating the 2nd-LVSS (i.e., Equations 59, 60), with the equations obtained by G-differentiating the 2nd-LASS, represented by Equations 66, 67, for the function . The resulting 3rd-LVSS for the third-level variational function comprises the following matrix equation, where the dots are used to denote zero-elements for better visibility of the structure:
The 3rd-LVSS comprising Equations 100, 101 can be formally expressed in the following -matrix form:
The above matrix form of the 3rd-LVSS will be used as a “condensed notation” to construct the 3rd-LASS, the solution of which will be used to derive the alternative expression for the G-differential . This 3rd-LASS will be constructed in a Hilbert space denoted as , comprising as elements eight-component vector-valued functions of the form , and endowed with the following inner product between two vectors and :
The inner product defined in Equation 104 will be used to construct the inner product of Equation 102 with a function denoted as , where the argument “1,1” of the function indicates that this (third-level adjoint) function corresponds to the unmixed second-order sensitivity of the response with respect to the “first” component, , of the feature function . Constructing this inner product yields the following relation, where the specification has been omitted to simplify the notation:
The component for Equation 105 can be written as follows:
The left side of Equation 106 is integrated by parts to obtain the relation given below, in which the argument “1;1” has been omitted when writing the components to simplify the notation:
The boundary terms that appear in Equation 107 will vanish by using Equation 101 and imposing the following boundary conditions on the components of the third-level adjoint sensitivity function :
Equation 107 can be written in matrix form as follows:
where denotes the formal adjoint of . The right side of Equation 109 is now required to represent the G-differential by imposing the following relation:
where
The relations provided in Equations 108, 110 constitute the 3rd-LASS for the third-level adjoint sensitivity function . In component form, Equation 110 has the following expression, where the dots are used to denote zero-elements for better visibility of the structure:
Using the relations in Equations 99, 102, 103, 108, 110 yields the following alternative expression for :
The third-order sensitivities stemming from the relation obtained in Equation 113 are the expressions that multiply the respective variations and and are as follows:
The expressions obtained in Equations 114, 115 are to be evaluated at the nominal values of parameters and state functions, but the notation has been omitted for simplicity.
Solving Equations 112, 108 yields the following expressions for the components of the third-level adjoint sensitivity function :
Using the expressions obtained in in Equations 132, 133 and performing the respective operations yield the following results:
5.2 Application of the 3rd-FASAM-L to compute the 3rd-order sensitivities stemming from
The third-order sensitivities stemming from will be obtained from the G-differential of (Equation 72), which will be denoted as , and which is by definition determined as follows:
The function is one of the components of the third-level variational function , which is the solution of the 3rd-LVSS represented by Equations 101, 102. To avoid the need for solving the 3rd-LVSS, the appearance of this function will be eliminated from Equation 126 by deriving an alternative expression for the G-differential in terms of a third-level adjoint sensitivity function, denoted as . The argument “2,1” of the function indicates that this (third-level adjoint) function corresponds to the mixed second-order sensitivity of the response with respect to the “second and first” components, , of the feature function .
The third-level adjoint sensitivity function will be the solution of 3rd-LASS to be constructed in the Hilbert space using Equation 104 to construct the inner product of with Equation 102. Constructing this inner product yields the following relation, where the specification has been omitted to simplify the notation:
The left side of Equation 127 is integrated by parts to obtain the following relation:
where the following boundary conditions were imposed on the components , of the third-level adjoint sensitivity function :
The right side of Equation 109 is now required to represent the G-differential by imposing the following relation:
The relations provided in Equations 108, 110 constitute the 3rd-LASS for the third-level adjoint sensitivity function . Using the relations in Equations 99, 102, 103, 108, and 110 yields the following alternative expression for , in which the function has been replaced by the function :
The third-order sensitivities stemming from the relation obtained in Equation 131 are the expressions that multiply the respective variations and and are as follows:
The expressions obtained in Equations 132, 133 are to be evaluated at the nominal values of parameters and state functions, but the notation has been omitted for simplicity.
In component form, the 3rd-LASS for the third-level adjoint sensitivity function has the following expression, where dots are used to denote zero-elements for better visibility of the structure:
Solving Equation 134 yields the following expressions for the components of the third-level adjoint sensitivity function :
Using the expressions obtained in Equation 135, substituting them into Equations 132, 133 and performing the respective operations yield the following results:
6 Concluding discussion
This work has presented illustrative applications of the “nth-FASAM-L,” which has been specifically developed to be the most efficient methodology for computing exact expressions of sensitivities of responses (of such unique linear models) to features of model parameters and, subsequently, to the model parameters themselves. The efficiency of the nth-FASAM-L stems from the maximal reduction of the number of adjoint computations (which are “large-scale” computations) compared to the extant conventional high-order adjoint sensitivity analysis methodology nth-CASAM-L (Cacuci, 2022). The unique characteristics of the nth-FASAM-L have been illustrated in this work using a paradigm model of a “contributon-flux density response” that occurs in the energy distribution of neutrons stemming from a fission source in a homogeneous mixture of materials. This analytically solvable illustrative paradigm model has been used to demonstrate the following general conclusions regarding the characteristics and applicability of the nth-FASAM-L.
(i) Comparing the mathematical framework of the nth-FASAM-L to that of the nth-CASAM-L indicates that the components of the “feature function” play within the nth-FASAM-L the same role as played by the components of the “vector of primary model parameters” within the framework of the nth-CASAM-L. It is paramount to underscore, at the outset, that the total number of model parameters is always larger (usually by a wide margin) than the total number of components of the feature function , i.e., . The illustrative paradigm model of “neutron slowing down in a homogeneous mixture of materials” presented in this work comprised a feature function with two components (i.e., ) denoted as and , which were, in turn, functions of imprecisely known model parameters (where M denotes the number of materials and/or isotopes in the mixture, which is of the order of 20–50 in a nuclear reactor, depending on its service in operation).
(ii) For computing the exact expressions of the first-order sensitivities of a model response to the uncertain parameters, boundaries, and internal interfaces of the model, both the 1st-FASAM-L and 1st-CASAM-L require a single large-scale “adjoint” computation. This “large-scale” computation using either the 1st-FASAM-L or 1st-CASAM-L involves solving the same operator equations and boundary conditions within the respective 1st-LASS; only the sources for the respective 1st-LASS differ from each other. The 1st-FASAM-L enjoys a slight computational advantage since it requires only TF quadratures (one quadrature per component of the feature function), while the 1st-CASAM-L requires TP quadratures (one quadrature per model parameter). For the illustrative “contributon response of the neutron slowing-down” paradigm model, the computation of the first-order response sensitivities with respect to the model parameters required two quadratures using the 1st-FASAM-L, while the 1st-CASAM-L required TP-quadratures. Within the 1st-FASAM-L, the sensitivities with respect to the primary model parameters are obtained by using the first-order sensitivities and (with respect to the components of the feature function) in conjunction with the chain rule of differentiation of the exactly known expressions of the components and in terms of the primary model parameters.
(iii) Both the 2nd-FASAM-L and 2nd-CASAM-L conceptually determine the second-order sensitivities by using the fundamental concept that “the second-order sensitivities are the first-order sensitivities of the first-order sensitivities.” For computing the exact expressions of the second-order response sensitivities with respect to the primary model’s parameters, the fundamental difference between the 2nd-FASAM-L and 2nd-CASAM-L is obtained as follows: the 2nd-FASAM-L requires as many large-scale “adjoint” computations as there are “feature functions of parameters” (where denotes the total number of feature functions) for solving the left side of the 2nd-LASS with distinct sources on its right side. In contradistinction, the 2nd-CASAM-L requires (where denotes the total number of model parameters or non-zero first-order sensitivities) large-scale computations for solving the same left side of the 2nd-LASS but with distinct sources. Remarkably, the types of “large-scale” computations are the same in both the 2nd-FASAM-L and 2nd-CASAM-L since they both solve the same operator equations and boundary conditions within the respective 2nd-LASS systems; only the sources for these adjoint systems differ from each other. Since , the 2nd-FASAM-L is considerably more efficient than the 2nd-CASAM-L for computing the exact expressions of the second-order sensitivities of a model response to the uncertain parameters, boundaries, and internal interfaces of the model. For the illustrative contributon-response paradigm model, the computation of the second-order response sensitivities with respect to the model parameters using the 2nd-FASAM-L requires just two large-scale computations, for solving the two 2nd-LASS that correspond to the first-order sensitivities, and , of the contributon response with respect to the respective components, and , of the model’s “feature function” . In contradistinction, computing the second-order sensitivities to the model parameters using the 2nd-CASAM-L requires TP large-scale computations, one for solving each of the 2nd-LASS that corresponds to each one of the distinct first-order sensitivities , , of the response with respect to the TP model parameters. Remarkably, only the unmixed second-order sensitivity and the mixed second-order sensitivity are non-zero. The unmixed second-order sensitivity is identically zero, i.e., . In contradistinction, computing the second-order sensitivities to the model parameters using the 2nd-CASAM-L requires TP large-scale computations, one for solving each of the 2nd-LASS that corresponds to one of the distinct TP model parameters. None of the second-order sensitivities with respect to the primary model parameters vanish.
(iv) For computing the exact expressions of the third-order response sensitivities with respect to the primary model’s parameters, the 3rd-FASAM-L requires at most large-scale “adjoint” computations for solving the 3rd-LASS with distinct sources, while the 3rd-CASAM-L requires at most large-scale computations for solving the 3rd-LASS with distinct sources. For the illustrative “contributon response of the neutron slowing-down” paradigm model, the computation of the third-order response sensitivities with respect to the model parameters using the 3rd-FASAM-L requires only two large-scale computations for solving the two 3rd-LASS that correspond to the respective non-zero second-order sensitivities and . Only the unmixed third-order sensitivity and the mixed third-order sensitivity are non-zero; all other third-order sensitivities vanish identically. In contradistinction, the 3rd-CASAM-L requires all large-scale computations for solving the 3rd-LASS since all of the second-order sensitivities with respect to the primary model parameters are non-zero. Furthermore, all of the third-order response sensitivities with respect to the primary model parameters are non-zero.
(v) The same computational count of “large-scale computations” caries over when computing the fourth- and higher-order sensitivities, i.e., the formula for calculating the “number of large-scale adjoint computations” is formally the same for both the nth-FASAM-N (Cacuci, 2024a, 2024b) and nth-CASAM-N (Cacuci, 2023a), but the “variable” in the formula for determining the number of adjoint computations for the nth-FASAM-N is (i.e., total number of feature functions), while the counterpart for the formula for determining the number of adjoint computations for the nth-CASAM-N is (i.e., total number of model parameters). Since , it follows that the higher the order of computed sensitivities, the more efficient the nth-FASAM-N (Cacuci, 2024a, 2024b) becomes compared to the nth-CASAM-N (Cacuci, 2023a).
(vi) The probability of encountering vanishing sensitivities is much higher when using the nth-FASAM-L than when using the nth-CASAM-L. For the illustrative “contributon response of the neutron slowing-down” paradigm model, it is evident that the only a few of the response sensitivities of fourth order (and higher order) with respect to the components of the feature function will not vanish, and the non-vanishing sensitivities will all involve the component of the feature function since this component appears in an exponential, whereas the other component appears just as a multiplicative factor. In contradistinction, none of the higher-order response sensitivities with respect to the primary model parameters will vanish using the 2nd-CASAM-L.
(vii) When a model has no “feature” functions of parameters, but only comprises primary parameters, the nth-FASAM-L becomes identical to the nth-CASAM-L.
(viii) Both the nth-FASAM-L and nth-CASAM-L are formulated in linearly increasing higher-dimensional Hilbert spaces—as opposed to exponentially increasing parameter-dimensional spaces—thus overcoming the limitation of dimensionality in the sensitivity analysis of linear systems. Both the nth-FASAM-L and nth-CASAM-L are incomparably more efficient and more accurate than any other method (statistical, finite differences, etc.) for computing the exact expressions of response sensitivities (of any order) with respect to the uncertain parameters, boundaries, and internal interfaces of the model.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material; further inquiries can be directed to the corresponding author.
Author contributions
DC: conceptualization, data curation, formal analysis, investigation, methodology, project administration, resources, software, supervision, validation, visualization, writing–original draft, and writing–review and editing.
Funding
The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.
Conflict of interest
The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Cacuci, D. G. (2022). “The nth-order comprehensive adjoint sensitivity analysis methodology (nth-CASAM): overcoming the curse of dimensionality in sensitivity and uncertainty analysis,” in Volume I: Linear systems (Cham, Switzerland: Springer Nature), 362. doi:10.1007/978-3-030-96364-4
CrossRef Full Text | Google Scholar
Cacuci, D. G. (2023a). “The nth-order comprehensive adjoint sensitivity analysis methodology (nth-CASAM): overcoming the curse of dimensionality in sensitivity and uncertainty analysis,” in Volume III: Nonlinear Systems (Cham, Switzerland: Springer Nature), 369. doi:10.1007/978-3-031-22757-8
CrossRef Full Text | Google Scholar
Cacuci, D. G. (2024a). Introducing the nth-order features adjoint sensitivity analysis methodology for nonlinear systems (nth-FASAM-N): I. Mathematical framework. Am. J. Comput. Math. 14, 11–42. doi:10.4236/ajcm.2024.141002
CrossRef Full Text | Google Scholar
Cacuci, D. G. (2024b). Introducing the nth-order features adjoint sensitivity analysis methodology for nonlinear systems (nth-FASAM-N): II. Illustrative example. Am. J. Comput. Math. 14, 43–95. doi:10.4236/ajcm.2024.141003
CrossRef Full Text | Google Scholar
Cacuci, D. G. (2024c). The nth-order features adjoint sensitivity analysis methodology for response-coupled forward/adjoint linear systems (nth-FASAM-L):I. I. Mathematical framework. Front. Energy Res. 12, 1417594. doi:10.3389/fenrg.2024.1417594
CrossRef Full Text | Google Scholar
Cacuci, D. G., and Fang, R. (2023). “The nth-order comprehensive adjoint sensitivity analysis methodology (nth-CASAM): overcoming the curse of dimensionality in sensitivity and uncertainty analysis,” in Application to a large-scale system (Nature Switzerland, Cham: Springer), 463. doi:10.1007/978-3-031-19635-5
CrossRef Full Text | Google Scholar
Lamarsh, J. R. (1966). Introduction to nuclear reactor theory. Reading MA, USA: Adison-Wesley Publishing Co., 491–492.
Google Scholar
Lewins, J. (1965). IMPORTANCE: the adjoint function. Oxford, UK: Pergamon Press Ltd.
Google Scholar
Meghreblian, R. V., and Holmes, D. K. (1960). Reactor analysis. New York, USA: McGraw-Hill.
Google Scholar
Stacey, W. M. (1974). Variational methods in nuclear reactor physics. New York, USA: Academic Press.
Google Scholar
Stacey, W. M. (2001). Nuclear reactor physics. New York. USA: John Wiley and Sons.
Google Scholar
Valentine, T. E. (2006). “Polyethylene-reflected plutonium metal sphere subcritical noise measurements, SUB-PU-METMIXED-001,” in International handbook of evaluated criticality safety benchmark experiments, NEA/NSC/DOC(95)03/I-ix, organization for economic Co-operation and development (OECD). Paris, France: Nuclear Energy Agency.
Google Scholar
Williams, M. L., and Engle, W. W. (1977). The concept of spatial channel theory applied to reactor shielding analysis. Nucl. Sci. Eng. 62, 92–104. doi:10.13182/nse77-a26941
CrossRef Full Text | Google Scholar
Wilson, W. B., Perry, R. T., Shores, E. F., Charlton, W. S., Parish, T. A., Estes, G. P., et al. (2002). “SOURCES4C: a code for calculating (α, n), spontaneous fission, and delayed neutron sources and spectra,” in Proceedings of the American nuclear society/radiation protection and shielding division 12th biennial topical meeting (Santa Fe, NM, USA), 14–18.
Google Scholar