Variance based sensitivity analysis of model output. Design and estimator for the total sensitivity index

https://doi.org/10.1016/j.cpc.2009.09.018Get rights and content

Abstract

Variance based methods have assessed themselves as versatile and effective among the various available techniques for sensitivity analysis of model output. Practitioners can in principle describe the sensitivity pattern of a model Y=f(X1,X2,,Xk) with k uncertain input factors via a full decomposition of the variance V of Y into terms depending on the factors and their interactions. More often practitioners are satisfied with computing just k first order effects and k total effects, the latter describing synthetically interactions among input factors. In sensitivity analysis a key concern is the computational cost of the analysis, defined in terms of number of evaluations of f(X1,X2,,Xk) needed to complete the analysis, as f(X1,X2,,Xk) is often in the form of a numerical model which may take long processing time. While the computational cost is relatively cheap and weakly dependent on k for estimating first order effects, it remains expensive and strictly k-dependent for total effect indices. In the present note we compare existing and new practices for this index and offer recommendations on which to use.

Section snippets

Introduction to variance based measures

Sensitivity analysis is the study of how uncertainty in the output of a model (numerical or otherwise) can be apportioned to different sources of uncertainty in the model input factors, factors from now on [30]. Existing regulatory documents on impact assessment recommend the use of quantitative sensitivity analysis [7], [21]. Official guidelines insist on the importance of taking factor interactions into account [7], [9]. Variance based methods [6], [37] are well suited to this task and have

Sensitivity indices

Given a model of the form Y=f(X1,X2,Xk), with Y a scalar, a variance based first order effect for a generic factor Xi can be written as (see notations in Table 1):VXi(EXi(Y|Xi)) where Xi is the i-th factor and Xi denotes the matrix of all factors but Xi. The meaning of the inner expectation operator is that the mean of Y is taken over all possible values of Xi while keeping Xi fixed. The outer variance is taken over all possible values of Xi. The associated sensitivity measure (first order

Best practices for the simultaneous computation of Si and STi

We discuss here existing estimators to compute in a single set of simulations both sets of indices Si and STi. By ‘simulation’ we mean here the computation of an individual value for Y corresponding to a sampled set of k factors X1,X2,,Xk.

We imagine to have two independent sampling matrices A and B, with aji and bji as generic elements. The index i runs from one to k, the number of factors, while the index j runs from one to N, the number of simulations. We now introduce matrix AB(i) (BA(i))

Computational scheme for STi

To compute STi from formula (f), which represents the best practice so far, the design matrices A and AB(i) have to be set-up. Different methods may be used. In the following two different designs are compared: the first, called ‘radial design’, has been firstly presented in [25]; the second, called ‘winding design’ derives from the method discussed in [14]. The two designs are illustrated in Table 3. Let us focus first on the left-hand side. This shows how – starting from the fist row made of

Using Sobol' quasi-random sequences

Several types of quasi-random (QR) sequences have been suggested by Faure, Niederreiter, Halton, Hammersley, Sobol' and other investigators, see Bratley and Fox [3] for a review of these works.

QR sequences are specifically designed to generate samples of X1,X2,,Xk as uniformly as possible over the unit hypercube Ω.

Unlike random numbers, successive quasi-random points know about the position of previously sampled points and fill the gaps between them. For this reason they are also called

Numerical experiments

The following research questions concerning the estimation of STi are tackled here:

  • 1.

    Which is the best estimator for STi between estimators (e) and (f) in Table 24?

  • 2.

    Which is the best strategy between winding stairs and radial sampling?

  • 3.

    Is n>1 convenient with either of the above strategies?

  • 4.

    Is the answer to the questions above dependent upon the typology of the

Conclusions

The theory and the computational tools available to compute total sensitivity indices STi have been revised. The main motivation for the present work is that previous comparisons of different methods to estimate STi were based on incomplete combinations of sampling designs and estimators [5] or a limited set of test functions [25]. In this work a larger set of test functions has been employed reflecting different degrees of linearity, additivity and effective dimension. Further the simulations

Acknowledgements

Authors are particularly grateful to an anonymous reviewer who considerably helped in improving the manuscript.

References (48)

  • A. Saltelli et al.

    About the use of rank transformation in sensitivity analysis of model output

    Reliability Engineering and System Safety

    (1995)
  • I.M. Sobol'

    On the distribution of points in a cube and the approximate evaluation of integrals

    USSR Comput. Maths. Math. Phys.

    (1967)
  • I.M. Sobol'

    Uniformly distributed sequences with an addition uniform property

    USSR Comput. Maths. Math. Phys.

    (1976)
  • I.M. Sobol'

    Global sensitivity analysis indices for the investigation of nonlinear mathematical models

    Matematicheskoe Modelirovanie

    (2007)
  • I.M. Sobol' et al.

    Quasirandom sequence generators

    Keldysh Inst. Appl. Maths RAS Acad. Sci.

    (1992)
  • S. Tarantola et al.

    Random balance designs for the estimation of first order global sensitivity indices

    Reliability Engineering and System Safety

    (2006)
  • S. Da Veiga, F. Wahl, F. Gamboa, Local polynomial estimation for sensitivity analysis for models with correlated...
  • G. Archer et al.

    Sensitivity measures, anova-like techniques and the use of bootstrap

    Journal of Statistical Computation and Simulation

    (1997)
  • B. Bratley et al.

    Implementation and test of low discrepancy sequences

    ACM Transactions on Modeling and Computer Simulation

    (1992)
  • P. Bratley et al.

    Algorithm 659 implementing Sobol's quasirandom sequence generator

    ACM Transactions on Mathematical Software

    (1988)
  • K. Chan et al.

    Winding stairs: A sampling tool to compute sensitivity indices

    Statistics and Computing

    (2000)
  • R.I. Cukier et al.

    Study of the sensitivity of coupled reaction systems to uncertainties in rate coefficients. i theory

    The Journal of Chemical Physics

    (1973)
  • B. Efron et al.

    The jackknife estimate of variance

    Annals of Statistics

    (1981)
  • Cited by (2382)

    • Large-scale sandwich structures optimization using Bayesian method

      2024, International Journal of Mechanical Sciences
    View all citing articles on Scopus
    View full text