Experimental designs¶

An experimental design is a sample of the domain of the curve we are fitting. We may use the submodule design to generate experimental designs.

Latin-hypercube sampling¶

A popular method for generating a design is Latin-hypercube sampling [M79, ]. As a rule of thumb a sample size of \(10d\), where \(d\) is the dimension of the training inputs, is sufficient [L09]. We may generate a Latin-hypercube design of size \(10d\) for the three-dimensional region \([0, 1] \times [0, 1] \times [0, 1]\) as follows.

>>> bounds = [[0., 1.], [0., 1.], [0., 1.]]
>>> mim.design(bounds, method="lhs", n=30)
array([[0.98333333, 0.35      , 0.75      ],
       ...
       [0.01666667, 0.71666667, 0.38333333]])

In fact, the function design() returns a Latin-hypercube sample of size \(10d\) by default. Such a design may be generated more simply as follows.

>>> mim.design(bounds)
array([[0.01666667, 0.35      , 0.38333333],
       ...
       [0.98333333, 0.71666667, 0.75      ]])

PyMimic generates Latin-hypercube designs using look-up tables generated using the method of maximum projection [JB15, BJ18] and then randomly rotates and reflects them.

Generalized Latin-hypercube sampling¶

An extension of Latin-hypercube sampling is generalized Latin-hypercube sampling [DP10]. This preferentially places training inputs at the boundary of the sample region.

>>> bounds = [[0., 1.], [0., 1.], [0., 1.]]
>>> mim.design(bounds, method="gmlhs", n=30)
array([[9.99314767e-01, 8.14660196e-01, 8.53553391e-01],
       ...
       [6.85232623e-04, 2.73004750e-01, 3.20816025e-01]])

PyMimic generates a generalized Latin-hypercube design by transforming a Latin-hypercube design [lhs].

Regular-lattice sampling¶

We may also generate a design using a regular lattice.

>>> bounds = [[0., 1.], [0., 1.], [0., 1.]]
>>> mim.design(bounds, method="regular", n=30)
array([[0.        , 0.        , 0.        ],
       ...
       [1.        , 1.        , 1.        ]])

Random sampling¶

We may also generate a design using random sampling.

>>> bounds = [[0., 1.], [0., 1.], [0., 1.]]
>>> mim.design(bounds, method="random", n=30)
array([[0.01865849, 0.457221  , 0.00652817],
       ...
       [0.25116118, 0.46654406, 0.22595428]])

References¶

[M79]

McKay, M. D., Beckman, R. J., and W. J. Conover. 1979. 'A comparison of three methods for selecting values of input variables in the analysis of output from a computer code' in Technometrics, 21 (2): 239–45. Available at https://www.doi.org/10.2307/1268522.

[JB15]

Joseph, V. R., Gul, E., and Ba, S. 2015. 'Maximum projection designs for computer experiments' in Biometrika, 102: 371–80. Available at https://doi.org/10.1093/biomet/asv002.

[BJ18]

Ba, S., and V.R. Joseph. 2018. MaxPro: maximum projection designs [software]. Available at https://cran.r-project.org/web/packages/MaxPro/index.html.

[DP10]

Dette, H., and A. Pepelyshev. 2010. 'Generalized Latin hypercube design for computer experiment' in Technometrics, 51 (4): 421–9. Available at https://doi.org/10.1198/TECH.2010.09157.

[L09]

Loeppky, J.L., Sacks, J., and W.J. Welch. 2009. 'Choosing the sample size of a computer experiment: a practical guide' in Technometrics 51 (4): 366–76. Available at https://doi.org/10.1198/TECH.2009.08040.