A mixed finite differences scheme for gradient approximation (Q2671431)

From MaRDI portal
scientific article
Language Label Description Also known as
English
A mixed finite differences scheme for gradient approximation
scientific article

    Statements

    A mixed finite differences scheme for gradient approximation (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    3 June 2022
    0 references
    The authors focus on linear functionals defining an approximate version of the gradient of a function. These functionals are often used when dealing with optimization problems where the computation of the gradient of the objective function is costly or the objective function values are affected by some noise. These functionals have been recently considered to estimate the gradient of the objective function by the expected value of the function variations in the space of directions. The expected value is then approximated by a sample average over a proper (random) choice of sample directions in the domain of integration. In this way, the approximation error is characterized by statistical properties of the sample average estimate, typically its variance. Therefore, while useful and attractive bounds for the error variance can be expressed in terms of the number of function evaluations, nothing can be said on the error of a single experiment that could be quite large. This work instead is aimed at deriving an approximation scheme for linear functionals approximating the gradient, whose error of approximation can be characterized by a deterministic point of view in the case of noise-free data. The previously mentioned linear functionals are no longer considered as expected values over the space of directions, but rather as the filtered derivative of the objective function by a Gaussian kernel. By using this new approach, a gradient estimation based on a suitable linear combination of central finite differences at different step sizes is proposed and deterministic bounds that do not depend on the particular sample of points considered are computed. In the noisy setting, on the other end, the variance of the estimation error of the proposed method is shown to be strictly lower than the one of the estimation error of the central finite difference scheme. Numerical experiments on a set of test functions with a significant benchmark are encouraging, showing good performances compared to those of some methods commonly used in the literature, also in the noisy setting.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    gradient approximation
    0 references
    filtered derivative
    0 references
    derivative free optimization
    0 references
    0 references
    0 references