Multivariate stochastic approximation using a simultaneous perturbation gradient approximation
Citations
2,348 citations
Cites background or methods from "Multivariate stochastic approximati..."
...Following Feynman’s idea for quantum simulation, a quantum algorithm for the ground state problem of interacting fermions was proposed in [14] and [15]....
[...]
...The convergence of θk to the optimal solution ~ θ ∗ can be proven even in the presence of stochastic fluctuations, if the starting point is in the domain of the attraction of the problem [15], ....
[...]
...The simultaneous perturbation stochastic approximation (SPSA) algorithm, introduced in [15], is a gradient-descent method that gives a level of accuracy in the optimization of the cost function that is comparable with finite-difference gradient approximations, while saving an order O(p) of cost function evaluations....
[...]
2,178 citations
Cites background or methods from "Multivariate stochastic approximati..."
...Instead, a perturbation-based estimator such as found in Simultaneous Perturbation Stochastic Approximation (SPSA) (Spall, 1992) chooses a random perturbation vector z (e.g., isotropic Gaussian noise of variance σ2) and estimates the gradient of the expected loss with respect to ui…...
[...]
...Unlike the SPSA (Spall, 1992) estimator, our estimator is unbiased even though the perturbations are not small (0 or 1), and it multiplies by the perturbation rather than dividing by it....
[...]
...Gradient estimators based on stochastic perturbations have long been shown to be much more efficient than standard finite-difference approximations (Spall, 1992)....
[...]
...Instead, a perturbation-based estimator such as found in Simultaneous Perturbation Stochastic Approximation (SPSA) (Spall, 1992) chooses a random perturbation vector z (e.g., isotropic Gaussian noise of variance σ2) and estimates the gradient of the expected loss with respect to ui through L(u+z)−L(u−z) 2zi ....
[...]
...Instead, a perturbation-based estimator such as found in Simultaneous Perturbation Stochastic Approximation (SPSA) (Spall, 1992) chooses a random perturbation vector z (e....
[...]
1,436 citations
Cites methods from "Multivariate stochastic approximati..."
...Recently, Arnold in his Ph.D. thesis (Arnold, 2001) extensively tested numerous optimization methods under noise, including: (1) the direct pattern search algorithm of Hooke and Jeeves (Hooke and Jeeves, 1961), (2) the simplex metdod of Nelder and Mead (Nelder and Mead, 1965), (3) the multi-directional search algorithm of Torczon (Torczon, 1989), (4) the implicit filtering algorithm of Gilmore and Kelley (Gilmore and Kelley, 1995; Kelley, 1999) that is based on explicitly approximating the local gradient of the objective functions by means of finite differencing, (5) the simultaneous perturbation stochastic approximation algorithm due to Spall (Spall, 1992; Spall, 1998a; Spall, 1998b), (6) the evolutionary gradient search algorithm of Salomon (Salomon, 1998), (7) the evolution strategy with cumulative mutation strength adaptation mechanism by Hansen and Ostermeier (Hansen, 1998; Hansen and Ostermeier, 2001)....
[...]
...…of the objective functions by means of finite differencing, (5) the simultaneous perturbation stochastic approximation algorithm due to Spall (Spall, 1992; Spall, 1998a; Spall, 1998b), (6) the evolutionary gradient search algorithm of Salomon (Salomon, 1998), (7) the evolution strategy with…...
[...]
...(2) the simplex metdod of Nelder and Mead (Nelder and Mead, 1965), (3) the multi-directional search algorithm of Torczon (Torczon, 1989), (4) the implicit filtering algorithm of Gilmore and Kelley (Gilmore and Kelley, 1995; Kelley, 1999) that is based on explicitly approximating the local gradient of the objective functions by means of finite differencing, (5) the simultaneous perturbation stochastic approximation algorithm due to Spall (Spall, 1992; Spall, 1998a; Spall, 1998b), (6) the evolutionary gradient search algorithm of Salomon (Salomon, 1998), (7) the evolution strategy with cumulative mutation strength adaptation mechanism by Hansen and Ostermeier (Hansen, 1998; Hansen and Ostermeier, 2001)....
[...]
1,434 citations
Cites background from "Multivariate stochastic approximati..."
...The second one, known as Simultaneous Perturbation (SP) [379], estimates the gradient by perturbing it not along the basis axis but instead along a random perturbation vector ∆ whose elements are independent and symmetrically Bernoulli distributed....
[...]
1,218 citations
Cites background or methods from "Multivariate stochastic approximati..."
...…For the special case where pψ is factored Gaussian (as in this work), the resulting gradient estimator is also known as simultaneous perturbation stochastic approximation [Spall, 1992], parameterexploring policy gradients [Sehnke et al., 2010], or zero-order gradient estimation [Nesterov and…...
[...]
...Specifically, using the score function estimator for∇ψEθ∼pψF (θ) in a fashion similar to REINFORCE [Williams, 1992], NES algorithms take gradient steps on ψ with the following estimator: ∇ψEθ∼pψF (θ) = Eθ∼pψ {F (θ)∇ψ log pψ(θ)} For the special case where pψ is factored Gaussian (as in this work), the resulting gradient estimator is also known as simultaneous perturbation stochastic approximation [Spall, 1992], parameterexploring policy gradients [Sehnke et al....
[...]
References
11,456 citations
2,141 citations
508 citations
403 citations