Non-parametric Approximate Dynamic Programming via the Kernel Method
Citations
2 citations
2 citations
2 citations
Cites background or methods from "Non-parametric Approximate Dynamic ..."
...A range of approximations have been explored in the literature including linear (Powell and Carvalho (1998), De Farias and Van Roy (2003)), separable concave (Topaloglu and Powell (2003)), non-parametric kernel (Bhat et al. (2012)) or neural networks (Mnih et al. (2013), Mnih et al....
[...]
...A range of approximations have been explored in the literature including linear (Powell and Carvalho (1998), De Farias and Van Roy (2003)), separable concave (Topaloglu and Powell (2003)), non-parametric kernel (Bhat et al. (2012)) or neural networks (Mnih et al. (2013), Mnih et al. (2015)). A number of these techniques have also been applied to a vehicle routing setting (Godfrey and Powell (2002), Powell and Topaloglu (2003), Topaloglu and Powell (2006), Powell et al....
[...]
...A range of approximations have been explored in the literature including linear (Powell and Carvalho (1998), De Farias and Van Roy (2003)), separable concave (Topaloglu and Powell (2003)), non-parametric kernel (Bhat et al. (2012)) or neural networks (Mnih et al. (2013), Mnih et al. (2015))....
[...]
...A range of approximations have been explored in the literature including linear (Powell and Carvalho (1998), De Farias and Van Roy (2003)), separable concave (Topaloglu and Powell (2003)), non-parametric kernel (Bhat et al. (2012)) or neural networks (Mnih et al. (2013), Mnih et al. (2015)). A number of these techniques have also been applied to a vehicle routing setting (Godfrey and Powell (2002), Powell and Topaloglu (2003), Topaloglu and Powell (2006), Powell et al. (2007) Novoa and Storer (2009))....
[...]
...A range of approximations have been explored in the literature including linear (Powell and Carvalho (1998), De Farias and Van Roy (2003)), separable concave (Topaloglu and Powell (2003)), non-parametric kernel (Bhat et al. (2012)) or neural networks (Mnih et al....
[...]
1 citations
Cites methods from "Non-parametric Approximate Dynamic ..."
...The initial selection and potential modification of basis functions in steps (i) and (iv), respectively, are implementation bottlenecks when using ALP but this issue has received limited attention in the literature (Klabjan and Adelman 2007, Adelman and Klabjan 2012, and Bhat et al. 2012)....
[...]
...Bhat et al. (2012) side-step basis function selection when computing a VFA by applying the kernel trick (see, e.g., chapter 5 of Mohri et al. 2012) to replace inner-products of such functions in the dual of a regularized ALP relaxation....
[...]
1 citations
References
10,834 citations
7,880 citations
"Non-parametric Approximate Dynamic ..." refers background in this paper
...For certain sets S, Mercer’s theorem provides another important construction of such a Hilbert space. more examples can be found in the text of Scholkopf and Smola (2001)....
[...]
...The Gaussian kernel is known to be full-dimensional (see, e.g., Theorem 2.18, Scholkopf and Smola, 2001), so that employing such a kernel in our setting would correspond to working with an infinite dimensional approximation architecture....
[...]
...more examples can be found in the text of Scholkopf and Smola (2001)....
[...]
5,667 citations
3,018 citations
"Non-parametric Approximate Dynamic ..." refers methods in this paper
...Max-Weight (Tassiulas and Ephremides, 1992)....
[...]
...We prepare the ground for the proof by developing appropriate uniform concentration guarantees for appropriate function classes....
[...]
2,535 citations