Dual coordinate descent methods for logistic regression and maximum entropy models
Citations
7,848 citations
Cites methods from "Dual coordinate descent methods for..."
...L2-regularized Logistic Regression (Solving Dual) See Yu et al. (2011) for details of a dual coordinate descent method....
[...]
...See Yu et al. (2011) for details of a dual coordinate descent method....
[...]
...Appendix I. L2-regularized Logistic Regression (Solving Dual) See Yu et al. (2011) for details of a dual coordinate descent method....
[...]
308 citations
Cites methods from "Dual coordinate descent methods for..."
...Recently it has been successfully applied to various large-scale problems such as linear SVMs [14], maximum entropy models [15], NMF problems [9], [10], and sparse inverse covariance estimation...
[...]
303 citations
Cites methods from "Dual coordinate descent methods for..."
...The work in [29] follows [64] to apply a two-level coordinate descent method, but uses a different method in the second level to decide variables for update....
[...]
...One example is a coordinate descent method (LIBLINEAR [29])....
[...]
273 citations
Cites background from "Dual coordinate descent methods for..."
...Yu et al. (2010) proposed a quasi Newton approach to solve non-smooth convex optimization problems....
[...]
257 citations
Cites result from "Dual coordinate descent methods for..."
...…the 5 Other models that were tested and yielded similar results include decision trees (Breiman et al. 1984), one vs. one multiclass strategy with support vector classification (Chang and Lin 2011), and one vs. all multiclass strategy with logistic regression (Yu et al. 2011) and ridge regression....
[...]
References
12,671 citations
Additional excerpts
...1 by Bertsekas (1999), which gives the convergence of coordinate descent methods for the following problem: min D(α) subject to α ∈ A1 × · · · ×Al, (75)...
[...]
7,004 citations
6,562 citations
"Dual coordinate descent methods for..." refers methods in this paper
...Therefore, we follow Memisevic (2006) and earlier SVM works (Crammer and Singer 2000; Hsu and Lin 2002; Keerthi et al. 2008) to consider variables associated with an xi as a block....
[...]
5,506 citations
4,145 citations