Skip to content

Latest commit

 

History

History
42 lines (42 loc) · 1.43 KB

2024-06-30-chen24b.md

File metadata and controls

42 lines (42 loc) · 1.43 KB
title section abstract layout series publisher issn id month tex_title firstpage lastpage page order cycles bibtex_author author date address container-title volume genre issued pdf extras
A faster and simpler algorithm for learning shallow networks
Original Papers
We revisit the well-studied problem of learning a linear combination of $k$ ReLU activations given labeled examples drawn from the standard $d$-dimensional Gaussian measure. Chen et al. recently gave the first algorithm for this problem to run in $\mathrm{poly}(d,1/\epsilon)$ time when $k = O(1)$, where $\epsilon$ is the target error. More precisely, their algorithm runs in time $(d/\epsilon)^{\mathrm{quasipoly}(k)}$ and learns over multiple stages. Here we show that a much simpler one-stage version of their algorithm suffices, and moreover its runtime is only $(d k/\epsilon)^{O(k^2)}$.
inproceedings
Proceedings of Machine Learning Research
PMLR
2640-3498
chen24b
0
A faster and simpler algorithm for learning shallow networks
981
994
981-994
981
false
Chen, Sitan and Narayanan, Shyam
given family
Sitan
Chen
given family
Shyam
Narayanan
2024-06-30
Proceedings of Thirty Seventh Conference on Learning Theory
247
inproceedings
date-parts
2024
6
30