title

section

abstract

layout

series

publisher

issn

id

month

tex_title

firstpage

lastpage

page

order

cycles

bibtex_author

author

date

address

container-title

volume

genre

issued

pdf

extras

A faster and simpler algorithm for learning shallow networks

Original Papers

We revisit the well-studied problem of learning a linear combination of $k$ ReLU activations given labeled examples drawn from the standard $d$-dimensional Gaussian measure. Chen et al. recently gave the first algorithm for this problem to run in $\mathrm{poly}(d,1/\epsilon)$ time when $k = O(1)$, where $\epsilon$ is the target error. More precisely, their algorithm runs in time $(d/\epsilon)^{\mathrm{quasipoly}(k)}$ and learns over multiple stages. Here we show that a much simpler one-stage version of their algorithm suffices, and moreover its runtime is only $(d k/\epsilon)^{O(k^2)}$.

inproceedings

Proceedings of Machine Learning Research

PMLR

2640-3498

chen24b

0

A faster and simpler algorithm for learning shallow networks

981

994

981-994

981

false

Chen, Sitan and Narayanan, Shyam

given	family
Sitan	Chen

given	family
Shyam	Narayanan

2024-06-30

Proceedings of Thirty Seventh Conference on Learning Theory

247

inproceedings

date-parts

2024

6

30

https://proceedings.mlr.press/v247/chen24b/chen24b.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2024-06-30-chen24b.md

2024-06-30-chen24b.md

Files

2024-06-30-chen24b.md

Latest commit

History

2024-06-30-chen24b.md

File metadata and controls