Skip to content

[Question] Why SAETrainer do not resample dead neurons? #543

@jasonrichdarmawan

Description

@jasonrichdarmawan

I suspect there is no logic for resampling dead neurons?

It seems the sparsity/dead_features metric never decrease and I can't find code which resample dead neurons

There is logic for sampling dead neurons. However, it's only for metrics and not to resample the dead neurons

If you are interested, you can see the WandB runs below

Note: gated_detach-16384 run variants use via_gate_reconstruction = pi_gate_act @ self.W_dec.detach() + self.b_dec.detach() to compute the auxiliary_reconstruction_loss. Meanwhile gated-16384 run variants use the unmodified version of SAELens discussed in #539

https://wandb.ai/jasonrichdarma/sonar_sae?nw=nwuserjasonrichdarmawan

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions