I suspect there is no logic for resampling dead neurons?
It seems the sparsity/dead_features metric never decrease and I can't find code which resample dead neurons
There is logic for sampling dead neurons. However, it's only for metrics and not to resample the dead neurons
If you are interested, you can see the WandB runs below
Note: gated_detach-16384 run variants use via_gate_reconstruction = pi_gate_act @ self.W_dec.detach() + self.b_dec.detach() to compute the auxiliary_reconstruction_loss. Meanwhile gated-16384 run variants use the unmodified version of SAELens discussed in #539
https://wandb.ai/jasonrichdarma/sonar_sae?nw=nwuserjasonrichdarmawan

I suspect there is no logic for resampling dead neurons?
It seems the
sparsity/dead_featuresmetric never decrease and I can't find code which resample dead neuronsThere is logic for sampling dead neurons. However, it's only for metrics and not to resample the dead neurons
If you are interested, you can see the WandB runs below
Note:
gated_detach-16384run variants usevia_gate_reconstruction = pi_gate_act @ self.W_dec.detach() + self.b_dec.detach()to compute theauxiliary_reconstruction_loss. Meanwhilegated-16384run variants use the unmodified version ofSAELensdiscussed in #539https://wandb.ai/jasonrichdarma/sonar_sae?nw=nwuserjasonrichdarmawan