Skip to content

Encoder n_bins_in doesn't update if changing fmin and fmax #2

@lamipaul

Description

@lamipaul

Hi, and thanks a lot for this really cool repo!

I am trying to train pesto on my own data, and I had to change fmin and fmax (having relatively short audio files, otherwise CQT kernels would not fit).
As a result, my CQT frames have a different size than expected (157 and 125 before and after cropping respectively, instead of 216).
This results in a crash when entering the layernorm of the Resnet encoder, because its input dimension seems fixed.
I could bypass this by manually setting model.encoder.n_bins_in=125, but I guess it would be nice to have it adapt automatically.

Really sorry if I understood something wrong and this is not a bug.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions