Skip to content

Enquiry about the experiment setting. #9

@KingGZX

Description

@KingGZX

I have a question regarding the bash files in the EMoE/Language/scripts folder. It appears that you applied MoEfication to only one [10th] layer of the model. Additionally, since you set expert_repeat to be greater than or equal to num_experts, it seems that each expert is initialized with the original weights. As a result, the dimensionality isn't effectively reduced, correct?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions