Skip to content

Can the data set loading code for the training process be made public? Or can you provide the usage of the datasets and several training data sets in the project? #4

@myprospection

Description

@myprospection

Thank you very much for sharing your code. I am currently working on applying your approach to my own research, and I would like to gain a deeper understanding of the entire training process.

However, I am currently facing difficulties in reproducing the training pipeline, particularly due to issues related to dataset loading. Even after carefully reading the accompanying paper, I still have some doubts and areas of confusion.

My main goal in writing this message is to ask about the usage of the scripts in the datasets folder. I would like to know how these scripts are intended to be used, and whether the dataset loading logic used during training is publicly available or could be described in more detail. Additionally, I would appreciate any advice on how I could adapt the provided dataset scripts to construct a data loader for my own dataset that fits into your training framework.

To resolve these questions—and to move forward with my research as efficiently as possible—I would like to understand the complete training process from a data flow perspective. This would greatly help me grasp the structure and logic of the pipeline.

I would truly appreciate any guidance or clarification you could provide. Thank you again for your valuable work!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions