Skip to content

keep_in_memory=False leads to the fact that dataset.get_vw_document() is almost unworkable #59

Open
@Alvant

Description

@Alvant

The method is too slow!

Do we really need dask.dataframe? Maybe better to store documents on disk as single files (and not as one big .csv)?

References:

Metadata

Metadata

Assignees

No one assigned

    Labels

    discussNot everything clear, further communication required

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions