Skip to content

Display original deposited (rather than ingested) version of tabular files #925

@isaacpratt

Description

@isaacpratt

This feature request is linked to previous discussion going back to 2021 IQSS/dataverse#7956 and I am copying text from there.

Overview of the Feature Request
When an ingestable tabular file is deposited (.xlsx, .sav, .dta), the default download format (and the displayed file extension) is the ingested .tab version of the file. The original file format is only available from the File access menu.

I suggest that the deposited file format is better suited as the default download format, with .tab being available through the file access menu.

Also I would suggest changing the drop down menu text to somehow note that the Tab-Delimited and RData formats are automatic conversions so that there is some awareness by users that these files might be missing some formatting that is present in the original file.

And finally I suggest that Dataverse should use .tsv instead of .tab as a more commonly recognized extension for tab-delimited files - see IQSS/dataverse#6330

What kind of user is the feature intended for?
(Example users roles: API User, Curator, Depositor, Guest, Superuser, Sysadmin)

What inspired the request?

IQSS/dataverse#7956 provided several good reasons deposited file formats are preferable:

  1. The default display of the ingested file is confusing for depositors and users
  2. Frequently, deposit format are richer than the extracted .tab. E.g., Excel files may have additional rich text formatting like colour which makes them easier to ready than their plain text counterparts
  3. In some cases, ingest can cause data loss (e.g. for Excel files with multiple tabs, undesirable as those may be).

Defaulting to the deposited format somewhat mitigates this, even though it is still problematic.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions