-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Name should be a required field on DRSObject #385
Comments
One of the main goals of DRS is to abstract the locations of files with a persistent identifier, such that a change in location or file name doesn't break existing references (e.g. we're enabling research reproducibility in a world of moving data). That being said, we need to be really careful about requiring and relying on a name separate from the explicit locations - because we're beginning to crack that abstraction a bit.
I think a file name hint for download clients is fine, if that's all |
Agree with Alex on the problem of putting data in the file name. And it's not just clinical Data Connect exists alongside DRS to handle this problem. There are plenty of examples we could use to illustrate this. See also this issue which is intended to deal with a specific and important part of what Michele raises. |
hi @Avantol13 and @ianfore , thanks for the feedback. I see two main aspects raised:
The PR #381, which has already received positive feedback and will be merged soon will resolve this problem: a user will be able to perform the
Using the
As for the suggesting on using the For example, this is from the DRS Server in production from the INCLUDE Project:
If instead I do import via just the id I have:
Does this make sense to you? |
I understand what you're saying but all the things I previously said still stand. Requiring an additional I sympathize with the user experience problem of file naming, but clients should just be flexible in their logic for naming things. First, try to get a filename from the The reality is that users should not be using the filename as a consistent and reliable source of data. The intention is that there is some other external source of information that makes these files useful by linking such data to the DRS ID, like @ianfore 's mention of Data Connect. I could perhaps get behind a |
Hi @Avantol13 , I think it's totally ok if the storage location changes, and the I do not understand the change in the filename instead. The DRS standard has the idea that once, an object is indexing, the content should not change. Do you have use-cases where the drs object stays the same, but the filename changes? |
Hi all,
I'm opening this issue to discuss the change for the field
name
on the DRSObject from optional to required.As per version 1.2 the field
name
is optional on theDRSObject
, while it is required on theContentsObject
.We suggest to make the
name
required also on theDRSObject
.In our experience, the name of the file carry a certain type of information that are extremely useful to the final user. who uses this information to operate on the data pointed by the DRSObject. The information usually are:
information related to the content of the file, identifying for example the sample_id or other type of "sequencing" metadata which the user can utilize to distinguish the files and operate on them correctly.
Extension of the file, which provides an implicit assertion on what is the content of the file, and gives the user the ability to choose the right tool to continue the analysis (for example a
.bam
vs.vcf
vs.dcm
informs the user what type of files they have, and what would be the next step that makes sense.) While the info on point 1 could be addressed via some ancillary metadata import, the information about the extension are usually not covered.The main reason to not have the
name
as required it is only if thename
is overloaded with controlled type metadata, which should be accessed only by authorized party. However this problem will not be present anymore as soon the #381 gets merged in, given that the OPTION call would be used to assess the authorization of the files, in case there is the need to enforce the authorization directly from the first GET.The text was updated successfully, but these errors were encountered: