-
Notifications
You must be signed in to change notification settings - Fork 14
Update uploading_to_cpg.md #123
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
add clarification on how to verify correct transfer to staging bucket
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks or adding this! I simplified and clarified a bit through suggestions.
- Total file size on origin: `du -sh --apparent-size PATH/TO/YOUR/FILES` | ||
- Number of files on origin: `find PATH/TO/YOUR/FILES -type f | wc -l` | ||
with | ||
- Total object size on the Staging bucket: `aws s3 ls s3://staging-cellpainting-gallery/$PROJECT_PREFIX/$SOURCE/$YOUR_FILES --summarize --human-readable --recursive | grep Total` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- Total object size on the Staging bucket: `aws s3 ls s3://staging-cellpainting-gallery/$PROJECT_PREFIX/$SOURCE/$YOUR_FILES --summarize --human-readable --recursive | grep Total` |
@@ -105,4 +105,15 @@ Run your transfer commands to `staging-cellpainting-gallery`. | |||
|
|||
Once the transfers are complete, either you (Imaging Platform internal) or your data champion (if external) must verify the data transferred to `staging-cellpainting-gallery` is complete. | |||
(Currently this is done manually, though this will be programatic in the future.) | |||
|
|||
To verify if the transfer was succesful you (Imaging Platform internal) can compare: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
To verify if the transfer was succesful you (Imaging Platform internal) can compare: | |
To verify if the transfer was successful, compare object counts between your source and destination. | |
Because of differences in the way file sizes are calculated between file systems and object storage, file size is not a reliable metric for comparison. |
@@ -105,4 +105,15 @@ Run your transfer commands to `staging-cellpainting-gallery`. | |||
|
|||
Once the transfers are complete, either you (Imaging Platform internal) or your data champion (if external) must verify the data transferred to `staging-cellpainting-gallery` is complete. | |||
(Currently this is done manually, though this will be programatic in the future.) | |||
|
|||
To verify if the transfer was succesful you (Imaging Platform internal) can compare: | |||
- Total file size on origin: `du -sh --apparent-size PATH/TO/YOUR/FILES` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- Total file size on origin: `du -sh --apparent-size PATH/TO/YOUR/FILES` |
|
||
To verify if the transfer was succesful you (Imaging Platform internal) can compare: | ||
- Total file size on origin: `du -sh --apparent-size PATH/TO/YOUR/FILES` | ||
- Number of files on origin: `find PATH/TO/YOUR/FILES -type f | wc -l` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- Number of files on origin: `find PATH/TO/YOUR/FILES -type f | wc -l` | |
- Number of files on origin (for a file system): `find PATH/TO/YOUR/FILES -type f | wc -l` |
add clarification on how to verify correct transfer to staging bucket