Skip to content

Proposed - Move temDir and chunkDir to scratch space #16

@milechin

Description

@milechin

Hi @seamore-z

You might get some speed up by moving the tempDir and chunkDir off the file system and onto the compute node's scratch space. Doing a lot of I/O operations on the local scratch space will be faster and more consistent than to access the chunks via file system. The change will need to be made in MSLSP_runTile_SCC.sh in the code block below:

tempDir="${workDir}${tile}/temp/"
chunkDir="${workDir}${tile}/imageChunks/"
phenDir="${workDir}${tile}/phenoMetrics/"

The updated code will look like this, notice $workDIR is replaced with $TMPDIR`:

tempDir="${TMPDIR}/${tile}/temp/"	
chunkDir="${TMPDIR}/${tile}/imageChunks/"
phenDir="${workDir}${tile}/phenoMetrics/"

Couple things to note:

  1. These files will be deleted when the job finishes, so if you want to save them, you will need to copy them to the file system before the job finishes. Since these seem like temporary files, I am assuming it is OK for them to be deleted.
  2. There is a limit on how much storage is available on node's local scratch disk. Many of nodes have 885GB dedicated to the scratch storage. How much storage do you estimate will be required for tempDir and chunkDir for a single tile run?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions