Skip to content

ANNIZHENG/CDS-2024-Fall-Capstone

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

230 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CDS-2024-Fall-Capstone

References

  1. https://www.nature.com/articles/s41588-024-01682-1
  2. https://www.nature.com/articles/s41588-024-01689-8
  3. https://www.biorxiv.org/content/10.1101/2023.04.26.538501v2.full

Past codes and models:

  1. SCENT (poisson regression) Github: https://github.com/immunogenomics/SCENT
  2. SCAR (regression) Github: https://github.com/snehamitra/SCARlinka

Environment Set-Up

  1. NYU HPC: https://ood.hpc.nyu.edu/
  2. Open HPC Console
  3. Clone or Update the Repository

Clone

git clone https://github.com/ANNIZHENG/CDS-2024-Fall-Capstone.git

Update

git pull origin main
  1. Manually create data folder and upload data

The path should match what's specified in CDS-2024-Fall-Capstone/config/config_PBMC.yaml

cd CDS-2024-Fall-Capstone
mkdir data
  1. Send task to HPC Cluster

Direct Cluster Training

sbatch --time=48:00:00 run_snakemake.sh

Interactive Session

srun -t 1:00:00 --mem 100G --pty /bin/bash
bash run_snakemake.sh
  1. Useful command to clear some memory space
# Remove all created files from the pipeline
rm -rf resources && rm -rf results 
rm -rf *.txt && rm -rf *.out && rm -rf *.err
rm -rf conda_cache && rm -rf .snakemake/conda/ && conda clean --all

# Create a specific env
conda env create -f <file_name>.yaml
conda activate <file_name>
conda deactivate

# Find top 20 files that take the most space
du -ah | sort -rh | head -n 20

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors