- https://www.nature.com/articles/s41588-024-01682-1
- https://www.nature.com/articles/s41588-024-01689-8
- https://www.biorxiv.org/content/10.1101/2023.04.26.538501v2.full
Past codes and models:
- SCENT (poisson regression) Github: https://github.com/immunogenomics/SCENT
- SCAR (regression) Github: https://github.com/snehamitra/SCARlinka
- NYU HPC: https://ood.hpc.nyu.edu/
- Open HPC Console
- Clone or Update the Repository
Clone
git clone https://github.com/ANNIZHENG/CDS-2024-Fall-Capstone.git
Update
git pull origin main
- Manually create data folder and upload data
The path should match what's specified in CDS-2024-Fall-Capstone/config/config_PBMC.yaml
cd CDS-2024-Fall-Capstone
mkdir data
- Send task to HPC Cluster
Direct Cluster Training
sbatch --time=48:00:00 run_snakemake.sh
Interactive Session
srun -t 1:00:00 --mem 100G --pty /bin/bash
bash run_snakemake.sh
- Useful command to clear some memory space
# Remove all created files from the pipeline
rm -rf resources && rm -rf results
rm -rf *.txt && rm -rf *.out && rm -rf *.err
rm -rf conda_cache && rm -rf .snakemake/conda/ && conda clean --all
# Create a specific env
conda env create -f <file_name>.yaml
conda activate <file_name>
conda deactivate
# Find top 20 files that take the most space
du -ah | sort -rh | head -n 20