Branch version1 is the code for the article "Gene-regulation modules in nonalcoholic fatty liver disease revealed by single-nucleus ATAC-seq."
Files are in R Markdown/Notebook (.Rmd) format. They are named 1_, 2_, ... in the order of execution. I recommend using RStudio for editing and running.
Questions? Please submit to GitHub Issues or e-mail fumihiko AT takeuchi DOT name
- See fields
sample_name
,age
andtreatment
for eachbiosample
in DDBJ (BioProject PRJDB13870)
- single-nucleus ATAC-seq
- BAM files produced by Cell Ranger are available from DDBJ (DRA DRA014511 within BioProject PRJDB13870)
- Start by running
1_unify_multiple_barcodes_for_single_droplet.Rmd
- bulk ATAC-seq
- FASTQ files are available from DDBJ (DRA DRA014458 within BioProject PRJDB13870)
- I used the ATAC-seq data processing workflow by Reske et al.
- single-nucleus ATAC-seq
- ArchR project files (*.zip; unzip before use) are available from figshare
- Start by running
2_ArchR_setting.Rmd
and then4_LSI_and_clustering_of_tile_matrix.Rmd
- bulk ATAC-seq
- BAM files (*.bam *.bai) are available from figshare
- The data is analyzed in
6_cell_type_composition.Rmd
- Run Use case 2.
- 5_LSI_and_clustering_of_peak_matrix.Rmd >
Clustering with taylored parameter >
Uniform Manifold Approximation and Projection (UMAP)
- Fig.1B
- 6_cell_type_composition.Rmd >
Refine and inspect cell type composition of bulk samples >
Cross check, bulk vs snATAC-seq
- Fig.1D
- 6_cell_type_composition.Rmd >
Refine and inspect cell type composition of bulk samples >
Test difference and plot cell type composition
- Fig.1E
- 7_GeneScoreMatrix.Rmd >
QC
- Fig.1C
- 7_GeneScoreMatrix.Rmd >
Compare between conditions
- Table S2, Dataset S1
- 7_GeneScoreMatrix.Rmd >
Cross-check with bulk differential gene expression experiments
- Fig.S1
- 9_GSEA.Rmd >
Summarize results across tgt/bgd combinations
- Fig.2 (The R code outputs result for each gene set. The heatmap was drawn using Excel.)
- 10_GRN.Rmd >
Multiple motifs-to-one gene analysis >
Nonnegative matrix factorization of regulator-regulatee matrix
- Fig.3A, Fig.4A, Fig.5A, Fig.6A
- 10_GRN.Rmd >
Multiple motifs-to-one gene analysis >
Hyperparameters for the discovery of factors (modules) of TF regulation
- Fig.S3
- 10_GRN.Rmd >
Multiple motifs-to-one gene analysis >
Inspect enriched gene sets
- Fig.3B, Fig.4B, Fig.5B, Fig.6B
- 10_GRN.Rmd >
Activity score and co-regulation of a gene set >
Compute gene set activity score
- Fig.7 (top panel), Fig.S2 (top panel)
- 10_GRN.Rmd >
Activity score and co-regulation of a gene set >
Core genes as those central in co-regulation and protein-protein interaction
- Fig.7 (middle & bottom panels), Fig.S2 (middle & bottom panels)
Please cite