LLM Contextizer

A zero-dependency Python CLI for turning a codebase into an LLM-ready text snapshot.

LLM Contextizer scans a project directory, prints a readable ASCII tree, and then emits a single concatenated text output of file contents. The result is designed for copy/paste into ChatGPT, Claude, Gemini, or any other LLM workflow where you need a fast, deterministic snapshot of a repository.

At a glance

Runtime: standard-library-only Python CLI
Input: any local project directory
Output: ASCII tree followed by concatenated file contents
Safety model: default ignore rules, truncation for log-like files, and self-inclusion protection
Configuration: optional per-project .llmignore
Citation: Zenodo archival DOI available for research use

python3 src/contextizer.py /path/to/project > context.txt

Why this exists

LLMs have limited context windows, and real repositories contain:

dependency folders (node_modules, venv, etc.)
build artifacts, caches, binaries
large data and log files that explode token budgets

This tool aims to be:

portable: standard library only
deterministic: stable ordering makes runs predictable
safe-by-default: avoids common footguns
LLM-friendly: tree first, then contents

Key features

Project tree visualization
- Prints an ASCII directory tree before the content dump.
Noise filtering by default
- Skips common junk folders and binary extensions such as .git, node_modules, .venv, images, and archives.
Truncation for large or log-like files
- Truncates selected file types such as .csv, .log, .tsv, .jsonl, and .sql.
Per-project .llmignore
- Supports lightweight per-project exclusions without introducing extra dependencies.
Output safety
- Detects redirected output files inside the scanned root and excludes them to avoid self-inclusion.

Installation

Clone the repository. There are no runtime dependencies.

git clone https://github.com/ulissesflores/llm-contextizer.git
cd llm-contextizer

Optional: create a shell alias for faster local use.

alias llmctx='python3 /absolute/path/to/llm-contextizer/src/contextizer.py'

Quick start

python3 src/contextizer.py . > context.txt

Run against the current directory:

python3 src/contextizer.py
# or, with the alias:
llmctx

Target another directory:

python3 src/contextizer.py ../some-project

Pipe to clipboard:

python3 src/contextizer.py . | pbcopy   # macOS
python3 src/contextizer.py . | xclip    # Linux

Output format

The output has two parts:

PROJECT STRUCTURE
PROJECT FILE CONTENTS with file separators and relative paths

This consistent structure helps an LLM understand the repository before reading the code.

Configuration

`.llmignore`

Create a .llmignore file at the project root you are scanning.

Supported syntax is intentionally minimal:

dir/ ignores a directory by name
*.ext ignores a file extension
filename ignores a specific file name

Example:

# directories (by name)
dist/
build/
coverage/
__pycache__/

# secrets
.env
.env.*
*.pem
*.key

# large/noisy
*.log
*.csv
*.tsv
*.jsonl

Tip: the repository includes a ready-to-copy template in .llmignore.example.

Architectural decisions

Core decisions are documented under docs/adr/:

ADR 0001: Zero Dependencies Policy
ADR 0002: Output Safety (Prevent self-inclusion on stdout redirect)

Versioning

This project follows Semantic Versioning and maintains a human-readable changelog in CHANGELOG.md.

Patch: bug fixes (0.1.x)
Minor: backwards-compatible feature additions (0.x.0)
Major: breaking changes (x.0.0)

Contributing

Contributions are welcome.

Fork the repository
Create a branch from main
Keep changes focused
Add or update docs when behavior changes
Open a PR

📜 Citation

If you use this software in academic or research contexts, cite the Zenodo archival record below.

The repository continues beyond the archived runtime release through documentation and metadata maintenance, but the DOI remains the stable scholarly reference point.

Archival DOI: https://doi.org/10.5281/zenodo.18343438
Archival runtime release: v0.1.1
Latest GitHub release carrying the DOI reference: v0.1.2

BibTeX

@software{flores_2026_llm_contextizer,
  author       = {Flores, Carlos Ulisses},
  title        = {LLM Contextizer},
  year         = {2026},
  version      = {v0.1.1},
  publisher    = {Zenodo},
  doi          = {10.5281/zenodo.18343438},
  url          = {https://doi.org/10.5281/zenodo.18343438}
}

APA

Flores, C. U. (2026). LLM Contextizer (v0.1.1) [Software]. Zenodo. https://doi.org/10.5281/zenodo.18343438

For reproducibility and scholarly reference, prefer the Zenodo DOI rather than an unversioned repository URL.

License

Licensed under the Apache License 2.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
docs		docs
src		src
.gitignore		.gitignore
.llmignore.example		.llmignore.example
.zenodo.json		.zenodo.json
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
codemeta.json		codemeta.json
example.txt		example.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Contextizer

At a glance

Why this exists

Key features

Installation

Quick start

Output format

Configuration

`.llmignore`

Architectural decisions

Versioning

Contributing

📜 Citation

BibTeX

APA

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM Contextizer

At a glance

Why this exists

Key features

Installation

Quick start

Output format

Configuration

.llmignore

Architectural decisions

Versioning

Contributing

📜 Citation

BibTeX

APA

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`.llmignore`

Packages