LLM-MindMap

Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning LLMs
Zhen Xiong · Yujun Cai · Zhecheng Li · Yiwei Wang Empirical Methods in Natural Language Processing (EMNLP) 2025

This repository implements the analytical framework introduced in Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning LLMs. The toolkit builds the reasoning graph based on the raw chain-of-thought (CoT) traces, enabling quantitative analysis of model cognition beyond coarse token-level statistics.

Unit Segmentation – split raw CoT transcripts into reasoning units using paragraph boundaries (\n\n).
Logical Clustering – aggregate adjacent units into semantically coherent reasoning steps, then select the best clustering using the intra-step coherence, step separation, and length regularity criteria.
Semantics Detection – estimate support/contradiction probabilities for every ordered pair of steps, applying the adaptive sampling scheme and dual-threshold consensus.

Usage

1. Install Dependencies

The default implementation here uses Qwen/Qwen3-32B and the SentenceTransformer all-mpnet-base-v2. Ensure you have the necessary compute (80 GB GPU memory is recommended) and appropriate Hugging Face credentials.

pip install torch transformers accelerate sentence-transformers

2. Run MindMap

Run the MindMap on any Chain-of-Thought transcript:

python mindmap.py --input path/to/cot.txt

Optional cached JSON payloads can be supplied to bypass live LLM calls:

python mindmap.py --input path/to/trace.txt \
                  --cluster-json cached_cluster.json \
                  --semantics-json cached_semantics_*.json

The MindMap will also print the ordered reasoning steps, the selected graph metrics, and signed edge confidences.

3. Graph Metrics

compute_metrics reports:

exploration_density – normalised edge density.
branching_ratio – fraction of steps with out-degree greater than one.
convergence_ratio – fraction of steps with in-degree greater than one.
linearity – proportion of steps with total degree ≤ 2.

These metrics provide structural signatures of the reasoning graph.

Citation

@article{xiong2025mapping,
  title={Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning LLM},
  author={Xiong, Zhen and Cai, Yujun and Li, Zhecheng and Wang, Yiwei},
  journal={arXiv preprint arXiv:2505.13890},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
imgs		imgs
README.md		README.md
clustering.py		clustering.py
data.py		data.py
embedding.py		embedding.py
graph.py		graph.py
llm.py		llm.py
mindmap.py		mindmap.py
pipeline.py		pipeline.py
semantics.py		semantics.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM-MindMap

Usage

1. Install Dependencies

2. Run MindMap

3. Graph Metrics

Citation

About

Uh oh!

Releases

Packages

Languages

Eric2i/LLM-MindMap

Folders and files

Latest commit

History

Repository files navigation

LLM-MindMap

Usage

1. Install Dependencies

2. Run MindMap

3. Graph Metrics

Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages