Name	Name	Last commit message	Last commit date
parent directory ..
results	results
results_v2	results_v2
BLIND_DECODE_TEST_LOG.md	BLIND_DECODE_TEST_LOG.md
README.md	README.md
__init__.py	__init__.py
baseline.py	baseline.py
compare.py	compare.py
config.py	config.py
decoder.py	decoder.py
report.py	report.py
requirements.txt	requirements.txt
run_test.py	run_test.py
run_test_v2.py	run_test_v2.py
utils.py	utils.py
v2_baselines.py	v2_baselines.py
v2_compare.py	v2_compare.py
v2_config.py	v2_config.py
v2_generators.py	v2_generators.py
v2_report.py	v2_report.py

ZFD Blind Decode Falsification Test

Current Status: v2 PASSED

Test v2 demonstrates that the ZFD decoder's vocabulary mappings are specific to Voynich manuscript morphology. Real Voynich text produces significantly higher coherence than three non-Voynich baselines (synthetic EVA, character-shuffled, random Latin) through the same frozen pipeline.

The "degrees of freedom" criticism is empirically refuted.

Quick Start

# Run v2 vocabulary specificity test (full: 1500 baseline decodes)
python run_test_v2.py

# Quick mode (150 baseline decodes)
python run_test_v2.py --quick

# Run v1.1 position-independence test
python run_test.py --quick

Test v2 Results: Vocabulary Specificity (2026-02-04)

Folio	Real Coherence	Synthetic EVA	Char-Shuffled	Random Latin	Verdict
f10r	0.704	0.43 ± 0.10	0.53 ± 0.06	0.35 ± 0.03	DISCRIMINATING
f23v	0.765	0.42 ± 0.10	0.55 ± 0.09	0.34 ± 0.05	DISCRIMINATING
f47r	0.700	0.42 ± 0.10	0.45 ± 0.08	0.33 ± 0.05	DISCRIMINATING
f89r	0.694	0.55 ± 0.01	0.59 ± 0.01	0.41 ± 0.01	DISCRIMINATING
f101v	0.744	0.50 ± 0.08	0.58 ± 0.05	0.39 ± 0.03	DISCRIMINATING

Overall Verdict: PASS (5/5 folios discriminating)

Hierarchy confirmed: Real (~0.70) > Char-Shuffled (~0.55) > Synthetic EVA (~0.45) > Latin (~0.35)

Full report: results_v2/V2_VOCABULARY_SPECIFICITY_REPORT.md

Test v1.1 Results: Position Independence (2026-02-04)

Folio	Tokens	Known Ratio	Coherence	vs. Shuffled
f10r	89	41.6%	0.7043	Identical
f23v	83	57.8%	0.7655	Identical
f47r	82	39.0%	0.7001	Identical
f89r	387	38.3%	0.6938	Identical
f101v	208	52.4%	0.7442	Identical

Finding: Decoder is position-independent (processes tokens in isolation). Word-order shuffling has no effect on coherence because each token is decoded based on its morphological structure alone.

This is expected behavior for pharmaceutical shorthand, where each abbreviation decodes to its meaning regardless of position.

Full report: results/BLIND_DECODE_REPORT.md

Test History

Version	Date	Result	Finding
v1.0	2026-02-04 17:12	Bug	Tokenizer didn't handle dot-separated EVA words
v1.1	2026-02-04 17:20	Complete	Position-independent decoder confirmed
v2	2026-02-04	PASS	Vocabulary mappings are Voynich-specific

Full details: BLIND_DECODE_TEST_LOG.md

What These Tests Prove

v1.1 proved: The decoder processes tokens independently (bag-of-words). Word order doesn't matter.

v2 proved: The decoder's vocabulary mappings are specific to Voynich morphology. Non-Voynich input (synthetic EVA, character-shuffled words, Latin) produces significantly lower coherence.

Combined interpretation: The ZFD decoder is detecting morphological patterns specific to the Voynich manuscript. The "degrees of freedom" criticism is empirically refuted: the system cannot produce comparable output from arbitrary input.

Key Statement

"Test v2 asks whether the ZFD pipeline is specific to Voynich manuscript text or flexible enough to produce comparable output from any input. Three non-Voynich baselines were tested: synthetic EVA strings matching manuscript statistics, character-shuffled Voynich words destroying morphological patterns, and random medieval Latin pharmaceutical vocabulary. If real Voynich text produces significantly higher coherence than all three baselines through the same frozen pipeline, the decoder's vocabulary mappings are detecting structure specific to the manuscript."

Result: Real Voynich significantly outperforms all three baselines. Specificity confirmed.

Output Directories

results/ - v1.1 test outputs (position-independence)
results_v2/ - v2 test outputs (vocabulary specificity)

Reproducibility

All tests use fixed random seeds for deterministic results. Lexicon is checksummed at test start and verified at test end.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

ZFD Blind Decode Falsification Test

Current Status: v2 PASSED

Quick Start

Test v2 Results: Vocabulary Specificity (2026-02-04)

Test v1.1 Results: Position Independence (2026-02-04)

Test History

What These Tests Prove

Key Statement

Output Directories

Reproducibility

FilesExpand file tree

blind_decode_test

Directory actions

More options

Directory actions

More options

Latest commit

History

blind_decode_test

Folders and files

parent directory

README.md

ZFD Blind Decode Falsification Test

Current Status: v2 PASSED

Quick Start

Test v2 Results: Vocabulary Specificity (2026-02-04)

Test v1.1 Results: Position Independence (2026-02-04)

Test History

What These Tests Prove

Key Statement

Output Directories

Reproducibility