Skip to content

Align Holmes NLP component coverage#6

Merged
mdheller merged 8 commits into
mainfrom
codex/watson-nlp-alignment
May 12, 2026
Merged

Align Holmes NLP component coverage#6
mdheller merged 8 commits into
mainfrom
codex/watson-nlp-alignment

Conversation

@mdheller

@mdheller mdheller commented May 8, 2026

Copy link
Copy Markdown
Member

Summary

Adds the first explicit Holmes NLP component alignment contract and expands the Holmes surface validator so the product spine covers lower-layer NLP families instead of only high-level investigation components.

This PR adds:

  • docs/NLP_COMPONENT_ALIGNMENT.md
  • explicit componentFamilies in examples/holmes-surface.json
  • explicit nlpTasks coverage for primitive, advanced, rule, classical ML, neural, transformer, task-model, graph, and governance lanes
  • stronger requiredPromotionEvidence fields for algorithm family, task contract, and latency/footprint evidence
  • validator checks for component families and NLP task coverage
  • README pointer to the alignment contract

Why

Holmes should absorb a disciplined NLP component taxonomy while remaining broader than a component NLP library. The target architecture is not a model zoo: it is a governed investigation fabric spanning primitives, extraction, classification, retrieval, evidence, graph conversion, policy, evals, and agent/tool orchestration.

Boundary

This PR makes the architecture enforceable at the Holmes contract layer. Runtime implementation and benchmarks belong next in SociOS-Linux/nlplab, with stable service graduation into SocioProphet/prophet-platform and pointer-backed retrieval through SocioProphet/sherlock-search.

Validation

Expected local validation:

make validate

The validator now requires:

  • product components
  • component families
  • NLP task coverage
  • promotion evidence fields
  • cross-repo integrations

Copy link
Copy Markdown
Member Author

Added Slash Topics training alignment to this PR.

Changes now make Holmes explicitly support Slash Topics as a downstream governed topic-model training surface, not merely a retrieval/indexing consumer:

  • README now includes a dedicated Slash Topics training role and layer-stack entry.
  • docs/NLP_COMPONENT_ALIGNMENT.md now defines Holmes-owned Slash Topics outputs: TopicSeedCandidate, TopicBoundaryEvidence, TopicLabelCandidate, TopicTaxonomyCandidate, SlashTopicTrainingRef, and TopicPackGenerationReceipt.
  • examples/holmes-surface.json now includes topic-model-training, topic-taxonomy-induction, topic-pack-generation, language.topic.v1/Propose, language.topic.v1/Train, slashTopicsTrainingRef, and the SocioProphet/slash-topics integration.
  • tools/validate_holmes.py now validates the Slash Topics training tasks, method families, promotion evidence, and integration key.

Boundary remains clear: Holmes produces evidence-bound training artifacts and model-training support; Slash Topics owns /topic pack semantics and policy membranes; Policy Fabric owns admission.

@mdheller mdheller merged commit c172b8f into main May 12, 2026
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant