Finish LLM text exporter #1417

Mpdreamz · 2025-06-23T12:45:46Z

Reorganize Elastic.Markdown so that code is grouped by purpose not type

Thought this was a prerequisite for LLM text by utilizing markdigs round trip serializer as a base of our own. However that heavily relies on parsing with Trivia something we can not do because it breaks list continuations. See #435

We now manually parse includes and re-evaluate substitutions on the full included files. Adding the exporter does not add much overhead in addition to the HTML exporter

This emits a filename.md next to each filename/index.html

In addition it emits a llm.zip that can be used to download everything at once.

theletterf · 2025-06-23T12:55:15Z

Niceeee! How does this work? Do we generate the final Markdown from an intermediate representation/AST? It's important that the final file we produce has the same content as the rendered HTML, that is, resolved substitutions, etc.

Mpdreamz · 2025-06-23T15:29:11Z

Do we generate the final Markdown from an intermediate representation/AST?

We do not sadly, that was the initial plan but would be too time costly too implement due to a quirk in our parser's handling of TrackTrivia and loose list continuations.

It's important that the final file we produce has the same content as the rendered HTML, that is, resolved substitutions, etc.

In the end we do get the same content, we might need to go over this again when we implement more dynamic {applies_to} output.

theletterf · 2025-06-23T16:14:14Z

I guess converting from the final HTML to Markdown would be too primitive / slow? I used that approach in the past for a NextJS project for generating the llmstxt file and it wasn't too bad.

Mpdreamz · 2025-06-23T16:35:30Z

I guess converting from the final HTML to Markdown would be too primitive / slow?

Yeah potentially, it's also more labor intensive projecting everything back with proper indentations etcetera. I would worry too much about lists of list etcetera.

Not closing the door doing that but what we have now is good enough.

Mpdreamz added 5 commits June 19, 2025 11:21

Reorganize Elastic.Markdown so that code is grouped by purpose not type

26e6343

LLMText exports resolves includes and substitutions

72e0dd3

ensure llm exporter writes output files

3482d44

tweak output paths of markdown files

f470203

Ensure we emit llm data as zip too

d453f6a

Mpdreamz requested a review from a team as a code owner June 23, 2025 12:45

Mpdreamz added the feature label Jun 23, 2025

Mpdreamz self-assigned this Jun 23, 2025

Mpdreamz changed the title ~~feature/llm text output~~ Finish LLM text exporter Jun 23, 2025

Merge remote-tracking branch 'origin/main' into feature/llm-text-output

709f1f9

reakaleek approved these changes Jun 24, 2025

View reviewed changes

Mpdreamz added 3 commits June 24, 2025 15:06

Merge remote-tracking branch 'origin/main' into feature/llm-text-output

71fc52f

blind windows test fix attempt

b4b45d6

Merge branch 'main' into feature/llm-text-output

88eea18

Mpdreamz enabled auto-merge (squash) June 24, 2025 13:21

Mpdreamz merged commit cb72d5e into main Jun 24, 2025
16 checks passed

Mpdreamz deleted the feature/llm-text-output branch June 24, 2025 13:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Finish LLM text exporter #1417

Finish LLM text exporter #1417

Uh oh!

Mpdreamz commented Jun 23, 2025 •

edited

Loading

Uh oh!

theletterf commented Jun 23, 2025

Uh oh!

Mpdreamz commented Jun 23, 2025

Uh oh!

theletterf commented Jun 23, 2025

Uh oh!

Mpdreamz commented Jun 23, 2025

Uh oh!

Uh oh!

Uh oh!

Finish LLM text exporter #1417

Finish LLM text exporter #1417

Uh oh!

Conversation

Mpdreamz commented Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

theletterf commented Jun 23, 2025

Uh oh!

Mpdreamz commented Jun 23, 2025

Uh oh!

theletterf commented Jun 23, 2025

Uh oh!

Mpdreamz commented Jun 23, 2025

Uh oh!

Uh oh!

Uh oh!

Mpdreamz commented Jun 23, 2025 •

edited

Loading