fix: Improved semantic highlighting performance for huge files #828

FalsePattern · 2025-02-10T18:27:44Z

I encountered a consistent stutter (2-3 seconds long freezes) when working with huge generated files in zig. After analysing with a profiler, i've pinpointed the source to the LSPSemanticTokensHighlightVisitor.highlightSemanticTokens method, which blocks the EDT while it's adding every single HighlightInfo to the holder for the entire file, which is an expensive operation due to internal checks.
Additionally, for very large files (>50k lines), the ProgressManager.checkCanceled(); call inside SemanticTokensData.highlight, along with the HighlightInfo creations also started being a major contributor (>30% sampler time) to the total freeze duration.

This pull request attempts to resolve these issues using the following steps:

HighlightInfo inside SemanticTokensData.highlight is replaced with a LazyHighlightInfo record, which stores the bare minimum information required to create an actual HighlightInfo on-demand.
semanticTokens.highlight inside the visitor is no longer passed holder::add directly, instead, the lazy highlight infos are stored in a lookup array.
This lookup array is iterated over inside the visit() method for each leaf PSI element, which, for languages with existing PSI structures, prevents the IDE from freezing, because it internally pumps the EDT events after every few hundred PSI element.
Because the highlight infos are no longer being resolved and added inside the SemanticTokensData.highlight method, the ProgressManager.checkCanceled() is only called once every 100 data elements, effectively nullifying its overhead while still not being that long of a delay between each check.

FalsePattern · 2025-02-10T18:30:48Z

The only case where this did not noticeably improve the overall performance is when the entire file is a single LeafElement (plaintext files with no plugin providing a lexer+parser for them).

FalsePattern · 2025-02-10T18:33:12Z

The approach of storing data in the fields of the HighlightVisitor instance at the start of analyze, calling action.run(), using those fields in visit() to read data "initialized" in the analyze() method, then cleaning up said fields in a try/finally block seems to be a common approach in a lot of other highlighters in intellij idea code.

FalsePattern · 2025-02-10T19:22:59Z

Memory-usage wise, with a 2MB generated zig file (Vulkan bindings), the allocated lazyInfos array was 2 million elements (8MB, assuming 4 byte jvm pointers), and 144k elements of those arrays were populated, which, with a worst case scenario of 16 byte object headers, and the record itself containing 2 ints and an object, thus +12 bytes, total 28, rounded up to 32 bytes, would be about 4 MB on top of that.
In total that's about 12 MB for a 2 MB file, or about 6x as much memory used as the raw filesize while analysis is running, after which it gets immediately discarded.
At the absolute worst-case scenario with every element populated, it would be 72MB of used heap memory for a 2MB file (if every single character had a separate highlight info), so about 36x the memory usage as the raw file size, but a range of 4-8x is realistic for most languages.
This should not cause any problems, as intellij applies syntax highlighting to files one by one, and the highlighter cleans up the allocated arrays in a try/finally block, so a leak can never happen.

angelozerr · 2025-02-10T19:46:26Z

Thanks so much @FalsePattern for your contribution.

I would like to create the release 0.10.0 tomorrow, can we wait for 0.11.0 to merge your PR?

angelozerr · 2025-02-10T19:48:03Z

The only case where this did not noticeably improve the overall performance is when the entire file is a single LeafElement (plaintext files with no plugin providing a lexer+parser for them).

I think we should keep like today with textmate + TEXT language. You can use SimpleLanguageUtils.isSupported(language) to manage that.

FalsePattern · 2025-02-10T20:34:23Z

Thanks so much @FalsePattern for your contribution.

I would like to create the release 0.10.0 tomorrow, can we wait for 0.11.0 to merge your PR?

Yep, this is not urgent, it can wait until 0.11.0

FalsePattern · 2025-02-10T20:50:24Z

I think we should keep like today with textmate + TEXT language. You can use SimpleLanguageUtils.isSupported(language) to manage that.

Done, the latest push has an extra check so that simple languages skip the lazy array lookup-based highlighting and instead directly push to highlight infos to holder.add like before, and visit() returns instantly

angelozerr · 2025-02-11T13:20:45Z

@FalsePattern I wonder if it is possible to add your Zig PsiFile in the LSP4IJ test and write tests with your Zig PsiFile? Do you think it could be doable?

FalsePattern · 2025-02-11T14:57:41Z

@FalsePattern I wonder if it is possible to add your Zig PsiFile in the LSP4IJ test and write tests with your Zig PsiFile? Do you think it could be doable?

How do we do that without also pulling in the full parser from ZigBrains as a dependency?

FalsePattern · 2025-02-11T15:56:30Z

Also, there is another major bottleneck i found, the LSPInlayHintsProvider.doCollect method blocks inside a read action, and in the case of that large file, the LSP itself takes about half a second to complete, which also contributes to the stutter:

This is a much easier fix though, by just making the inlay hint future itself "pending" unconditionally if it's not finished, and relying on the caching behaviour of the LSPInlayHintsSupport to provide the same future once the refreshEditorFeatureWhenAllDone triggers another inlay hint pass. This change, combined with this pull request, completely fixed the large file editing stutter for me.

I didn't add this change to the PR because it's a very dirty workaround.

Maybe we could add a boolean waitUntilDoneOrTimeout(future, psiFile, int timeoutMillis, List<CompletableFuture<?>> pendingFutures) method to CompletableFutures so that inlay hints that complete fast (<50ms) get done synchronously, but ones that take longer can take the async path?

angelozerr · 2025-02-11T16:05:05Z

Also, there is another major bottleneck i found, the LSPInlayHintsProvider.doCollect method blocks inside a read action, and in the case of that large file, the LSP itself takes about half a second to complete, which also contributes to the stutter:

This is a much easier fix though, by just making the inlay hint future itself "pending" unconditionally if it's not finished, and relying on the caching behaviour of the LSPInlayHintsSupport to provide the same future once the refreshEditorFeatureWhenAllDone triggers another inlay hint pass. This change, combined with this pull request, completely fixed the large file editing stutter for me.

I didn't add this change to the PR because it's a very dirty workaround.

Maybe we could add a boolean waitUntilDoneOrTimeout(future, psiFile, int timeoutMillis, List<CompletableFuture<?>> pendingFutures) method to CompletableFutures so that inlay hints that complete fast (<50ms) get done synchronously, but ones that take longer can take the async path?

Indeed I have tried to avoid using timeout. Please create a new issue with the inlay hint topic.

angelozerr · 2025-02-11T16:07:38Z

@FalsePattern I wonder if it is possible to add your Zig PsiFile in the LSP4IJ test and write tests with your Zig PsiFile? Do you think it could be doable?

How do we do that without also pulling in the full parser from ZigBrains as a dependency?

No my idea is just copy paste your parser inside lsp4ij.

We have no custom psifile in our test and your plugins and language server that yiu use seems very advanced.

We could also write other tests like completion based on your copied zig psifile.

What do you think about that?

FalsePattern · 2025-02-11T16:48:30Z

Indeed I have tried to avoid using timeout. Please create a new issue with the inlay hint topic.

#835

No my idea is just copy paste your parser inside lsp4ij.

We have no custom psifile in our test and your plugins and language server that yiu use seems very advanced.

We could also write other tests like completion based on your copied zig psifile.

What do you think about that?

I'm fine with that, I'll dual-license the psi lexer+parser code under EPLv2 and that way it can be used in lsp4ij freely.

The "large zig file" i've been performance testing on is just an autogenerated vulkan bindings file generated from the vulkan registry (MIT / Apache 2.0 license) using vulkan-zig (MIT license) so it should be fine to include that file as a whole, and I can provide json dumps of the LSP message traces from the project i use that file in for the test cases.

angelozerr · 2025-02-11T18:42:57Z

Indeed I have tried to avoid using timeout. Please create a new issue with the inlay hint topic.

#835

No my idea is just copy paste your parser inside lsp4ij.
We have no custom psifile in our test and your plugins and language server that yiu use seems very advanced.
We could also write other tests like completion based on your copied zig psifile.
What do you think about that?

I'm fine with that, I'll dual-license the psi lexer+parser code under EPLv2 and that way it can be used in lsp4ij freely.

It is a super news! We could after that add another test with completion, codelens, etc with a real custom PsiFile.

The "large zig file" i've been performance testing on is just an autogenerated vulkan bindings file generated from the vulkan registry (MIT / Apache 2.0 license) using vulkan-zig (MIT license) so it should be fine to include that file as a whole, and I can provide json dumps of the LSP message traces from the project i use that file in for the test cases.

Great!

angelozerr · 2025-02-13T18:00:37Z

@ericdallo could you please test this PR witj your plugin

ericdallo · 2025-02-13T18:41:05Z

Will do

ericdallo · 2025-02-13T20:40:19Z

@angelozerr I tested and didn't notice any problems so far

angelozerr · 2025-02-13T20:50:45Z

Thanks and do you see performance improvement when yiur PsiFile is big?

ericdallo · 2025-02-13T21:32:53Z

I didn't notice perf issues, but mostly clojure files are small, it's rare to have clojure files bigger than 2k lines

angelozerr · 2025-02-14T07:43:43Z

@ericdallo thanks for your feedback!

@CppCXY could you please test this PR because I know you have a custom PsiFile and give us feedback (if you don't see any problem and with large file if it improves performance). Thanks!

CppCXY · 2025-02-14T08:15:04Z

This is a version that enable the plugin with custom PSI and custom render and lsp4ij

This disables the plugin, enabling only lsp4ij:

It can be seen that the rendering is still slow. I think we might keep the previous rendering result until the new LSP result is returned and rendering is complete.

CppCXY · 2025-02-14T08:28:18Z

As a comparison, this is the performance of my language server in VS Code. As you can see, rendering is extremely fast, without the prolonged period of colorlessness seen in IntelliJ.

angelozerr · 2025-02-14T08:29:23Z

Thanks @CppCXY for your feedback. I think this PR avoid blocking the EDT with large file, but doesn't improve the speed of the renderer.

Do you see some blocking issue without this PR?

angelozerr · 2025-02-14T08:31:22Z

As a comparison, this is the performance of my language server in VS Code. As you can see, rendering is extremely fast, without the prolonged period of colorlessness seen in IntelliJ.

I wonder if in vscode you consume SemanticTokensRange ? If yes it could explain the performance issue in IJ because LSP4IJ doesn't support it (it supports only SemanticTokensFull).

CppCXY · 2025-02-14T08:32:15Z

I wonder if in vscode you consume SemanticTokensRange ? If yes it could explain the performance issue in IJ because LSP4IJ doesn't support it (it supports only SemanticTokensFull).

No, I don't implement SemanticTokensRange

CppCXY · 2025-02-14T08:35:05Z

I wonder if in vscode you consume SemanticTokensRange ? If yes it could explain the performance issue in IJ because LSP4IJ doesn't support it (it supports only SemanticTokensFull).

No, I don't implement SemanticTokensRange

But I remember that in VS Code, after editing code it doesn't immediately send semanticTokenFull—there might be some debouncing—and the newly entered characters will first inherit the color of the character to the left.

angelozerr · 2025-02-14T08:44:08Z

I wonder if in vscode you consume SemanticTokensRange ? If yes it could explain the performance issue in IJ because LSP4IJ doesn't support it (it supports only SemanticTokensFull).

No, I don't implement SemanticTokensRange

But I remember that in VS Code, after editing code it doesn't immediately send semanticTokenFull—there might be some debouncing—and the newly entered characters will first inherit the color of the character to the left.

Ok I think it is an another issue (please create it). This PR seems avoiding freezing IDE. Do you see this problem without this PR with large file?

CppCXY · 2025-02-14T08:57:03Z

I wonder if in vscode you consume SemanticTokensRange ? If yes it could explain the performance issue in IJ because LSP4IJ doesn't support it (it supports only SemanticTokensFull).

No, I don't implement SemanticTokensRange

But I remember that in VS Code, after editing code it doesn't immediately send semanticTokenFull—there might be some debouncing—and the newly entered characters will first inherit the color of the character to the left.

Ok I think it is an another issue (please create it). This PR seems avoiding freezing IDE. Do you see this problem without this PR with large file?

Regardless of whether this patch is applied, I don't see any difference

CppCXY · 2025-02-14T09:09:18Z

I observed a lag: when I continuously hold down the Enter key, the IDE freezes for a long time regardless of the version.

FalsePattern · 2025-02-14T10:32:43Z

I observed a lag: when I continuously hold down the Enter key, the IDE freezes for a long time regardless of the version.

Disable inlay hints while you're testing this PR, they're also a lag source, I have a separate PR for those.

angelozerr · 2025-02-19T13:15:34Z

...va/com/redhat/devtools/lsp4ij/features/semanticTokens/LSPSemanticTokensHighlightVisitor.java

+                this.lazyInfos = highlightSemanticTokens(file, null);
+                this.holder = holder;
+            }
+            action.run();


Is there any reason why action.run() is called at the end although before it was called at first?

angelozerr · 2025-02-19T13:16:34Z

src/main/java/com/redhat/devtools/lsp4ij/features/semanticTokens/LazyHighlightInfo.java

+
+    public static HighlightInfo resolve(int start, int end, TextAttributesKey colorKey) {
+        return HighlightInfo
+                .newHighlightInfo(RAINBOW_ELEMENT)


Why RAINBOW_ELEMENT?

Oh sorry it was like this.

angelozerr · 2025-02-19T13:22:53Z

Additionally, for very large files (>50k lines), the ProgressManager.checkCanceled(); call inside SemanticTokensData.highlight, along with the HighlightInfo creations also started being a major contributor (>30% sampler time) to the total freeze duration.

You mean that just calling ProgressManager.checkCanceled(); can be expensive?

angelozerr · 2025-02-19T17:10:43Z

Great improvement. Thanks @FalsePattern !

FalsePattern · 2025-02-20T19:22:45Z

You mean that just calling ProgressManager.checkCanceled(); can be expensive?

It's because it was called for every single integer in the semantic highlighting payload, which for that file was approximately 600k times, and at such high call counts even low-overhead function calls start to add up. With the per-100 element check it reduces it by 2 orders of magnitude while still being more than plenty frequent enough to not cause a noticeable stall when a cancel is triggered.

FalsePattern force-pushed the main branch 2 times, most recently from 7e413f9 to cdc1e80 Compare February 10, 2025 20:47

fix: Improved semantic highlighting performance for huge files

b5a5d6b

FalsePattern force-pushed the main branch from cdc1e80 to b5a5d6b Compare February 11, 2025 17:05

FalsePattern mentioned this pull request Feb 11, 2025

feat: Add zig lexer, parser, and PSI from ZigBrains for tests #837

Open

angelozerr added semantic tokens performance labels Feb 13, 2025

angelozerr added this to the 0.11.0 milestone Feb 13, 2025

angelozerr reviewed Feb 19, 2025

View reviewed changes

angelozerr merged commit 8974e19 into redhat-developer:main Feb 19, 2025
6 checks passed

fix: Improved semantic highlighting performance for huge files #828

fix: Improved semantic highlighting performance for huge files #828

Conversation

FalsePattern commented Feb 10, 2025 • edited Loading

FalsePattern commented Feb 10, 2025 • edited Loading

FalsePattern commented Feb 10, 2025

FalsePattern commented Feb 10, 2025 • edited Loading

angelozerr commented Feb 10, 2025

angelozerr commented Feb 10, 2025

FalsePattern commented Feb 10, 2025

FalsePattern commented Feb 10, 2025 • edited Loading

angelozerr commented Feb 11, 2025

FalsePattern commented Feb 11, 2025 • edited Loading

FalsePattern commented Feb 11, 2025 • edited Loading

angelozerr commented Feb 11, 2025

angelozerr commented Feb 11, 2025

FalsePattern commented Feb 11, 2025 • edited Loading

angelozerr commented Feb 11, 2025

angelozerr commented Feb 13, 2025

ericdallo commented Feb 13, 2025

ericdallo commented Feb 13, 2025

angelozerr commented Feb 13, 2025

ericdallo commented Feb 13, 2025

angelozerr commented Feb 14, 2025

CppCXY commented Feb 14, 2025

CppCXY commented Feb 14, 2025

angelozerr commented Feb 14, 2025

angelozerr commented Feb 14, 2025

CppCXY commented Feb 14, 2025

CppCXY commented Feb 14, 2025

angelozerr commented Feb 14, 2025

CppCXY commented Feb 14, 2025

CppCXY commented Feb 14, 2025

FalsePattern commented Feb 14, 2025

angelozerr Feb 19, 2025

Choose a reason for hiding this comment

angelozerr Feb 19, 2025

Choose a reason for hiding this comment

angelozerr Feb 19, 2025

Choose a reason for hiding this comment

angelozerr commented Feb 19, 2025

angelozerr commented Feb 19, 2025

FalsePattern commented Feb 20, 2025 • edited Loading

FalsePattern commented Feb 10, 2025 •

edited

Loading

FalsePattern commented Feb 10, 2025 •

edited

Loading

FalsePattern commented Feb 10, 2025 •

edited

Loading

FalsePattern commented Feb 10, 2025 •

edited

Loading

FalsePattern commented Feb 11, 2025 •

edited

Loading

FalsePattern commented Feb 11, 2025 •

edited

Loading

FalsePattern commented Feb 11, 2025 •

edited

Loading

FalsePattern commented Feb 20, 2025 •

edited

Loading