Add rate limit handling to embedders #425

NathalieCharbel · 2025-09-20T18:08:35Z

Description

This PR extends the rate limiting functionality (previously available only for LLMs) to all embedding providers, and ensures consistent error handling

Type of Change

Complexity

Low

Complexity:

How Has This Been Tested?

Unit tests
E2E tests
Manual tests

Checklist

The following requirements should have been met (depending on the changes in the branch):

Documentation has been updated
Unit tests have been updated
E2E tests have been updated
Examples have been updated
New files have copyright header
CLA (https://neo4j.com/developer/cla/) has been signed
CHANGELOG.md updated if appropriate

stellasia

Just worried about the tests, if you have some time to explain. The other comments are marginal and non blocking.

stellasia · 2025-09-22T09:17:14Z

src/neo4j_graphrag/llm/rate_limit.py


 def is_rate_limit_error(exception: Exception) -> bool:
-    """Check if an exception is a rate limit error from any LLM provider.
+    """Check if an exception is a rate limit error from any LLM provider or embedder.


Do you think we can move this file somewhere else now that it's not only used for LLMs?

stellasia · 2025-09-22T09:19:35Z

tests/unit/embeddings/test_openai_embedder.py

+    )
+    embedder = OpenAIEmbeddings(api_key="my key")
+    with pytest.raises(
+        EmbeddingsGenerationError, match="Failed to generate embedding with OpenAI"


I don't understand this test, shouldn't this error be retried? I'm not sure this test captures it? I would add a check to make sure the function is called the expected number of times.

stellasia · 2025-09-22T09:33:29Z

src/neo4j_graphrag/embeddings/base.py

+            self._rate_limit_handler = rate_limit_handler
+        else:
+            self._rate_limit_handler = DEFAULT_RATE_LIMIT_HANDLER
+


We could consider having the embed_query method in the base class, that deals with the common logic of retries, and calls an _embed_query (or another name) in the subclasses. In this way, the implementation in the subclasses doesn't have to change (just update the method name):

In short:

base class:

@rate_limit_handler def embed_query(self, query): try: return self._embed_query(query) except Exception as e: raise EmbeddingGenerationError() from e @abstractmethod def _embed_query(self, query): ...

and in the children:

# no need to add the decorator def _embed_query(self, query): # rest of the code

(up to you if you want to do it this way, it's not a request for change now)

NathalieCharbel added 3 commits September 20, 2025 19:58

Improve error handling for embedders including rate limit

efcb69a

Update unit tests

8e2190f

Update changelog and docs

1fff12b

NathalieCharbel requested a review from a team as a code owner September 20, 2025 18:08

NathalieCharbel added 3 commits September 20, 2025 20:12

Ruff

54c28a4

Fix mypy

c9b4798

Fix more mypy issues

33c7810

stellasia reviewed Sep 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add rate limit handling to embedders #425

Add rate limit handling to embedders #425

Uh oh!

NathalieCharbel commented Sep 20, 2025

Uh oh!

stellasia left a comment

Uh oh!

stellasia Sep 22, 2025

Uh oh!

stellasia Sep 22, 2025

Uh oh!

stellasia Sep 22, 2025

Uh oh!

Uh oh!

Add rate limit handling to embedders #425

Are you sure you want to change the base?

Add rate limit handling to embedders #425

Uh oh!

Conversation

NathalieCharbel commented Sep 20, 2025

Description

Type of Change

Complexity

How Has This Been Tested?

Checklist

Uh oh!

stellasia left a comment

Choose a reason for hiding this comment

Uh oh!

stellasia Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

stellasia Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

stellasia Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!