Tests take too long #235

wRAR · 2024-12-06T17:08:09Z

A full test run is now about 20-30 minutes which is a problem for testing changes. I tried to investigate it and so far I've found that a single spider run in tests takes 5-7 seconds here, if ScrapyZyteAPIDownloadHandler is enabled. I will continue the investigation later, and if we cannot fix this we may want to enable parallel test runs.

The text was updated successfully, but these errors were encountered:

wRAR · 2025-01-08T14:50:03Z

The reason for "a single spider run in tests takes 5-7 seconds" is that the first scrapy.core.engine.ExecutionEngine._next_request() doesn't do anything and the next one is only called after 5s (self.slot.heartbeat.start(5)). It doesn't do anything because by the time it's first executed, the engine isn't yet marked as running, because scrapy.core.engine.ExecutionEngine.start() waits until all engine_started handlers finish (and both scrapy-zyte-api download handlers have non-trivial async engine_started handlers, though that's likely not important), and my logging shows that the first _next_request() call is between they finish and self.running = True is executed. Which may be just an unfortunate order of coroutines executed, but I don't know if Scrapy is really expected to wait 5s before downloading the first request in certain cases so maybe there is a different underlying cause.

wRAR mentioned this issue Jan 14, 2025

Add pytest-xdist and enable it on CI. #240

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tests take too long #235

Tests take too long #235

wRAR commented Dec 6, 2024

wRAR commented Jan 8, 2025

Tests take too long #235

Tests take too long #235

Comments

wRAR commented Dec 6, 2024

wRAR commented Jan 8, 2025