GraphQL Error: RunStepStatsQuery
Fails Due to Message: 'NoneType' object has no attribute 'value'
#27180
Labels
type: bug
Something isn't working
What's the issue?
We are running dagster 1.9.8 open source on kubernetes. I have a graph that triggers a few hundred operations in total. It works fine at first but around 4 hours into the job I can no longer check on the status. The UI returns the following error:
If I keep the run open for about 5 minutes the view eventually populates.
What did you expect to happen?
I do not believe we should be seeing a graphql error display. I'm also uncertain as to why the client cannot jump straight to the current step vs loading all of the prior log / job data for an extended amount of time before populating the view.
How to reproduce?
I'm not sure how to replicate this without replicating my job that causes the issue. Would it be best to supply the debug file from the job? It's 128mb so far.
Dagster version
dagster, version 1.9.8
Deployment type
Dagster Helm chart
Deployment details
Dagster is installed on GKE with autopilot enabled via helm.
Additional information
No response
Message from the maintainers
Impacted by this issue? Give it a 👍! We factor engagement into prioritization.
The text was updated successfully, but these errors were encountered: