-
Notifications
You must be signed in to change notification settings - Fork 66
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ACA-Py Test Sets Failing to start...usually #896
Comments
@ianco -- could you look into this? This is pretty high priority -- not having the tests running at all is bad. |
on it |
I think the ngrok error is a red herring. Ngrok changed their model awhile ago to require a "token" - which we don't have - we should probably remove ngrok out of the tails startup ... |
Agreed - since it gives the same error on success or failure, it's not like the issue. Looking forward to determining what it is! |
Re-running the test now. Started to yesterday, but stopped because of github issues. I'll see if anything pops up in that. |
I'll update the manage script to dump out the aca-py logs if the agent fails to start. (I'm not having any luck reproducing the error) |
I restarted a couple of the tests (aca-py aip10 and aip20) and they are both working :-S I'll check tomorrow to see if any of the "auto" runs failed. |
I ran aip20 today, and it did run -- w00t. However, a number of the tests failed. I'm re-running locally, but pretty sure I ran them yesterday and all passed. Weird... |
I'm seeing this error:
|
I think that is fine — it’s not crucial. Are those errors showing up in the failed tests? It’s odd those are failing in the GHA runs, but working locally. I’m trying to understand why redis would be used in some tests in the runset, but not all or none. |
I don't have a lot of experience with this project but I remember it had something for testing the redis events plugin. I don't think this is the right place for that testing, and the plugin repo is also testing it. Perhaps that whole thing should be removed from here. |
The redis updated didn't fix the issue! (Just a "redis" herring.) It looks like the agent is starting but then the backchannel either can't connect or times out. Back to the drawing board :-( |
OK I managed to reproduce the issue locally (just by fluke I think). It looks like at some point on startup the aca-py agent just "hangs" and then the startup times out. Might be a threading issue where aca-py "yields" and then never gets control back? |
OK the agent is just getting frozen when starting up, here is a log from my local this morning:
Notice the agent isn't doing anything from |
This is still happening sporadically in the github test runs, however this morning on my local it's happening consistently and I can't run any tests :-( I found this link: https://thraxil.org/users/anders/posts/2008/03/13/Subprocess-Hanging-PIPE-is-your-enemy/ (and a couple of stackoverflow comments that were similar) Basically there's a known bug with I'll take a look, since this is blocking me right now ... |
Using |
Recently (since Dec. 20) the ACA-Py tests are failing, because the agents are not starting. Running the tests locally work fine -- agents run and all ACA-Py tests pass.
When I first saw this, I thought it was the unpinned poetry update that broke ACA-Py briefly, but I see that is not the issue. When I look at the logs from the runs, I see that ngrok gives errors (see attached). However, I checked a successful run from a few days ago (ACA-Py aip10 tests) and the ngrok log looked the same.
docker-ngrok-tails-server-1.log
The text was updated successfully, but these errors were encountered: