-
-
Notifications
You must be signed in to change notification settings - Fork 8.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
rp2: Fix lost CYW43 WiFi events when using both cores. #16915
rp2: Fix lost CYW43 WiFi events when using both cores. #16915
Conversation
Code size report:
|
This comment was marked as outdated.
This comment was marked as outdated.
84f7744
to
efff02c
Compare
Thank you for delving deeper into this and detailing the cause, I didn't quite understand why what I was seeing was happening, but now I do. CYW43 never fails to surface the wildest bugs! |
This comment was marked as outdated.
This comment was marked as outdated.
efff02c
to
d24704a
Compare
New version pushed which keeps the level-triggered interrupt but hand-rolls a function to only ever enable it on CPU0. Have also a raised a feature request in pico-sdk to have this in the GPIO API: raspberrypi/pico-sdk#2354 |
0854938
to
6a72c58
Compare
The latest changes look really good. I will test them. |
6a72c58
to
069cd2f
Compare
I've done extensive tests of this PR:
I didn't see any issues or regressions. |
There's a very odd but predictable sequence of events that breaks Wi-Fi when using both cores: 1) CPU1 calls pendsv_suspend() - for example sleep() causes a softtimer node to be inserted, which calls pendsv_suspend(). 2) CYW43 sends wakeup IRQ. CPU0 GPIO IRQ handler schedules PendSV and disables the GPIO IRQ on CPU0, to re-enable after cyw43_poll() runs and completes. 3) CPU0 PendSV_Handler runs, sees pendsv is suspended, exits. 4) CPU1 calls pendsv_resume() and pendsv_resume() sees PendSV is pending and triggers it on CPU1. 5) CPU1 runs PendSV_Handler, runs cyw43_poll(), and at the end it re-enables the IRQ *but now on CPU1*. However CPU1 has GPIO IRQs disabled, so the CYW43 interrupt never runs again... The fix in this commit is to always enable/disable the interrupt on CPU0. This isn't supported by the pico-sdk, but it is supported by the hardware. Fixes issue micropython#16779. This work was funded through GitHub Sponsors. Signed-off-by: Angus Gratton <[email protected]>
A better indication of whether a cyw43 event is pending is the actual flag in the PendSV handler table. (If this fails, could also use the GPIO interrupt enabled register bit). This commit was needed of a previous version of the fix in the parent commit, but it turned out not strictly necessary for the current version. However, it's still a good clean up. This work was funded through GitHub Sponsors. Signed-off-by: Angus Gratton <[email protected]>
Changes: - Move setting of PendSV priority to pendsv_init(). - Call pendsv_init() from CPU1 as well, to ensure priority is the same. This work was funded through GitHub Sponsors. Signed-off-by: Angus Gratton <[email protected]>
069cd2f
to
35d4d2d
Compare
Thanks for splitting this up into a set of very clean, independent commits. |
Summary
Closes #16779. Thanks to @coencoensmeets for a clear and easy to reproduce bug report, and to @Gadgetoid for doing some deep digging to zero in on the root cause.
There's a very odd but predictable sequence of events that breaks Wi-Fi when using both cores:
pendsv_suspend()
- for example sleep() causes a softtimer node to be inserted, which callspendsv_suspend()
.cyw43_poll()
runs and completes.PendSV_Handler
runs, sees pendsv is suspended, exits.pendsv_resume()
andpendsv_resume()
sees PendSV is pending and triggers it on CPU1.PendSV_Handler
, runscyw43_poll()
, and at the end it re-enables the pin IRQ but now on CPU1.However CPU1 has the overall GPIO IRQ line disabled, so the CYW43 interrupt never runs again...
Fix is to specifically only enable the GPIO interrupt on CPU0. This isn't supported in pico-sdk but appears to be supported by the hardware.
Plus two follow-up commits:
cyw43_has_pending
global flag to instead be a check of the pendsv table. This was necessary for the previous version of this PR. Not necessary now, but seems like a worthwhile change.This work was funded through GitHub Sponsors.
Testing
Trade-offs and Alternatives
pendsv_schedule_dispatch()
runs on CPU0).pendsv_resume()
. However it might be easier to reason about overall.