You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As a continuation of #2377, we have a regression on parsing invalid URLs. Previously, the urllib was mach more liberal in processing URLs, now it rejects much more cases.
We use it for sanitize the URLs, and html_parser is an example of bot that uses the liberal behavior in tests:
More restrict validation in urllib causes troubles
when processing invalid URLs. The correct solution
on our side is at the moment unclear, see certtools#2382
As a continuation of #2377, we have a regression on parsing invalid URLs. Previously, the
urllib
was mach more liberal in processing URLs, now it rejects much more cases.We use it for sanitize the URLs, and
html_parser
is an example of bot that uses the liberal behavior in tests:intelmq/intelmq/tests/bots/parsers/html_table/test_parser_column_split.py
Line 47 in 61c45ac
intelmq/intelmq/tests/bots/parsers/html_table/test_parser_column_split.py
Lines 73 to 80 in 61c45ac
In patched Python versions (e.g. 3.11.4), this URL is rejected. We need to either decide against allowing such URLs, or redesign our sanitization.
Temporally, the test is skipped to unlock other work.
The text was updated successfully, but these errors were encountered: