dict() overload insufficient #10013

imba-tjd · 2023-04-05T15:17:28Z

    # Next overload is for dict(string.split(sep) for string in iterable)
    # Cannot be Iterable[Sequence[_T]] or otherwise dict(["foo", "bar", "baz"]) is not an error
    @overload
    def __init__(self: dict[str, str], __iterable: Iterable[list[str]]) -> None: ...

This typing works with dict([['a', '1'], ['b', '2']]), but not dict([['a', 1], ['b', 2]])

Or

    @overload
    def __init__(self, __iterable: Iterable[tuple[_KT, _VT]]) -> None: ...
    @overload
    def __init__(self: dict[str, _VT], __iterable: Iterable[tuple[str, _VT]], **kwargs: _VT) -> None: ...

This works with dict([('a', 1), ('b', 2)]), but not dict([['a', 1], ['b', 2]])

The text was updated successfully, but these errors were encountered:

srittau · 2023-04-05T16:15:23Z

Currently typing does not support multi-type lists. See python/typing#592. Unfortunately, there is no way for us to fix/work around this in typeshed.

ods · 2023-04-18T15:30:11Z

It's strange to see it closed. According to docs

[…] the positional argument must be an iterable object. Each item in the iterable must itself be an iterable with exactly two objects.

It's OK if typing tools don't catch all the cases that are not possible to describe with current typing system. But explicitly disallowing valid cases is certainly a bug. The problem is not only with multi-type list, and even not lists only. The following case is valid too, but is not acceptable according to current annotations:

dict([iter([1, 2]), iter([2, 3])])

Akuli · 2023-04-18T17:04:00Z

Do you actually have a use case for dict([[1, 2], [3, 4]])? As the comment says, we currently support strings because passing an iterable of foo.split() to dict() is common (e.g. when parsing a text file that consists of key=value lines). If you want us to support something more general, please explain why you need it :)

As the comment says, we can't just accept an iterable of iterables, because then type checkers would not complain about dict(iterable_of_strings). We really want a type checker error for that. It's a somewhat common mistake, because iterables of strings are very common. For example, you might forget to .split() your strings when attempting dict(foo.split("=") for foo in blah).

JelleZijlstra · 2023-04-18T18:33:17Z

Yes, the trouble here is that it's hard to write the types so that they accept everything the runtime accepts, while still rejecting a reasonable set of common mistakes.

ods · 2023-04-19T08:01:52Z

Do you actually have a use case for dict([[1, 2], [3, 4]])?

Not exactly, it was just a simplest example of valid code with false positive complaint due to bug in typeshed. Here is real life example with SQLAlchemy:

result: CursorResult[tuple[str, some_type]] = await connection.execute(
    select(some_table.c.string_field, some_table.c.some_type_field)
)
dict(result.all())

Here CursorResult.all() returns a Sequence[Row[…]], and Row is an Iterable[Any].

If you want us to support something more general, please explain why you need it :)

Because for typing false negative is unfortunate, but ok, while false positive is a certain bug.

As the comment says, we can't just accept an iterable of iterables, because then type checkers would not complain about dict(iterable_of_strings).

But it MUST NOT complaint about dict(iterable_of_strings), because it's pretty valid both according to docs and implementation:

>>> dict(['ab', 'cd'])
{'a': 'b', 'c': 'd'}

Do you mean we can potentially use strings of different length? But typing is not expected to cover all possible runtime errors. So dict(['abc', 'cde']) is pretty valid from typing point of view, even if it causes an error when run.

imba-tjd · 2023-04-22T12:47:07Z

In my case, I was parsing HTTP Headers into dict

header_raw = b'''\
Host: example.com\r
User-Agent: curl/8.0.1\r
Accept: */*'''

result = dict(line.split(b': ') for line in header_raw.split(b'\r\n'))

print(result)

When using str rather than bytes, it can be inferred.

Not fixing this is not a big issue for me. I understand there are limitations.

Akuli · 2023-04-22T15:35:30Z

I added a similar overload with bytes and checked that the HTTP headers example works with the latest typeshed.

So dict(['abc', 'cde']) is pretty valid from typing point of view, even if it causes an error when run.

I like "practicality beats purity": it just isn't practical for type checkers to be happy with that. It's more likely a mistake than not, and IMO it's exactly the kind of mistake that people expect type checkers to point out.

srittau closed this as not planned Won't fix, can't repro, duplicate, stale Apr 5, 2023

ods mentioned this issue Apr 18, 2023

Over-restrictive requirements for iterable argument to dict python/mypy#15074

Closed

Akuli mentioned this issue Apr 22, 2023

Support dict(foo.split() for foo in bar) with bytes #10072

Merged

Akuli mentioned this issue Mar 6, 2024

Should dict() constructor overloads be more permissive? #11532

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dict() overload insufficient #10013

dict() overload insufficient #10013

imba-tjd commented Apr 5, 2023 •

edited

Loading

srittau commented Apr 5, 2023

ods commented Apr 18, 2023

Akuli commented Apr 18, 2023

JelleZijlstra commented Apr 18, 2023

ods commented Apr 19, 2023 •

edited

Loading

imba-tjd commented Apr 22, 2023 •

edited

Loading

Akuli commented Apr 22, 2023

dict() overload insufficient #10013

dict() overload insufficient #10013

Comments

imba-tjd commented Apr 5, 2023 • edited Loading

srittau commented Apr 5, 2023

ods commented Apr 18, 2023

Akuli commented Apr 18, 2023

JelleZijlstra commented Apr 18, 2023

ods commented Apr 19, 2023 • edited Loading

imba-tjd commented Apr 22, 2023 • edited Loading

Akuli commented Apr 22, 2023

imba-tjd commented Apr 5, 2023 •

edited

Loading

ods commented Apr 19, 2023 •

edited

Loading

imba-tjd commented Apr 22, 2023 •

edited

Loading