Fix #60766:.map,.apply would convert element type for extension array #61396

pedromfdiogo · 2025-05-03T21:54:02Z

closes BUG: .map & .apply would convert element type for extension array. #60766
Tests added and passed if fixing a bug or adding a new feature
All code checks passed.
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/v3.0.0.rst file if fixing a bug or adding a new feature.

The Int32Dtype type allows representing integers with support for null values (pd.NA). However, when using .map(f) or .apply(f), the elements passed to f are converted to float64, and pd.NA is transformed into np.nan.

This happens because .map() and .apply() internally use numpy, which automatically converts the data to float64, even when the original type is Int32Dtype.

The fix (just remove the method to_numpy()) ensures that when using .map() or .apply(), the elements in the series retain their original type (Int32, Float64, boolean, etc.), preventing unnecessary conversions to float64 and ensuring that pd.NA remains correctly handled.

…sion array. The Int32Dtype type allows representing integers with support for null values (pd.NA). However, when using .map(f) or .apply(f), the elements passed to f are converted to float64, and pd.NA is transformed into np.nan. This happens because .map() and .apply() internally use numpy, which automatically converts the data to float64, even when the original type is Int32Dtype. The fix (just remove the method to_numpy()) ensures that when using .map() or .apply(), the elements in the series retain their original type (Int32, Float64, boolean, etc.), preventing unnecessary conversions to float64 and ensuring that pd.NA remains correctly handled.

pandas/tests/arrays/masked/test_basemaskedarray_map.py

doc/source/whatsnew/v3.0.0.rst

pandas/tests/arrays/masked/test_basemaskedarray_map.py

pandas/tests/extension/test_masked.py

datapythonista · 2025-06-03T07:23:05Z

pandas/tests/extension/test_masked.py

@@ -181,10 +187,15 @@ def test_map(self, data_missing, na_action):
    def test_map_na_action_ignore(self, data_missing_for_sorting):
        zero = data_missing_for_sorting[2]
        result = data_missing_for_sorting.map(lambda x: zero, na_action="ignore")
+


Better to avoid this unrelated changes

jbrockmendel · 2025-07-08T18:21:17Z

pandas/tests/arrays/masked/test_basemaskedarray_map.py

+        return x + 1
+
+    result = s.map(transform)
+    expected = Series([2, 3, NA, 5], dtype=result.dtype)


can you be explicit about the expected dtype. i.e. is it Int32?

jbrockmendel · 2025-08-20T17:09:09Z

pandas/tests/arrays/masked/test_basemaskedarray_map.py

@@ -0,0 +1,20 @@
+from pandas import (


can you name this file just test_map.py

jbrockmendel · 2025-08-20T17:09:17Z

pandas/tests/arrays/masked/test_basemaskedarray_map.py

+    Series,
+    isna,
+)
+from pandas.testing import assert_series_equal


use tm.assert_series_equal

jbrockmendel · 2025-08-20T17:10:09Z

pandas/tests/extension/test_masked.py

+        if data_missing.dtype.kind != "b":
+            for i in range(len(result)):
+                if result[i] is pd.NA:
+                    result[i] = "nan"


isnt this pretty unwanted behavior?

jbrockmendel · 2025-08-20T17:10:45Z

i suspect the correct thing to do for map involves the just-implemented EA._cast_pointwise_result

pedromfdiogo added 5 commits April 22, 2025 16:03

Update v3.0.0.rst

bf6aaef

fixed test_masked.py

e8edcea

Apply Ruff and Ruff-format auto-fixes

d845306

Merge branch 'main' into bug#60766

ef9812e

datapythonista reviewed Jun 3, 2025

View reviewed changes

datapythonista added Bug Apply Apply, Aggregate, Transform, Map labels Jun 3, 2025

pedromfdiogo added 4 commits June 20, 2025 20:35

fixed some errors

8222c21

fixed test_map_na_action_ignore

f4df033

fixed space

9e97dba

fixed spaces

1cf2604

jbrockmendel reviewed Jul 8, 2025

View reviewed changes

jbrockmendel added the pyarrow dtype retention op with pyarrow dtype -> expect pyarrow result label Aug 14, 2025

jbrockmendel reviewed Aug 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix #60766:.map,.apply would convert element type for extension array #61396

Fix #60766:.map,.apply would convert element type for extension array #61396

pedromfdiogo commented May 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

datapythonista Jun 3, 2025

Uh oh!

jbrockmendel Jul 8, 2025

Uh oh!

jbrockmendel Aug 20, 2025

Uh oh!

jbrockmendel Aug 20, 2025

Uh oh!

jbrockmendel Aug 20, 2025

Uh oh!

jbrockmendel commented Aug 20, 2025

Uh oh!

Uh oh!

Uh oh!

Fix #60766:.map,.apply would convert element type for extension array #61396

Are you sure you want to change the base?

Fix #60766:.map,.apply would convert element type for extension array #61396

Conversation

pedromfdiogo commented May 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

datapythonista Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

jbrockmendel Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

jbrockmendel Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

jbrockmendel Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

jbrockmendel Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

jbrockmendel commented Aug 20, 2025

Uh oh!

Uh oh!

pedromfdiogo commented May 3, 2025 •

edited

Loading