Skip to content

Dataframe shuffle: Behaviour for NULL partitions differs from Pandas 3 #9184

@crusaderky

Description

@crusaderky

FAILED distributed/shuffle/tests/test_shuffle.py::test_handle_null_partitions_2 - AssertionError: Attributes of DataFrame.iloc[:, 1] (column name="b") are different

Attribute "dtype" are different
[left]: <StringDtype(na_value=)>
[right]: float64

This test has been broken since the day Pandas 3 was released:

Image

Note that the 3.10 environment still uses pandas 2.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething is brokenp2Affects more than a few users but doesn't prevent core functionsshuffle

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions