Skip to content

[BUG] Oversampling warning spurious / misleading #979

@amueller

Description

@amueller

Describe the bug

If I resample a dataset 10x for each class, I might get a warning like

After over-sampling, the number of samples in class 1 will be larger than the number of samples in the majority class

Which is true, but I don't know why it deserves a warning. Maybe the criterion for the warning should be that after sampling another class is bigger than the majority class? But not sure.

Expected Results

No warning.

Actual Results

Warning, potentially for each other class. If the classes are of similar sizes, I will likely get a warning for each class.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions