Skip to content

Boxplot Aggregation #33112

@jtibshirani

Description

@jtibshirani

(Previous title: 'Interquartile Range Aggregation')

The interquartile range is a common robust measure of statistical dispersion. Compared to the standard deviation, the IQR is less sensitive to outliers in the data, with a breakdown point of 0.25. Along with the median, it is often used in creating a box plot, a simple yet common way to summarize data and identify potential outliers.

The IQR is equal to the third minus the first quartile of a dataset, and could be calculated from the output of a percentiles aggregation. Even though it can be easily calculated from quantile information, it may still be useful to provide it as an aggregation for convenience, and to increase its visibility. An alternative option would be to describe the IQR as part of the percentiles documentation.

Compared to the MAD (#26681), the IQR has a lower breakdown point (0.25, compared to 0.5). However, it is simple to calculate and is better equipped to handle skewed (asymmetric) data.

Relates to #26681.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions