I really like yardstick's API for computing discrimination. Calibration is also an important metric for classification models. Is there any interest in developing some yardstick functions to compute a calibration statistic?
Frank Harrell's Unreliability index seems like a good candidate. The Unreliability index is a chi-square statistic with 2 degrees of freedom testing the hypothesis that calibration slope and intercept are ideal (i.e., the slope is 1, the intercept is 0).
I really like
yardstick's API for computing discrimination. Calibration is also an important metric for classification models. Is there any interest in developing someyardstickfunctions to compute a calibration statistic?Frank Harrell's Unreliability index seems like a good candidate. The Unreliability index is a chi-square statistic with 2 degrees of freedom testing the hypothesis that calibration slope and intercept are ideal (i.e., the slope is 1, the intercept is 0).