I haven’t had too much trouble explaining the concept, and while there is sometimes debate as to the best way to choose the intervals for any given situation I feel like that debate is a very healthy one. The question: “What is in A anyway” is a very healthy question to ask. Even if you go with a definition like “they don’t seem to notice any further improvements in [metric]” you still get very interesting debates like “[metric] is still pretty bad, why are they not noticing when we make it better?”

All of that is part of the goodness.

