Six heads in ten flips doesn’t prove a biased coin. A campaign’s observed rate is one sample; the true rate sits somewhere in a range around it. Small samples mean wide ranges.
The two curves are those ranges. Where they overlap is the uncertainty. The question isn’t which rate is higher. It’s how much of B’s range sits above A’s.
“B beats A” isn’t enough if the gap is trivial. Set a threshold for what counts as a meaningful win, and read the probability of clearing it. That’s the number to act on.