Is the new one better than the old one?
Successful commercialization of products and services is fueled by one fundamental – making the new one better than the old one. If the new one is better the customer experience is better, the marketing is better, the sales are better and the profits are better.
It’s not enough to know in your heart that the new one is better, there’s got to be objective evidence that demonstrates the improvement. The only way to do that is with testing. There are a number of types testing mechanisms, but whether it’s surveys, interviews or in-the-lab experiments, test results must be quantifiable and repeatable.
The best way I know to determine if the new one is better than the old one is to test both populations with the same test protocol done on the same test setup and measure the results (in a quantified way) using the same measurement system. Sounds easy, but it’s not. The biggest mistake is the confusion between the “same” test conditions and “almost the same” test conditions. If the test protocol is slightly different there’s no way to tell if the difference between new and old is due to goodness of the new design or the badness of the test setup. This type of uncertainty won’t cut it.
You can never be 100% sure that new one is better than the old one, but that’s were statistics come in handy. Without getting deep into the statistics, here’s how it goes. For both population’s test results the mean and standard deviation (spread) are calculated, and taking into consideration the sample size of the test results, the statistical test will tell you if they’re different and confidence of it’s discernment.
The statistical calculations (Student’s t-test) aren’t all that important, what’s important is to understand the implications of the calculations. When there’s a small difference between new and old, the sample size must be large for the statistics to recognize a difference. When the difference between populations is huge, a sample size of one will do nicely. When the spread of the data within a population is large, the statistics need a large sample size or it can’t tell new from old. But when the data is tight, they can see more clearly and need fewer samples to see a difference.
If marketing claims are based on large sample sizes, the difference between new and old is small. (No one uses large sample sizes unless they have to because they’re expensive.) But if in a design review for the new product the sample size is three and the statistical confidence is 95%, new is far better than old. If the average of new is much larger than the average of old and the sample size is large yet the confidence is low, the statistics know the there’s a lot of variability within the populations. (A visual check should show the distributions to more wide than tall.)
The measurement systems used in the experiments can give a good indication of the difference between new and old. If the measurement system is expensive and complicated, likely the difference between new and old is small. Like with large sample sizes, the only time to use an expensive measurement system is when it is needed. And when the difference between new and old is small, the expensive measurement system’s ability accurately and repeatably measure small differences (micrometers vs. meters).
If you need large sample sizes, expensive measurement systems and complicated statistical analyses, the new one isn’t all that different from the old one. And when that’s the case, your new profits will be much like your old ones. But if your naked eye can see the difference with a back-to-back comparison using a sample size of one, you’re on to something.
Image credit – amanda tipton