**#1**May 13th, 2010

I'll most likely be rebuffed by someone much more knowledgeable on the subject

of Statistics than I am, but Statistics can show pretty much what you want them to

show depending on the parameters set out to display the information.

*The whole Climate thing I'm not going to touch*, so lets use crime as an example.

Crime rates have been dropping for a long time....everywhere, it seems. Hmmm..

Antidotally, my Father or any of his Friends 30+ years ago*never* had their cars

broken into or stolen.*I'm sure it happened, but not to anyone they knew*. Now here

I am and I'm not sure if I know anyone personally that hasn't had a car broken into

or stolen in the last five years (& I'm leaving the never or ever out of this now).

Similar story with B&E's of peoples homes (& many where not locked much of the

time) back 30+ years ago. If you heard of it, it was a rare occurrence. Now I'm hard

pressed to think of anyone I know personally that hasn't had a B&E (or an attempt

was made on their homes) in the last 5-10 years....and so on & so forth. The crime

rate has been dropping for years though according to the statistics. What's going

on?

What's going on Ron is that the human brain is selective, and subjective. Statistics should be objective, and when Stats Canada says that crime rates are dropping, it's fairly certain that this is the case.

If you really want to know why there is a disconnect, think of it this way. Statistics deal with arithmetic means. They take a large sample, or census data, and make inferences about the population based on the mean. You and your friends and families are single realizations, and you will remember an event (car theft, break and enter) more than a non-event (no crime today, did you take notice?)

One part of doing statistics is having proper amounts of information to realistically be able to say something about the population you're interested in. Anecdotes aren't going to cut it.

I know you didn't want to touch the climate thing, but I will say one more thing. A proper amount of information in statistics means you're dealing with sample size. The more values you gather, the more they tend to approach something called the normal distribution (it's the bell shaped curve.) It essentially means that for values far away from the top of the bell, the mean, that the probability of a single measurement is very low. The probability of obtaining a data point close to the mean is very high. It turns out that as you get close to thirty values, many types of data will begin to appear normally distributed. Much statistical analysis relies on the normal distribution to make good reasoned inferences.

Anyways, climate is defined as thirty years, because at this value, the individual data points should appear to be close to normally distributed. This tendency is called the central limit theorem. So climatologists can make meaningful claims about changes in the system when this normality is apparent. This is a highly simplified explanation of course, there's far more to it once you go down the rabbet hole.

We all know stats are as reliable as they need them to sound. I've said it over and over, stats are changed all the time to suit the needs of the person needing to show them. My husband was listening to a report on how crime has dropped across the country and he's one of those people who seems to think that if he speaks loud enough to the TV, someone will hear him so - he's yelling across the room - That's wrong, that's wrong. All they can do stats on are reported crime and many crimes go un-reported! I certainly heard him.

From my field, if we want to test a new vaccine, we don't use more fish in the trial than we need to. For one, it's an animal welfare issue, and second, using more fish would tie up resources that can be used in other trials (tank space, fish, heated or chilled water, etc.) We use statistics to determine how many fish we need for statistical significance.

Mark Twain said: "There are lies. There are damned lies. And then, there are statistics".

Telling the whole story?

Telling the whole story?

If you try to make more out of the stats then they can give, then you're using them improperly. Despite what you all might think of stats, the truth of the matter is that the progress we've made in science would have been a fraction of what we have discovered and learned if we did not use statistics when analyzing scientific results.

One poster gave a good example of the fallibility of statistics, in that for crime statistics, not all the information is entered because not all of it is even reported.

But now here's a pickle for you. Do you think that this non-reporting of crime is a new phenomenon, or something that has always been there? If so, showing a decreasing trend in crime rates is very likely to be a robust result. If not, then the statistics which show that this is a new phenomenon lead to investigations, which can find ways of addressing the problem.

Statistics are only as good as the quality of the data. Caveat.

Are five-day weather forecasts 100% accurate? There's statistics in action.

Around twenty-five years ago, I was dumped by Wawanesa Insurance because, according to their statistics, my luck was soon going to run out. It hasn't happened yet - and Wawanesa cheated themselves out of a quarter century's worth of free money.

Statistics? If we lived in a world without change, they'd have a better chance.

... and the description of what the stats are supposed to show.

Interesting thing in BC lately: someone did a study on the public's view of the RCMP and found about only 35% of the public had confidence in the RCMP here. I can imagine that this stat would cause a few people to just suffer minor crimes and keep clam about them rather than reporting them. That could cause a lowering of crime stats and reporting. Ironic, huh?

What was the % before say... the tasering incidents? That's a crucial bit of information to have before anyone makes any kind of inference.

Somebody else said "facts ? I don't need facts,I just make 'em up as I need 'em"

"And I'll ask you the same thing I asked Jack. Do you know in what context that quote is derived from?"

Tonnignton, you are asking two people who only quoted a very quotable quote by a very wise person.

There is one person who could give you the answer to your question, called: Samuel Clements.

The meaning of that quote is two-fold, and it's wise because of that. The people who most often use this quote use it predominantly in one form only, and thus miss the true wisdom in the quote.

It means that some people use statistics to tell lies, sure enough, and that's what most people use it (the quote) for. But it also refers to the fact that some people choose to write off statistics that they have cognitive issues with, or find inconvenient. So when someone uses this quote when they have no reason to discard the finding, they are actually the other side of the coin that this wise person was referring to.

Incidentally, there were others before him who had similar statements.
Statistics is historical information and probabilities is using statistics to predict the future.

