Previous Month | RSS/XML | Current | Next Month

WEBLOG

December 25th, 2015 (Permalink)

A Puzzle for Christmas

Klaus has a problem: an old college friend of his has invited him to visit on Christmas day, and Klaus has decided to bring presents for his friend's children. Unfortunately, he hasn't seen his friend since the childless days of college, and has never met the children before. Moreover, he has a notoriously faulty memory for other people's children. He does remember his friend announcing the birth of a new child twice; furthermore, he recalls the friend using the masculine pronoun in reference to one of the two, but not which one. However, that's all that he can remember for sure: his friend has two children and at least one is a boy. What kind of presents should he buy?

Obviously, he should buy one present for a boy, but what about the other one? Should he purchase a present that a girl might like or another one for a boy? Of course, he could always call his friend and ask, but that would be embarrassing: how could he have forgotten such vitally important information? He could buy three presents, two for boys and one for a girl, but how would he manage to explain away the third present? It would appear that he thought his friend had three children, which would be even more embarrassing! Another possibility was to buy a present that would be suitable for either sex, but Klaus simply couldn't think of a toy that would work. It was hard enough figuring out what a modern boy or girl would like!

Finally, Klaus decided that he simply must take his chances and buy a single present for a boy or a girl based on the odds. But was it an even chance that his friend had an additional boy or a girl? If so, then he might as well just flip a coin. What should Klaus do? Should he buy a present for a boy or for a girl? When you think that you know the answer, click on "Solution", below.

Solution


December 15th, 2015 (Permalink)

At Risk of Pinocchios

The Washington Post's fact-checker Glenn Kessler put a list out yesterday of "the biggest Pinocchios of 2015", that is, the "biggest" errors that failed a fact-check in the past year. I gather that what made the list were those errors that received four "Pinocchios", which is the highest―or lowest, depending on how you look at it―rating in the fact-checker's rating system. At the end of the list is a "special award" for "bushels of bogus sex trafficking statistics", among which is one from May concerning the claim that 300,000 American children are "at risk" of sexual exploitation.

I missed the May article, which is unfortunate since it concerns a topic I had dealt with last year―see the Resource, below. Before looking at this specific issue, there's a general problem with the notion of "at risk of X", whatever X is. Even if X is a precisely defined concept, "at risk of X" will probably not be. So, why not just count cases of X rather than trying to count the less precise concept of "at risk of X"? One obvious reason is that the latter is likely to be a larger class, and the resulting number of cases greater. For this reason, activists who wish to use large numbers to galvanize support for anti-X activities will gravitate towards using the broader concept.

Furthermore, in order to count cases, we need a precise definition of what it is to be at risk of X, but that will be defined by those doing the counting. Also, the definition of a vague concept such as "at risk of X" is more open to manipulation than that of a more precise concept such as X itself. Assuming that the definition doesn't provide a large enough number to scare up anti-X support, it can always be broadened.

For these reasons, when you see the concept of "at risk for X", whatever X happens to be, you should be on your guard. Here are some critical questions that should be asked about the use of this concept:

If you can't answer these questions at all or with reassuring answers, then you probably shouldn't rely upon the number of "at risk for X". Now, let's turn to the specific issue of "at risk of sexual exploitation" and fact-check the fact-checker.

The claim that Kessler fact-checked is that 300,000 American children are at risk of commercial sexual exploitation. Here's what Kessler says about the provenance of this figure:

…[T]he 300,000 figure comes from a 2001 report written by Richard J. Estes and Neil Weiner of the University of Pennsylvania. … The report suggested that about 326,000 children were ďat risk for commercial sexual exploitation"….

The second claim is correct, but the first is at least dubious, as I pointed out in my previous entry on this issue. If the Estes and Weiner (E&W) estimate is the source of the number, why would it be rounded down to 300,000? The last thing that politicians who are advocating legislation against commercial sexual exploitation would want to do is minimize the number of those at risk.

My own research indicated that the estimate came from a previous report from the mid-'90s, which means that it's almost twenty years old, and E&W specifically rejected all previous estimates as unreliable. However, the E&W report may be the proximate source for the 100,000-300,000 estimate, since it does mention it if only to reject it. It's mentioned early (p. 4) in a long report―over 200 pages―so it's possible that a lazy researcher looking for numbers to cite came across it and didn't notice or care that it was not the estimate of the report itself, but a previous one rejected by the report. Pinocchio

I give Kessler one Pinocchio.

Sources:

Resource: One "myth" that's not quite dead yet, 9/5/2014


December 9th, 2015 (Permalink)

Dueling Headlines, Dueling Polls

Ted Cruz takes lead from Donald Trump in new Iowa poll

Trump holds 13-point lead in CNNís Iowa poll

Can both of these polls be right? Trump's 13 percentage point lead over Cruz in the CNN/ORC poll is highly significant, so you can't explain this away as simply statistical noise.

You might suspect that they were conducted at different times, which would certainly explain how two polls could differ so much. However, though they were not conducted during exactly the same time period, the polls did overlap: the first was conducted by Monmouth University from the 3rd to the 6th of this month, whereas the second, CNN/ORC poll ended on the same day but started six days earlier. Could the inclusion of some samples from November 28th to December 2nd account for such a large difference in results? If public opinion can change so much so fast then it's unlikely that any poll this far from the caucuses will be of any use in predicting the results.

How about sampling bias, that is, were the samples different in some relevant respect? Both polls were aimed at sampling likely voters in the Iowa Republican caucuses, but according to The Hill's article:

Monmouth drew its sample from lists of registered voters who voted in at least one prior state primary, in a recent general election or registered to vote in the past year. CNN drew its sample by asking adults about their past participation patterns and intensions.

One problem with polls aimed at sampling likely voters is that every polling organization has its own definition of "likely voter". As a result, each is sampling a somewhat different population, which makes it difficult to compare polls from different pollsters. If the different results are explained by the different definitions, which definition comes closest to capturing the group of people who will actually take part in the caucuses? We may have to wait until after the caucuses are over to find out.

Sources:

Resource: How to Read a Poll

Update (12/13/2015): A newer poll, conducted for Bloomberg Politics and the Des Moines Register newspaper, puts Cruz ahead of Trump by ten percentage points, which is a significant lead. This suggests that the Monmouth poll showing Cruz in the lead was capturing a real surge in support for Cruz. Of course, this doesn't explain what happened with the CNN/ORC poll. Perhaps the CNN poll missed most of the surge, or maybe it's just that one-in-twenty poll that's wrong by more than its margin of error. In any case, if it's true that Cruz has risen so much in such a short time, then it appears that the race for support in Iowa is highly volatile and could change an equal amount in the month and a half left before the caucuses.

Sources:


December 4th, 2015 (Permalink)

Headline

400-Year-Old Hearts Had Same Diseases As Hearts Of Today

If they're 400 years old they're doing pretty well.


Solution to a Puzzle for Christmas: As counter-intuitive as it may seem, Klaus should buy a present for a girl, though this is not because his friend having had one boy a girl was due. Rather, the odds of a particular child being a girl is almost exactly half, which might make you think that it doesn't matter what present Klaus buys, since the odds will be the same for a boy or girl.

However, all that Klaus knows is that his friend has two children, one of whom is a boy. Thus, since both children being girls is ruled out, there are three equally likely possibilities:

1 2 3
Oldest: Boy Boy Girl
Youngest: Boy Girl Boy

In only one of these possibilities―namely, the first―is the other child a boy. In the other two possibilities, the other child is a girl. Thus, if Klaus buys a present for a girl, the odds are 2 in 3 that he will have bought the right present.

Sources:

Resource: "Ask Dr. Math: Boy or Girl?", Math Forum at Drexel University. If you're still puzzled or not convinced by the above solution, read this brief article.

Previous Month | RSS/XML | Current | Next Month