Dewey Defeats Truman

How to Read a Poll

Every other year, during election campaigns, the American public is polled, surveyed, and canvassed for their opinions, and the news media continuously inform us of the results. The media report polls in the same breathless way that race track announcers describe horse races: "As they round the corner of the convention, the Republican is pulling ahead on the right! Now, they're entering the home stretch and the Democrat is pulling up on the left!" Et cetera.
Newsweek Madam President Cover

There is little drama in simply waiting until after the election to report the results. Instead, reporters use polls to add suspense to their coverage, with a leader and an underdog to root for. Moreover, every news outlet is trying to scoop the others by being the first to correctly predict the winner. Unfortunately, much of this coverage sacrifices accuracy for artificial excitement.

This article explains how to read a news report of a poll without being duped by the hype. You don't need to be a statistician to understand enough about polls to not be taken in, because the problems are often not with the polls themselves but with the way that they're reported.

First, please take the following unscientific poll:

Fallacy Files Online Poll

  1. A poll, such as this one, that people voluntarily take online, is just as good at measuring public opinion as a poll in which the participants are chosen at random.

    Agree Disagree

  2. If a scientific poll shows a candidate with 37% support, say, that means that if the election were held today 37% of the voters would vote for that candidate.

    Agree Disagree

  3. In a scientific poll conducted between two candidates for office with a margin of error of plus-or-minus three percentage points, Candidate R has 49% support and Candidate D has 47% support, so we can be sure that R is supported by more voters than D.

    Agree Disagree

  4. A scientific poll with a margin of error of plus-or-minus three percentage points at the 95% confidence level taken ahead of an election shows Candidate D with 52% support and Candidate R with 48%. Since the difference between the two polls is greater than the margin of error, we can be 95% confident that Candidate D is going to win the election.

    Agree Disagree

  5. A scientific poll of registered American voters with a margin of error of plus-or-minus three percentage points shows that 83% of white male Republicans support Donald Trump, which means that the true percentage who support Trump is between 80% and 86%.

    Agree Disagree

  6. This poll is a worthwhile measure of people's understanding of polls.

    Agree Disagree

Are you looking for the results of the poll you just took? Read on!

Scientific Versus Self-Selected

Opinion polls, like other surveys, are a way of inferring the characteristics of a large group—called "the population"—from a small sample of it. In order for this inference to be cogent, the sample must represent the population. Thus, the main error to avoid is an unrepresentative sample. For example, the most infamous polling fiasco in history is the Literary Digest poll in the 1936 presidential election1. The magazine surveyed over two million people, primarily chosen from subscribers, telephone books, and car registrations. Even though the sample was enormous, the poll showed Republican Alf Landon beating the actual winner, Democrat Franklin Roosevelt. Despite its size, the Digest's sample was unrepresentative of the population of voters because not everyone could afford a phone or car during the Depression year of 1936, and those who could tended to vote Republican in greater numbers than those who couldn't2.

So, the first question that you should ask of a poll report you read is: "Was the sample chosen scientifically?" If the poll is a scientific one, then an effort has been made to choose the sample randomly from the population. However, many polls are unscientific, such as most online polls, telephone surveys in which you must call a certain number, or mail-in questionaires in magazines or sent to you by interest groups. Such surveys suffer from the fault that the sample is self-selected, that is, you decide whether you wish to participate. Self-selected samples are not likely to be representative of the population for various reasons:

  • The readers of a particular magazine or the contributors to a specific charity are likely to differ from the rest of the population in other respects.
  • Those who take the time and trouble to volunteer for a poll are more motivated than the average person, and probably care more about the survey subject.
  • Many such polls allow individuals to vote more than once, thus allowing the results to be skewed by people who stuff the ballot box.

For example, some media outlets sponsor scientific polls but, when the results are reported in their online edition, they are sometimes accompanied by an online poll using a self-selected sample and asking some of the same questions. It is instructive to compare the two, as the results are usually very different.

So, self-selected samples are likely to be biased and are, at best, a form of entertainment, as opposed to a source of information about the population as a whole.


Margin of Error Errors3

Because polls question only a sample of the population, there is always a chance of sampling error, that is, drawing a sample that is unrepresentative of the population as a whole. For instance, in a political poll, it is possible that a random sample of voters would consist entirely of Democrats, though this is highly unlikely. However, less extreme errors of the same kind are not so unlikely, and this means that every poll has some degree of imprecision or fuzziness. Because the sample may not be precisely representative of the population as a whole, there is some chance that the poll results will be off by a certain amount. Statisticians measure the chance of this kind of error by the "margin of error", or "MoE" for short.

The MoE takes the form "±N percentage points4", where usually N=3 in national polls. This margin determines what is called a "confidence interval": for example, if the percentage of a sample that supports candidate R is 51%, and the MoE is ±3 percentage points, then the confidence interval is 48-54%. In turn, the confidence interval and the MoE are determined by the "level of confidence", which is usually set at 95% in national polls―for more on the confidence level, see the next section.

The MoE is a common source of error in news reports of poll results. Most reputable news sources require their reporters to include the MoE in a report on a poll, at least in a note at the end. However, many reporters ignore the MoE in the body of their articles, perhaps because they don't understand what the number means.

Reporters often use polls for "horse race" reporting by comparing the poll numbers of candidates, or to compare current polls to past ones to see if the results are changing. The MoE needs to be factored into such comparisons. There are two kinds of errors about MoEs frequently committed in news reports of poll results:

  1. Suppose that in one poll with a MoE of ±3 percentage points, candidate C polls at 36%, and in a later poll with the same MoE C is at 38%. Many newspapers will report this as a two percentage point "shift" in support for C between the two polls, as if that many voters or previous supporters of other candidates had decided to vote for C since the previous poll. However, given that the MoE is ±3 percentage points in both polls, the result in the first poll could be as high as 39%, and in the second one as low as 35%. In other words, C's support could have dropped by as much as four percentage points! The poll results are simply not precise enough to say with 95% confidence that there is a real increase in C's support, let alone that such an increase is exactly two points.
  2. Another type of mistake occurs in reporting on polls, such as those conducted during presidential elections, in which the results for two candidates are compared. For instance, suppose that in a national poll with an MoE of ±3% candidate R's support is 52% while D's is 48%. The difference in their support is 52 minus 48, that is, four percentage points, which is greater than the MoE. Most news outlets would report that candidate R is ahead by four points. However, R's support could be as low as 49% and D's as high as 51%, which would mean that D actually leads by two points. A good rule of thumb is that a candidate needs to lead by at least twice the MoE to be ahead in the race5.

So, when you read a news report of a poll, always look for the margin of error, then take that margin into account when evaluating the results. Don't expect the reporter to do this for you: some will but many won't.


Samples, Sub-Samples, and the MoE

You may well wonder why the MoE is usually around plus-or-minus three percentage points in most national polls. The reason is that most such polls have about a thousand respondents, and that's the MoE for samples of that size.

An important mathematical fact about polls is that the sample size and margin of error (MoE) are inversely related5. What that means is that the larger the sample, the smaller the MoE, and the smaller the sample, the larger the MoE. The MoE is a measure of the precision of a poll, which means that the smaller it is, the more precise the poll results are, and the larger it is, the less precise they are. So, if you want more precise poll results, you need a larger sample.

The problem is that the sample size and the cost of polling in both time and money are directly related, that is, the larger the sample, the more time it takes to do the poll and the more it costs. This is why you seldom see national polls with a sample size of much more than a thousand respondents. To reduce the usual MoE by a single percentage point requires more than doubling the sample size6, and thus probably doubling the time and money spent on the poll. So, most pollsters decide that it isn't worth it.

What is true of an entire sample is equally true of its sub-samples: a "sub-sample" is simply a subset of the whole sample, and is itself a sample. Since sub-samples are smaller than the whole sample, they have larger MoEs. Journalists writing about polls and discussing the results for various subgroups seldom mention that the MoE is larger for such results, let alone actually supplying that larger MoE7, so you need to be on the lookout for it.

So, keep this fact in mind when reading a report about poll results: if it reports on the opinions of female independents, or black Republicans, or whatever, these are almost certainly sub-samples that will have a larger MoE than that reported for the whole sample.


The Confidence Game

In the previous section, I mentioned the level of confidence—usually 95%—used to determine the MoE and, therefore, the confidence interval. The purpose of a survey is to measure some characteristic of a sample, such as support for a candidate, in order to infer its level in the whole population. A 95% confidence level means that in 19 out of 20 samples, the percentage of the sample with the characteristic should be within the confidence interval of the percentage of the population with the characteristic. However, we can also expect one out of twenty samples to differ from the population by more than the MoE.

95% confidence sounds pretty confident—and it is!—however, there are a lot of polls done these days. In fact, there are hundreds of national polls conducted in the U.S. during a presidential election year. This means that with a confidence level of 95%, we can expect 5% of those polls to be off by more than the MoE as a result of sampling error.

How can we tell when the results of a poll are off by more than the MoE? If a poll gives very different results from others taken around the same time, or shows a sudden and large change from previous polls, this suggests that the unusual result may be due to sampling error. No one can know for sure whether sampling error is responsible for polls with surprising results, but the fact that 1 in 20 polls can be expected to be significantly in error should encourage us to regard such poll results with skepticism. Moreover, it's important to pay attention to all of the polls taken on a given topic at a particular time, otherwise you'll have no way of knowing whether a poll you're looking at is giving wildly different results than comparable ones.


Polling the Polls

Here's another reason to pay attention to all the comparable polls, as opposed to concentrating on just one. Suppose that five polls are conducted at about the same time showing the following results with a MoE of ±3 percentage points:

Poll Candidate D Candidate R Undecided
1 43% 42% 15%
2 42% 41% 17%
3 44% 42% 14%
4 44% 44% 12%
5 46% 43% 15%

Each of these results is within the MoE so, taken individually, you would have to conclude that neither candidate is really ahead. However, four of the five polls show candidate D with a lead, and the other shows a tie; no poll shows candidate R leading. Of course, it's highly improbable that both candidates have exactly the same level of support, but if they are within a percentage point of each another you would expect the polls showing one candidate ahead to be about evenly divided between the two. Instead, in this example, all of the polls showing one candidate ahead favor candidate D, which is unlikely unless D has a real, albeit small, lead. Thus, even when individual polls do not show a clear leader, the consensus of all polls may do so.

Another possible explanation for the results in the table above are a systematic bias in favor of candidate D, which is a possibility that cannot be ruled out as we've seen examples in recent elections. Such biases may result from unrepresentative sampling due to some groups of people being less likely to respond to pollsters, which is the same thing that happened in the Literary Digest debacle, discussed above. Unfortunately, the source of such biases is not well understood, and there's little that the poll consumer can do to allow for it.

Unfortunately, news stories on polls usually concentrate on one poll at the expense of all others. Many polls are sponsored by news outlets, which get their money's worth by reporting only the results of their own polls, ignoring those sponsored by their competitors. Therefore, it's up to you to check to see whether there are other polls on the same topic, and to compare the results of any comparable polls. Thankfully, there are now more than one site than collects and aggregates the results of all recent polls, which makes the task of comparing poll results much easier than it used to be.


Snapshots and Photo Finishes

Even the best, least-biased polls are only a snapshot of public opinion during a brief period of time. Public opinion can, and does, change over the course of time, depending on what's happening, so it's important to know when a poll was conducted. Most polls take a few days to conduct, and the dates during which the survey was made should be indicated somewhere―most often at the end of a news article reporting on the poll. Unless there is some major event in the news, you should be skeptical of a poll or polls that seem to show a large change in public opinion over the course of a few days. If you encounter such polling, check the fine print for the dates the poll or polls were conducted and the MoEs7.

When elections are very close, as many presidential elections have been in the last few decades, polls cannot be expected to predict a winner. A close election is like a photo finish in a horse race when one candidate wins by a nose. Given its usual MoE, it's unrealistic to expect a poll to call such a race. Even poll aggregators often miscall such races, since the polls that are aggregated may be systematically biased.

People are often disappointed with polling because they expect too much. When a close election surprises people, as happened in 2016, many conclude that the polls were wrong. Yet, an average of polls in 2016 was only about one percentage point shy of the popular vote total8, which is better than we have any right to expect.


A Checklist of Questions

When you are confronted with a new poll, ask the following questions about it:

  • Was the sample scientifically selected or self-selected? If self-selected, the poll is only good for entertainment.
  • How recently was the poll conducted? Remember that at best a poll can tell you something about public opinion at the time it was taken. If it's more than a week old, it's probably outdated.
  • What is the poll's margin of error? Are any comparisons reported―such as changes in popularity, record highs or lows, or comparisons between candidates―that lie within the MoE? If so, they are not statistically significant.
  • Is the reported information based on the full sample or only on a sub-sample? If the latter, keep in mind that the MoE for a sub-sample is greater than that for the whole sample.
  • Have other polls on this issue been done recently? If so, it's a good idea to compare the new poll with these others:
    • If the results of the new poll are significantly different from those of most other polls then the new poll is of dubious reliability.
    • If the results of the new poll are within the MoE of those of most other polls, then those results may be reliable, even if they are not individually statistically significant.

If the poll you are confronted with fails at any step of this checklist, or if you can't find the answer to these questions in the report, then your confidence in the poll should be much less than 95%.


The Poll Results

If you haven't guessed by now, the online poll was bogus, but not much more bogus than most such polls. If you go back and retake the poll having read the entire article, I hope that you will agree to disagree with all of the questions!


Technical Appendix: A Point about Percentages and Percentage Points

Poll results are most often reported in terms of percentages; for instance, a news article may report that the president's job approval rating is 40%. The approval rating is based on a poll that asks respondents whether they approve of the president's job performance, and the report indicates that 40% of those who answered the poll did so. In a later poll, the approval rating might be 45%. Both of these approval ratings are percentages, and the difference between them is measured in percentage points: five percentage points, in this case. In other words, if you subtract 40% from 45%, you get five percentage points. So, percentage points are the way to measure the difference between two percentages.

Confusingly, the percentage notation "N%" is often used to mean N percentage points―I did it myself in earlier versions of this article. Thus, the percent sign is ambiguous between percentages and percentage points. This ambiguity can lead to confusion when percentage points are taken to be percentages, or vice versa9. To avoid such potential confusion, I refer to the MoE in terms of percentage points and not percentages throughout this article.

Keep in mind, however, that news articles usually report the MoE of a poll as "±3%", rather than "±3 percentage points". So, if the president's approval rating is reported to be "40% ±3%", that means that it is between 37% and 43%.


Notes:

  1. Peverill Squire, "Why the 1936 Literary Digest Poll Failed", Public Opinion Quarterly 52, pp. 125-133.
  2. This is a necessarily simplified explanation of what happened. There was also what pollsters call "nonresponse bias", that is, not everyone who was polled responded to the survey. For some reason, fewer Democratic voters returned the survey than Republicans did. Ibid., pp. 130-131.
  3. Parts of this section are based on the following entry: MOE-mentum, 10/22/2022.
  4. See the Technical Appendix if you're unclear about the difference between percentages and percentage points.
  5. Andrew Mercer, "5 key things to know about the margin of error in election polls", Pew Research Center, 9/8/2016.
  6. To be more precise myself, a sample size of 2,400 will produce a MoE of two percentage points, give or take. I estimated this using the formula: 1/√n, for sample size n, which gives a good estimate of the MOE at the 95% confidence level. See: Jessica M. Utts, Seeing Through Statistics (2nd edition, 1999), p. 53.
  7. For an example of such reporting, see the entry under note 3, above.
  8. See: "It ain't over till it's over.", 9/23/2020.
  9. For an example of such confusion, see the paragraphs on Social Security in: John Allen Paulos, "Misleading Numbers in the News", Who's Counting?, 12/5/2004.

Revised: 12/6/2022, 10/28/2024