State Winning Percentages
After estimating the most accurate possible poll, the next step is to ask, what does that poll tell us about each candidates chance to win the state. I use a logistic model for each state informed by how far we are away from the election and the state’s effective sample size. For example for a fully sample poll (defined at n = 10,000) done today the relationship between polling advantage and chance of winning looks like:
However on Election Day it looks like:
This is steeper (and implies errors smaller) than a typical poll one might read on election day, however it’s important to remember a model using many aggregated polls should have a much lower standard error than any one poll.
The logistic function is calibrated as follows. Polls currently conducted have two main sources of error:
1) The poll sample doesn’t necessarily represent the population (sample size)
2) The poll doesn’t know what will happen between now and election day (cone of uncertainty)
Error source #1 can be calculated. We discount polls against what we have defined as a full sample (10k) using the square root of the ratio of the sample / 10,000 (quite a bit went into selecting square root ratio over log-ratio or standard ratio if anyone is interested).
Quantifying the error from Source #2 is more difficult. The currently model assumes a linear flow of information (we learn, on average, as much going from 60 to 59 days out as we do from 9 to 8 days). We then calibrate the logistic using the assumption that a candidate who is 1 point down in a state in a full 10k sample today has a 45% chance to win that state. Plugging everything in we get the following results:
In summary:
1) Aggregate poll is translated into a likelihood of winning using a logistic curve that gets sharper as sample size increase or the election grows closer.
2) Error comes from two primary sources (1) sample size and (2) cone of uncertainty.
3) The logistic is calibrated assuming the error from (2) is zero on Election Day, and assuming a candidate ahead by 1 point in a full sample today has a 55% chance to win the state.