|
|
| Politics political discourse |
05-30-2012, 12:45 AM
|
#31
|
|
Carpal \'Tunnel
Join Date: Apr 2007
Location: J. Edgar Hoover Elementary
Posts: 10,255
|
Re: Election Modeling
Quote:
Originally Posted by goofball
That's what my package does (it's called @risk, I love it) but the question is how correlated to make my variables. I currently have them all correlated at the same level in both my high and medium correlation models, but I think that cold use refining. For example, michigan and wisconsin are probably more closely correlated than say, michigan and north carolina.
|
Setting aside manually evaluating all 1225 pairings of states, my two initial thoughts are to weight on some measure of distance (either by counting how many state borders are between any given pairing or by actual distance) and on state PVI. Both of these have obvious shortcomings. The former discounts the fact that the northeast and west coast are a lot more similar to each other than they are to a lot of states in closer proximity. The latter will obviously create some strange pairings of states it thinks should be similar. While not perfect, combing the two of them somehow should give you a workable, but very rough, first attempt.
|
|
|
05-30-2012, 01:43 AM
|
#32
|
|
Beat Nate Silver
Join Date: Oct 2003
Location: Blogging 2012 Election Projections
Posts: 12,225
|
Re: Election Modeling
Both of those seem like decent methodologies, but I guess I have no way to estimate how PVI or distance should figure in.
I feel like the correlation model should eventually end up looking like (national correl) + (regional correl) + (some kind of demographic similarity?) but I haven't thought of a good way to estimate data driven values for those 3 variables.
On the plus side I'm only considering swing states, sure if Obama loses CA he's definitely losing NH but if Obama loses CA he's getting his ass kicked and the election just isn't interesting to model. So I'm "only" looking at 12 states worth of combos but still.
|
|
|
05-30-2012, 05:09 AM
|
#33
|
|
Carpal \'Tunnel
Join Date: Oct 2007
Location: The water hemisphere
Posts: 8,058
|
Re: Election Modeling
Quote:
Originally Posted by goofball
Remember the graphs provided are for an aggregate poll of effective sample size 10,000 - iow imagine 10 polls were released on election day all showing a margin of 3 for Romney in Arizona, I think Romney losing arizona in that case would be a big big surprise.
If you look more at individual polls scale, the logistic gives someone who trails by 1 point on election day in a 500 n sample a 37% chance to win, and someone who trails by 3 points a 16% chance to win.
|
10 polls all showing the exact same margin for Romney would be odd whatever that margin was. But it appears that you think a 3 point average lead over 10 polls produces a near 100% chance of winning. That doesn't seem to jive with recent history. I know this is a Canadian provincial election, but in Alberta this year the incumbent government trailed in every single poll (22 in total) for nearly a month before the election, mostly by high single or low double digit margins. The last poll had them only losing by two, but no other polls indicated a tightening. However, they ended up winning by 10 points. I'm guessing your model would have considered this result to be something like a one-in-a-billion event.
Quote:
The logistic function is calibrated as follows. Polls currently conducted have two main sources of error:
1) The poll sample doesn’t necessarily represent the population (sample size)
2) The poll doesn’t know what will happen between now and election day (cone of uncertainty)
|
You're missing #3 - the polls are systematically biased.
|
|
|
05-30-2012, 09:12 AM
|
#34
|
|
Carpal \'Tunnel
Join Date: Apr 2007
Location: J. Edgar Hoover Elementary
Posts: 10,255
|
Re: Election Modeling
Quote:
Originally Posted by goofball
Both of those seem like decent methodologies, but I guess I have no way to estimate how PVI or distance should figure in.
I feel like the correlation model should eventually end up looking like (national correl) + (regional correl) + (some kind of demographic similarity?) but I haven't thought of a good way to estimate data driven values for those 3 variables.
On the plus side I'm only considering swing states, sure if Obama loses CA he's definitely losing NH but if Obama loses CA he's getting his ass kicked and the election just isn't interesting to model. So I'm "only" looking at 12 states worth of combos but still.
|
Just going off intuition, but not a whole lot of theory, I'd say that your correlation numbers should all be between like .05 and .2 or something, so I'd just create and index using the two (scaling them both so they are the same order of magnitude) and then normalize values to fall on the interval of .05-.2
Depending on how long you plan to be working on this, about a month from now, I'll have access to some insanely good data and can probably get you a whole host of numbers you'd be interested in for fine tuning this model
|
|
|
05-30-2012, 11:30 AM
|
#35
|
|
Beat Nate Silver
Join Date: Oct 2003
Location: Blogging 2012 Election Projections
Posts: 12,225
|
Re: Election Modeling
Quote:
Originally Posted by Nichlemn
10 polls all showing the exact same margin for Romney would be odd whatever that margin was. But it appears that you think a 3 point average lead over 10 polls produces a near 100% chance of winning. That doesn't seem to jive with recent history. I know this is a Canadian provincial election, but in Alberta this year the incumbent government trailed in every single poll (22 in total) for nearly a month before the election, mostly by high single or low double digit margins. The last poll had them only losing by two, but no other polls indicated a tightening. However, they ended up winning by 10 points. I'm guessing your model would have considered this result to be something like a one-in-a-billion event.
|
Canada though. Multiple parties blah blah. Not saying you don't have a point with..
Quote:
|
You're missing #3 - the polls are systematically biased.
|
Do you mean
a) polls have house effects (rasumussen is consistently biased towards republicans)
or
b) polling is biased
For the former I agree and hopefully keeping a large aggregation will mitigate that some.
For the latter I think you're better off saying Polling may be biased (undersampling of cell phones, oversampling of people willing to stay on the phone and talk to a pollster, etc) which could be true but I don't really know how to quantify.
I may end up adding just extra noise/error to account for that kind of thinig but I don't have a feel for the scale (if it's there at all), and I don't know how to calculate it using data.
|
|
|
05-30-2012, 11:32 AM
|
#36
|
|
Beat Nate Silver
Join Date: Oct 2003
Location: Blogging 2012 Election Projections
Posts: 12,225
|
Re: Election Modeling
Quote:
Originally Posted by reno expat
Just going off intuition, but not a whole lot of theory, I'd say that your correlation numbers should all be between like .05 and .2 or something, so I'd just create and index using the two (scaling them both so they are the same order of magnitude) and then normalize values to fall on the interval of .05-.2
Depending on how long you plan to be working on this, about a month from now, I'll have access to some insanely good data and can probably get you a whole host of numbers you'd be interested in for fine tuning this model
|
But I don't even really have a way to verify what kind of correlations exist and how big they are at the moment. Like for example teh distance thing would not work with OR and ID. Plus no reason to think TX and OR are more alike than OR and FL. I think for that approach I'm better off going with some kind of defined region but again, we keep coming back to scale.
|
|
|
05-30-2012, 12:22 PM
|
#37
|
|
Carpal \'Tunnel
Join Date: Apr 2007
Location: J. Edgar Hoover Elementary
Posts: 10,255
|
Re: Election Modeling
Quote:
Originally Posted by goofball
But I don't even really have a way to verify what kind of correlations exist and how big they are at the moment. Like for example teh distance thing would not work with OR and ID. Plus no reason to think TX and OR are more alike than OR and FL. I think for that approach I'm better off going with some kind of defined region but again, we keep coming back to scale.
|
the part of the reason to include PVI is because of oregon is a lot more like Washington than it is like idaho.
as far as scale goes, i stand by my thought about .05-.2. Every should be positively correlated. Its really hard to imagine that as different as they are, a single event would move oregon one direction and idaho the other. The more likely case is that they are either weakly correlated or at worst, not at all correlated. I would argue that no states are totally uncorrelated because there are plenty of demographic groups that are the same across even extremely different states.
|
|
|
05-30-2012, 12:28 PM
|
#38
|
|
4l Mod of the Year
Join Date: Sep 2006
Location: It's a town full of losers.
Posts: 47,706
|
Re: Election Modeling
Quote:
Originally Posted by goofball
Assuming middling state to state correlation drops President Obama’s chance to win to 73%, while using a very high correlation reduces it to 66%.
|
What are you defining as "middling" and "very high?"
Great thread here.
|
|
|
05-30-2012, 01:58 PM
|
#39
|
|
Carpal \'Tunnel
Join Date: Jun 2005
Posts: 9,646
|
Re: Election Modeling
cool stuff, will follow this thread.
|
|
|
05-30-2012, 04:00 PM
|
#40
|
|
Beat Nate Silver
Join Date: Oct 2003
Location: Blogging 2012 Election Projections
Posts: 12,225
|
Re: Election Modeling
Quote:
Originally Posted by reno expat
Just going off intuition, but not a whole lot of theory, I'd say that your correlation numbers should all be between like .05 and .2 or something, so I'd just create and index using the two (scaling them both so they are the same order of magnitude) and then normalize values to fall on the interval of .05-.2
Depending on how long you plan to be working on this, about a month from now, I'll have access to some insanely good data and can probably get you a whole host of numbers you'd be interested in for fine tuning this model
|
I plan to update it every week until the election. Odds are I won't get tired of it but who knows.
|
|
|
05-31-2012, 06:27 AM
|
#41
|
|
Pooh-Bah
Join Date: Feb 2009
Posts: 4,633
|
can you explain the reasoning behind using teh sqrt
|
|
|
05-31-2012, 07:00 AM
|
#42
|
|
Carpal \'Tunnel
Join Date: Oct 2007
Location: The water hemisphere
Posts: 8,058
|
Re: Election Modeling
Quote:
Originally Posted by goofball
Canada though. Multiple parties blah blah. Not saying you don't have a point with..
Do you mean
a) polls have house effects (rasumussen is consistently biased towards republicans)
or
b) polling is biased
For the former I agree and hopefully keeping a large aggregation will mitigate that some.
For the latter I think you're better off saying Polling may be biased (undersampling of cell phones, oversampling of people willing to stay on the phone and talk to a pollster, etc) which could be true but I don't really know how to quantify.
I may end up adding just extra noise/error to account for that kind of thinig but I don't have a feel for the scale (if it's there at all), and I don't know how to calculate it using data.
|
There's always going to be some polling bias. No-one can never get a completely random sample. The main risk is that a large number of pollsters are systematically biased to a significant degree.
Also, there's the whole difficulty of working out the likelihood of a voter turning out even if your sample isn't biased, as well as the possibility of voters being systematically misleading in their intentions (like how the Bradley effect, even though it didn't eventually materialise, should have made Obama's 2008 win less certain than polls indicated).
It's difficult to quantify, there isn't really a way around that. You could perhaps gain some sense of objectiveness by evaluating how polls have historically performed, but there are some things you're just going to have to basically guess.
|
|
|
05-31-2012, 12:00 PM
|
#43
|
|
journeyman
Join Date: May 2012
Location: I use to be jackaaron
Posts: 268
|
Re: Election Modeling
Quote:
Originally Posted by swinginglory
You don't think Obama is a polarizing enough figure to get white religious conservatives in southern Ohio to get off their asses and vote?
A warm bucket of spit could be running on the R side and Obama would drive turn out.
|
I don't think that the white religious conservatives in southern Ohio favor Romney enough to really care.
But, on second thought, they DO hate them some B.O.
|
|
|
06-04-2012, 03:34 AM
|
#44
|
|
Beat Nate Silver
Join Date: Oct 2003
Location: Blogging 2012 Election Projections
Posts: 12,225
|
Re: Election Modeling
Quote:
Originally Posted by reno expat
the part of the reason to include PVI is because of oregon is a lot more like Washington than it is like idaho.
as far as scale goes, i stand by my thought about .05-.2. Every should be positively correlated. Its really hard to imagine that as different as they are, a single event would move oregon one direction and idaho the other. The more likely case is that they are either weakly correlated or at worst, not at all correlated. I would argue that no states are totally uncorrelated because there are plenty of demographic groups that are the same across even extremely different states.
|
I definitely agree every state is correlated with every other state to some degree. (Those who disagree see: 9/11).
In this week's update I explored a high/mid/low just for funsies.
|
|
|
06-04-2012, 03:35 AM
|
#45
|
|
Beat Nate Silver
Join Date: Oct 2003
Location: Blogging 2012 Election Projections
Posts: 12,225
|
Re: Election Modeling
Quote:
Originally Posted by Mayo
What are you defining as "middling" and "very high?"
Great thread here.
|
Simply every state correlated with every other state at .5 and .9 respectively. This week I added in a .2 scenario.
|
|
|
| Thread Tools |
|
|
| Display Modes |
Linear Mode
|
Posting Rules
|
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is Off
|
|
|
All times are GMT -4. The time now is 05:30 PM.
|