|
|
| Poker Theory General poker theory |
03-14-2012, 03:56 PM
|
#1
|
|
Pooh-Bah
Join Date: Aug 2007
Location: ithaca
Posts: 5,005
|
Theory terminology
This is a theory forum. Although we accomodate posters with a wide variety of backgrounds, some of the discussions can become quite technical. These discussions often draw upon concepts, facts, and words from the field of mathematics known as game theory.
Unfortunately, many words commonly used in game theory have very specific, exact meanings which may or may not line up with their usage in everyday speech. In order that we are able to communicate effectively, it is important to understand the technical meanings of a few terms. Way too many threads have degenerated into arguments between posters who mean different things when they say "optimal".
With that in mind, here's a quick intro to some common terminology, some of which comes from game theory.
range
A range is basically just a group of hands, possibly with some frequency information attached. Ranges are particularly useful for specifying strategies. For example, you might want to say that you take a particular action with all of some particular hands, half of others, etc.
equity
The equity of a hand or a range is the percentage of the pot expected to be won if all betting were stopped and all players checked down to showdown. This frequency is averaged over all the cards that can come and all the ranges involved.
game
There's a few different ways to define a game depending on the situation you want to study. Basically, it's just going to be a sequence of decisions which lead to some payoff depending on what decisions you make and what decisions the other players make. That is, your payoff depends on your decisions as well as other peoples' decisions.
Sometimes you have a situation, and you get to make some decisions, and the results only depend on your own decisions (and maybe some randomness from Nature). These might be interesting problems, but they're not questions for game theory. It's the fact that your payoff depends on other peoples' decisions too that puts us in the field of game theory.
For example, rock paper scissors (RPS) is a game with only one decision point, and three payoffs are possible: win, lose, or tie, depending on your choice and your opponent's choice.
strategy
A strategy specifies how you choose your move at every decision point in a game, that is, at every situation you could possibly be faced with in the game.
pure strategy
A pure strategy is a strategy that specifies exactly how you will play in every situation you could face. In RPS, there are three pure strategies: throw rock, throw paper, or throw scissors.
mixed strategy
A mixed strategy is one which associates some probability with each of the possible pure strategies. For example, in RPS, you could play a mixed strategy by using "throw rock" 60% of the time and using "throw paper" the other 40%. It is a good idea to play a mixed strategy in RPS. It's not as clear that this is necessary to be successful in poker.
dominated strategy
A strategy is (strictly) dominated if there is another strategy possible which is more profitable regardless of your opponents' strategies. This actually does not come up too often in poker, since almost anything can be good if your particular opponent happens to play particularly poorly against it.
One example from hold'em is as follows: any strategy that involves folding the nuts on the river is dominated (at least in a cash game context, ignoring rake). In particular, it is dominated by a strategy which is the same except that the nuts are played somehow other than by folding. This second strategy will be more profitable regardless of how your opponents play.
+EV
+EV is short for "positive expected value" or "higher expected value". This is vague for a couple of reasons. First, expected value? expected value of what? Second, higher? higher than what?
Expected value just means the average value of some random quantity. In poker, the quantity we're interested in (and which we're interested in maximizing) is the size of our chip stack. However, when doing calculations, we don't always calculate the average or expected size of our chip stack. Sometimes we calculate the expected change in our stack size over the course of the hand, starting from the beginning, and try to make choices that maximze that. Or, sometimes we calculate the expected change in stack size starting from some specific point in the hand.
Basically, it doesn't matter which way you do it, as long as you are consistent in your choice and clear about what you are doing when you write a post.
Second, higher than what? Sometimes people say "higher than 0", but that doesn't necessarily mean anything depending on what expectation we're considering. Other times, people are implying "higher than than the EV of folding if I computed it with the same convention", but that may or may not be important in any particular situation (say, if we are really interested in deciding between calling and raising).
From a strategic point of view, it's important to choose the option with an EV higher than all your other possible choices. So if you are arguing that something is "+EV", please say what it is that your favored move has a higher EV than, and if you write down EV equations, make it clear what you're finding the expectation of.
best response strategy or maximally-exploitative strategy or nemesis strategy
Suppose all your opponents' strategies are fixed and you know them. Then, you can compute the very most profitable way to play against them. This best strategy is known as a best response or a maximally-exploitative strategy. An (imaginary) player who automatically knows your strategy and always plays maximally exploitatively in response is sometimes refered to as nemesis.
nash equilibrium or (game theory) optimal strategies or unexploitable strategies
A Nash equilibrium is a set of strategies (one for each player in the game) with a couple properties. These properties are equivalent, they're just different ways of looking at the same thing:
- No player in the game can unilaterally change his strategy to improve his expectation.
- Each player's strategy is maximally exploiting those of his opponent(s), at the same time.
Notice that nothing about these definitions implies that the players will break-even on average. However, it turns out that in poker, if all players are playing their equilibrium strategies, they will break even in the long-term average sense when we average over all positions in the game.
The existence of a set of strategies like this has some special consequences (at least in heads-up play!). Whenever players are not playing their equilibrium strategies, and one player is making more money than he would at equilibrium, and thus the other is making less, then the guy who is making less has incentive to switch to his equilibrium strategy. However, whenever both players are playing the equilibrium, neither has any incentive to change. Thus, if both players are rational and smart enough to compute the equilibrium strategies, they those are the strategies they will end up playing. It's only then that neither has any incentive to deviate.
Now it is not immediately obvious that such a set of strategies exist, but John Nash proved it for a class of games that includes poker. Of course, knowing that it exists is different than knowing exactly what it is, and in fact the equilibrium strategies are unknown for all "real" poker games. But they definitely exist, and also, equilibriums for greatly-simplified versions of some games are known. For example, if the SB is restricted to playing shove-or-fold preflop in heads up no limit hold 'em, then the game becomes much simpler, and we can find the well-known shove/fold equilibrium which can be useful for short-stacked play.
Any non-equilibrium strategy may also be refered to as exploitable. Notice that, if you were playing unexploitably, but then you changed your strategy to take advantage of some mistakes of your opponent, then you yourself are now playing exploitably, but that's OK if your opponent isn't taking advantage of it.
Nash equilibrium strategies are also known as optimal or game theory optimal or GTO or unexploitable. The fact that "optimal" does not simply mean "maximally exploitative" really seems to trip people up and is thus unfortunate, but that's the language the mathematicians chose, so we're stuck with it.
The usage of "optimal" or "game theory optimal" to refer to the Nash Equilibrium appears to be somewhat unique to poker. It's genesis may be from the book "The Mathematics of Poker" which uses the term this way. This usage does not appear to be common among game theorists and definitely causes some confusion at times.
Last edited by RustyBrooks; 03-19-2012 at 12:33 PM.
|
|
|
03-14-2012, 04:31 PM
|
#2
|
|
grinder
Join Date: Mar 2012
Posts: 455
|
Re: Theory terminology
Quote:
|
However, it turns out that in poker, if all players are playing their equilibrium strategies, they will break even in the long-term average sense when we average over all positions in the game.
|
if all Players would shove and call any two cards they would be break even too, maybe add that point
ty for ya post =)
|
|
|
03-14-2012, 05:08 PM
|
#3
|
|
adept
Join Date: Apr 2007
Location: Vienna
Posts: 949
|
Re: Theory terminology
Looks pretty good, thanks!
|
|
|
03-14-2012, 05:13 PM
|
#4
|
|
adept
Join Date: Apr 2008
Posts: 915
|
Re: Theory terminology
sticky at the top imo.
|
|
|
03-14-2012, 07:42 PM
|
#5
|
|
veteran
Join Date: Jun 2007
Posts: 2,344
|
Re: Theory terminology
fwiw, I'm not sure if people would be interested in distinguishing between "hand distribution" and "range".
Hand Distribution: All hands for a player before any action is taken on a particular street
Range: All hands used by player when taking an action
For example, on the Flop:
Hand Distribution: All pocket pairs 22+
Betting Range = JJ+
Checking Range = 22 - TT
Check-Calling Range = 66 - TT
Check-Folding Range = 22 - 66
I'm not sure if it's necessary, but I feel that I get confused sometimes on these boards.
thx for doing this.
|
|
|
03-14-2012, 08:56 PM
|
#6
|
|
Pooh-Bah
Join Date: Feb 2007
Location: e^πi=-1
Posts: 5,743
|
Re: Theory terminology
Quote:
Originally Posted by yaqh
dominated strategy
...folding the nuts on the river...
|
Perhaps a more clear example of the definition is how folding first to act is dominated by checking.
|
|
|
03-15-2012, 01:55 AM
|
#7
|
|
newbie
Join Date: Mar 2012
Posts: 37
|
Re: Theory terminology
Quote:
Originally Posted by yaqh
This is a theory forum. Although we accomodate posters with a wide variety of backgrounds, some of the discussions can become quite technical. These discussions often draw upon concepts, facts, and words from the field of mathematics known as game theory.
Unfortunately, many words commonly used in game theory have very specific, exact meanings which may or may not line up with their usage in everyday speech. In order that we are able to communicate effectively, it is important to understand the technical meanings of a few terms. Way too many threads have degenerated into arguments between posters who mean different things when they say "optimal".
With that in mind, here's a quick intro to some common terminology, some of which comes from game theory.
range
A range is basically just a group of hands, possibly with some frequency information attached. Ranges are particularly useful for specifying strategies. For example, you might want to say that you take a particular action with all of some particular hands, half of others, etc.
equity
The equity of a hand or a range is how often it will win at showdown if all betting were stopped and all players checked down to showdown. This frequency is averaged over all the cards that can come and all the ranges involved.
game
There's a few different ways to define a game depending on the situation you want to study. Basically, it's just going to be a sequence of decisions which lead to some payoff depending on what decisions you make and what decisions the other players make. That is, your payoff depends on your decisions as well as other peoples' decisions.
Sometimes you have a situation, and you get to make some decisions, and the results only depend on your own decisions (and maybe some randomness from Nature). These might be interesting problems, but they're not questions for game theory. It's the fact that your payoff depends on other peoples' decisions too that puts us in the field of game theory.
For example, rock paper scissors (RPS) is a game with only one decision point, and three payoffs are possible: win, lose, or tie, depending on your choice and your opponent's choice.
strategy
A strategy specifies how you choose your move at every decision point in a game, that is, at every situation you could possibly be faced with in the game.
pure strategy
A pure strategy is a strategy that specifies exactly how you will play in every situation you could face. In RPS, there are three pure strategies: throw rock, throw paper, or throw scissors.
mixed strategy
A mixed strategy is one which associates some probability with each of the possible pure strategies. For example, in RPS, you could play a mixed strategy by using "throw rock" 60% of the time and using "throw paper" the other 40%. It is a good idea to play a mixed strategy in RPS. It's not as clear that this is necessary to be successful in poker.
dominated strategy
A strategy is (strictly) dominated if there is another strategy possible which is more profitable regardless of your opponents' strategies. This actually does not come up too often in poker, since almost anything can be good if your particular opponent happens to play particularly poorly against it.
One example from hold'em is as follows: any strategy that involves folding the nuts on the river is dominated (at least in a cash game context, ignoring rake). In particular, it is dominated by a strategy which is the same except that the nuts are played somehow other than by folding. This second strategy will be more profitable regardless of how your opponents play.
+EV
+EV is short for "positive expected value" or "higher expected value". This is vague for a couple of reasons. First, expected value? expected value of what? Second, higher? higher than what?
Expected value just means the average value of some random quantity. In poker, the quantity we're interested in (and which we're interested in maximizing) is the size of our chip stack. However, when doing calculations, we don't always calculate the average or expected size of our chip stack. Sometimes we calculate the expected change in our stack size over the course of the hand, starting from the beginning, and try to make choices that maximze that. Or, sometimes we calculate the expected change in stack size starting from some specific point in the hand.
Basically, it doesn't matter which way you do it, as long as you are consistent in your choice and clear about what you are doing when you write a post.
Second, higher than what? Sometimes people say "higher than 0", but that doesn't necessarily mean anything depending on what expectation we're considering. Other times, people are implying "higher than than the EV of folding if I computed it with the same convention", but that may or may not be important in any particular situation (say, if we are really interested in deciding between calling and raising).
From a strategic point of view, it's important to choose the option with an EV higher than all your other possible choices. So if you are arguing that something is "+EV", please say what it is that your favored move has a higher EV than, and if you write down EV equations, make it clear what you're finding the expectation of.
best response strategy or maximally-exploitative strategy or nemesis strategy
Suppose all your opponents' strategies are fixed and you know them. Then, you can compute the very most profitable way to play against them. This best strategy is known as a best response or a maximally-exploitative strategy. An (imaginary) player who automatically knows your strategy and always plays maximally exploitatively in response is sometimes refered to as nemesis.
nash equilibrium or (game theory) optimal strategies or unexploitable strategies
A Nash equilibrium is a set of strategies (one for each player in the game) with a couple properties. These properties are equivalent, they're just different ways of looking at the same thing:
- No player in the game can unilaterally change his strategy to improve his expectation.
- Each player's strategy is maximally exploiting those of his opponent(s), at the same time.
Notice that nothing about these definitions implies that the players will break-even on average. However, it turns out that in poker, if all players are playing their equilibrium strategies, they will break even in the long-term average sense when we average over all positions in the game.
The existence of a set of strategies like this has some special consequences (at least in heads-up play!). Whenever players are not playing their equilibrium strategies, and one player is making more money than he would at equilibrium, and thus the other is making less, then the guy who is making less has incentive to switch to his equilibrium strategy. However, whenever both players are playing the equilibrium, neither has any incentive to change. Thus, if both players are rational and smart enough to compute the equilibrium strategies, they those are the strategies they will end up playing. It's only then that neither has any incentive to deviate.
Now it is not immediately obvious that such a set of strategies exist, but John Nash proved it for a class of games that includes poker. Of course, knowing that it exists is different than knowing exactly what it is, and in fact the equilibrium strategies are unknown for all "real" poker games. But they definitely exist, and also, equilibriums for greatly-simplified versions of some games are known. For example, if the SB is restricted to playing shove-or-fold preflop in heads up no limit hold 'em, then the game becomes much simpler, and we can find the well-known shove/fold equilibrium which can be useful for short-stacked play.
Nash equilibrium strategies are also known as optimal or game theory optimal or GTO or unexploitable. The fact that "optimal" does not simply mean "maximally exploitative" really seems to trip people up and is thus unfortunate, but that's the language the mathematicians chose, so we're stuck with it.
Any non-equilibrium strategy may also be refered to as exploitable. Notice that, if you were playing unexploitably, but then you changed your strategy to take advantage of some mistakes of your opponent, then you yourself are now playing exploitably, but that's OK if your opponent isn't taking advantage of it.
|
good post. btw, i come from a stock/futures (and to a lesser extent - options) trading background and i find poker interesting in a lot of ways. a lot of the game theory carries over, but a lot is quite unique to poker (and trading)
the primary difference i see is that while technically any trade affects the market (even lifting one share from the offer), it has far more minimal effects on the other player's actions vs. poker (obviously) since there might be thousands or millions of players in a future contract in a given day, such that one trade doesn't have the effect that one bet, raise, call or fold does in a poker hand. also, while the futures market is zero sum (like poker) , the stock market is not zero sum.
i have found traders who don't "get" that last point ... which boggles my mind, but it's indisputable
the other difference is that in trading, the "cards" so to speak are the market price at the time (as well as how thin or thick the current orders are on the DOM), but the cards to come are affected by the current actions which is very distinguishable from poker
in brief, in trading, you cannot "force" the game and doing the equivalent of a three barrel bluff will just cause you to lose more money. unless you are soros or something, you cannot appreciably move the market
in poker, you CAN - the market is each individual hand.
can you bluff? sure. people do it on the DOM all the time by flashing fake size to try to get others to react. a market maker can create the illusion of demand where none really exists, in order to move the price. otoh, he (unlike the poker player) has to be there to provide liquidity at all times, whereas the poker player can simply fold
but ultimately, it's most similar in bankroll management (poor money management is amongst the #1 downfall of traders, even if they are very talented at trading itself), in not going on tilt (making emotional/revenge trades will KILL you) and especially in being able to say "i'm wrong and i'm getting out". the best traders are more than willing to take their stop loss. if the market turns around and it would have been a winning trade ... oh well. just like the poker player must fold when he thinks he's beat.
the good trader wants to lose small and win big... "cut your losses short and let your winners ride", so often a winning strategy in trading might only have 30% successful trades, but you still make money.
also, the costs of slippage and commission are similar to the costs of rake.
options guys remind me the most of the internet whizzes because the options guys are VERY comfortable with the math, and with probability curves, and some very complex , galfond'ian if you will , math. the futures guys, ESPECIALLY the pit guys (who are a dying breed) are much more like "feel players" who have great intuition and card sense. they know when they are beat, they are more sensitive to physical tells, and especially for a floor trader, they use their physicality to help execute a trade, just like a live player can use his presence, false tells, etc. to sell his bet.
it's interesting stuff i am seeing here in this forum.
|
|
|
03-16-2012, 11:49 AM
|
#8
|
|
Pooh-Bah
Join Date: Aug 2007
Location: ithaca
Posts: 5,005
|
Re: Theory terminology
Quote:
Originally Posted by wbuster
if all Players would shove and call any two cards they would be break even too, maybe add that point
ty for ya post =)
|
Sure, but at that point, each player can improve his expectation by changing his strategy, something which is not true at the equilibrium.
I just wanted to make the point about the value of the equilibrium being 0 only on average over all positions because it seems like that's something people get confused about. I've even seen people solve for the "equilibrium" in a subgame by setting the EV equal to 0, as if EV=0 is the definition of Nash equilibrium.
Quote:
Originally Posted by plexiq
Looks pretty good, thanks!
|
Quote:
Originally Posted by Bob147
sticky at the top imo.
|
Thanks for the positive feedback guys.
Quote:
Originally Posted by RainbowBright
fwiw, I'm not sure if people would be interested in distinguishing between "hand distribution" and "range".
Hand Distribution: All hands for a player before any action is taken on a particular street
Range: All hands used by player when taking an action
For example, on the Flop:
Hand Distribution: All pocket pairs 22+
Betting Range = JJ+
Checking Range = 22 - TT
Check-Calling Range = 66 - TT
Check-Folding Range = 22 - 66
I'm not sure if it's necessary, but I feel that I get confused sometimes on these boards.
thx for doing this.
|
Ah, to be honest, I think I use the terms "range" and "hand distribution" almost interchangably. In particular, to specify the range at the beginning of a street, I usually go with 'turn starting distribution' or 'turn starting range'.
If anything, 'range' is sort of a grouping of hands without any context, whereas 'hand distribution' usually implies a distribution over the various possible hand strengths, which means that there's opponent(s)' range(s) and a particular board involved.
And of course the 'equity distribution' is something specific and different.
But anyway these are kind of poker-specific (not game theory) words, so I wouldn't expect this thread to be authoritative about their meanings. But I am happy to talk about it.
Quote:
Originally Posted by gedanken
Perhaps a more clear example of the definition is how folding first to act is dominated by checking.
|
Yup, it's true that it's another example, and it's probably more intuitive. However, it's a bit harder to construct a strategy that dominates it. I guess you could take the strategy which is the same except that instead of open-folding, you check and then proceed by only putting any more money in the pot if you get to the river and have the nuts. It should be clear that this strategy does at least as well as the one that involved open-folding, and so it weakly dominates it. (Weakly because it doesn't do strictly better.. it might do the same.)
Quote:
Originally Posted by Tua
good post. btw, i come from a stock/futures (and to a lesser extent - options) trading background and i find poker interesting in a lot of ways. a lot of the game theory carries over, but a lot is quite unique to poker (and trading)
|
Interesting stuff, thanks for the response.
|
|
|
03-17-2012, 02:16 AM
|
#9
|
|
old hand
Join Date: Aug 2011
Location: Canada!
Posts: 1,690
|
Re: Theory terminology
Quote:
Originally Posted by yaqh
Nash equilibrium strategies are also known as optimal or game theory optimal or GTO or unexploitable. The fact that "optimal" does not simply mean "maximally exploitative" really seems to trip people up and is thus unfortunate, but that's the language the mathematicians chose, so we're stuck with it.
|
Very nice post, yaqh, but I just wanted to comment on the above. If you talk to a game theorist (my brother is an economics professor specializing in game theory, so I did  ), his or her understanding of the word 'optimal' is completely different from the meaning used in poker theory. An optimal strategy in game theory is not typically an equilibrium or unexploitable strategy; it's a max EV strategy when the game parameters are completely specified (including opponent responses). When people talk about GTO strategy in poker, they really mean minimax strategy, and the misuse of the word 'optimal' has somehow crept into poker-speak and has stuck. It's unfortunate, not only because it's confusing, but ironically because it contradicts the usage from game theory itself.
If you simply google the phrase 'game theory optimal' in quotes, you won't find a single usage of the phrase in any mathematical literature. The only hits you'll get are from poker.
|
|
|
03-17-2012, 02:58 AM
|
#10
|
|
Pooh-Bah
Join Date: Aug 2007
Location: ithaca
Posts: 5,005
|
Re: Theory terminology
Quote:
Originally Posted by kamikaze baby
Very nice post, yaqh, but I just wanted to comment on the above. If you talk to a game theorist (my brother is an economics professor specializing in game theory, so I did  ), his or her understanding of the word 'optimal' is completely different from the meaning used in poker theory. An optimal strategy in game theory is not typically an equilibrium or unexploitable strategy; it's a max EV strategy when the game parameters are completely specified (including opponent responses). When people talk about GTO strategy in poker, they really mean minimax strategy, and the misuse of the word 'optimal' has somehow crept into poker-speak and has stuck. It's unfortunate, not only because it's confusing, but ironically because it contradicts the usage from game theory itself.
|
Gah, I didn't want it to be so, but you might be right. Chen and Ankenman definitely use 'optimal' to mean 'equilibrium' in The Mathematics of Poker, though, which may be where the confusion began. I wonder why they did that. I'm going to keep looking for uses of the word optimal in the math literature. I don't trust economists  .
Quote:
Originally Posted by kamikaze baby
If you simply google the phrase 'game theory optimal' in quotes, you won't find a single usage of the phrase in any mathematical literature. The only hits you'll get are from poker.
|
Well, of course they don't call it game theory optimal in game theory.
Last edited by yaqh; 03-17-2012 at 03:04 AM.
|
|
|
03-17-2012, 12:28 PM
|
#13
|
|
enthusiast
Join Date: Feb 2011
Posts: 88
|
Re: Theory terminology
Good job, should be sticked or at least linked in the FAQ page for me.
As a side note, another possible "dominated strategy" is folding preflop hand XYsuited while raising hand XYoffsuited. If you start folding some of the offsuited and raising the suited hand you gain on every possible spot.
Or folding AA preflop (in cash game, in certain spot in sat tournament can be right), opposed to go all-in.
|
|
|
03-17-2012, 08:44 PM
|
#14
|
|
Pooh-Bah
Join Date: Aug 2007
Location: ithaca
Posts: 5,005
|
Re: Theory terminology
We should just outlaw the use of the word optimal in this forum
|
|
|
03-17-2012, 08:52 PM
|
#15
|
|
old hand
Join Date: Aug 2011
Location: Canada!
Posts: 1,690
|
Re: Theory terminology
Quote:
Originally Posted by yaqh
We should just outlaw the use of the word optimal in this forum 
|
When I spoke to him a couple of days ago, I asked my brother that exact question - is it possible that mathematician game theorists use the word 'optimal' differently from economist game theorists, and he said "I don't think so". I'll point him to the links you posted and see what he has to say!
|
|
|
| Thread Tools |
|
|
| Display Modes |
Linear Mode
|
Posting Rules
|
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is Off
|
|
|
All times are GMT -4. The time now is 02:51 PM.
|