Open Side Menu Go to the Top
Register
Discussing GTO Discussing GTO

10-27-2011 , 11:05 AM
Quote:
Originally Posted by Ronin Talken
In the second example, I obviously understand how that affects a normal range. But against a GTO range, the checking range is as balanced as the betting range, isn't it?
I'm not sure exactly what "as balanced" means... but I am saying that in that example, the GTO's betting range is going to be different than his calling range. Likewise with villain's "expected" ranges for those two lines. So the strategies for future betting rounds are going to be different.

To me it's intuitive, not counter-intuitive. Perfect strategy depends on the two ranges. Different actions (betting lines) imply different ranges even if they lead to the same pot/stack sizes.

Last edited by bobf; 10-27-2011 at 11:15 AM.
Discussing GTO Quote
10-27-2011 , 01:54 PM
Sorry, it's hard to visualize in a GTO v GTO scenario. I understand what you're saying.

Im just having trouble, because a Check should be protected in GTO.

Keep riffing. I'm learning a lot and appreciating very much. =D
Discussing GTO Quote
10-27-2011 , 06:19 PM
Sorry for the double-post, i can't edit anymore.

I have a really stupid question...

Does Chubukov require that the hands referenced are shoved, or is that a proof that it is unexploitable to be AI at the designated stack depths in a HU scenario with the given holdings?

I'm not sure if, for example, it is a proof that raise, 4bet calling TT HU for 100bb (more, I just don't have the chart in front of me) is unexploitable, or if it's just a proof that open-shoving TT HU for 100bb is unexploitable.

I have no clue. 0.o
Discussing GTO Quote
10-27-2011 , 07:51 PM
Quote:
Originally Posted by Ronin Talken
I'm not sure if, for example, it is a proof that raise, 4bet calling TT HU for 100bb (more, I just don't have the chart in front of me) is unexploitable, or if it's just a proof that open-shoving TT HU for 100bb is unexploitable.
It's just a proof open-shoving is unexploitable. An open-shove is necessarily up against a range of ATC, whereas a 4bet-shove or a 4bet-call is not.
Discussing GTO Quote
10-27-2011 , 08:14 PM
Quote:
Originally Posted by Ronin Talken

So is a GTO range shaped only by:
- Dead money
- effective stacks
- number of people in a given hand
IMO you should just try to understand Nash Equilibrium in a mathematical sense. It will answer all of your questions and it's not that complicated.

BTW 3+ player games usually don't have Nash Equilibria, so it doesn't make sense to talk about GTO outside HU situations.

GTO is just a fancy word for one of the strategies of the Nash Equilibrium.
Discussing GTO Quote
10-27-2011 , 08:42 PM
Quote:
Originally Posted by SquirrelsUnite
IMO you should just try to understand Nash Equilibrium in a mathematical sense. It will answer all of your questions and it's not that complicated.
This is very helpful! - "Just try and understand! Then you will understand!"

Perhaps you'll enlighten us all about the "uncomplicated" relationship between an abstract mathematical proof about games in general, and the conditions under which Q5o forms part of a GTO UTG open-shoving range.

Quote:
Originally Posted by SquirrelsUnite
BTW 3+ player games usually don't have Nash Equilibria, so it doesn't make sense to talk about GTO outside HU situations.
"Our survery says these things are unusual. It is therefore meaningless to talk about them".
Discussing GTO Quote
10-27-2011 , 08:54 PM
Ok. This isn't going to be as thorough of a response as I hoped for. I'm just going to spew out some stuff before I have to go back to studying/working and grading 80 papers over the weekend.

Quote:
Originally Posted by Clicken
I seems to me that a degenerate mixed strategy just is a pure strategy. Is there any way to differentiate the two? If not it would seem, that the whole concept of a "degenerate mixed strategy" was nothing other than a useful slight of hand used to subsume pure strategies within the concept of mixed strategies.

Presumably it would be more useful to stick to the term "pure strategy" for this, and "mixed strategy" for non-degerate-mixed-strategies.

When we consider that Nash's proof might not in fact be the proof of the existence of non-generate-equilibrium-mixed-strategies but merely the existence of mixed-equilibrium-strategies-degenerate-or-not, then surely we need to re-examine some of the assumptions commonly made about GTO.

Specifically, one such assumption is that in GTO mixing is at work, and that a NE strategy sometimes plays one hand one way, and yet in exactly the same situation, play it another way. A probability distribution which assigns the value of 1 to one cherry picked pure strategy, and a value of 0 to all others, is not consistent with this assumption, the consensus or the received understanding that the poker world has about "GTO".
A pure strategy is a special case (degenerate) of a mixed strategy. You can tell the difference based on the probabilities assigned to the actions. It's not really a slight of hand, its just that you are moving to a more general thing which includes the specific case of a pure strategy.

I don't think so, but yes typically when you say mixed strategy you are referring to non degenerate cases but not excluding them.

The proof I provided is for the existense of a mixed strategy NE. To me it seems pretty clear there won't be a pure strategy NE to poker (you can probably show it if you had to, but its not worth the time). The proof posted also doesn't say anything about the number of equilibrium, it just guarantees that there is at least one.

Quote:
Originally Posted by Montrealcorp
seem to me the talk on mixed and pure strategy is complicated in this thread for no reason ?


pure strategy only means the strategy is decided in advance ,so the same action will be execute in the same order in the same situation everytime.

mixed strategy means strategy will be decide randomly between 2 or more strategy , so actions in some situation will be different when those situation will happen again.

All strategies are decided in advance. A strategy is a complete plan of what to do in any and all situations. If you gave someone else your strategy and went and did someething instead of playing they game they could play it identical to you based on your instructions. They are deciding between the actions available at a given node, but the probabilities assigned to each node have weights that were assigned in advance.


Quote:
Originally Posted by Ronin Talken
Thanks for the replies. What a great thread (other than the random "GTO is exploitable" post, lol). Im still digesting, so I'll hold back on entering too deeply into the discussion...

- any chance someone could offer outside readings for those of us who only recognize "dominated strategy" and other super-simple game theory terms? =P
How good are you at math? What are the highest courses/subjects you know?

Quote:
- I think Clicken summed up the argument about Nash not being a proof of the existence of GTO quite well. However, I don't think that means it's unprovable nor non-existent. Just a subject of debate to which threads attempting to define it contribute.
It's easier to prove it's not pure by contradiction. I think it could be done, but that's just my initial impression.

Quote:
Not sure who responded to those last bullets, but my response is definitely more questions:
- you said our opponent's play affects the GTO strategy. I don't understand how that is possible. A GTO strategy is indifferent to opponents reaction. Is it possible that what you mean is how large the pot becomes? It seems like the narrowing of a GTO range throughout the course of the hand isn't due to our opponents action, but actually the narrowing of how many hands can invest that many BBs while remaining unexploitable. For example, Preflop, we may be able to flat a range of 15% of hands without the possibility of being exploited by the initial raiser or other players, due to the fact that there is no range which can squeeze nor be the initial raiser that would exploit us. However, if that 3BB flat does get squeezed, the GTO range would naturally narrow--not because someone raised, but because the number of BBs invested into the Pot has risen, narrowing the number of hands that can call/raise while remaining unexploited.

Or am I way off?
You see what your opponent does. If he bets an amount x you are at set of nodes contained within the same information set where all you know is that the opponent bet x. If he bets y=/=x then you are at a different information set.

Quote:
Given a GTO range, is there a difference between our opponent calling and our opponent raising (other than the fact that some hands would benefit from fold equity)?
They are different strategies? I mean it seems like you want to be somewhere in the middle of the game tree rather than at the beginning, then the answer is it depends.

Quote:
I just thought that a part of the definition of GTO play was that we are indifferent to our opponent's actions. Saying that the range will changed based on what actions he is most likely to take seems counter-intuitive.
No, the opponent should be indifferent between his choice of strategies against our strategy (note this is before nature decides who goes first, deals the cards, determines the order of following cards on the flop, turn, river, etc). The strategy we play, since it isn't optimal, doesn't adjust to information revealed about the type of strategy our opponent is playing.

Quote:
The more i read, the more It seems as if the total width of a GTO range is defined only by the range of hands that are infrequent enough of occurrences that the existence of dead money causes them to be unexploitable.

For example, A range of KK+ is only bearable by a range of AA. The fact that AA is only dealt 0.5% of the time means that waiting for only AA would require folding more dead money than is gained by stacking off against KK+.

If that's the case, then the width of a GTO range is only subject to the width of that unexploitable nut range, which is only a factor of number of opponents with hands, effective stacks and the pot.

I see no reason to believe this. Your opponent doesn't know what you hold, that's why you can do this. He's at an information set, but doesn't know which node within the set he is at. He has no idea if you are 3 betting to X with great cards or with marginal holdings.

Last edited by ArBar; 10-27-2011 at 09:02 PM.
Discussing GTO Quote
10-27-2011 , 09:03 PM
Quote:
Originally Posted by bobf
Say GTO is playing heads up no-limit with infinite stacks (so stack size never changes nor does stack-to-pot ratio). Suppose villain

a) bets 2bb
b) bets 5bb
c) bets 10bb
d) bets 100bb

GTO's folding, calling, raising ranges will vary in each of those situations.

So I would say it is due to villains actions (and his perceived/assumed GTO range for those actions).

Another example... consider two possible betting lines...

a) check | bet pot | call
b) bet pot | call

In both cases the pot and stacks are the same for future betting rounds. But a GTO will likely behave differently on future betting rounds, depending on (a) or (b), because the GTO ranges of each player are likely to be different.
Uhh, can you prove that there is a NE here? Infinite stacks=infinite actions. Not to mention potential problems of infinite pay offs, and having to do dumb stuff that won't work out nicely with math. Pretty bad example, and wrong stuff in this post.

Quote:
Originally Posted by SquirrelsUnite
IMO you should just try to understand Nash Equilibrium in a mathematical sense. It will answer all of your questions and it's not that complicated.

BTW 3+ player games usually don't have Nash Equilibria, so it doesn't make sense to talk about GTO outside HU situations.

GTO is just a fancy word for one of the strategies of the Nash Equilibrium.

I concur. Once you understand what NEs are then it should be clearer the role each of the things you ask about should be to you. What do you mean about 3+player games not have NE? They can if they meet certain conditions which a whole lot of games do...

GTO is a misnomer for one of the strategies in a symmetric NE(not just any NE) from my take on it.

Edit:
Go bulls. Ronin, do you play at hammond? I'm from/my family still lives in NW IL.

Last edited by ArBar; 10-27-2011 at 09:08 PM.
Discussing GTO Quote
10-27-2011 , 09:04 PM
Quote:
Originally Posted by SquirrelsUnite
BTW 3+ player games usually don't have Nash Equilibria, so it doesn't make sense to talk about GTO outside HU situations.
A Nash equilibrium is a set of strategies such that no player can do better by unilaterally changing strategies. So this concept does exist for multi-player games, but choosing a NE strategy only guarantees you'll be unexploitable in a two player context. With 3+ players you could be playing a NE strategy and still get exploited if multiple opponents deviate from NE, but not if only one of them does (hence the "unilaterally" caveat).
Discussing GTO Quote
10-27-2011 , 09:22 PM
Quote:
Originally Posted by Clicken
This is very helpful! - "Just try and understand! Then you will understand!"

Perhaps you'll enlighten us all about the "uncomplicated" relationship between an abstract mathematical proof about games in general, and the conditions under which Q5o forms part of a GTO UTG open-shoving range.
I was answering the specific question in OP. If you actually understand what Nash Equilibrium is it's clear that it will depend only the starting SPR and the players' relative position. Of course it doesn't tell what the specific strategy is but that wasn't the question.

"Our survery says these things are unusual. It is therefore meaningless to talk about them".[/QUOTE]

Alright, let me repraphrase. In 3 or more player poker games with reasonable stack sizes there is no strategy that guarantees a 0 payout. There's always a chance that the other players are colluding against you. Possibly unintentionally. I don't have a mathematical proof for this and it's possible this isn't true for very short stacks (like 5BBs or less). But if you've played any poker at all, I'm sure you agree it's true.

I suspect there aren't Nash Equilibria either, but even if there is it's not a very useful concept. If a player realized he was stuck in a NE that gave him a negative payout he would either leave the game or played a strategy that lowered his payout in the short term hoping that it would lead to others changing their strategy.
Discussing GTO Quote
10-27-2011 , 09:31 PM
Quote:
Originally Posted by EvilSteve
A Nash equilibrium is a set of strategies such that no player can do better by unilaterally changing strategies. So this concept does exist for multi-player games, but choosing a NE strategy only guarantees you'll be unexploitable in a two player context. With 3+ players you could be playing a NE strategy and still get exploited if multiple opponents deviate from NE, but not if only one of them does (hence the "unilaterally" caveat).
I'm pretty sure there are even 3 player zero-sum games that are symmetric and have multiple NE, each with non-zero payouts.
Discussing GTO Quote
10-27-2011 , 09:32 PM
Quote:
Originally Posted by ArBar
The proof I provided is for the existense of a mixed strategy NE. To me it seems pretty clear there won't be a pure strategy NE to poker (you can probably show it if you had to, but its not worth the time).


It's easier to prove it's not pure by contradiction. I think it could be done, but that's just my initial impression.
I agree that it's implausible that a pure strategy NE for poker exists.

As I understand it though, at present it simply hasn't been proven that a (non-degenerate) mixed stratgegy NE exists for poker. This isn't to say it does not exist, but just that there is no *proof*.

I think it's important to recognise this if only because there is a widely held assumption that the proof now exists - and it doesn't (as far as I can see).
Discussing GTO Quote
10-27-2011 , 09:36 PM
Quote:
Originally Posted by Clicken
I agree that it's implausible that a pure strategy NE for poker exists.

As I understand it though, at present it simply hasn't been proven that a (non-degenerate) mixed stratgegy NE exists for poker. This isn't to say it does not exist, but just that there is no *proof*.

I think it's important to recognise this if only because there is a widely held assumption that the proof now exists - and it doesn't (as far as I can see).
It's been proven that 2 player fixed-sum games have mixed strategy NE. Since HU poker is a 2 player fixed-sum game (ignoring rake) it has a NE.
Discussing GTO Quote
10-27-2011 , 09:42 PM
Quote:
Originally Posted by SquirrelsUnite
I'm pretty sure there are even 3 player zero-sum games that are symmetric and have multiple NE, each with non-zero payouts.
Sure, but what I was responding to...
"3+ player games usually don't have Nash Equilibria"
They definitely do have NE. Possibly multiple NE as you later stated.
Discussing GTO Quote
10-27-2011 , 09:54 PM
Quote:
Originally Posted by SquirrelsUnite
In 3 or more player poker games with reasonable stack sizes there is no strategy that guarantees a 0 payout. There's always a chance that the other players are colluding against you. Possibly unintentionally. I don't have a mathematical proof for this and it's possible this isn't true for very short stacks (like 5BBs or less). But if you've played any poker at all, I'm sure you agree it's true.
I think there are two ways of looking at this.

(1) A "GTO-bot" playing multiple opponents who were all colluding (in some optimal fashion) would have a negative winrate over any random sample of hands. A negative win-rate in the blinds is important for this to be the case.

(2) A "GTO-bot" can guarantee a payout of 0 for itself by folding. Folding is 0EV, even from the blinds. A game theoretic definition of NLHE(Cash Game) would be that it is *not* a zero-sum game.
Discussing GTO Quote
10-27-2011 , 09:55 PM
Quote:
Originally Posted by EvilSteve
Sure, but what I was responding to...
"3+ player games usually don't have Nash Equilibria"
They definitely do have NE. Possibly multiple NE as you later stated.
Yeah, I guess that's wrong. I guess I was thinking along the lines of 3+ player poker games have no unexploitable strategies. I'm still 99.99% sure that's true (for realistic stack sizes).
Discussing GTO Quote
10-27-2011 , 09:58 PM
Quote:
Originally Posted by SquirrelsUnite
Yeah, I guess that's wrong. I guess I was thinking along the lines of 3+ player poker games have no unexploitable strategies. I'm still 99.99% sure that's true (for realistic stack sizes).
Could probably construct trivial examples where there are unexploitable strategies but I think that's what you're trying to eliminate with the "realistic stack sizes" clause. But generally collusion (implicit or explicit) is going to be a problem with multiplayer equilibrium strategies, so to that extent I agree.
Discussing GTO Quote
10-27-2011 , 09:58 PM
Quote:
Originally Posted by Clicken
(2) A "GTO-bot" can guarantee a payout of 0 for itself by folding. Folding is 0EV, even from the blinds.
Wasn't this covered by Sklansky in one of his books? In this game you fold all your hands except AA where you shove.
Discussing GTO Quote
10-27-2011 , 10:01 PM
Quote:
Originally Posted by SquirrelsUnite
It's been proven that 2 player fixed-sum games have mixed strategy NE.
I wasn't disputing this.

The point was that it's not been proven that 2 player fixed-sum games have non-degenerate mixed strategy NE.
Discussing GTO Quote
10-27-2011 , 10:07 PM
Quote:
Originally Posted by Clicken
I wasn't disputing this.

The point was that it's not been proven that 2 player fixed-sum games have non-degenerate mixed strategy NE.
What does non-degenerate mean here? Not pure strategy?
Discussing GTO Quote
10-28-2011 , 12:45 AM
Quote:
Originally Posted by SquirrelsUnite
What does non-degenerate mean here? Not pure strategy?
Yes - a mixed strategy in which no pure strategy is assigned a probability of 1.

Also in retrospect it was incorrect to include the qualification "2 player fixed-sum games". I should just have said "games", but just copy and paisted from your post.
Discussing GTO Quote
10-28-2011 , 05:05 AM
Quote:
Originally Posted by ArBar
Uhh, can you prove that there is a NE here? Infinite stacks=infinite actions. Not to mention potential problems of infinite pay offs, and having to do dumb stuff that won't work out nicely with math. Pretty bad example, and wrong stuff in this post.
Ok, yes, infinite stacks was a bad example. But that got me thinking about what happens to the NE's as the stack size tends to infinity: Will there be some kind of convergence of the NE's? Doubtful, since as stack size grows we keep adding new possible actions.

My intention was to counter the notion that GTO actions are based only on (stack size, pot, position, board, hole cards) but not on the past betting for the current hand.

So my second example was better: For finite stack sizes heads up if we consider these two betting lines:

a) Villain checks | GTO bets pot | Villain calls
b) Villain bets pot | GTO calls

The stacks, pot, position, etc. are the same for (a) and (b). Yet it seems likely and intuitive, at least to me, that the GTO strategy for future betting after (a) will not be the same as after (b) even for the same exact hand, because:

After (a) Villain supposedly has calling range, GTO has a betting range
After (b) Villain supposedly has a betting range, GTO has a calling range

Since each player has a different range of hands in (a) vs (b) one would not expect the GTO strategy to be the same following (a) vs following (b).

Last edited by bobf; 10-28-2011 at 05:11 AM.
Discussing GTO Quote
10-28-2011 , 10:08 AM
Quote:
Originally Posted by EvilSteve
A Nash equilibrium is a set of strategies such that no player can do better by unilaterally changing strategies. So this concept does exist for multi-player games, but choosing a NE strategy only guarantees you'll be unexploitable in a two player context. With 3+ players you could be playing a NE strategy and still get exploited if multiple opponents deviate from NE, but not if only one of them does (hence the "unilaterally" caveat).
The bolded is wrong. In a 3+ player game you could get less than your equilibrium EV by playing the NE, even if only one player deviates, and even if the game is zero sum. All the NE guarantees is that if only one player deviates it won't benefit them, but their deviation might hurt both you and them while benefiting a third player. This can happen for example in a 3 player poker tournament where the top two positions pay. From the small blind you go all-in, the big blind calls wider than he should, hurting you both while benefiting the third player.
Discussing GTO Quote
10-28-2011 , 10:41 AM
Quote:
Originally Posted by Clicken
Yes - a mixed strategy in which no pure strategy is assigned a probability of 1.

Also in retrospect it was incorrect to include the qualification "2 player fixed-sum games". I should just have said "games", but just copy and paisted from your post.
Well, consider this: You are facing a bet on the river. Your calling frequency is determined by size of the bet in relation to the size of the pot. Unless the number of hand combinations line up perfectly there's going to be a hand that you're calling sometimes and folding sometimes.

Given that there are millions of different boards and thousands to millions of ways to reach the river on each one it's practically impossible that it always just works out.
Discussing GTO Quote
10-28-2011 , 11:08 AM
Quote:
Originally Posted by SquirrelsUnite
Well, consider this: You are facing a bet on the river. Your calling frequency is determined by size of the bet in relation to the size of the pot. Unless the number of hand combinations line up perfectly there's going to be a hand that you're calling sometimes and folding sometimes.

Given that there are millions of different boards and thousands to millions of ways to reach the river on each one it's practically impossible that it always just works out.
This is convincing.

Clearly, the existence of a pure strategy NE for poker is unlikely. I just thought that it was important to note that the fixed point theorems' proofs of the existence of mixed NE in general might not in themsleves be sufficient to prove the existence of a non-degenerate variety for poker. Something else would be required - such as an argument along the lines that you've just offered here.
Discussing GTO Quote

      
m