OK, I think I have time to write a reply to all of this now:
Quote:
Originally Posted by Lapidator
My point it that it will always be more profitable to play an exploitable strategy then a GTO strategy, simply because our opponents (at LLSNL) are not capable of playing a GTO strategy. We don't even have to know whether such a strategy might exist -- because we can easily use common sense to show that our opponents could not execute it if it did.
Quote:
Originally Posted by AltronIV
it's nearly impossible to "solve" the game.
So, I didn't say anything about poker itself in my OP because I wanted to limit it to the bare-bones mathematical facts of game theory, but now I want to sort of give my opinion on game theory and poker.
Both of you seem to echo a sentiment that I've heard before, which is that the reason we shouldn't care about GTO play in poker is that poker is too hard to solve. The implication is that if we knew how to play GTO, we would, but since no one can, we don't need to worry about it.
Both of you are missing the point.
I am not claiming that the reason we don't want to try playing GTO poker is because it's too hard. I'm claiming that
even if we knew how to play GTO poker, we probably still shouldn't want to.
To show where I'm coming from, I'm going to give an example of a third game that illustrates a concept that I didn't get to show in my OP--iteratively dominated strategies. Let's say you are playing a game where, similar to RPS, you both make your "move" at the same time and compare results. Only this time, let's change the payout matrix. Let's also change the names of the strategies since this isn't RPS anymore. Imagine each player has 3 strategies: let's call them C, E, and G.
The payout matrix, which I'll write as ordered pairs of strategies and payouts for the first player in a zero-sum game (because this format makes it really hard to align matrices), looks like this:
(C,C) = 0_____(C,E) = -15_____(C,G) = -6
(E,C) = 15_____(E,E) = 0______(E,G) = -3
(G,C) = 6______(G,E) = 3______(G,G) = 0
Given this payout matrix, how would we solve for the Nash equilibrium?
The first step, as always, is to look for dominated strategies. When we do, this is what we find:
-G is clearly not dominated. There is no other row where the third entry is at least 0.
-E is clearly not dominated. There is no other row where the first entry is at least 15.
-C clearly *is* dominated. No matter what column you are in, the payout for C is less than the payout for G, so C is dominated by G. (It's also dominated by E, but we only needed one "witness".)
That means a Nash equilibrium cannot involve C. So once we know that both players will only play E and G, we can "reduce" the game by making a 2x2 matrix where the only strategies are E and G. It obviously will look like this:
(E,E) = 0______(E,G) = -3
(G,E) = 3______(G,G) = 0
Now here's the next step: once we have this reduced matrix, it is now clear that a Nash equilibrium cannot include E! The reason is that once we eliminate C, E now appears to be dominated by G. We've thrown away the only strategy that stopped E from being dominated, so now we can treat it as if it is.
This means that the GTO strategy for this game is unique and static--throw G 100% of the time.
Let's back up and consider this for a second. Here we have a game where, if you play GTO and your opponent plays GTO, you will both have 0EV. But if your opponent deviates from GTO and you don't, you will have a positive outcome. This is exactly the "ideal" situation that many people erroneously believe GTO to be all the time (hopefully by this point in the thread you know why this is erroneous). In this case, G functions as a "perfect" strategy. And yet--and yet!--
even in this "ideal" situation, it would be wrong to jump to the conclusion that you should always be playing G.
Why is that? Well, solving for the Nash equilibrium means making a leap in logic--one that clearly does not apply to a game like poker. The only reason we were able to conclude that we should not throw E was because we assumed our opponent would not throw C.
Clearly, in this game, if our opponent is deviating from GTO, but only doing so by mixing up E and G, we maximize our expectation by sticking to G. But if our opponent will not always avoid C, all bets are off and now we might do even better than GTO by putting E back on the table.
Here is an example, which I'll leave as questions and let you guys answer and interpret however you want. Suppose you are playing an opponent who is randomizing his play so that he will play C one-third of the time, and G the other two-thirds of the time. If you know your opponent is doing this:
1) What should you do to minimize your probability of a negative outcome?
2) What should you do to maximize your EV?