Quote:
Originally Posted by David Sklansky
1. In the specific case of drawing to the nuts with one card to come is it possible that the computer would fold getting a smidgeon above correct odds (as the casino bot sometimes does)? If you cant say no with absolute certainty, why not add the instruction not to?
I guess we can't say it doesn't ever do this with absolute certainty, and there's always a bit of noise because it's essentially solved, not perfectly solved. It's certainly possible to check scenarios, one at a time: given a specific hand, board cards and betting, I can ask what Cepheus does (as could you or anyone else - it will always be a bit slow, on purpose - but the strategy query tool is up and running again after the big media hit.)
Why not use a hard coded rule? We've gotten (lightly) burned trying to do this in the past. At one point, back in the days of PSOpti, we added some hard-coded rules to force correct play in a few circumstances where the bot would otherwise do strange things. This code sat there, turned on, effecting everything we did for the next 8 or so years. Eventually, we realised it was still on, tested things with it off, and discovered the hard-coded rules weren't quite correct: once we finally got good enough, they made things just a little bit worse.
So it's safer just to let the algorithm do its own thing. We can verify after the fact that anything strange it's doing is either i) correct, despite expectations, or ii) not worth worrying about.
Now that we've computed the strategy, could we possibly do something like this to clean up a few messy small numbers? Maybe. But why? Good poker players would already know because it was using existing human knowledge, and science wouldn't care...
Quote:
2 Do you actually know the precise strategy that beats the computer or do you just know that it exists? If you know it, what can you say about strategies that are in between your strategy and that strategy?
We know one strategy that "beats" it, in the sense of getting as much money as possible: itself. That's one property of a GTO solution. For two player games, each player's strategy has to be a best response to the opponent's strategy. It's just that they're special best responses. Ignoring the "essentially" part of "essentially solved", there is no way to play against Cepheus that gets more money than Cepheus itself gets. *
If you were emphasising "precise" strategy in the sense of the exact strategy that wins 1/1000 big blinds a hand against our strategy, we don't have that. We computed it, one bit at a time, in order to find out how close to GTO we are, but we throw it away as we go to save space (and time.)
* Breaking even against a GTO doesn't mean you have perfect play: think "always rock" against the 1/3,1/3,1/3 equilibrium in rock-paper-scissors. Similarly, bad play doesn't necessarily mean you will lose against GTO. You'll only lose if you make a mistake -- something NO GTO solution would ever do. That's easy in rock-paper-scissors, but poker is a bit harder and it's hard to make absolutely no mistakes.