Pokersnowie question - Page 22 - Poker Theory

Thanks for that. The RNG is the standard C++ random generator. I appreciate the fact that since our software is not open source, and we are not monitored by any third party, you don't have a way to make heavy statistical analysis on it.

Like RusyBrooks said, the C++ rand() function is not considered to be a great RNG (http://channel9.msdn.com/Events/Goin...idered-Harmful ). I have no idea how important this is for a software like pokersnowie, but obviously any serious poker room such as stars would use something more refined.

It is also quite possible that your learning algorithm may be biased if you use a poor RNG.

Quote:

Originally Posted by RustyBrooks

I honestly don't think most people are particularly worried about this.

Not most people, indeed. But there have been claims about it (even ITT), and during a snowie challenge on a French website, a challenger made public his HH and claimed that snowie was cheating. By using snowie's preflop ranges I showed that snowie's rungood was very unlikely (~0.5%). If anyone wants to investigate further, or check my analysis, I can share the HH and my calculations. I have never considered this analysis to be a proof that snowie was cheating, but it obviously does legitimate doubts some players may have.

Therefore, even if this is not the main issue currently, I think it's good to see that they are making serious efforts towards more transparency.

Quote

03-29-2014 , 09:16 PM

#527

oracle3001

adept

Join Date: Nov 2006 Posts: 790

I strongly back-up the other posters who have pointed out the massive deficiencies in using the C++ rand function (or for that matter any other simplistic RNG) for any task whereby good pseudo "randomness" is essential. Seeding it properly is another issue that needs serious consideration.

Personally, I try to avoid it like the plague, especially for anything whereby I am using stochastic methods e.g. MCMC, etc.

Last edited by oracle3001; 03-29-2014 at 09:23 PM.

Quote

03-30-2014 , 03:47 PM

#528

TopPair2Pair

veteran

Join Date: Nov 2009 Posts: 3,366

Agreed. Even Rand() within Excel sucks stained pants.

Quote

03-31-2014 , 03:45 PM

#529

PokerSnowie

verified representative

Join Date: Jul 2013 Posts: 47

Quote:

Originally Posted by RustyBrooks

The absolute best way to remove the chance of cheating is to have your software not generate cards at all. Ideally you would have it set up so that it has no more information than any poker "client", i.e. it gets told it's cards and the board, bet sizes, etc, but all dealing and scoring is done outside the client. This way you don't have control of the cards. Provided you use one of the more standard means available to do this, it would also let you participate in matches against other AI agents.

Hi RustyBrooks,

I fully agree with your comments. Finally it would mean playing on a third party client and server.

The features we added to export the deck and probs in order to give more control to the users were very important in my opinion, because the objective was to show that PokerSnowie is not taking any unfair advantage and it is not cheating while playing vs. the user.

About the random generator, I read some more comments and suggestions also from other users, like Babarberousse, TopPair2Pair and Oracle3001. Thanks for that. We are in fact aware that it's not the ideal RNG, but on the other hand we did not expect this to be an issue for the challenges purposes.

In particular we are using this one:
http://qt-project.org/doc/qt-4.8/qtglobal.html#qrand

Regarding the AI training, here a much more solid RNG is used.

Best Regards,

Roberto Gobbo
CEO - Snowie Games Ltd.

Quote

03-31-2014 , 04:27 PM

#530

PokerSnowie

verified representative

Join Date: Jul 2013 Posts: 47

Quote:

Originally Posted by Tackleberry

"Thanks to people’s feedback and internal analysis, we found several leaks in PokerSnowie's strategy. We have fixed some of them by reviewing and changing our learning algorithm."

Can you explain that a bit more (w/o revealing any "secrets")? For obvious reasons it is / was not possible to fix the "strategy", now you said you fixed the "learning algorithm". How can that fix improve Snowies strategy in short time - when it´s current strategy is the result of billions and billions of played hands so far?

Hi Tackleberry,

you are right to wonder. In fact the answer is not easy, without entering into to many technical details.

Occasionally, in order to fix some issues in a very short term, we do like we wrote in the answer to David Sklansky: "for bet sizes that Snowie has never seen in training (like a 1% pot bet or a 100 times pot bet), algorithms have been implemented to handle these situations as good as possible."

Like you said, a full solution of an issue like this would need a new training to run and this would require some time. I admit that the short term solution in having some algorithms to handle these situations is suboptimal and it's more a patch than a real solution. When possible, we always try to avoid patches.

This point might seem conflicting with our statement that we don’t introduce expert knowledge in the AI, and in fact it partially is – apologies for that. But it should be clear that these patches are used only in very special and very uncommon situations, where we think it’s not worth to spend so much time, because they would not bring value to our customers. The 1% pot bet issue for example, is something which makes the AI technically exploitable, but on the other hand no one would play this strategy online and therefore fixing it was more to avoid this exploitation than to make it more useful for our customers. For this purpose, in the short term, we felt that a patch was enough.

In fact, because our CPU time is obviously limited, we need to choose on which aspects of PokerSnowie to focus. If we feel that how to play vs. a 1% pot bet is not crucial for the PokerSnowie goal, then we prefer to use our CPU for other improvements, like for example introducing other bet sizes. Obviously when you start a new training you can’t be sure if and when you will get the results.

Also, when I write “We fixed some of them”, I don’t always mean that the AI is ready. I mean that we fixed what we believe was causing the problem, but we still need to wait for the training to progress. Finally, in certain situation it’s not needed to start the training from scratch.
I hope this answer, at least at high level

Best Regards,

Roberto Gobbo
CEO - Snowie Games Ltd.

Quote

04-01-2014 , 08:38 PM

#531

buggits30

adept

Join Date: Jan 2012 Posts: 1,181

if you guys want some lolz heres a video using pokersnowie that makes little sense

http://www.pokersnowie.com/blog/2014...v#.UztY0PRDtv1

Quote

04-01-2014 , 10:09 PM

#532

andyg2001

veteran

Join Date: Mar 2009 Posts: 2,905

Quote:

Originally Posted by buggits30

if you guys want some lolz heres a video using pokersnowie that makes little sense

http://www.pokersnowie.com/blog/2014...v#.UztY0PRDtv1

Why do you think it makes little sense?

Quote

04-01-2014 , 10:43 PM

#533

oracle3001

adept

Join Date: Nov 2006 Posts: 790

Quote:

Originally Posted by buggits30

if you guys want some lolz heres a video using pokersnowie that makes little sense

http://www.pokersnowie.com/blog/2014...v#.UztY0PRDtv1

Is that strategy video an April Fools joke?

Quote

04-02-2014 , 01:00 AM

#534

buggits30

adept

Join Date: Jan 2012 Posts: 1,181

Quote:

Originally Posted by andyg2001

Why do you think it makes little sense?

Well one it doesn't bet the river because it can't balance it's range, but then it check/bombs in a spot where it can never balance it's check/bomb range?

Quote

04-02-2014 , 01:47 AM

#535

andyg2001

veteran

Join Date: Mar 2009 Posts: 2,905

Quote:

Originally Posted by buggits30

Well one it doesn't bet the river because it can't balance it's range, but then it check/bombs in a spot where it can never balance it's check/bomb range?

It can't have a check range in the spot it bluffed shoved as it's facing a bet, and call ip. It does have a bet range in the spot it checked.

Pokersnowie does get there with some combos of the nut flush. It does bet most of them by the turn but checks some.

Quote

04-02-2014 , 04:37 AM

#536

oracle3001

adept

Join Date: Nov 2006 Posts: 790

Quote:

Originally Posted by andyg2001

It can't have a check range in the spot it bluffed shoved as it's facing a bet, and call ip. It does have a bet range in the spot it checked.

I think you need to watch the video again, you have the action sequence wrong.

Quote

04-02-2014 , 09:44 AM

#537

buggits30

adept

Join Date: Jan 2012 Posts: 1,181

Quote:

Originally Posted by andyg2001

And it checks those nut flushes on the turn and river?

Quote

04-02-2014 , 12:33 PM

#538

andyg2001

veteran

Join Date: Mar 2009 Posts: 2,905

Quote:

Originally Posted by oracle3001

I think you need to watch the video again, you have the action sequence wrong.

I was sure it was CO vs SB, BB. But on looking again it's Co vs btn, SB. And it does bet all it's flushes on the turn when this is the case.

It check raises 12.28% of it's range. The only strong hand it has is a set of twos. Most of the time it has a pair below top pair, mainly a ten.

Quote

04-02-2014 , 01:45 PM

#539

ProPokerHUDs

centurion

Join Date: Nov 2013 Posts: 122

Quote:

I was sure it was CO vs SB, BB. But on looking again it's Co vs btn, SB.

Exactly, and so the check raise jam line is complete nonsense on several levels.

Quote

04-02-2014 , 05:04 PM

#540

samooth

veteran

Join Date: May 2009 Posts: 3,350

lol wp snowie

Quote

04-02-2014 , 07:40 PM

#541

Elmo_Shedtax

stranger

Join Date: Oct 2009 Posts: 4

PokerSnowie

Can you explain to subscribers (i.e. me), either here or within the software, the methodology used to train Team PokerSnowie members so that they were able to become winning cash game and tourney players within just a few weeks?

Thanks
Simon

Quote

04-03-2014 , 12:17 AM

#542

Seven7s

adept

Join Date: Apr 2010 Posts: 910

Let's stop telling Snowie where it's leaks are, right now it has quite a few leaks, I know at least i'd prefer it that way, we don't need a bunch of bot type programs making their way onto the tables.

Seems like it'd only be a matter of time where someone creates a program outside of snowie using Snowie's strategy that sites can't detect. Not that i'd have a problem playing current snowie, an improved one, where ppl can use real-time would be disasterous.

Quote

04-03-2014 , 05:23 AM

#543

PokerSnowie

verified representative

Join Date: Jul 2013 Posts: 47

Quote:

Originally Posted by Elmo_Shedtax

Dear Elmo_Shedtax,

if you refer to Zeljko story published here:

http://www.pokersnowie.com/about/pok...timonials.html

as described, it took few months full time of hard work to reach a strong level.

There isn't a specific pre-defined methodology, as this field is all new and everything is evolving very fast. The first stage for a beginner who would like to learn poker by using our product, would be to play a lot of hands against PokerSnowie.

He could then review the hands, give a look to the blunders, the balance, learn the ranges in various spots, learn the PF strategy, drill down the areas of the game and see where he has have more weaknesses (btw, with our new 2.6 release coming out today, we have improved the filters, exactly to help this process).

As we all know, improving at poker requires a lot of hard work, discipline and we also recognize that more talented players could improve faster than others.

Best Regards,

Roberto Gobbo
CEO - Snowie Games Ltd.

Quote

04-03-2014 , 08:11 AM

#544

PokerSnowie

verified representative

Join Date: Jul 2013 Posts: 47

Quote:

Originally Posted by Qlka

I did some checking for short stack scenarios and ranges proposed by Snowie are far away from push/fold equilibrium. That means solution for the full game is even further...

Snowie has been trained for very short stacks up to very deep stacks (400 blinds). The neural networks try to avoid the biggest errors first. For very short stacks the size of possible errors is much smaller than for deep stacks, so it is possible that Snowie is in fact better for deep stacks than for short stacks.

The PokerSnowie Team

Quote

04-03-2014 , 08:32 AM

#545

PokerSnowie

verified representative

Join Date: Jul 2013 Posts: 47

Quote:

Originally Posted by fontaine

I just completed the second challenge. Won 1482bb in 6017 hands for a win rate of 24.6 bb/100. To be fair i ran really hot the last 1k hands, the win rate after 5000 hands was only 7.5 bb/100. It was much harder this time, after Snowie learned how to combat my minbets in 3bet pots.

I played 66/64 with 5.38 AF. 100% open in utg, btn and sb positions, and 100% 3bet bb vs sb. I defended against most 3bets and 4bets when in LP.

That's true Fontaine, and thanks for have tried it again.

Just to recap, originally you were min-betting into blown-up pots, like betting $1 into a $50 pot. Snowie didn't handle these situations well. An algorithm was implemented to react correctly to those mini-bets and the problem was fixed.

In general we believe it is very possible to win a 5000 hands challenge against Snowie, especially by pumping up the pot each time, the variance becomes extraordinary high, so that you have a good chance of being ahead after 5000 hands. A more reliable challenge would need many more hands.

The PokerSnowie Team

Quote

04-03-2014 , 08:47 AM

#546

plexiq

veteran

Join Date: Apr 2007 Posts: 2,558

Quote:

Originally Posted by PokerSnowie

If Snowie fails to find decent solutions for a subgame where very strong solutions can be calculated in a matter of seconds using traditional techniques, what makes you assume it can find better solutions for cases where the complexity is dozens of magnitudes higher?

This is marketing talk at its best, unless you have data to back it up.

Quote

04-03-2014 , 09:09 AM

#547

Sevendeuceo

centurion

Join Date: Jul 2009 Posts: 181

Quote:

Originally Posted by plexiq

I don't think this objection is all that strong. The complexity of the task is relative to the resolution method. The short stack pre-flop games with simple decision trees admit of precise analytic solutions that can be computed easily. But neural-network training algorithms can sometimes rapidly find solutions to problems that don't admit of analytic solutions at all and, on the other hand, can be stumped by problems that can easily be solved analytically. It's more akin to an empirical trial and error method where we have an idea of what a good result looks like but we don't understand at all (initially) why some method or other works best. Of course, when an efficient method is found -- semi-empirically -- we can investigate why it works and, hopefully, derive general principles.

That said, it's true that performance in cases where we know the true GTO solution provide an ideal benchmark to the effectiveness of the training algorithm. I'll give you that.

Last edited by Sevendeuceo; 04-03-2014 at 09:22 AM.

Quote

04-03-2014 , 09:28 AM

#548

plexiq

veteran

Join Date: Apr 2007 Posts: 2,558

By "traditional techniques" i was actually referring to iterative algorithms like Fictitious Play or CFRM, not solving analytically. These algorithms actually very roughly resemble the iterative training of a neural-network.

Your example regarding "simple" problems stumping neural nets while much harder problems can be solved very well is misleading. This typically refers to problems with entirely different structure, which is not the case here imo.

Can you give an example of a problem where neural nets fail to find good solutions to very simple instances but succeed in solving huge instances of the same general structure?

Quote

04-03-2014 , 10:10 AM

#549

Sevendeuceo

centurion

Join Date: Jul 2009 Posts: 181

Quote:

Originally Posted by plexiq

Can you give an example of a problem where neural nets fail to find good solutions to very simple instances but succeed in solving huge instances of the same general structure?

(On Edit: I regrettably confused the push/fold game with the raise/shove game below. That seriously undermines my argument, but I'll leave it since it raises an issue regarding training algorithms and the indifference principle for GTO mixed strategies.)

That's a good question.

In the case of the push/fold game, it was brought up that the ranges were wrong. How about the frequencies for the bet-call action or the combined (open-fold + bet-fold) frequency? It might be more difficult for a neural network training algorithm to home in on all the correct GTO ranges in this sort of cases because the equilibrium condition dictates that open-folded hands have the exact same EV as raise-folded hands at that decision point. (At equilibrium, the BB shove frequency must be such that the SB is indifferent between the two actions with hands that never see a showdown). The very slight differences in EV stems only from card removal effects on the BB shoving frequency. But those effects can either be computed analytically or discovered by a neural network training method that is focused on finding them using huge samples that would make those small effects statistically significant in between random choices of ranges. Pokersnowie's training algorithm understandably isn't optimized for defining ranges that have so very little effects on overall EV.

This failure for Snowie to define the proper open-fold and raise-fold ranges then will translate in a corresponding failure to home in on the correct individual frequencies for those two action. (It ought to find the correct calling frequency, though, since the larger EV of the hands he can call a shove with dictate the proper action).

More complex decision trees that don't include such indifference points will possibly more easily be solved -- or will be solved modulo the detailed partition of ranges between the (roughly) indifferent decisions when there are some such points.

Last edited by Sevendeuceo; 04-03-2014 at 10:33 AM.

Quote

04-03-2014 , 11:00 AM

#550

plexiq

veteran

Join Date: Apr 2007 Posts: 2,558

Quote:

More complex decision trees that don't include such indifference points will possibly more easily be solved -- or will be solved modulo the detailed partition of ranges between the (roughly) indifferent decisions when there are some such points.

I agree that indifference will likely not play the same role deep-stacked that it does in the raise/fold game. (And on a side note, the raise/fold game also gives the other iterative algorithms i mentioned earlier a fairly hard time.)

The general indifference problem is still present whenever we have pure bluffs as part of a range though. So at least on the river this can be expected to be an extremely common situation deep-stacked. Snowie needs to somehow figure out the correct bluffing ratio based on the tiny EV differences caused by removal. If it fails to do this for simple Preflop games i would not expect Snowie to solve the same general problem Postflop.

There are plenty of other factors that make the deep-stacked games much, much harder to solve generally, so we really have absolutely no idea if Snowie can solve them better. Theoretically possible? I guess. But absent any evidence to support it, that's going to be very tough to sell.

Quote

Page 22 of 93

First

2 12 17 18 19 20 21 22 23 24 25 26 27 32 42 72

Last

Post Reply Subscribe

...

Page 22 of 93

First

2 12 17 18 19 20 21 22 23 24 25 26 27 32 42 72

Last