WCGRider, Dong Kim, Jason Les and Bjorn Li to play against a new HU bot - Page 50 - Poker News

Thanks tried my best but looks like I'm going to end up a small loser. Happy with how I've played the last half of the challenge though, been putting up a solid WR since really figuring things out...and getting out of the runbad

Quote

05-07-2015 , 05:27 PM

#1227

+rep_lol

banned

Join Date: Sep 2013 Posts: 14,900

Quote:

Originally Posted by bip!

I think it needs to be pointed out that every hand starts at exactly 200 bb deep for this challenge. In a real heads up match stack depth will often be long ways from XXX bb. It is an entire dimensional reduction.

ever heard of auto-top up?

Quote

05-07-2015 , 05:30 PM

#1228

bip!

Slow Pony

Join Date: Oct 2012 Posts: 14,205

Quote:

Originally Posted by +rep_lol

ever heard of auto-top up?

Ever heard of each player winning successive hands?

Quote

05-07-2015 , 05:30 PM

#1229

highstakesfan

banned

Join Date: Apr 2008 Posts: 3,074

Quote:

Originally Posted by bip!

I tried to point this out earlier and the fan boys of this match were crying like little biotches !

This rule plus the equity sharing that occurs if all the money is in before the final board card is exposed makes this match a waste of time.

Kids playing with fake money, that's all this match is.

Decisions being made with no consequences.

Free reloads every hand.

This is why cheet's is making unmotivated poor decisions.
He plays way better than this with real money.

Reminds me of the old Full Tilt Poker days when the chosen ones would play high stakes poker knowing if they lost they weren't going to pay the losses back.

Quote

05-07-2015 , 05:31 PM

#1230

+rep_lol

banned

Join Date: Sep 2013 Posts: 14,900

Quote:

Originally Posted by bip!

Ever heard of each player winning successive hands?

nope, because i crush

Quote

05-07-2015 , 05:32 PM

#1231

SiQ

Carpal \'Tunnel

Join Date: Jan 2008 Posts: 7,248

Quote:

Originally Posted by NoMeansYes_

I think I understand why HU can be solved, but I've never seen an explanation why multiplayer games can't be. Can you describe it simply?

Hey NoMeansYes,
This might be an interesting wiki for you

http://en.wikipedia.org/wiki/Minimax

Quote

05-07-2015 , 06:06 PM

#1232

pies01

grinder

Join Date: Jan 2013 Posts: 570

Quote:

Originally Posted by NoMeansYes_

I think I understand why HU can be solved, but I've never seen an explanation why multiplayer games can't be. Can you describe it simply?

A thorough explanation here

http://blog.gtorangebuilder.com/2014...at-it.html?m=1

Quote

05-07-2015 , 06:33 PM

#1233

samooth

veteran

Join Date: May 2009 Posts: 3,350

Quote:

Originally Posted by Sam Ganzfried

It's not clear to me if it's doable, and what the scientific value is of computing a bound on our exploitability like this. When I have some actual time (i.e., after I have my diploma in hand) I'll think about it some more and get back to you.

Come on, man.

We had this thread in late September last year dedicated to Tartanium7: http://forumserver.twoplustwo.com/29...25/index5.html. In this thread, several posters pointed out that -- since the bot is playing a static strategy that resembles your best approximation of a NE in 200bb deep hunl -- an exploitability number or a lower bound of expl. is not only useful but imperative to measure how well the bot/strategy actually is. If you go to the last page of this thread, you see that it died with this exact discussion going on and was left unaddressed, eventually.

After a 6 month posting hiatus, you come back and post

Quote:

Originally Posted by Sam Ganzfried

Our 2014 agent Tartanian7 beat everyone with statistical significance in the computer poker competition, including teams that had programmers with expert poker knowledge. Claudico is way better than Tartanian7.

Together with you questioning the "scientific value" of exploitability, this post indicates that for some reason, you and your team are resisting to understand the importance of the issue. punter11235 has made a lot of good posts explaining exploitability and its importance -- it's the most natural test, and it lets you/us actually quantify how well the approximation is rather than relying on relative benchmarks (other bots/humans) with results also being affected by variance. The fact that you beat other bots and that you (probably) lost with Claudico at a 9ish bb/100 rate really tells us nothing other than the obvious (x is better than y over z hands).

I'm not coming here to bring any hate, I have congratulated you on Tartanian7 and I really enjoyed following this challenge as well. It is simply inconceivable for me that you and your team are apparently resisting to understand the entire concept. It becomes even more inconceivable after reading

Quote:

Originally Posted by Sam Ganzfried

There are some limitations in academia that wouldn't apply to people outside of academia. We can only work on approaches that can be published in strong conferences. Often reviewers know close to nothing about poker, and a paper that is considered "poker specific" will just be auto rejected. (Earlier versions of) my endgame solving paper that was linked above for example was rejected four times, before it was finally accepted with scores right around the minimum threshold for acceptance. People outside of academia are free to work on poker-specific approaches without regard to whether they would lead to publication.

How are you going to publish papers based on these poker bots without giving out that number? If the answer is "the referees don't demand it", then it is in line with my own experience in academia (although from an unrelated field). meh

Also, this question still stands:

Quote:

Originally Posted by punter11235

Is there any scientific value in approximating something you don't have a measure for?

In contrast however, you have quoted the Cepheus bot and its exploitability number, implying you understand its usefulness -- which brings me to my last point:

Quote:

Originally Posted by Sam Ganzfried

A big goal of this research is to develop techniques that will apply to domains outside of poker, e.g., national security and medicine, where problems are modeled as imperfect-information games. So we prefer approaches that are more broadly applicable to ones that are purely poker specific.

You absolutely cannot recommend any technique to other domains like national security or medicine without measuring how well it actually works, so as long as you are not giving out that number...

Looking forward to your reply and all infos on how Claudico works.

Quote

05-07-2015 , 06:40 PM

#1234

+VLFBERH+T

grinder

Join Date: Mar 2013 Posts: 657

Quote:

Originally Posted by bip!

Sure, a "normal" match doesn't play that way ... but it seems impossible to implement the "mirrored hands" system with changing stack sizes.

One standard stack size is obviously better for the bot which doesn't need to brute-force itself through countless other stack size possibilities - but then again, with the standard starting stacks of 100bb which humans mostly use, the advantage of the Brains team would likely be considerably less.

Quote

05-07-2015 , 06:55 PM

#1235

ironandwine

enthusiast

Join Date: Apr 2015 Posts: 69

So they are done? Now what?

Quote

05-07-2015 , 07:56 PM

#1236

mythrilfox

veteran

Join Date: May 2004 Posts: 3,340

samooth, you're missing the point. obviously there's value in computing exploitability and sam never said otherwise. the question is what value is there in computing exploitability using an abstraction. if we're exploitable for 5 bb/100 using an abstracted tree, what does that translate to in terms of exploitability using the full game tree? i'm guessing from sam's comments that meaningful translation of this sort is nontrivial if not impossible.

punter never really addressed this and instead asked what's the value in creating a strategy without being able to assess how far from equilibrium it is, which imo is a silly question. of course there's value. if, for instance, your algorithm generates successive strategies such that strategy n beats all strategies 1 to n-1, then you're still getting closer to equilibrium with each successive iteration without ever knowing how far you are from it, and inevitably you will eventually arrive at equilibrium with enough iterations.

this relies on testing the value of the strategies empirically (i.e. by actually playing bajillions of hands), so as you got far enough along and started generating successive strategies that were extremely close to equilibrium, the edges would be so small that you would never be able to peg down precisely where equilibrium was. that doesn't mean there still isn't immense value in getting close to equilibrium, or at the very least demonstrably improving. it would be like saying we can't measure how far we are from a complete understanding of the laws of physics, so what value is there in even trying to understand them in the first place? literally all of science is measuring how a method/model compares to our current understanding & how well it performs empirically, not some sort of ground-up value assessment which would require complete omniscience of everything in the universe. generally speaking we only have the luxury of ground-up assessment in small and controlled systems that we humans created.

Quote

05-07-2015 , 08:16 PM

#1237

cheet

veteran

Join Date: Jan 2006 Posts: 2,498

Quote:

Originally Posted by ironandwine

So they are done? Now what?

We will be playing the final 200 hands each tomorrow for the media

Quote

05-07-2015 , 08:23 PM

#1238

lMikro

grinder

Join Date: Dec 2014 Posts: 436

Third place. Very strong. After they fix whatever leak li was exploiting, i will buy it.

Quote

05-07-2015 , 08:25 PM

#1239

restorativejustice

veteran

Join Date: Aug 2014 Posts: 2,753

Quote:

Originally Posted by cheet

We will be playing the final 200 hands each tomorrow for the media

Best of luck making it into the black on the last day.

Quote

05-07-2015 , 08:26 PM

#1240

cheet

veteran

Join Date: Jan 2006 Posts: 2,498

Quote:

Originally Posted by lMikro

Third place. Very strong. After they fix whatever leak li was exploiting, i will buy it.

It didn't get third place, it played two teams of humans with mirrored hole cards and it lost to both teams

Quote

05-07-2015 , 08:35 PM

#1241

cheet

veteran

Join Date: Jan 2006 Posts: 2,498

Quote:

Originally Posted by restorativejustice

Best of luck making it into the black on the last day.

Thanks a lot, it's pretty close but more likely than not gonna end in the red. Gotta gamble it up pretty hard tomorrow to have a chance of hitting more prize money, but likely nothing will change.

Quote

05-07-2015 , 08:38 PM

#1242

dodgybob

old hand

Join Date: Jan 2006 Posts: 1,899

What's the distribution of prize money between the 4 of you?

Quote

05-07-2015 , 08:39 PM

#1243

cheet

veteran

Join Date: Jan 2006 Posts: 2,498

If we're now analyzing selective samples of this, I'm winning at 21 bb/100 for the past 8k hands after really figuring things out. But that's not the spirit of the challenge here and it's not the story here. The story is that the humans with the mirrored hole-card format put together a cumulative 9.56 bb/100 over 79,200 hands so far. That's something we're all really happy and proud about. We all worked together and put together a big win and showed that humans still have the edge in this game for now.

Quote

05-07-2015 , 08:41 PM

#1244

ZOMG_RIGGED!

Carpal \'Tunnel

Join Date: Aug 2007 Posts: 17,504

Quote:

Originally Posted by cheet

It didn't get third place, it played two teams of humans with mirrored hole cards and it lost to both teams

so....if it lost to both teams doesnt that put it in third place?

Quote

05-07-2015 , 08:46 PM

#1245

cheet

veteran

Join Date: Jan 2006 Posts: 2,498

Quote:

Originally Posted by ZOMG_RIGGED!

so....if it lost to both teams doesnt that put it in third place?

Lol wp

Quote

05-07-2015 , 08:59 PM

#1246

mcb08

grinder

Join Date: Nov 2008 Posts: 434

Quote:

Originally Posted by lMikro

Third place. Very strong. After they fix whatever leak li was exploiting, i will buy it.

Douche much?

Quote

05-07-2015 , 08:59 PM

#1247

mcb08

grinder

Join Date: Nov 2008 Posts: 434

Quote:

Originally Posted by cheet

Thanks a lot, it's pretty close but more likely than not gonna end in the red. Gotta gamble it up pretty hard tomorrow to have a chance of hitting more prize money, but likely nothing will change.

Gambooooooolllll

Quote

05-07-2015 , 09:38 PM

#1248

Dantes

Carpal \'Tunnel

Join Date: Jun 2007 Posts: 10,833

Quote:

Originally Posted by bip!

Have to do it to get analytic value out of the mirrored hands

Quote

05-07-2015 , 10:02 PM

#1249

PoP T1me

centurion

Join Date: Dec 2006 Posts: 107

Thanks for streaming and putting on an entertaining series of matches.

Quote

05-07-2015 , 11:09 PM

#1250

.isolated

Carpal \'Tunnel

Join Date: May 2012 Posts: 21,495

Quote:

Originally Posted by mcb08

Douche much?

How so? I'd buy it to

. Seems like it would at least beat up to $1k and probably crush $400 and lower. And ironically that is probably the best compliment that the programmer on here has received the entire time.

Quote

Page 50 of 65

First

30 40 45 46 47 48 49 50 51 52 53 54 55 60

Last

Post Reply Subscribe

...

Page 50 of 65

First

30 40 45 46 47 48 49 50 51 52 53 54 55 60

Last