Quote:
Originally Posted by AltruisticRaven
I think that if Deepmind does decide to focus on poker, the challenge will be an exploitative one. Approximating equilibria is likely a trivial task to DeepMind.
An exploitative challenge is open ended and involves poorly defined elements like human psychology. It would be extremely interesting to see how a sophisticated neural network would preform at that compared to humans. I have a feeling that it would be somewhat easy to get it better than most humans, but quite difficult to reach a superhuman level.
Humans and robots learn in the same way:
Trial and error +
evolution.
You make a interaction with the environment, if the interaction is helpfull to reach your goal you save it in the strategy and pick that move more often, if it fails you throw it away.
Humans learn with 7Billion CPUs at the same. Every time one CPU(human brain) learns something he tells it to the other CPUs trough our system of information sharing (Language + internet). Each human will use an algorithm to update his mind with the new info. This way the knowledge spreads like a virus.
The way
humans learnt to exploit other humans at poker works like this as well: Maybe 10M human brains played poker online, Each time some1 learns an exploitative move or population tendency while playing or studying the data of his sessions he updates his own mind+ shares it with some of his friends or students or posts it on a internet forum.
If a
robot wants to get good at exploiting humans, he also will need to play vs humans. (Or at least use a database of human play). There is no other way. He will need to see the way humans play, then trial and error on the best way to exploit, then trial and error too detect the adjustments
The
problem is: (This is the problem for all real-life robot learning).
This learning takes
time. The robot can play 100K hands vs itself in one minute, because it can simulate the task on a pc. But it can not play 100K hands vs humans in one minute. It will take way more time. There is no easy way to speed-up this process This is why robots are crushing GO and CHESS, and will get superhuman all all pcgames+ doing stuff on the internet,long before they will get superhuman at tasks that require difficult to simulate slow interaction with humans.
However if u would feed a large DB of hands to the google AI, or pokerstars would be OK with letting loose say a couple of hundred Google bots in the games, I think it would get way superhuman at exploiting. humans arent that good at complex number processing
speaking mostly about HU, dont know enough about 6m
Last edited by icoon; 12-07-2017 at 11:14 AM.