Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Strategies for for games with incomplete information

Tags:

game-theory

Are there general strategies for games with incomplete information, especially trick-taking games like Bridge and Doppelkopf?

I am interested in ways to implement AI strategies for such games.

The bounty is for the answer with the best description of a particular strategy.

like image 261
fuz Avatar asked Jul 07 '11 19:07

fuz


People also ask

What is the incomplete information in game theory?

Informally, a game of incomplete information is a game where the players do not have common knowledge of the game being played. This idea is tremendously important in capturing many economic situations, where a variety of features of the environment may not be commonly known.

What are the game strategies?

Therefore on the basis of outcome, the strategies of the game theory are classified as pure and mixed strategies, dominant and dominated strategies, minimax strategy, and maximin strategy.

What is imperfect information game theory?

Imperfect information appears when decisions have to be made simultaneously, and players need to balance all possible outcomes when making a decision. A good example of imperfect information games is a card game where each player's card are hidden from the rest of the players.

Which methods can be used to solve game theory problem?

Specifically, the solution methods and techniques discussed are Nash Equilibrium Method; Pareto Optimality Technique; Shapley Values Technique; Maximin-Minimax Method; Dominance Method; Arithmetic Method; Matrix Method; Graphical Method and Linear Programming Method.


1 Answers

I would like to contribute with some concrete information for the card game Doppelkopf, which the author asked exemplarily. In 2012 a master thesis by Sievers was written where he adopted the UCT algorithm for the Doppelkopf game.

UCT normally assumes a perfect information game, thus he first solves the "card assignment" problem, which is to guess a card assignment for each player based on some already known cards.

After solving this, he tried two ways to perform the algorithm with his solution to the card assignment problem:

1) Guess a card assignment for each UCT-tree and look at the average of multiple trees. He calls this strategy ensemble UCT.

2) Take a single uct tree and guess for each rollout a new assignment. In the selection phase of UCT you simply ignore all inconsistent children. He calls this strategy single UCT.

My feeling is that 2) makes a stronger AI but it seemed to be weaker, which he pointed out more clearly in a follow-up conference paper from 2015.

Inspired by AlphaGo's success, I started a project with a friend for his bachelor thesis and he made a policy neural network using a character based LSTM to guide the selection process of the UCT algorithm. His bachelor thesis only covers some test results for the ensemble-UCT but I already tested it for the single UCT player too and it makes a much stronger AI. I guess this is because the single UCT player benefits alot more from reducing the search space more effectively.

So this answer is more or less the same as @charley have been given but a little more concrete.

like image 97
Maikel Avatar answered Oct 20 '22 01:10

Maikel