Fictitious Play

In real life, players may be myopic and only best respond to other players without calculating or playing a Nash Equilibrium strategy. However, when playing a game repeatedly, they may adapt their strategy based on the history of other players’ actions and eventually converge to a Nash equilibrium. This process is learning in games, as players adapt their beliefs about other players and act rationally w.r.t the beliefs.

Fictitious play is an iterative method of calculating Nash Equilibrium using the above idea without actually playing the game repeatedly. Formally, each player maintain a belief that uniformly mixes the other players’ historical actions $μ_{i} = \frac{1}{t} \sum_{τ = 1}^{t} s_{- i}^{τ}$ , and best respond $σ_{t + 1} \in BR_{i} (μ_{i})$ . Note that in fictitious play, players have a common belief as it’s generated by the observed history of actions.

Convergence of strategies

Let $(σ^{t})_{t \in N}$ be the strategy profile sequence generated by fictitious play. If there exists $T$ such that $σ^{t} = δ_{s}$ with $s \in S$ for all $t \geq T$ , then $s$ is a PSNE. And if there exists $T$ such that $σ^{T} = s^{*}$ with $s^{*}$ being a strict NE, then $σ^{t} = s^{*}$ for all $t \geq T$ .

Convergence of beliefs

We say $(σ^{t})_{t \in N}$ converges to a strategy profile $σ \in Σ$ in the time-average sense if $μ_{- i}^{t} (s_{i}) \to σ (s_{i})$ as $t \to \infty$ for all $i$ and $s_{i}$ , i.e.,
$t \to \infty lim \frac{1}{t} τ = 1 \sum t 1 {s_{i}^{τ} = s_{i}} = σ (s_{i}),$
where $σ (s_{i})$ is the marginal probability of $σ$ on $s_{i}$ .

Networked Networks

Backlinks

Graph View

Fictitious Play

Fictitious Play

Backlinks

Graph View