(3/n) Text tokens are compared to actions in AlphaGo. To decode each token, several “simulations” are performed, where each simulation has 4 stages: select, expand, evaluate, and backup. After all simulations, a token is decoded by referencing the visit counts of the next token.