Interesting, but seems like a fundamentally worse version of chain-of-thought, since that will also give extra "intermediate" tokens but can provide information beyond the null pause token.
Also wow it's still so crazy how big the gains from CoT are, what a cool paper