‘Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task’

“Language models show a surprising range of capabilities, but the source of their apparent competence is unclear. Do these networks just memorize a collection of surface statistics, or do they rely on internal representations of the process that generates the sequences they see? We investigate this question in a synthetic setting by applying a variant of the GPT model to the task of predicting legal moves in a simple board game, Othello.”

Find the paper and full list of authors at Open Review.

View on Site: ‘Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task’