'Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task'

‘Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task’

July 31, 2023

“Language models show a surprising range of capabilities, but the source of their apparent competence is unclear. Do these networks just memorize a collection of surface statistics, or do they rely on internal representations of the process that generates the sequences they see? We investigate this question in a synthetic setting by applying a variant of the GPT model to the task of predicting legal moves in a simple board game, Othello.”

Find the paper and full list of authors at Open Review.

View on Site

David Bau

Computer Science

‘Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task’

Related

NSF grant awarded for adaptive clothing

Patent for ‘lightweight pose estimation network’ goes to Fu

DARPA grant to enhance mixed reality security

Patents for experimental virtual reality methods

Patent for efficient computation

‘Human Mobility Is Well Described by Closed-Form Gravity-Like Models Learned Automatically from Data’

‘Foundations of Scalable Systems’

‘Network Coding for Engineers’

‘Practical Business Analytics Using R and Python’