'Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking'

‘Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking’

June 2, 2024

“Fine-tuning on generalized tasks such as instruction following, code generation, and mathematics has been shown to enhance language models’ performance on a range of tasks. Nevertheless, explanations of how such fine-tuning influences the internal computations in these models remain elusive. We study how fine-tuning affects the internal mechanisms implemented in language models. As a case study, we explore the property of entity tracking, a crucial facet of language comprehension, where models fine-tuned on mathematics have substantial performance gains.”

Find the paper and full list of authors at ArXiv.

View on Site

David Bau

Computer Science

‘Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking’

Related

NSF grant awarded for adaptive clothing

Patent for ‘lightweight pose estimation network’ goes to Fu

DARPA grant to enhance mixed reality security

Patents for experimental virtual reality methods

Patent for efficient computation

‘Human Mobility Is Well Described by Closed-Form Gravity-Like Models Learned Automatically from Data’

‘Foundations of Scalable Systems’

‘Network Coding for Engineers’

‘Practical Business Analytics Using R and Python’