'StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of Code'

‘StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of Code’

February 2, 2025

“Code LLMs have the potential to make it easier for non-experts to understand and write code. However, current CodeLLM benchmarks rely on a single expert-written prompt per problem, making it hard to generalize their success to non-expert users. In this paper, we present a new natural-language-to-code benchmark of prompts written by a key population of non-experts: beginning programmers. … We use StudentEval to evaluate 12 Code LLMs and find that StudentEval is a better discriminator of model performance than existing benchmarks.”

Find the paper and full list of authors in ACL Anthology.

View on Site

Arjun Guha

Computer Science, Education

‘StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of Code’

Related

Grant to support experiential learning and creation of offshore wind workforce

Best presentation award on just-in-time learning

NSF grant awarded for adaptive clothing

Patent for ‘lightweight pose estimation network’ goes to Fu

DARPA grant to enhance mixed reality security

Patents for experimental virtual reality methods

Patent for efficient computation

‘Human Mobility Is Well Described by Closed-Form Gravity-Like Models Learned Automatically from Data’

‘Foundations of Scalable Systems’