'MultiPL-E: A Scalable and Polyglot Approach to Benchmarking Neural Code Generation'

‘MultiPL-E: A Scalable and Polyglot Approach to Benchmarking Neural Code Generation’

October 30, 2023

“Large language models have demonstrated the ability to generate both natural language and programming language text. Although contemporary code generation models are trained on corpora with several programming languages, they are tested using benchmarks that are typically monolingual. The most widely used code generation benchmarks only target Python, so there is little quantitative evidence of how code generation models perform on other programming languages. We propose MultiPL-E, a system for translating unit test-driven code generation benchmarks to new languages.”

Find the paper and full list of authors at IEEE Transactions on Software Engineering.

View on Site

Arjun Guha

Computer Science

‘MultiPL-E: A Scalable and Polyglot Approach to Benchmarking Neural Code Generation’

Related

NSF grant awarded for adaptive clothing

Patent for ‘lightweight pose estimation network’ goes to Fu

DARPA grant to enhance mixed reality security

Patents for experimental virtual reality methods

Patent for efficient computation

‘Human Mobility Is Well Described by Closed-Form Gravity-Like Models Learned Automatically from Data’

‘Foundations of Scalable Systems’

‘Network Coding for Engineers’

‘Practical Business Analytics Using R and Python’