“Although psycholinguists and psychologists have long studied the tendency of linguistic strings to evoke mental images in hearers or readers, most computational studies have applied this concept of imageability only to isolated words. Using recent developments in text-to-image generation models, such as DALLE mini, we propose computational methods that use generated images to measure the imageability of both single English words and connected text. We sample text prompts for image generation … [and] subject these prompts to different deformances to examine the model’s ability to detect changes in imageability caused by compositional change.”
Find the paper and full authors list at ArXiv.