'Blend: A Unified Data Discovery System'

‘Blend: A Unified Data Discovery System’

by Noah Lloyd

October 26, 2023

“Data discovery is an iterative and incremental process that necessitates the execution of multiple data discovery queries to identify the desired tables from large and diverse data lakes. Current methodologies concentrate on single discovery tasks such as join, correlation, or union discovery. However, in practice, a series of these approaches and their corresponding index structures are necessary. … This paper presents BLEND, a comprehensive data discovery system that empowers users to develop ad-hoc discovery tasks without the need to develop new algorithms or build a new index structure.”

Find the paper and full list of authors at ArXiv.

View on Site

Renée Miller