An Overview of End-to-End Entity Resolution for Big Data

Επιστημονική δημοσίευση - Άρθρο Περιοδικού uoadl:3063216 15 Αναγνώσεις

Μονάδα:
Ερευνητικό υλικό ΕΚΠΑ
Τίτλος:
An Overview of End-to-End Entity Resolution for Big Data
Γλώσσες Τεκμηρίου:
Αγγλικά
Περίληψη:
One of the most critical tasks for improving data quality and increasing the reliability of data analytics is Entity Resolution (ER), which aims to identify different descriptions that refer to the same real-world entity. Despite several decades of research, ER remains a challenging problem. In this survey, we highlight the novel aspects of resolving Big Data entities when we should satisfy more than one of the Big Data characteristics simultaneously (i.e., Volume and Velocity with Variety). We present the basic concepts, processing steps, and execution strategies that have been proposed by database, semantic Web, and machine learning communities in order to cope with the loose structuredness, extreme diversity, high speed, and large scale of entity descriptions used by real-world applications. We provide an end-to-end view of ER workflows for Big Data, critically review the pros and cons of existing methods, and conclude with the main open research directions. © 2020 ACM.
Έτος δημοσίευσης:
2021
Συγγραφείς:
Christophides, V.
Efthymiou, V.
Palpanas, T.
Papadakis, G.
Stefanidis, K.
Περιοδικό:
ACM COMPUTING SURVEYS
Εκδότης:
ASSOCIATION FOR COMPUTING MACHINERY
Τόμος:
53
Αριθμός / τεύχος:
6
Λέξεις-κλειδιά:
Data Analytics, Basic concepts; Critical tasks; Data characteristics; Entity resolutions; Execution strategies; Machine learning communities; Processing steps; Real-world entities, Big data
Επίσημο URL (Εκδότης):
DOI:
10.1145/3418896
Το ψηφιακό υλικό του τεκμηρίου δεν είναι διαθέσιμο.