Deep learning models achieve state-of-the-art per-formance in solving the task of Entity Matching, which aims to identify records that refer to the same real-world entity. However, they act as black-box models for the user, who has limited insights into the rationales behind their decisions. Several explainers (e.g., LIME, Mojito, Landmark, LEMON, and CERTA) have been proposed in the literature to address this issue. Their main focus is to generate explanations that are faithful to the model without considering their comprehensibility to the user. For example, verbose explanations could be very complex to analyze, hindering the model's understanding. In this paper, we propose CREW, an explanation system for Entity Matching models that combines the comprehensibility of the explanations and fidelity to the model. To achieve this, CREW creates explanations as clusters of words. The clusters are created by exploiting three different forms of knowledge: the semantic similarity of the words, their arrangement into the dataset attributes, and their importance in explaining the model. Experiments show that CREW generates explanations that are more interpretable for the user and more faithful to the model than those generated by competing explanation techniques.

Explaining Entity Matching with Clusters of Words / Benassi, R.; Guerra, F.; Paganelli, M.; Tiano, D.. - (2024), pp. 2325-2337. (Intervento presentato al convegno 40th IEEE International Conference on Data Engineering, ICDE 2024 tenutosi a Utrecht (NL) nel 13-17 May 2024) [10.1109/ICDE60146.2024.00184].

Explaining Entity Matching with Clusters of Words

Benassi R.;Guerra F.;Paganelli M.;Tiano D.
2024

Abstract

Deep learning models achieve state-of-the-art per-formance in solving the task of Entity Matching, which aims to identify records that refer to the same real-world entity. However, they act as black-box models for the user, who has limited insights into the rationales behind their decisions. Several explainers (e.g., LIME, Mojito, Landmark, LEMON, and CERTA) have been proposed in the literature to address this issue. Their main focus is to generate explanations that are faithful to the model without considering their comprehensibility to the user. For example, verbose explanations could be very complex to analyze, hindering the model's understanding. In this paper, we propose CREW, an explanation system for Entity Matching models that combines the comprehensibility of the explanations and fidelity to the model. To achieve this, CREW creates explanations as clusters of words. The clusters are created by exploiting three different forms of knowledge: the semantic similarity of the words, their arrangement into the dataset attributes, and their importance in explaining the model. Experiments show that CREW generates explanations that are more interpretable for the user and more faithful to the model than those generated by competing explanation techniques.
2024
40th IEEE International Conference on Data Engineering, ICDE 2024
Utrecht (NL)
13-17 May 2024
2325
2337
Benassi, R.; Guerra, F.; Paganelli, M.; Tiano, D.
Explaining Entity Matching with Clusters of Words / Benassi, R.; Guerra, F.; Paganelli, M.; Tiano, D.. - (2024), pp. 2325-2337. (Intervento presentato al convegno 40th IEEE International Conference on Data Engineering, ICDE 2024 tenutosi a Utrecht (NL) nel 13-17 May 2024) [10.1109/ICDE60146.2024.00184].
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1351366
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact