Stream inequality join aims to combine tuples coming from differ- ent streams based on inequality conditions and is a fundamental operator in distributed data stream processing. It is known to be computationally expensive as indexing data structures for determining matching tuples must be continuously updated. To significantly alleviate this problem, we propose SPO-Join, a novel solution that combines a mutable B+-tree for efficient insertions and an immutable sorted-array-based data structure for efficient searching. Furthermore, our proposed method is designed to be efficiently executed with distributed stream pro- cessing engines. Our experiments on real-world and synthesized datasets suggest that the proposed SPO-Join exhibits superior performance compared to state-of-the-art index-based stream inequality join solutions.

SPO-Join: Efficient Stream Inequality Join / Aslam, Adeel; Beedkar, Kaustubh; Simonini, Giovanni. - 28:1(2025), pp. 145-157. ( 28th International Conference on Extending Database Technology (EDBT), 25th March-28th March, 2025 Barcelona, esp 25-28 March 2025) [10.48786/edbt.2025.12].

SPO-Join: Efficient Stream Inequality Join.

Adeel Aslam
;
Giovanni Simonini
2025

Abstract

Stream inequality join aims to combine tuples coming from differ- ent streams based on inequality conditions and is a fundamental operator in distributed data stream processing. It is known to be computationally expensive as indexing data structures for determining matching tuples must be continuously updated. To significantly alleviate this problem, we propose SPO-Join, a novel solution that combines a mutable B+-tree for efficient insertions and an immutable sorted-array-based data structure for efficient searching. Furthermore, our proposed method is designed to be efficiently executed with distributed stream pro- cessing engines. Our experiments on real-world and synthesized datasets suggest that the proposed SPO-Join exhibits superior performance compared to state-of-the-art index-based stream inequality join solutions.
2025
1-mar-2025
28th International Conference on Extending Database Technology (EDBT), 25th March-28th March, 2025
Barcelona, esp
25-28 March 2025
28
145
157
Aslam, Adeel; Beedkar, Kaustubh; Simonini, Giovanni
SPO-Join: Efficient Stream Inequality Join / Aslam, Adeel; Beedkar, Kaustubh; Simonini, Giovanni. - 28:1(2025), pp. 145-157. ( 28th International Conference on Extending Database Technology (EDBT), 25th March-28th March, 2025 Barcelona, esp 25-28 March 2025) [10.48786/edbt.2025.12].
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1346826
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact