Stream inequality join aims to combine tuples coming from differ- ent streams based on inequality conditions and is a fundamental operator in distributed data stream processing. It is known to be computationally expensive as indexing data structures for determining matching tuples must be continuously updated. To significantly alleviate this problem, we propose SPO-Join, a novel solution that combines a mutable B+-tree for efficient insertions and an immutable sorted-array-based data structure for efficient searching. Furthermore, our proposed method is designed to be efficiently executed with distributed stream pro- cessing engines. Our experiments on real-world and synthesized datasets suggest that the proposed SPO-Join exhibits superior performance compared to state-of-the-art index-based stream inequality join solutions.
SPO-Join: Efficient Stream Inequality Join / Aslam, Adeel; Beedkar, Kaustubh; Simonini, Giovanni. - 28:1(2025), pp. 145-157. ( 28th International Conference on Extending Database Technology (EDBT), 25th March-28th March, 2025 Barcelona, esp 25-28 March 2025) [10.48786/edbt.2025.12].
SPO-Join: Efficient Stream Inequality Join.
Adeel Aslam
;Giovanni Simonini
2025
Abstract
Stream inequality join aims to combine tuples coming from differ- ent streams based on inequality conditions and is a fundamental operator in distributed data stream processing. It is known to be computationally expensive as indexing data structures for determining matching tuples must be continuously updated. To significantly alleviate this problem, we propose SPO-Join, a novel solution that combines a mutable B+-tree for efficient insertions and an immutable sorted-array-based data structure for efficient searching. Furthermore, our proposed method is designed to be efficiently executed with distributed stream pro- cessing engines. Our experiments on real-world and synthesized datasets suggest that the proposed SPO-Join exhibits superior performance compared to state-of-the-art index-based stream inequality join solutions.Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris




