How improve Set Similarity Join based on prefix approach in distributed environment