A NOVEL SIMILARITY MEASURE FOR TRACE CLUSTERING BASED ON NORMALIZED GOOGLE DISTANCE
In trace clustering, a problem of process mining, traditional distance measures only focus on the local relationship between trace pairs. In this paper, we propose a new method to measure the global relationship of the traces based on the Normalized Google Distance. Experimental results show that our method not only outperforms alternatives but also helps to speed up the trace clustering.
process mining, process discovery, trace clustering, Normalized Google Distance, similarity measure.