Comparing sets of patterns with the Jaccard index

Sam Fletcher; Md Zahidul Islam

doi:10.3127/ajis.v22i0.1538

Comparing sets of patterns with the Jaccard index

Authors

Sam Fletcher Charles Sturt University
Md Zahidul Islam Charles Sturt University

DOI:

https://doi.org/10.3127/ajis.v22i0.1538

Keywords:

Machine Learning, Metrics, Data Mining, Patterns, Rules, Utility Measures, Quality Evaluation

Abstract

The ability to extract knowledge from data has been the driving force of Data Mining since its inception, and of statistical modeling long before even that. Actionable knowledge often takes the form of patterns, where a set of antecedents can be used to infer a consequent. In this paper we offer a solution to the problem of comparing different sets of patterns. Our solution allows comparisons between sets of patterns that were derived from different techniques (such as different classification algorithms), or made from different samples of data (such as temporal data or data perturbed for privacy reasons). We propose using the Jaccard index to measure the similarity between sets of patterns by converting each pattern into a single element within the set. Our measure focuses on providing conceptual simplicity, computational simplicity, interpretability, and wide applicability. The results of this measure are compared to prediction accuracy in the context of a real-world data mining scenario.

Downloads

Published

2018-03-07

How to Cite

Fletcher, S., & Islam, M. Z. (2018). Comparing sets of patterns with the Jaccard index. Australasian Journal of Information Systems, 22. https://doi.org/10.3127/ajis.v22i0.1538

Download Citation

Issue

Vol. 22 (2018)

Section

Research Articles

License

AJIS publishes open-access articles distributed under the terms of a Creative Commons Non-Commercial and Attribution License which permits non-commercial use, distribution, and reproduction in any medium, provided the original author and AJIS are credited. All other rights including granting permissions beyond those in the above license remain the property of the author(s).

Comparing sets of patterns with the Jaccard index

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Current Issue

Information