Abstract
Introduction: An important quality of association rules is novelty. However, evaluating rule novelty is AI-hard and has been a serious challenge for most data mining systems.
Objective: In this paper, we introduce functional novelty, a new non-pairwise approach to evaluating rule novelty. A functionally novel rule is interesting as it suggests previously unknown relations between user hypotheses.
Methods: We developed a novel domain-driven KDD framework for discovering functionally novel association rules. Association rules were mined from cardiovascular data sets. At post-processing, domain knowledge-compliant rules were discovered by applying semantic-based filtering based on UMLS ontology. Their knowledge compliance scores were computed against medical knowledge in Pubmed literature. A cardiologist explored possible relationships between several pairs of unknown hypotheses. The functional novelty of each rule was computed based on its likelihood to mediate these relationships.
Results: Highly interesting rules were successfully discovered. For instance, common rules such as diabetes mellitus⇔coronary arteriosclerosis was functionally novel as it mediated a rare association between von Willebrand factor and intracardiac thrombus.
Conclusion: The proposed post-mining domain-driven rule evaluation technique and measures proved to be useful for estimating candidate functionally novel rules with the results validated by a cardiologist.
Original language | English |
---|---|
Pages (from-to) | 609-620 |
Number of pages | 12 |
Journal | Knowledge-Based Systems |
Volume | 24 |
Issue number | 5 |
DOIs | |
Publication status | Published - Jul 2011 |
Externally published | Yes |