z-logo
open-access-imgOpen Access
EHAUPM: Efficient High Average-Utility Pattern Mining With Tighter Upper Bounds
Author(s) -
Jerry Chun-Wei Lin,
Shifeng Ren,
Philippe Fournier-Viger,
Tzung-Pei Hong
Publication year - 2017
Publication title -
ieee access
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.587
H-Index - 127
ISSN - 2169-3536
DOI - 10.1109/access.2017.2717438
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
High-utility itemset mining (HUIM) has become a popular data mining task, as it can reveal patterns that have a high-utility, contrarily to frequent pattern mining, which focuses on discovering frequent patterns. High average-utility itemset mining (HAUIM) is a variation of HUIM that provides an alternative measure, called the average utility, to select patterns by considering both their utilities and lengths. In the last decades, several algorithms have been developed to mine high average-utility itemsets (HAUIs). But most of them consume large amounts of memory and have long execution times, since they generally utilize the average-utility upper-bound (auub) model to overestimate the average utilities of itemsets. To improve the performance of HAUIM, this paper proposes two novel tighter upper-bound models as alternative to the traditional auub model for mining HAUIs. The looser upper-bound model considers the remaining-maximum utility in transactions to reduce the upper bound on the utilities of itemsets. The second upper-bound model ignores irrelevant items in transactions to further tighten the upper bound. Three pruning strategies are also designed to reduce the search space for mining HAUIs by a greater amount compared with the state-of-the-art HAUI-Miner algorithm. Experiments conducted on several benchmark data sets show that the designed algorithm integrating the two novel upper-bound models outperforms the traditional HAUI-Miner algorithm in terms of runtime, memory usage, number of join operations, and scalability.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom