Mining Interesting Infrequent Itemsets from Very Large Data based on MapReduce Framework | Zendy

Tene Ramakrishnudu | Zendy; R. B. V. Subramanyam | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Mining Interesting Infrequent Itemsets from Very Large Data based on MapReduce Framework

Author(s) -

Tene Ramakrishnudu,

R. B. V. Subramanyam

Publication year - 2015

Publication title -

international journal of intelligent systems and applications

Language(s) - English

Resource type - Journals

eISSN - 2074-9058

pISSN - 2074-904X

DOI - 10.5815/ijisa.2015.07.06

Subject(s) - computer science , data mining , association rule learning , field (mathematics) , node (physics) , cluster (spacecraft) , mathematics , structural engineering , pure mathematics , engineering , programming language

Mining frequent and infrequent itemsets from a given dataset is the most important field of data mining. When we mine frequent and infrequent itemsets simultaneously, infrequent itemsets become very important because there are many valued negative association rules in them. Mining frequent Itemset is highly expensive, if the minimum threshold is low, whereas mining infrequent itemsets is highly expensive, if the minimum threshold is high. When the dataset size is very large, both memory usage and computational cost of mining infrequent items is very expensive. In addition, single processor's memory and CPU resources are not enough to handle very large datasets. Parallel and distributed computing are effective approaches to handle large datasets. In this paper we proposed a method based on Hadoop-MapReduce model, which can handle massive datasets in mining infrequent itemsets. Experiments are performed on 8 node cluster with a synthetic dataset. The performance study shows that the proposed method is efficient in handling very large datasets.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research