Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

what is the difference between Association rule mining & frequent itemset mining

Tags:

data-mining

i am new to data mining and confuse about Association rules and frequent item mining. for me i think both are same but i need views from experts on this forum

My question is

what is the difference between Association rule mining & frequent itemset mining? Thanks

like image 356
Zia Avatar asked Jun 16 '10 05:06

Zia


People also ask

What is the difference between association rule and sequence mining?

The sequential rules are found in sequences while association rules are found in records (transactions) containing items that are not ordered. In other words, the order between items such as time is not considered in association rule mining.

What are the different association rules in data mining?

Types of Association RulesMulti-relational association rules. Generalized association rules. Quantitative association rules. Interval information association rules.

What is the difference between classification and association in data mining?

Classification rule mining aims to discover a small set of rules in the database to form an accurate classifier (e.g., Quinlan 1992; Breiman et al 1984). Association rule mining finds all rules in the database that satisfy some minimum support and minimum confidence constraints (e.g., Agrawal and Srikant 1994).

What is association rule mining with example?

So, in a given transaction with multiple items, Association Rule Mining primarily tries to find the rules that govern how or why such products/items are often bought together. For example, peanut butter and jelly are frequently purchased together because a lot of people like to make PB&J sandwiches.


2 Answers

An association rule is something like "A,B → C", meaning that C tends to occur when A and B occur. An itemset is just a collection such as "A,B,C", and it is frequent if its items tend to co-occur. The usual way to look for association rules is to find all frequent itemsets and then postprocess them into rules.

like image 113
Jouni K. Seppänen Avatar answered Oct 20 '22 01:10

Jouni K. Seppänen


The input of frequent itemset mining is :

  • a transaction database
  • a minimum support threshold minsup

The output is :

  • the set of all itemsets appearing in at least minsup transactions. An itemset is just a set of items that is unordered.

The input of assocition rule mining is :

  • a transaction database
  • a minimum support threshold minsup
  • a minimum confidence threshold minconf

The output is :

  • the set of all valid association rule. An association rule X-->Y is a relationship between two itemsets X and Y such that X and Y are disjoint and are not empty. A valid rule is a rule having a support higher or equals to minsup and a confidence higher or equal to minconf. The support is defined as sup(x-->Y) = sup (X U Y) / (number of transactions). The confidence is defined as conf(x-->Y) = sup (X U Y) / sup (X).

Now the relationship between itemset and association rule mining is that it is very efficient to use the frequent itemset to generate rules (see the paper by Agrawal 1993) for more details about this idea. So association rule mining will be broken down into two steps: - mining frequent itemsets - generating all valid association rules by using the frequent itemsets.

like image 28
Phil Avatar answered Oct 20 '22 00:10

Phil