Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
How to Choose Minimum Support/Confidence Values | Mining Frequent Itemsets
Association Rule Mining

How to Choose Minimum Support/Confidence ValuesHow to Choose Minimum Support/Confidence Values

Choosing appropriate minimum support and confidence values is crucial when mining association rules from transactional datasets. The support and confidence thresholds determine the strength and relevance of the discovered rules. Selecting these values requires a balance between capturing meaningful associations and avoiding an overwhelming number of trivial rules.

Minimum threshold values in the context of association rule mining refer to the user-defined thresholds for support and confidence that are used to filter and identify meaningful associations between items in a dataset.

Factors influencing minimum threshold selection

  • Dataset Size: Large datasets may require lower support thresholds to capture meaningful associations due to the increased variability in item occurrences;
  • Data Sparsity: Sparse datasets, where items have low occurrence frequencies, may necessitate lower support thresholds to uncover significant associations;
  • Domain Knowledge: Understanding the domain and the context of the dataset can guide the selection of appropriate thresholds. Prior knowledge about item interactions can inform the choice of support and confidence values;
  • Objective of Analysis: The purpose of association rule mining influences the choice of thresholds. For exploratory analysis, higher support thresholds may be suitable to identify prominent associations, while lower thresholds may be preferred for comprehensive pattern discovery.

Example

Now you can conduct a simple experiment: change the min_support and min_confidence values in the code sample below and observe how your changes influence the results.

What is the minimum support threshold used for in association rule mining?

Select the correct answer

Everything was clear?

Section 2. Chapter 7
course content

Course Content

Association Rule Mining

How to Choose Minimum Support/Confidence ValuesHow to Choose Minimum Support/Confidence Values

Choosing appropriate minimum support and confidence values is crucial when mining association rules from transactional datasets. The support and confidence thresholds determine the strength and relevance of the discovered rules. Selecting these values requires a balance between capturing meaningful associations and avoiding an overwhelming number of trivial rules.

Minimum threshold values in the context of association rule mining refer to the user-defined thresholds for support and confidence that are used to filter and identify meaningful associations between items in a dataset.

Factors influencing minimum threshold selection

  • Dataset Size: Large datasets may require lower support thresholds to capture meaningful associations due to the increased variability in item occurrences;
  • Data Sparsity: Sparse datasets, where items have low occurrence frequencies, may necessitate lower support thresholds to uncover significant associations;
  • Domain Knowledge: Understanding the domain and the context of the dataset can guide the selection of appropriate thresholds. Prior knowledge about item interactions can inform the choice of support and confidence values;
  • Objective of Analysis: The purpose of association rule mining influences the choice of thresholds. For exploratory analysis, higher support thresholds may be suitable to identify prominent associations, while lower thresholds may be preferred for comprehensive pattern discovery.

Example

Now you can conduct a simple experiment: change the min_support and min_confidence values in the code sample below and observe how your changes influence the results.

What is the minimum support threshold used for in association rule mining?

Select the correct answer

Everything was clear?

Section 2. Chapter 7
some-alt