data-reduction
Meaning to reduce the volume of data while maintaining its integrity. This is important because large datasets can be time-consuming and expensive to analyze.
- Dimensionality reduction : Removing irrelevant or redundant attributes.
- Numerosity reduction : Using methods such as regression or clustering to summarize data into fewer data points.
- Data compression : Reducing the size of the dataset without losing important information.
Status: #idea
Tags: data-miningData Mining* [x] data-mining-uts-quiz
knowdledge discovery in databases
data-warehousing
schema
Apriori Algorithm
Step 1: Count Distinct Items
Step 2: Identify Association Rules
FP Growth Algorithm
Step 1: Count Distinct Items
Step 2: Rearrange Items based count in descending order
Step 3: Make FP Growth Tree
1. Make Null Root Node
1. And make children sequentially
Step 4: Make Table
|Ending with|Paths|Count of each item in path|Candidate itemset with count|Frequent itemset|
|-----------, kddkdddata-prepartion
data mining
pattern-evaluation
knowledge-presentation
Status: #idea
Tags: data-mining
References, data-prepartiondata-prepartionproperly preparing the data to ensure that it is clean, consistent, and ready for analysis.
data-cleaning
data-integration
data-transformation
data-reduction
Status: #idea
Tags: data-mining, kdd
References