đď¸ Explore my blog
Storage for Cloud
Managed service responsible for providing readily available cloud storage solutions
How storage works in Azure: Azure Storage Tiers, Azure Data Redundancy Options
Azure offerings:
* Azure Blob Storage
* Azure File Storage
* Azure Data Lake Storage
* Azure Backup and Recovery
Status: #idea
Tags: Cloud
References
cluster
distinct physical infrastructure that is housed in a data center
Status: #idea
Tags: Cloud
References
data-cleaning
Meaning to identify and correct errors.
* Handling missing data
* Ignore if insignificant
* Fill with a global constant (such as âUnknownâ, âN/Aâ, etc.)
* Fill with mean or median
* Fill with most probably value taken from similar data points (using decision trees or Bayesian methods)
* Smoothing noisy data
* binning
* regression
* clustering
Status: #idea
Tags: data-mining, kdd, data-prepartion
References
data-integration
Meaning to combine data from multiple sources into a unified dataset while ensuring consistency and resolves conflicts from merging.
* Schema Integration : making sure that the format and structure of data are the same across all sources.
* Entity Identification : linking together entries that represent the same thing, even if they have different names.
In other words, it can be said that in many cases, entity identification happens before schema integration.
1. First, you need to know which
Pre UTS quiz for Data Mining
question and answer for Henry Lucky DM UTS Quiz
data-prepartion
properly preparing the data to ensure that it is clean, consistent, and ready for analysis.
data-cleaning
data-integration
data-transformation
data-reduction
Status: #idea
Tags: data-mining, kdd
References