Answer the question below:
· Mention what is the difference between data mining and data
profiling?
· Explain what should be done with suspected or missing data?
Differences between data mining and data profiling :
Data mining is a process of considering the existing database and turning into useful information.
Data profiling is about analyzing the data that is already existing and collecting the statistics about the data. Helps in finding the data quality.it identifies wrong data in the data base and corrects it when necessary.
Data mining evaluates the data base and evaluate patterns in data.
Suspect or missing values handling
Simply not considering the values if our data set is a large one.
Taking median for that column and replacing the median value with that missing values.
If the missing values is between 5%to 10% then we can simply drop that but more than that percentage missing values should be replaced with median or mean of that particular column
Get Answers For Free
Most questions answered within 1 hours.