Question

Questions about clustering: #What's the difference between Hierarchical Clustering algorithm and K-means clustering? #How to compute...

Questions about clustering:

#What's the difference between Hierarchical Clustering algorithm and K-means clustering?

#How to compute the Hierarchical Clustering step-by-step (basic)?

#How to compute the Kmeans clustering step-by-step (basic)?

#each that algorithm what's the type of data that is suitable for?

Homework Answers

Answer #1

Question#1

Hierarchical Clustering:

In Hierarchical clustering we have an advantage of creating  hierarchical decomposition of the set of data that has been given to us. We can also  create hierarchical decomposition in two ways one way is to creating from  bottom to the top and the other is top to down. Where as in

K-Mean Clustering:

As the K-mean clustering is the process of making the partion of the set of data objects into subsets and in k mean clustering every subset is a cluster and it is also similar to the cluster that is greater and but the similarity between these clusters is less.

Question#2

  • In the start first we will assign all the points to an individual cluster
  • Now we will now look for the smallest distance that is been located in the proximity matrix and then merge those points with the smallest distance.
  • Then repeat the step 2 until only a single cluster is left.

Question#3

  • Assign k a value and make the intial clsuter object of that
  • The assign each point a similar center
  • Then identify cluster centroids
  • Reassign the points
  • Identify the new cluster centorids

Question#4

In all these algorithms the type of data is been required is that the data point should classigy the  the distance between that points

PLEASE GIVE A THUMBS UP!!!!!!!!!!

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
Discuss the strengths and weaknesses of using K-Means clustering algorithm to cluster multi class data sets....
Discuss the strengths and weaknesses of using K-Means clustering algorithm to cluster multi class data sets. How do you compare it with a hierarchical clustering technique. Read 300 words with no plagrism
Let a configuration of the k-means algorithm correspond to the k way partition generated by the...
Let a configuration of the k-means algorithm correspond to the k way partition generated by the clustering at the end of each iteration. Is it possible for the k-means algorithm to revisit a configuration? Justify how your answer proves that the k-means algorithm converges in a finite number of steps.
Recall the method used to obtain a confidence interval for the difference between two population means...
Recall the method used to obtain a confidence interval for the difference between two population means for matched samples. (a) The following data are from matched samples taken from two populations. Compute the difference value for each element. (Use Population 1 − Population 2.) Element Population Difference 1 2 1 11 8 2 7 8 3 9 6 4 12 7 5 13 10 6 15 15 7 15 14 (b) Compute d. (c) Compute the standard deviation  sd.  (Round your answer...
This project is related to statistical inference about the difference between two population parameters and the...
This project is related to statistical inference about the difference between two population parameters and the analysis of variance. You need to formulate hypotheses testing and confidence estimation as you learned in class. Each group must collect data related to the topic of their choice. You should specify your parameter of interest and the claim, make the hypotheses, perform test calculations and estimation, and make a conclusion. specify your parameter of interest and the claim, make the hypotheses, perform test...
For this project, you will make decisions about how two parameters (proportions or means) compare using...
For this project, you will make decisions about how two parameters (proportions or means) compare using hypothesis tests, and you will estimate the difference between the two parameters using confidence intervals. For each confidence, report the following: the confidence interval limits rounded as directed (from StatCrunch) An interpretation of the confidence interval (e.g. "I am 95% confident ... .") Here is a template for reporting the answer for a sample confidence interval problem. Sample problem: Find a 90% confidence interval...
1.What is the difference between class/object? 2.Did you think about classes/objects around you since the last...
1.What is the difference between class/object? 2.Did you think about classes/objects around you since the last session? 3.What library do we need for processing file I/O? 4.What is the class for the input file stream? Give an example 5.What is the class for the output file stream? Give an example 6.Why do you want to use files instead of using input? 7.How do you read from a file? give an example 8.How do you write to a file? give an...
For the following experiments/questions, pick the most appropriate statistical test. You have the following statistical tests...
For the following experiments/questions, pick the most appropriate statistical test. You have the following statistical tests as choices: some may be used more than once, others not at all.  Assume homogeneity of variance (where applicable) and the validity of parametric tests (where applicable), unless something is directly stated (e.g., “the data are not at all normal”) or otherwise indicated (viz., by the inspection of the data) which would indicate a strong and obvious violation of an assumption. This means you must...
PUBH 6033—Week 7 Assignment 1 Comparing two means: When drink drove a student to statistics (Rubric...
PUBH 6033—Week 7 Assignment 1 Comparing two means: When drink drove a student to statistics (Rubric included)                                              Instructions For this assignment, you review this week’s Learning Resources and then perform a two-sample independent t test and an ANOVA related to the dataset that was utilized in the week 2 SPSS application assignment. Import the data into SPSS; or, if you correctly saved the data file in Week 2, you may open and use that saved file to complete this...
1) State the main difference between an ODE and a PDE? 2) Name two of the...
1) State the main difference between an ODE and a PDE? 2) Name two of the three archetypal PDEs? 3) Write the equation used to compute the Wronskian for two differentiable functions, y1 and y2. 4) What can you conclude about two differentiable functions, y1 and y2, if their Wronskian is nonzero? 5) (2 pts) If two functions, y1 and y2, solve a 2nd order DE, what does the Principle of Superposition guarantee? 6) (8 pts, 4 pts each) State...
As part of the quarterly reviews, the manager of a retail store analyzes the quality of...
As part of the quarterly reviews, the manager of a retail store analyzes the quality of customer service based on the periodic customer satisfaction ratings (on a scale of 1 to 10 with 1 = Poor and 10 = Excellent). To understand the level of service quality, which includes the waiting times of the customers in the checkout section, he collected data on 100 customers who visited the store; see the attached Excel file: ServiceQuality. Using Data Mining > Cluster,...