Question

Consider a data set where the objects are images from a weather satellite and each image...

Consider a data set where the objects are images from a weather satellite and each image consists of one million pixels. (Assume that each pixel consists of a real value representing the brightness. Also, assume that the images are snapshots of different areas and do not represent images of the same area at successive intervals in time.) The data can be represented as record data, where each image is a record (object) and each pixel is an attribute.

a. (10 pts) What are three techniques for handling missing values?

b. (10 pts) An image often has missing values for scattered pixels. (A pixel is missing, but those around it are not.) Which of the three techniques would be the most appropriate for this situation and why?

Homework Answers

Answer #1

a)

(i) Replace the missing values with IMPUTATION.

(ii) Replace missing values with an INTERPOLATED ESTIMATE.

(iii) Replace missing values with the MEAN.

b)

REPLACING MISSING VALUES WITH IMPUTATION- Imputation is a way of using features to model each other. That way, when one is missing, the others can be used to fill the blanks in a reasonable way. This is very powerful method when features are related.

REPLACING MISSING VALUES WITH AN INTERPOLATED ESTIMATE- Interpolation is a statistical method by which related known values are used to estimate an unknown values. Interpolation is a method of estimating an values.

REPLACING MISSING VALUES WITH MEAN- In this case,replacing values that represent the existing distribution, such as the mean, is a reasonable approach.

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
Describe how to construct a​ stem-and-leaf plot from a set of data. Assume that the data...
Describe how to construct a​ stem-and-leaf plot from a set of data. Assume that the data consist of​ two-digit numerical values. Choose the correct answer below. A. ​First, organize the data into several​ classes, such as​ 40-49, 50-59, and so on.​ Then, tally the number of data items in each​ class, and determine the percentage of the population occupied by each class. Draw a​ circle, and divide the circle into portions representing the percentage for each class. B. The plot...
What tools could AA leaders have used to increase their awareness of internal and external issues?...
What tools could AA leaders have used to increase their awareness of internal and external issues? ???ALASKA AIRLINES: NAVIGATING CHANGE In the autumn of 2007, Alaska Airlines executives adjourned at the end of a long and stressful day in the midst of a multi-day strategic planning session. Most headed outside to relax, unwind and enjoy a bonfire on the shore of Semiahmoo Spit, outside the meeting venue in Blaine, a seaport town in northwest Washington state. Meanwhile, several members of...
Delta airlines case study Global strategy. Describe the current global strategy and provide evidence about how...
Delta airlines case study Global strategy. Describe the current global strategy and provide evidence about how the firms resources incompetencies support the given pressures regarding costs and local responsiveness. Describe entry modes have they usually used, and whether they are appropriate for the given strategy. Any key issues in their global strategy? casestudy: Atlanta, June 17, 2014. Sea of Delta employees and their families swarmed between food trucks, amusement park booths, and entertainment venues that were scattered throughout what would...
ADVERTISEMENT
Need Online Homework Help?

Get Answers For Free
Most questions answered within 1 hours.

Ask a Question
ADVERTISEMENT