Question

5. This problem illustrates why it is better to use the squared errors rather than the...

5. This problem illustrates why it is better to use the squared errors rather than the absolute errors to assess a potential regression line. The Least Absolute Errors criterion says that we are looking for the regression line that minimizes the sum of the absolute errors, i.e., minimizes ∑ | ???? − ??�?? | . Consider the data: x - 0 0 1 1

y - 0 1 0 1

Here are three potential regression lines: (i) y = x

(ii) y = 0.5

(iii) y = 1 3 + 1 3 ??

(a) Calculate the sum of the absolute errors for each of these lines, and rank them from best to worst. What are your finding?

(b) Calculate the SSE for each of these lines, and rank them from best to worst by the SSE criterion.

(c) Find the Least-Squares regression line (by using the formulas). Is it one of the above three lines?

(d) What is your conclusion regarding the Least Absolute Errors criterion? i.e., if someone asks you why don’t we use the absolute errors rather than the squared errors, what would you tell them?

Homework Answers

Answer #1

a) and b)

x y predicted y=x Absolute error squared error
0 0 0 0 0
0 1 0 1 1
1 0 1 1 1
1 1 1 0 0
2 2
x y predicted y=0.5
0 0 0.5 0.5 0.25
0 1 0.5 0.5 0.25
1 0 0.5 0.5 0.25
1 1 0.5 0.5 0.25
2 1
x y predicted y=1/3 + x/3
0 0 0.333333333 0.333333333 0.111111111
0 1 0.333333333 0.666666667 0.444444444
1 0 0.666666667 0.666666667 0.444444444
1 1 0.666666667 0.333333333 0.111111111
2 1.111111111

Using Absolute error ,we see that sum is 2 for all models

Using SSE

Model 2 > 3 > 1    {better as SSE is lower in model 2 compared to model 3 }

c)

using regression

y^ = 0.5

yes, it is one of three lines it is model 2

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
1. The least squares criterion, SSE, SSR, and SST In the United States, tire tread depth...
1. The least squares criterion, SSE, SSR, and SST In the United States, tire tread depth is measured in 32nds of an inch. Car tires typically start out with 10/32 to 11/32 of an inch of tread depth. In most states, a tire is legally worn out when its tread depth reaches 2/32 of an inch. A random sample of four tires provides the following data on mileage and tread depth: Tire Mileage Tread Depth (10,000 miles) (32nds of an...
The health department of a large city has developed an air pollution index that measures the...
The health department of a large city has developed an air pollution index that measures the level of several air pollutants that cause respiratory distress in humans. The following table gives the pollution index (on a scale of 1 to 10, with 10 being the worst) for 7 randomly selected summer days and the number of patients with acute respiratory problems admitted to the emergency rooms of the city’s hospitals. Air pollution index 4.5 6.7 8.2 5.0 4.6 6.1 3.0...
Foreign Direct Investment and Economic Growth Economic theory suggests that foreign direct investment affect the economic...
Foreign Direct Investment and Economic Growth Economic theory suggests that foreign direct investment affect the economic growth (the growth of the Gross DomesticProduct (GDP)) in developing countries. The objective of this project is to carry out a simple linear regression analysisto examine this theory. Your independent and dependent variables are the growth of the foreign direct investment andthe economic growth (the growth of the Gross Domestic Product (GDP)) respectively. Required Tasks: State the regression model and determine the least squares...
Regression analysis procedures have as their primary purpose the development of an equation that can be...
Regression analysis procedures have as their primary purpose the development of an equation that can be used for predicting values on some DV for all members of a population. T F A secondary purpose is to use regression analysis as a means of explaining causal relationships among variables. T F In order to make predictions, three important facts about the regression line must be known. One of them is: The point at which the line crosses the X-axis. T F...
Question 1 How is a residual calculated in a regression model? i.e. what is the meaning...
Question 1 How is a residual calculated in a regression model? i.e. what is the meaning of a residual? a)The difference between the actual value, y, and the fitted value, y-hat b)The difference between the fitted value, y-hat, and the mean, y-bar c)The difference between the actual value, y, and the mean, y-ba d)The square of the difference between the fitted value, y-hat, and the mean, y-bar Question 2 Larger values of r-squared imply that the observations are more closely...
The data is for weekly sales in the dry goods department at the Methuen Wal*Mart store...
The data is for weekly sales in the dry goods department at the Methuen Wal*Mart store in fiscal 2002-2003. Peak values, i.e. spikes, usually occur at holiday periods. Week 1 is the first week of February 2002. Dollar values are adjusted in order to disguise true sales figures, but trends in the data are retained for analysis purposes. Week(x) Sales in $(y) Week(x) Sales in $(y) 26 15200 52 22400 27 15600 53 20900 28 16400 54 18800 29 15600...
Multiple Choice Select the best answer from the available choices for each question. Which of the...
Multiple Choice Select the best answer from the available choices for each question. Which of the following is NOT part of the definition of a sample space S? S can be discrete or continuous Each outcome must be in S at most once Each element in S is equally likely Each outcome must be in S at least once S is a set of possible outcomes in an experiment Three A’s, three B’s, and two C’s are arranged at random...
Question 1 2 pts Let x represent the height of first graders in a class. This...
Question 1 2 pts Let x represent the height of first graders in a class. This would be considered what type of variable: Nonsensical Lagging Continuous Discrete Flag this Question Question 2 2 pts Let x represent the height of corn in Oklahoma. This would be considered what type of variable: Discrete Inferential Distributed Continuous Flag this Question Question 3 2 pts Consider the following table. Age Group Frequency 18-29 9831 30-39 7845 40-49 6869 50-59 6323 60-69 5410 70...
QUESTION 1 Advanced Security Inc. was hired by the Treasury Bank Inc. for securing their systems....
QUESTION 1 Advanced Security Inc. was hired by the Treasury Bank Inc. for securing their systems. The first thing they did was implement the best practice if separation of domains. As a result of this The bank had to get a new domain name. any change made in the records points to only one party who could have made that change. If you are a technical person, you must have office in a particular area of the building. accessing outside...
I did already posted this question before, I did get the answer but i am not...
I did already posted this question before, I did get the answer but i am not satisfied with the answer i did the code as a solution not the description as my solution, so i am reposting this question again. Please send me the code as my solution not the description In this project, build a simple Unix shell. The shell is the heart of the command-line interface, and thus is central to the Unix/C programming environment. Mastering use of...
ADVERTISEMENT
Need Online Homework Help?

Get Answers For Free
Most questions answered within 1 hours.

Ask a Question
ADVERTISEMENT