Question

You are given the following data set: {(0,0), (0.5,0.6), (1,0.9), (1.1, 1), (1.5, 1.7)}, where the...

You are given the following data set: {(0,0), (0.5,0.6), (1,0.9), (1.1, 1), (1.5, 1.7)}, where the first coordinate is the independent (explanatory) variable, and the second coordinate is the dependent variable.

(a) Find a best fit model if the model is restricted to just be a constant (i.e. the best fit line has slope 0). (b) What is the mean squared error of (a)?

(c) What is the mean squared error of the model that has y-intercept 0 and slope 1? How much of the variation of the data can be explained by the independent variable (i.e. what is the R2) using the model from part (c)?

Homework Answers

Answer #1

Solution:

X

Y

0

0

0.5

0.6

1

0.9

1.1

1
1.5 1.7

Go tot data analysis>Regression

SUMMARY OUTPUT
Regression Statistics
Multiple R 0.978388
R Square 0.957243
Adjusted R Square 0.94299
Standard Error 0.147766
Observations 5
ANOVA
df SS MS F Significance F
Regression 1 1.466496 1.466496 67.16317 0.003802
Residual 3 0.065504 0.021835
Total 4 1.532
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept -0.01528 0.123525 -0.12371 0.909364 -0.40839 0.37783
X 1.043027 0.127271 8.195314 0.003802 0.637993 1.44806

Solutionc:

MSE=0.021835

model is

y=-0.01528+1.043027X

R sq=0.9572=0.9572*100=95.72% varaition in dependent varaible is expalined by independent variable good model.

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
1. Which of the following statements is correct? a. The median can be strongly influenced by...
1. Which of the following statements is correct? a. The median can be strongly influenced by just one or two very low or high values. b. The mode gives equal consideration to even very extreme values in the data. c. There will be just one value for the mean, median , and mode in the data set. d. The mean is able to make the most complete use of the data when compared to the median and mode. e. None...
A three times continuously differentiable function f is given by the following data: f(1.1)=2, f(1.3)=1.5, f(1.5)=1.2,...
A three times continuously differentiable function f is given by the following data: f(1.1)=2, f(1.3)=1.5, f(1.5)=1.2, f(1.7)=1.6. Assume that |f '''(t)| =< 100 for 1.1 < t < 1.7. Find the two-sided estimate for f'(1.3). Give an error bound.
In linear regression, the independent variable is called the a. Response Variable b. The explanatory variable...
In linear regression, the independent variable is called the a. Response Variable b. The explanatory variable c. The extrapolted variable d. an outlier A graph that will help to one to see what type of curve might best fit the bivariate data a. Pie chart b. stem-leaf plot c. dot plot d. scatter plot The technique of extending a regression line beyond the region of the actual data a. Least Squares Regression b. Variability c. Extrapolation d. Residual analysis The...
Applying Simple Linear Regression to Your favorite Data (Please confirm with the instructor the dataset you...
Applying Simple Linear Regression to Your favorite Data (Please confirm with the instructor the dataset you find before you work on this question) Many dependent variables in business serve as the subjects of regression modeling efforts. We list such variables here: Rate of return of a stock Annual unemployment rate Grade point average of an accounting student Gross domestic product of a country Choose one of these dependent variables, or choose some other dependent variable, for which you want to...
1. Given that the correlation between X and Y is -0.48, the mean and standard deviation...
1. Given that the correlation between X and Y is -0.48, the mean and standard deviation of X are 3.4 and 8.3, the mean and the standard deviation of Y is 6.5 and 7.2 respectively. Find the slope for the line of best fit. 2. Given that the correlation between X and Y is -0.48, the mean and standard deviation of X are 3.4 and 8.3, the mean and the standard deviation of Y is 6.5 and 7.2 respectively. Find...
Multiple linear regression results: Dependent Variable: Cost Independent Variable(s): Summated Rating Cost = -43.111788 + 1.468875...
Multiple linear regression results: Dependent Variable: Cost Independent Variable(s): Summated Rating Cost = -43.111788 + 1.468875 Summated Rating Parameter estimates: Parameter Estimate Std. Err. Alternative DF T-Stat P-value Intercept -43.111788 10.56402 ≠ 0 98 -4.0810021 <0.0001 Summated Rating 1.468875 0.17012937 ≠ 0 98 8.633871 <0.0001 Analysis of variance table for multiple regression model: Source DF SS MS F-stat P-value Model 1 8126.7714 8126.7714 74.543729 <0.0001 Error 98 10683.979 109.02019 Total 99 18810.75 Summary of fit: Root MSE: 10.441273 R-squared: 0.432...
Given the following data set, let x be the explanatory variable and y be the response...
Given the following data set, let x be the explanatory variable and y be the response variable. x 8 2 2 6 6 3 1 y 3 8 9 6 4 7 9 (a) If a least squares line was fitted to this data, what percentage of the variation in the y would be explained by the regression line? (Enter your answer as a percent.) ANSWER: % (b) Compute the correlation coefficient: r=
Given the following data set, let x be the explanatory variable and y be the response...
Given the following data set, let x be the explanatory variable and y be the response variable. x 1 6 7 3 1 5 2 y 10 4 4 7 9 5 8 (a) If a least squares line was fitted to this data, what percentage of the variation in the y would be explained by the regression line? (Enter your answer as a percent.) ANSWER: % (b) Compute the correlation coefficient: r=
(1 point) College Graduation Rates.  Data from the College Results Online website compared the 2011 graduation rate...
(1 point) College Graduation Rates.  Data from the College Results Online website compared the 2011 graduation rate and school size for 92 similar-sized public universities and colleges in the United States. Statistical software was used to create the linear regression model using size as the explanatory variable and graduation rate as the response variable. Summary output from the software and the scatter plot are shown below. Round all calculated results to four decimal places. Coefficients Estimate Std. Error t value Pr(>|t|)...
1. Consider regression through the origin, y=β1x1+β2x2+u, which of the following statements is wrong? a. The...
1. Consider regression through the origin, y=β1x1+β2x2+u, which of the following statements is wrong? a. The degree of freedom for estimating the variance of error term is n−2. b. The sum of residuals equals to 0. c. If the true intercept parameter doesn’t equal to 0, all slope estimators are biased. d. The residual is uncorrelated with the independent variable. 2. Which of the following statements is true of hypothesis testing? a. OLS estimates maximize the sum of squared residuals....