Question

Use the dependent variable (labeled Y) and the independent variables (labeled X1, X2, and X3) in the data file. Use Excel to perform the regression and correlation analysis to answer the following.

Generate a scatterplot for the specified dependent variable (Y) and the X1 independent variable, including the graph of the "best fit" line. Interpret.

Determine the equation of the "best fit" line, which describes the relationship between the dependent variable and the selected independent variable.

Determine the coefficient of correlation. Interpret.

Determine the coefficient of determination. Interpret.

Test the utility of this regression model. Interpret results, including the p-value.

Based on the findings in Steps 1-5, analyze the ability of the independent variable to predict the designated dependent variable.

Compute the confidence interval for β1 (the population slope) using a 95% confidence level. Interpret this interval.

Using an interval, estimate the average for the dependent variable for a selected value of the independent variable. Interpret this interval.

Using an interval, predict the particular value of the dependent variable for a selected value of the independent variable. Interpret this interval.

What can be said about the value of the dependent variable for values of the independent variable that are outside the range of the sample values? Explain.

**In an attempt to improve the model, use a multiple
regression model to predict the dependent variable .Y, based on all
of the independent variables. X1, X2, and X3.**

Using Excel, run the multiple regression analysis using the designated dependent and three independent variables. State the equation for this multiple regression model.

Perform the Global Test for Utility (F-Test). Explain the conclusion.

Perform the t-test on each independent variable. Explain the conclusions and clearly state how the analysis should proceed. In particular, which independent variables should be kept and which should be discarded. If any independent variables are to be discarded, re-run the multiple regression, including only the significant independent variables, and summarize results with discussion of analysis.

Is this multiple regression model better than the linear model generated in parts 1-10? Explain.

Please use the data below, thank you for helping me with this in advance, I really appreciate it.

Sales (Y) | Calls (X1) | Time (X2) | Years (X3) | Type |

51 | 167 | 12.6 | 5 | ONLINE |

34 | 133 | 15.2 | 4 | GROUP |

49 | 161 | 16.1 | 3 | NONE |

45 | 185 | 13.3 | 1 | ONLINE |

47 | 176 | 14.1 | 2 | ONLINE |

47 | 183 | 12.8 | 2 | ONLINE |

38 | 122 | 19.3 | 3 | GROUP |

44 | 171 | 13.6 | 3 | GROUP |

47 | 157 | 14.3 | 1 | GROUP |

37 | 148 | 15.7 | 3 | GROUP |

51 | 177 | 11.4 | 4 | NONE |

40 | 144 | 17.4 | 0 | NONE |

48 | 136 | 13.3 | 2 | ONLINE |

52 | 197 | 14 | 2 | ONLINE |

46 | 145 | 16.8 | 0 | ONLINE |

42 | 167 | 17.7 | 3 | ONLINE |

37 | 120 | 12 | 2 | NONE |

42 | 148 | 16.9 | 1 | NONE |

43 | 131 | 18.5 | 1 | NONE |

49 | 184 | 16.7 | 2 | ONLINE |

44 | 150 | 18.4 | 1 | NONE |

43 | 148 | 15.9 | 1 | ONLINE |

55 | 189 | 12 | 1 | ONLINE |

37 | 152 | 19.8 | 0 | GROUP |

44 | 148 | 13.5 | 3 | GROUP |

43 | 169 | 13.3 | 4 | NONE |

49 | 188 | 20.4 | 1 | NONE |

45 | 164 | 16.7 | 3 | NONE |

45 | 146 | 12 | 3 | GROUP |

43 | 173 | 19.8 | 2 | ONLINE |

47 | 164 | 15.3 | 0 | ONLINE |

48 | 177 | 13.9 | 3 | ONLINE |

49 | 160 | 13.6 | 3 | GROUP |

51 | 190 | 11.3 | 1 | ONLINE |

42 | 135 | 16.1 | 0 | NONE |

37 | 137 | 18.1 | 1 | ONLINE |

51 | 167 | 16.2 | 1 | ONLINE |

44 | 169 | 8.9 | 0 | ONLINE |

46 | 149 | 17.8 | 3 | NONE |

42 | 153 | 15.5 | 2 | GROUP |

45 | 140 | 11 | 3 | GROUP |

37 | 133 | 19.8 | 2 | NONE |

52 | 173 | 18.6 | 0 | ONLINE |

39 | 156 | 13.3 | 4 | NONE |

45 | 130 | 20.6 | 3 | GROUP |

37 | 130 | 15.6 | 1 | GROUP |

40 | 125 | 12.2 | 4 | NONE |

44 | 182 | 15.5 | 4 | NONE |

48 | 165 | 19.8 | 5 | ONLINE |

42 | 154 | 14.8 | 2 | ONLINE |

53 | 178 | 13.2 | 2 | ONLINE |

37 | 142 | 18.5 | 1 | NONE |

46 | 153 | 14.1 | 1 | ONLINE |

43 | 166 | 17.6 | 3 | ONLINE |

45 | 138 | 18.9 | 2 | NONE |

42 | 167 | 18 | 2 | NONE |

48 | 171 | 13 | 2 | GROUP |

39 | 149 | 18.8 | 1 | GROUP |

46 | 151 | 16 | 1 | GROUP |

46 | 162 | 16.2 | 2 | ONLINE |

45 | 158 | 13.9 | 1 | ONLINE |

44 | 188 | 12.9 | 3 | GROUP |

49 | 149 | 21.1 | 2 | GROUP |

41 | 157 | 11.5 | 3 | ONLINE |

48 | 156 | 15.1 | 4 | ONLINE |

46 | 172 | 12.5 | 1 | ONLINE |

48 | 174 | 18.6 | 2 | GROUP |

47 | 188 | 16.3 | 1 | NONE |

54 | 180 | 11.8 | 4 | GROUP |

45 | 173 | 17.6 | 2 | ONLINE |

53 | 184 | 15.2 | 0 | ONLINE |

37 | 148 | 16.2 | 1 | GROUP |

45 | 155 | 18.9 | 2 | GROUP |

44 | 159 | 18.1 | 2 | ONLINE |

46 | 162 | 12.1 | 1 | GROUP |

52 | 177 | 14.5 | 1 | ONLINE |

54 | 174 | 10.8 | 2 | NONE |

48 | 175 | 13.7 | 1 | ONLINE |

44 | 139 | 15.2 | 2 | NONE |

41 | 158 | 19.3 | 2 | ONLINE |

43 | 145 | 18.6 | 2 | NONE |

40 | 150 | 10.8 | 1 | GROUP |

53 | 182 | 10.5 | 1 | ONLINE |

47 | 193 | 13.5 | 2 | ONLINE |

43 | 148 | 14.5 | 4 | ONLINE |

38 | 145 | 17.1 | 2 | NONE |

50 | 184 | 15.6 | 2 | ONLINE |

39 | 138 | 17.7 | 3 | GROUP |

54 | 197 | 11.8 | 1 | ONLINE |

41 | 155 | 13.6 | 3 | GROUP |

41 | 128 | 15.5 | 2 | NONE |

42 | 160 | 10.6 | 3 | NONE |

46 | 148 | 13.1 | 1 | GROUP |

45 | 177 | 14.2 | 2 | GROUP |

43 | 153 | 15.2 | 3 | GROUP |

41 | 153 | 14.7 | 1 | GROUP |

49 | 152 | 22.3 | 0 | ONLINE |

44 | 169 | 13.6 | 1 | ONLINE |

49 | 166 | 16.2 | 0 | ONLINE |

37 | 145 | 18 | 3 | NONE |

Answer #1

Data

Type is made cateogrical

1 for online, 2 for group , 0 for none

result for multiple regression

SUMMARY OUTPUT | |||||||

Regression Statistics | |||||||

Multiple R | 0.693545406 | ||||||

R Square | 0.48100523 | ||||||

Adjusted R Square | 0.459152819 | ||||||

Standard Error | 3.480531964 | ||||||

Observations | 100 | ||||||

ANOVA | |||||||

df | SS | MS | F | Significance F | |||

Regression | 4 | 1066.600238 | 266.6500595 | 22.01154018 | 7.04189E-13 |
||

Residual | 95 | 1150.839762 | 12.11410276 | ||||

Total | 99 | 2217.44 | |||||

Coefficients | Standard Error | t Stat | P-value |
Lower 95% | Upper 95% | Lower 95.0% | |

Intercept | 20.07981678 | 4.515261657 | 4.447099261 | 2.36175E-05 |
11.11588904 | 29.04374451 | 11.11588904 |

TYPE | -0.085965801 | 0.463839992 | -0.185335036 | 0.853361379 |
-1.006804611 | 0.834873008 | -1.006804611 |

Calls (X1) | 0.171766931 | 0.020281269 | 8.469239868 | 3.06065E-13 |
0.131503522 | 0.21203034 | 0.131503522 |

Time (X2) | -0.133904863 | 0.131953162 | -1.014790864 | 0.312783101 |
-0.395865011 | 0.128055284 | -0.395865011 |

Years (X3) | -0.257125411 | 0.294252939 | -0.873824446 | 0.384417446 |
-0.841291353 | 0.327040531 | -0.841291353 |

significance F = **7.04189E-13 << 0.05**

hence this model is significance

if p-value < 0.05 , that variable is significant

here only X1 has p-value (**3.06065E-13)** <
0.05

we can keep X1 and remove X2,x3 and type

Part C: Regression and Correlation Analysis
Use the dependent variable (labeled Y) and the independent
variables (labeled X1, X2, and X3) in the data file. Use Excel to
perform the regression and correlation analysis to answer the
following.
Generate a scatterplot for the specified dependent variable (Y)
and the X1 independent variable, including the graph of the "best
fit" line. Interpret.
Determine the equation of the "best fit" line, which describes
the relationship between the dependent variable and the selected...

Please compute the value, upper control limit, and lower
control limit for the x-bar chart, r-bar chart, p
chart, and np chart.
Sample #
Observations
Defects
1
44
49
44
44
40
0
2
42
46
51
51
42
2
3
45
45
49
49
46
2
4
40
39
42
42
47
7
5
44
51
53
53
44
2
6
41
44
43
43
46
8
7
40
44
46
46
48
3
8
54
49
48
48...

Given here are data for a dependent variable and four potential
predictors.
y
x1
x2
x3
x4
x5
96
8
60
2.4
48
51
73
6
64
2.1
42
43
108
2
76
1.8
34
20
124
5
74
2.2
11
14
82
6
50
1.5
61
29
89
9
57
1.6
53
22
76
1
72
2
72
38
109
3
74
2.8
36
40
123
2
99
2.6
17
50
125
6
81
2.5
48
55
101
2...

y x1 x2 x3
x4
64 74 22 24
17
43 63 29 15
30
51 78 20 9 25
49 52 17 38
29
39 45 12 19 37
Consider the set of dependent and independent variables given
below. Perform a best subsets regression and choose the most
appropriate model for these data.
Find the most appropriate model for the data. Note that the
coefficient is 0 for any variable that is not included in the
model.
y= _____+...

Consider the following data for a dependent variable y and two
independent variables, x1 and x2 . x 1 x 2 y 29 13 94 46 11 109 25
17 112 50 16 178 40 6 95 51 19 175 74 7 170 36 13 117 59 13 143 76
17 212 Round your all answers to two decimal places. Enter negative
values as negative numbers, if necessary. a. Develop an estimated
regression equation relating y to x1 . ŷ...

Consider the following data for a dependent variable y and two
independent variables, x1 and x2.
x1
x2
y
29
12
95
46
10
109
24
17
113
50
17
178
40
5
94
52
19
176
74
7
170
36
13
118
59
13
143
76
16
212
Round your all answers to two decimal places. Enter negative
values as negative numbers, if necessary.
a. Develop an estimated regression equation
relating y to x1.
y^=_____ + _____x1 (fill in...

One of the biggest changes in higher education in recent years
has been the growth of online universities. The Online Education
Database is an independent organization whose mission is to build a
comprehensive list of the top accredited online colleges. The
following table shows the retention rate (%) and the graduation
rate (%) for 29 online colleges. Retention Rate (%) Graduation Rate
(%) 7 25 51 25 4 28 29 32 33 33 47 33 63 34 45 36 60...

(By Hand) For the dependent variable Y and the independent
variables X1 and X2, the linear regression model is given by:
Y=0.08059*X1-0.16109*X2+5.26570. Complete the following table:
Actual Y
X1
X2
Predicted Y
Prediction Error
6
6.8
4.7
3.1
5.3
5.5
5.8
4.5
6.2
4.5
8.8
7
4.5
6.8
6.1
3.7
8.5
5.1
5.4
8.9
4.8
5.1
6.9
5.4
5.8
9.3
5.9
5.7
8.4
5.4

Kuya conducted a study to see if, among smokers, that there is a
significant difference in the sense of smoking urge between
individuals in various levels of administration. To
measure the urge to smoke, the Chronic Habit Obliging Killer
Emphysema (CHOKE) test was used. Type of profession was
categorized to Upper Management, Middle Management, and Lower
Management. The higher the score, the stronger the
urge. Is there a significant difference in their
urge?
UPPER
MIDDLE
LOWER
44
47
31
45
48
32
50
43...

Consider the following data for a dependent variable y
and two independent
variables, x1 and
x2.
x1
x2
y
30
12
94
47
10
108
25
17
112
51
16
178
40
5
94
51
19
175
74
7
170
36
12
117
59
13
142
76
16
211
The estimated regression equation for these data isŷ =
−18.37 + 2.01x1 +
4.74x2.
Here, SST = 15,182.9, SSR = 14,052.2,
sb1 =
0.2471, and sb2
= 0.9484.
(A) Test for a significant relationship...

ADVERTISEMENT

Get Answers For Free

Most questions answered within 1 hours.

ADVERTISEMENT

asked 7 minutes ago

asked 37 minutes ago

asked 1 hour ago

asked 1 hour ago

asked 1 hour ago

asked 1 hour ago

asked 1 hour ago

asked 1 hour ago

asked 1 hour ago

asked 1 hour ago

asked 1 hour ago

asked 1 hour ago